|
i'm an AI/ML engineer based in the US. right now i'm building production AI systems at Reallytics.ai and Verticiti, mostly getting large language models to do useful things in the real world. not demos, actual systems with real users and real traffic. before this i was at Afiniti and Cloud Kinetics for a few years. fraud detection, voice analytics, enterprise search. the kind of stuff that pages you at 3am when something breaks. honestly what keeps me going is when an agent you built solves something you never explicitly told it to do. that feeling never gets old. what i'm working on right now:
|
|
|
Agentic AI Workflows |
RAG Enterprise Search |
|
Voice AI Platform |
LLM Fine-Tuning LoRA |
|
RLHF LLM Optimization |
Sentinel Fraud Detection |
not going to pretend i use everything equally. here's what i actually reach for:
the full picture (click to expand)
| daily drivers | Python, PyTorch, FastAPI, Docker, Git, VS Code |
| LLM and GenAI | LangChain, LlamaIndex, HuggingFace Transformers, vLLM, PEFT/LoRA/QLoRA |
| data and vector | FAISS, ChromaDB, Pinecone, PostgreSQL, MongoDB, Redis, Kafka, Elasticsearch |
| cloud and MLOps | AWS (SageMaker, Bedrock, Lambda, ECS), GCP Vertex AI, Azure OpenAI |
| ML frameworks | TensorFlow, scikit-learn, XGBoost, LightGBM, ONNX |
| infrastructure | Kubernetes, Terraform, GitHub Actions, MLflow, Weights & Biases |
i write about what i'm building and learning. nothing polished, more like notes to my future self that happen to be public.
|
Ai Safety And Alignment Engineering
|
Real Time Model Serving With Gpus
|
Multi Agent Ai Orchestration Patterns
|
💬 Commented on issue: 0.9.5 -> 0.9.6 Model-attached skills are injected int in open-webui/open-webui (2026-06-04)
💬 Commented on YOLOE Visual Prompt based Classification in ultralytics/ultralytics (2026-06-04)
💬 Commented on Chinese characters display garbled in sweepai/sweep (2026-06-04)
💬 Commented on Feature Request: Implement Adaptive PFlash (Self-Tuning Pref in ggml-org/llama.cpp (2026-06-04)
💬 Commented on Transformer Engine plugin fails to check weight exists for L in Lightning-AI/pytorch-lightning (2026-06-04)
💬 Commented on Performance/caching issue: tokenizer fails to reset has_spec in explosion/spaCy (2026-06-04)
💬 Commented on PXI: dumps raw resource IDs instead of actionable links in r in Arize-ai/phoenix (2026-06-04)
💬 Commented on Your project is now listed on CodeGuilds in modelcontextprotocol/servers (2026-06-04)
stuff i've been digging into recently. mostly papers, blog posts, and rabbit holes that kept me up too late.
🔬 Fine-Tuning and Customization of Open-Source LLMs for Domain-Specific Tasks
🔬 Retrieval-Augmented Generation (RAG) in Production LLM Systems
🔬 Graph RAG and Knowledge Graphs for LLMs
🔬 LLM Fine-Tuning at Scale with LoRA
🔬 AI Safety and Alignment Engineering
🔬 Edge AI and TinyML
📌 Prompt Template Engine with Variable Injection — Production Pattern (Python) (2026-06-04)
📌 Agent Tool Registry with Dynamic Discovery — Production Pattern (Python) (2026-06-04)
📌 Agent Tool Registry with Dynamic Discovery — Production Pattern (Python) (2026-06-02)
🤖 Profile auto-updated on 2026-06-04 20:13 UTC


