AI Engineer with 5+ years of experience building production-grade AI systems, specializing in LLMs, RAG pipelines, and agentic / multi-agent architectures.
Currently working on national-scale AI systems serving millions of users, focusing on:
- Agentic systems & tool-calling
- RAG & retrieval pipelines (including structured data)
- Observability, evaluation, and guardrails
- Scalable backend systems (FastAPI, async, microservices)
I enjoy turning AI prototypes into reliable, production-ready systems.
- Agentic & Multi-Agent Systems (LangGraph-based workflows)
- RAG Pipelines (text + structured data like tables)
- LLM Evaluation Systems (LLM-as-judge, regression testing)
- AI Observability & Guardrails (logging, tracing, validation)
- High-performance inference systems (vLLM, batching, caching)
- Scalable APIs (FastAPI, async, microservices)
- Designed multi-agent architecture combining RAG + tool-calling
- Implemented guardrails, evaluation pipelines, and observability
- Optimized for latency, cost, and scalability (sub-2s responses)
- Built autonomous agent for search, ranking, and comparison of listings
- Combined web search, structured extraction, and LLM reasoning
- Designed ranking + filtering logic based on real-world criteria