I'm an AI engineer building systems that reason, plan, and learn. Background in electrical engineering and full-stack dev.
- ๐ Exploring Interpretability in AI, Vision-Language-Action (VLA) models, and World Models
- ๐ ๏ธ Experience in Agentic AI, Deep Learning, Reinforcement Learning, and LLM applications
- ๐ฌ Building end-to-end systems: from autonomous research pipelines to AI-powered stock analysis
- โก Fun fact: Built an autonomous research agent that discovers papers, runs experiments in Docker sandboxes, and writes weekly reports!
๐ฌ Research Agent
Autonomous Research Pipeline | Claude API | Docker
Autonomous research system that discovers papers from arXiv and Semantic Scholar, analyzes them with AI, runs experiments in Docker sandboxes or Modal cloud GPUs, and generates weekly narrative reports.
- ๐ง Built a 5-stage pipeline: Ingestion โ Synthesis โ Experiments โ Analysis โ Reporting
- ๐ Maintains a knowledge graph (NetworkX) with hybrid retrieval (BM25 + vector RRF)
- ๐ Experiment safety gates: AST validation, Bandit scan, auto-fix, human approval, Docker isolation
- ๐ Statistical analysis with confidence intervals, t-tests, and Cohen's d
- ๐ฅ๏ธ Streamlit web UI, Typer CLI, and APScheduler daemon for automated research cycles
Tech Stack: Python โข Claude API โข ChromaDB โข Docker โข Modal โข Streamlit โข NetworkX โข SQLite
๐ StockX
AI-Powered Stock Analysis | ReAct Reasoning | LLMs
Local AI-powered stock analysis app with a ReAct reasoning loop, technical and fundamental analysis, portfolio P&L tracking, sector heatmaps, and real-time alerts.
- ๐ค ReAct reasoning loop with multi-provider LLM fallover (NVIDIA NIM โ Claude โ GPT)
- ๐พ Persistent ChromaDB vector memory with JSONL fallback
- ๐ Scored recommendations (STRONG BUY to AVOID), sector screening, and earnings calendar
- ๐ฐ Financial news aggregation with sentiment scoring
- ๐จ Fintech-style dark GUI built with PyQt and matplotlib charting
Tech Stack: Python โข PyQt โข ChromaDB โข yfinance โข LLMs โข matplotlib
Deep Q-Network (DQN) | PyTorch
Built an intelligent Tetris agent using Deep Reinforcement Learning with custom reward shaping and GA-optimized features.
- ๐ง Implemented GPU-accelerated DQN with experience replay and target networks
- ๐ Achieved 3ร improvement in average lines cleared vs. baseline
- ๐ฏ Engineered board-state features (aggregate height, holes, bumpiness) with delta-based rewards
- ๐ Trained for 1.5M timesteps with automated evaluation pipeline
Tech Stack: Python โข PyTorch โข Reinforcement Learning
Retrieval-Augmented Generation | FastAPI | LLMs
End-to-end RAG system for querying research papers with semantic search and grounded responses.
- ๐ Implemented PDF parsing, semantic chunking, and vector-based retrieval
- ๐ค Built LLM-powered Q&A with source citations to reduce hallucinations
- โก Designed FastAPI web interface + CLI for low-latency question answering
- ๐พ Integrated ChromaDB for efficient vector storage and similarity search
Tech Stack: Python โข LangChain โข ChromaDB โข FastAPI โข Vector Embeddings
Northeastern University | Master of Science in Artificial Intelligence ๐ San Jose, CA | Sep 2025 โ May 2027
APJ Abdul Kalam Technological University | Bachelor of Technology in Electrical & Electronics Engineering ๐ Kerala, India | Oct 2020 โ May 2024
