Skip to content
View HarshilMaks's full-sized avatar

Highlights

  • Pro

Block or report HarshilMaks

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
.github/profile/README.md

Harshil Makhija

Machine Learning Practitioner & Backend Systems Engineer

I work at the convergence of AI systems, backend engineering, and cloud infrastructure, building practical, scalable solutions that move from idea to deployed product. My work spans LLM agents and RAG architectures to high-performance APIs that reliably serve ML models in production. Designing clean systems to ship iteratively and scale with intent. Everything I build is rooted in engineering fundamentals and guided by a product-focused mindset.


Professional Interests

Deep Learning • LLMOps • RAG Pipelines • Computer Vision
Backend APIs • Cloud-Native Architecture • System Design
Multi-Agent Systems • Model Serving • Vector Search


Core Stack

Languages: Python, TypeScript, C++, SQL AI & ML: PyTorch, Hugging Face Transformers, OpenCV, LangChain, LlamaIndex, RAG, Agentic AI Backend & Databases: FastAPI, NestJS, API Design, PostgreSQL, Redis, Milvus & Pinecone Cloud & Infrastructure: AWS (EC2, S3, EKS, Lambda, Bedrock), GCP (GKE, BigQuery) DevOps & MLOps: Docker, Terraform, GitHub Actions, MLflow, Airflow, Prefect, Weights & Biases Observability & Systems: Prometheus, Grafana, OpenTelemetry, Sentry, Microservices, gRPC, Event-Driven Architecture


Professional Ethos

I enjoy building systems to production state: architecting services, developing ML workflows, and deploying everything through cloud-native pipelines. So my main focus remains consistent across projects:

  • Reliability: Systems that hold up under real usage
  • Scalability: Infrastructure that grows without friction
  • Maintainability: Clear, modular, testable code
  • Performance: Faster inference, optimised retrieval, efficient design
  • Impact: Practical solutions powered by modern AI

Whether it's a microservice architecture or a vector-search pipeline, I aim for clarity, robustness, and long-term maintainability.


Connect

Email: harshilmakhija@outlook.com
LinkedIn: https://www.linkedin.com/in/harshil-makhija-500909353/
X / Twitter: https://twitter.com/MakhijaHarshil

Always learning. Always building.

Pinned Loading

  1. transformer transformer Public

    The Transformer architecture from the “Attention Is All You Need” paper. Engineered for real‑world prototyping, it transparently exposes core attention mechanisms and model workflows

    Python

  2. tenancy-service tenancy-service Public

    A multi-tenant organization management service with 8 layer logic structure designed for comprehensive lifecycle control and enterprise-level governance.

    Python 5

  3. ghost_architect_gemma3 ghost_architect_gemma3 Public

    Google's Gemma-3-12B LLM fine-tuned using QLoRA + DoRA + rsLoRA optimized to be Visual-to-Database Compiler

    HTML

  4. TemporalSentimentTrader TemporalSentimentTrader Public

    An intelligent stock analysis platform that combines social sentiment and real-time market data to generate short-term predictions and actionable insights into market behavior.

    Python