Skip to content
View Das-Chinmay's full-sized avatar

Block or report Das-Chinmay

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Das-Chinmay/README.md
Typing SVG

LinkedIn Gmail GitHub Portfolio

Profile Views


πŸ§‘β€πŸ’» About Me

name: Chinmay Satish Das
location: Riverside, CA (open to relocation)
education: M.S. Computer Science @ UC Riverside
focus:
  - Data Engineering
  - Software Engineering
  - AI / ML Systems
currently_working_on:
  - Agentic RAG pipeline with GraphRAG + Pinecone
  - CTR prediction across 40M+ ad impressions
open_to: Full-time SWE Β· Data Engineering Β· AI/ML roles
fun_fact: Built a full-stack ETF compliance platform in one day using only free tools

πŸ› οΈ Tech Stack

πŸ‘¨β€πŸ’» Languages

Python TypeScript SQL Scala JavaScript

βš™οΈ Backend & Frameworks

FastAPI Node.js React SQLAlchemy

πŸ—„οΈ Databases & Storage

PostgreSQL Snowflake Redis Elasticsearch Neo4j Pinecone

πŸ“Š Data Engineering

Apache Spark PySpark dbt Apache Kafka Airflow Databricks Hadoop

πŸ€– AI / ML

LangChain OpenAI scikit-learn LightGBM XGBoost MLflow

☁️ Cloud & DevOps

AWS Azure Docker Kubernetes GitHub Actions


πŸš€ Featured Projects

🏦 ETF Ops Platform

Full-stack ETF compliance and operations platform with SEC filing tracking, AI-powered disclosure review, live EDGAR filings, audit log and exception queue.

Stack: Python Β· FastAPI Β· PostgreSQL Β· JS Β· Gemini API

Live Demo GitHub

Built end-to-end in 1 day on free infrastructure

🧠 Agentic RAG Pipeline

GraphRAG system using Neo4j knowledge graphs + Pinecone for semantic retrieval across 50K+ engineering documents.

Stack: LangChain Β· OpenAI Β· Neo4j Β· Pinecone

  • 87% answer accuracy
  • Sub-2s latency
  • Multi-step agentic workflows with tool-use

πŸ“ˆ CTR Prediction β€” 40M+ Impressions

Click-through rate prediction using gradient boosted trees with behavioral and contextual feature engineering.

Stack: LightGBM Β· XGBoost Β· MLflow Β· scikit-learn

  • 12% AUC improvement
  • Full MLflow experiment tracking
  • Statistical validation across 40M+ records

πŸ”’ ML Intrusion Detection System

Machine learning based network intrusion detection using Random Forest and ensemble methods on network telemetry data.

Stack: Python Β· scikit-learn Β· Jupyter

GitHub


πŸ“Š GitHub Stats

GitHub Streak


πŸ’Ό Experience

Role Company Period
πŸ–₯️ Data Engineer Intern California Air Resources Board Sep – Dec 2025
πŸ€– Software Developer Intern Afro Alternatives LLC Jun – Sep 2025
πŸ“š Student Assistant β€” Big Data Systems UC Riverside Jan – Apr 2025
πŸ”¬ Software Developer Intern DRDO, India May – Jul 2023

πŸ’¬ Let's build something together

LinkedIn Email

Open to full-time roles in Software Engineering Β· Data Engineering Β· AI/ML

Pinned Loading

  1. Attribute-Based-Encryption-for-Heath-Care-data Attribute-Based-Encryption-for-Heath-Care-data Public

    Attribute Based Encryption for Protecting Healthcare Data

    Python 1

  2. Codeshield Codeshield Public

    Python

  3. etf-ops etf-ops Public

    HTML

  4. Machine-Learning-Based-Intrusion-Detection-System Machine-Learning-Based-Intrusion-Detection-System Public

    This project demonstrates the effectiveness of machine learning in intrusion detection. The Random Forest model emerged as the best-performing model, achieving the highest accuracy while handling i…

    Jupyter Notebook