Skip to content
View Parshwa1504's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report Parshwa1504

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Parshwa1504/README.md

Hi, I'm Parshwa Gandhi 👋

Data Analyst & Data Engineer · MS Computer Science @ George Mason University


🧠 About Me

I sit in the overlap between Data Analytics and Data Engineering — I build end-to-end pipelines that move data from messy reality into dashboards executives actually trust, and I write the SQL that answers what the business is actually asking.

  • 🎓 Graduating MS Computer Science from George Mason University in May 2026
  • 💼 Previously: ML Product Engineer Intern @ Aorbis · Software Developer Intern @ Xcellence-IT
  • 🌎 Based in the Washington DC area — open to relocation
  • 📬 Reach me at pgandhi6@gmu.edu

📊 By the Numbers

🚀  19 projects shipped across analytics & engineering
📈  10M+ records processed across AWS and GCP
⚡  50% latency reduction on LLM inference pipelines
💰  $500K+ in quantified business insights delivered

🛠️ Tech Stack

Languages : Python, SQL, R, JavaScript, Bash Data Engineering & Orchestration : Apache Airflow, dbt, Apache Spark, Apache Kafka, Mage AI Databases & Warehouses : Snowflake, Bigquery, Redshift, PostgreSQL, MySQL, MongoDB Analytics & BI : Tableau, Power BI, Looker Studio, Pandas, Numpy, Excel Cloud & DevOps : AWS, GCP, Azure, Docker, Git, Jenkins


🚀 Featured Projects

🏗️ Data Engineering

Project Stack Highlight
Stock Market ETL & Predictive Pipeline Airflow · dbt · Snowflake · Parquet Cloud ETL ingesting data for 100+ companies · ML predictions at 70% accuracy
Geospatial Data Pipeline for Taxi Analytics GCP · BigQuery · Mage AI · Looker Studio ELT pipeline processing 10M+ trips · 40% query speedup

📊 Data Analytics

Project Stack Highlight
Customer Churn Analysis SQL · Tableau · Python Found the $130K churn driver in 7,043 telecom customers
Bank Loan Portfolio Analysis MySQL · Tableau Exposed a 5x risk gap across grades in a $435.8M, 38K-application portfolio
A/B Test Statistical Analysis Python · Statistical Analysis Caught a 25.9% statistical power test — prevented a costly launch on 290K sessions
E-Commerce Cohort & RFM Analysis SQL · Power BI Segmented 96K customers across $19.7M revenue · Quantified $356K uplift opportunity
HR Analytics Employee Attrition Python · SQL · SQLite · scikit-learn · Tableau · pandas · Logistic Regression Achieved 75% model accuracy with 77% recall, identified overtime as the #1 attrition driver, and surfaced targeted retention actions projected to save $330K–$660K annually.

🤖 AI & Machine Learning

Project Stack Highlight
Credit Card Fraud Detection CatBoost · XGBoost · LightGBM Ensemble model with 80%+ detection accuracy on real-time risk scoring
Driver Drowsiness Detection System CNN · OpenCV · Keras Real-time alert system trained on 7,000+ eye-state images
AI Code Review and Security Auditor Agent Python · LLMs · NLP · Security · AI Detects critical vulnerabilities like SQL and XSS with automated remediation suggestions, improving code security at scale.

📂 See all projects → github.com/Parshwa1504?tab=repositories


📫 Let's Connect

If you've been burned by a flaky pipeline at 2 AM or watched a team argue about what a number means for an hour we should talk.

Thanks for stopping by! ⭐ any project that catches your eye.

Pinned Loading

  1. Stock-Market-ETL-Predictive-Analytics-Pipeline Stock-Market-ETL-Predictive-Analytics-Pipeline Public

    Python 1

  2. AB-Test-Ecommerce-Analysis AB-Test-Ecommerce-Analysis Public

    A/B test analysis of an e-commerce landing page redesign using hypothesis testing, power analysis, and segment analysis in Python.

    Jupyter Notebook 1

  3. Bank-Loan-Analysis Bank-Loan-Analysis Public

    Bank loan performance analysis using SQL, Python, Excel and Tableau — 38K records, MTD/MoM KPI tracking, good vs bad loan segmentation, and interactive dashboards.

    Jupyter Notebook 1

  4. Customer-Churn-Analysis Customer-Churn-Analysis Public

    End-to-end customer churn analysis using SQL, Python EDA, Risk Segmentation, and Logistic Regression to identify at-risk customers and drive retention strategy.

    Jupyter Notebook 1

  5. Ecommerce-Cohort-Revenue-Analysis Ecommerce-Cohort-Revenue-Analysis Public

    End-to-end e-commerce analysis of 96,470 orders using SQL, Python, and Power BI also used RFM segmentation, revenue trends, customer acquisition cohort analysis, and interactive dashboard.

    Jupyter Notebook 1

  6. HR-Analytics-Employee-Attrition HR-Analytics-Employee-Attrition Public

    HR analytics project analyzing employee attrition across 1,470 employees using SQL, Python, and Tableau , logistic regression model, RFM-style segmentation, and identification of top at-risk employ…

    Jupyter Notebook 1