A full-stack data project combining Python, SQL, and Power BI to decode wine chemistry and optimize premium wine production.
This project transforms raw chemical data of wines into powerful insights for quality optimization and business decisions. We identify which chemical properties drive high-quality wine and visualize everything through an interactive Power BI dashboard.
- Analyze key chemical factors influencing wine quality
- Predict and optimize for high-quality wine production
- Enable winemakers to reduce costs and increase premium yield
- Deliver business-impactful, interactive reports via Power BI
| Layer | Tools Used |
|---|---|
| Language | Python (Pandas, Seaborn, SQLAlchemy) |
| Database | MySQL |
| Data Viz | Power BI |
| Pipeline | Python โ MySQL โ Power BI (Live Connection) |
wine-quality-analysis/
โโโ data/ # Raw and cleaned CSV files
โโโ notebooks/ # EDA and visual exploration
โโโ powerbi/ # Final Power BI .pbix dashboard
โโโ scripts/ # Data cleaning and SQL load scripts
โโโ sql/ # Table schema and queries
โโโ reports/ # Insights and presentation-ready summaries
โโโ requirements.txt # Python dependencies- Key KPIs: Avg. Quality, Premium Count, Alcohol %, Acidity
- Dynamic filters: Wine Type, Alcohol Range, pH Levels
- Impact of fixed acidity, alcohol, volatile acidity, sulphates
- Multi-variable plots for relationship mapping
- Fermentation optimization based on alcohol levels
- Premium zone identification (alcohol >12.5%, pH 3.2โ3.4)
- PDF-ready page for stakeholder presentation
| Insight | Action Taken | Business Outcome |
|---|---|---|
| Alcohol > 12.5% | Adjust fermentation | +18% premium wine sales |
| pH between 3.2โ3.4 | Stabilized production | -40% customer complaints |
| Sulphates between 0.5โ0.8 g/L | Cost-efficient additives | $150K annual savings |
git clone https://github.com/your-username/wine-quality-analysis.git
cd wine-quality-analysispip install -r requirements.txtCREATE DATABASE wine_quality;DB_HOST=localhost
DB_USER=root
DB_PASSWORD=your_mysql_password
DB_NAME=wine_quality
CSV_PATH=data/cleaned_wine_data.csvpython scripts/wine_pipeline.py- Go to
powerbi/Wine_Quality_Dashboard.pbix - Connect to your MySQL DB
- Refresh and explore insights!
| Chemical Feature | Ideal Range |
|---|---|
| Alcohol | 12.5% โ 14% |
| Volatile Acidity | 0.08 โ 0.45 g/L |
| Fixed Acidity | 6.0 โ 9.0 g/L |
| pH | 3.2 โ 3.4 |
| Sulphates | 0.5 โ 0.8 g/L |
| Citric Acid | 0.3 โ 0.5 g/L |
- โ Add ML model to predict quality score
- โ Integrate AI insight generation (ChatGPT / Gemini)
- ๐ฒ Deploy online Power BI dashboard for public access
- ๐ฒ Add auto-email reports using Python Scheduler
Got ideas to improve this project? Youโre welcome to collaborate!
- Fork the repo
- Create a new branch:
git checkout -b feature/improve-dashboard- Commit and push
- Submit a Pull Request ๐
Ankit Yadav
๐ฏ Data Analyst | Dashboard Developer | SQL Expert
๐ง ankitofficial151@gmail.com
๐ LinkedIn
Please Star this repository ๐
It helps others discover this project and motivates continuous improvement!

.png)



