Skip to content

DeveshShukla23/Quantium-Data-Analytics-Simulation

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

5 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

Header

Typing SVG


Forage Certificate Status Python Pandas SciPy Jupyter


LinkedIn GitHub


πŸ›’ About This Project

project = {
    "name"        : "Quantium Data Analytics Job Simulation",
    "platform"    : "Forage",
    "company"     : "Quantium β€” Australia's Leading Analytics Firm",
    "completed"   : "March 22nd, 2026",
    "tasks"       : 3,
    "dataset"     : "Retail Chip Category | FY2018-19",
    "tools"       : ["Python", "Pandas", "NumPy", "SciPy", "Matplotlib", "Seaborn"],
    "skills"      : ["Customer Analytics", "Uplift Testing", "Pearson Correlation",
                     "t-test", "95% CI", "Pyramid Principle", "Commercial Reporting"],
    "outcome"     : "πŸ† Certificate Issued β€” Data Analytics Job Simulation"
}

πŸ’‘ "Replicating the real work of a Quantium Data Analyst β€” turning raw retail data into commercial decisions that drive business growth."


πŸ“Š Key Metrics

πŸ’° Total Revenue πŸ›οΈ Transactions πŸ‘€ Customers 🧾 Avg Spend/Trip
$1.8M 245,255 71,287 $7.36

πŸ“ Repository Structure

πŸ“¦ Quantium-Data-Analytics-Simulation/
β”‚
β”œβ”€β”€ πŸ““ Quantium_Task1.ipynb           β†’ Customer Analytics
β”œβ”€β”€ πŸ““ Quantium_Task2.ipynb           β†’ Uplift Testing & Experimentation
β”œβ”€β”€ πŸ“Š fig_monthly_transactions.png
β”œβ”€β”€ πŸ“Š fig_customer_segments.png
β”œβ”€β”€ πŸ“Š fig_sales_by_lifestage.png
β”œβ”€β”€ πŸ“Š fig_sales_by_premium.png
β”œβ”€β”€ πŸ“Š fig_sales_heatmap.png
β”œβ”€β”€ πŸ“Š fig_customer_count_heatmap.png
β”œβ”€β”€ πŸ“Š fig_avg_spend_heatmap.png
β”œβ”€β”€ πŸ“Š fig_top_brands.png
β”œβ”€β”€ πŸ“Š fig_pack_size_sales.png
β”œβ”€β”€ πŸ“Š fig_avg_txns_heatmap.png
β”œβ”€β”€ πŸ“Š fig_brand_preference.png
β”œβ”€β”€ πŸ“Š fig_pack_preference.png
β”œβ”€β”€ πŸ“Š fig_store77_*.png              β†’ Store 77 Trial Charts
β”œβ”€β”€ πŸ“Š fig_store86_*.png              β†’ Store 86 Trial Charts
β”œβ”€β”€ πŸ“Š fig_store88_*.png              β†’ Store 88 Trial Charts
└── πŸ“„ README.md

⚠️ Note: Raw data files excluded per Quantium's confidentiality policy.


🎯 Task 1 β€” Customer Analytics

Methodology

Step Action
🧹 Data Cleaning Parsed dates, extracted pack sizes via regex, removed salsa & outliers
βš™οΈ Feature Engineering Extracted brand names, merged transaction + customer datasets
πŸ‘₯ Segmentation Analysed 7 lifestage segments Γ— 3 premium tiers
πŸ“ˆ Sales Analysis Monthly trends, brand performance, pack size preferences
πŸ“ Statistics Avg spend and frequency comparison across all segments

πŸ” Key Findings

πŸ“Œ Mainstream customers      β†’ drive 38.8% of revenue ($700K)
πŸ“Œ Older Families            β†’ highest avg spend $34 AND frequency 4.7x/year
πŸ“Œ Young Families            β†’ $32 avg spend, over-index on Smiths & Red Rock Deli
πŸ“Œ Kettle brand              β†’ #1 at 21.6% of chip sales ($390K) β€” 2x nearest competitor
πŸ“Œ 175g pack                 β†’ drives 26.9% of all chip revenue β€” universally preferred
πŸ“Œ December festive peak     β†’ clearly visible | February dip reflects shorter month

πŸ“Έ Task 1 Visualizations

Monthly Transactions Customer Segments Sales by Lifestage Sales by Premium Sales Heatmap Customer Count Heatmap Avg Spend Heatmap Avg Transactions Heatmap Top Brands Pack Size Sales Brand Preference Pack Preference


πŸ§ͺ Task 2 β€” Uplift Testing & Experimentation

Methodology

Step 1 β†’ Calculate monthly metrics per store (Sales, Customers, Txns, AvgPrice)
Step 2 β†’ Filter to pre-trial period (Jul 2018 – Jan 2019)
Step 3 β†’ Match control stores using Pearson Correlation
Step 4 β†’ Scale control store metrics to trial store magnitude
Step 5 β†’ Apply t-tests with 95% Confidence Intervals (Feb–Apr 2019)
Step 6 β†’ Compare trial store performance vs CI bands

πŸ“Š Results Summary

Trial Store Control Store Sales Significant Customers Significant
Store 77 Store 233 βœ… Mar & Apr (t=7.3, 12.5) βœ… Mar & Apr (t=13.5, 30.8)
Store 86 Store 155 βœ… Mar & Apr βœ… All 3 months (t=11.8, 20.9, 5.7)
Store 88 Store 237 βœ… Mar & Apr (t=6.6, 5.8) βœ… Mar & Apr (t=17.9, 9.8)

πŸ”‘ Key Insight: Primary driver of uplift = more customers visiting β€” not higher spend per visit.

πŸ“Έ Task 2 Visualizations

Store 77 Sales Check Store 77 Customers Check Store 77 Trial Sales Store 77 Trial Customers

Store 86 Sales Check Store 86 Customers Check Store 86 Trial Sales Store 86 Trial Customers

Store 88 Sales Check Store 88 Customers Check Store 88 Trial Sales Store 88 Trial Customers


πŸ“‹ Task 3 β€” Analytics & Commercial Application

Pyramid Principle Structure

πŸ”Ί CONCLUSION FIRST
└── Trial layout drives significant uplift β†’ Recommend full rollout
    β”‚
    β”œβ”€β”€ πŸ“Š EVIDENCE 1 β€” Task 1 Insights
    β”‚     Customer segments, brand dominance, pack size preferences
    β”‚
    β”œβ”€β”€ πŸ§ͺ EVIDENCE 2 β€” Task 2 Results
    β”‚     Statistically significant uplift in all 3 trial stores
    β”‚
    └── πŸ’‘ RECOMMENDATIONS β€” 4 Next Steps for H2

4 Strategic Recommendations

# Recommendation Data Behind It
1️⃣ Target Older & Young Families Highest spend ($33-34) + frequency (4.5-4.7x/year)
2️⃣ Protect Kettle's Shelf Position 21.6% revenue β€” eye-level + December push
3️⃣ Anchor on 175g Pack 26.9% of all sales β€” use as promo anchor SKU
4️⃣ Roll Out Trial Format Significant uplift confirmed β€” target 10-15 stores

πŸ› οΈ Tech Stack

Python Pandas NumPy SciPy Matplotlib Seaborn Jupyter


πŸ’Ό Skills Demonstrated

Analytics Statistics Business
βœ… End-to-end EDA βœ… Hypothesis Testing (t-test) βœ… Pyramid Principle Reporting
βœ… Customer Segmentation βœ… 95% Confidence Intervals βœ… Commercial Recommendations
βœ… Feature Engineering βœ… Pearson Correlation Matching βœ… Stakeholder Communication
βœ… Data Visualisation βœ… Uplift Testing βœ… KPI Definition

πŸ† Certificate of Completion

Field Details
πŸ… Certificate Data Analytics Job Simulation
🏒 Issued By Quantium via Forage
πŸ“… Completed March 22nd, 2026
πŸ”‘ Credential ID 9SZbzY89Z5gHe2s4e
πŸ‘€ User Verification hbKXAXgYSyRv56eS5
πŸ”— View Certificate Click Here

πŸ‘¨β€πŸ’» Author

Devesh Shukla Data Analyst | Data Scientist | Builder

LinkedIn GitHub Email


⭐ If you find this useful, please give it a star! ⭐

Footer

About

πŸ“Š Quantium x Forage | Customer Segmentation β€’ Pearson Correlation Matching β€’ Uplift Testing on 3 Trial Stores | 245K transactions β€’ 71K customers β€’ $1.8M revenue | Python | SciPy | Pandas βœ…

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors