Skip to content

C-dan93/BI-Practice-Datasets-Collection

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 

Repository files navigation

BI Practice Datasets Collection

📖 Full Repository Structure

🎯 Purpose

A curated collection of practice datasets designed to build real-world BI skills. Each dataset represents different industries, data challenges, and transformation scenarios you'll encounter in professional environments.

📁 Dataset Categories

📊 Current Datasets

/ecommerce-sales/ - Data Cleaning Fundamentals

  • Focus: Basic transformations, data type fixes, header cleaning
  • Skills: Remove rows/columns, fix data types, handle missing values
  • Industry: E-commerce/Retail
  • Difficulty: ⭐⭐☆☆☆ Beginner

/financial-quarterly/ - Pivot & Unpivot Mastery

  • Focus: Reshaping data from pivot table format
  • Skills: Unpivot columns, date handling, aggregations
  • Industry: Finance/Accounting
  • Difficulty: ⭐⭐⭐☆☆ Intermediate

/hr-employee-data/ - Relationship Building

  • Focus: Multiple tables, foreign keys, data modeling
  • Skills: Table relationships, calculated columns, DAX basics
  • Industry: Human Resources
  • Difficulty: ⭐⭐⭐⭐☆ Advanced

🔮 Planned Datasets

  • Manufacturing Quality Control (Statistical analysis, control charts)
  • Marketing Campaign Analytics (Attribution modeling, funnel analysis)
  • Healthcare Patient Records (Time-series, compliance considerations)
  • Supply Chain Logistics (Geospatial data, optimization scenarios)
  • Social Media Analytics (Unstructured data, sentiment analysis)

🛠️ Each Dataset Includes

/dataset-name/
├── 📄 README.md              # Dataset overview & learning objectives
├── 🐍 generate_data.py       # Python script to create the dataset
├── 📊 sample_data.xlsx       # Pre-generated Excel file
├── 🎯 practice_guide.md      # Step-by-step transformation exercises
├── ✅ solution_pbix.pbix     # Completed Power BI file (reference)
└── 📝 learning_notes.md      # Key insights and common mistakes

🎓 Progressive Learning Path

Level 1: Data Cleaning Basics

  1. E-commerce Sales → Master fundamental transformations
  2. Customer Database → Handle missing data and duplicates

Level 2: Data Reshaping
3. Financial Quarterly → Unpivot and date handling 4. Survey Responses → Wide to long format conversion

Level 3: Multi-Table Modeling 5. HR Employee System → Relationships and lookups 6. Inventory Management → Complex joins and calculations

Level 4: Advanced Analytics 7. Manufacturing QC → Statistical functions and alerts 8. Marketing Attribution → Advanced DAX and measures

🚀 Quick Start Guide

# 1. Clone the repository
git clone https://github.com/yourusername/bi-practice-datasets

# 2. Choose a dataset based on your skill level
cd ecommerce-sales/  # Start here for beginners

# 3. Generate fresh practice data
python generate_data.py

# 4. Follow the practice guide
# Open practice_guide.md for step-by-step instructions

# 5. Import into Power BI and practice!

💡 How to Use This Collection

For Structured Learning:

  • Follow the progressive path from Level 1 → Level 4
  • Complete practice guides before checking solutions
  • Document your own insights in learning notes

For Targeted Practice:

  • Need to practice unpivoting? → Jump to Financial Quarterly
  • Want to work on relationships? → Try HR Employee Data
  • Preparing for an interview? → Pick datasets matching the industry

For Content Creation:

  • Each dataset comes with reusable scenarios
  • Perfect for tutorials, workshops, or training materials
  • All datasets designed to highlight specific BI concepts

🎯 Perfect For

  • Self-taught analysts building practical BI skills
  • Students practicing Power BI transformations
  • Instructors teaching data modeling concepts
  • Professionals preparing for BI roles or interviews
  • Tutorial creators needing realistic practice scenarios

🤝 Contributing

Have an industry scenario that should be included? Submit:

  • Dataset description and learning objectives
  • Sample messy data that needs transformation
  • Key skills this dataset would teach

📈 Dataset Request Process

Missing a specific industry or skill area? Open an issue with:

  • Industry/Domain: What field should this represent?
  • Skills Focus: What BI concepts should it teach?
  • Difficulty Level: Beginner to Advanced?
  • Special Requirements: Any unique data challenges?

🔗 Connect & Learn Together

This collection grows based on real learning needs. Follow my BI learning journey:


"The best way to master BI tools is through deliberate practice with realistic, challenging data. This collection ensures you always have the right dataset for your current learning goal."

📊 Repository Stats

Dataset Count Skill Levels Industries

About

Collection of realistic, challenging datasets for practicing Power BI and business intelligence skills. Each dataset targets specific transformation techniques and real-world data problems.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors