Welcome to the Statistical Inference & Modeling repository! 🚀
This project is a comprehensive collection of Jupyter Notebooks dedicated to the world of data-driven decision-making,
covering everything from fundamental goodness-of-fit tests to advanced predictive modeling and diagnostic techniques.
This repository is structured as a week-by-week journey through statistical modeling:
- Weeks 1 & 2: Foundations 🧱
- Goodness of Fit and initial data exploration.
- Week 3: Decisions & Predictions 🔮
- Week 4: The Curves of Reality 📈
- Week 5 & 6: Complex Relationships 🔗
- Deep dives into Multiple Linear Regression.
- Week 7: Advanced Diagnostics 🔬
- Tackling Multicollinearity, VIF, and ANOVA.
If you're looking for the most sophisticated techniques in this project, check out these notebooks:
Explore how to detect and handle redundant predictors using:
- VIF (Variance Inflation Factor)
- Klein’s Rule of Thumb
- ANOVA Tables for model comparison.
Master the art of fitting data when straight lines aren't enough:
- Advanced diagnostic plots (Residuals vs. Fitted, QQ Plots).
- Outlier detection and handling.
- Logarithmic and polynomial transformations.
Learn to model multiple predictors simultaneously and evaluate their combined significance using F-statistics and analysis of variance.
- Python 3.12+
uvinstalled (recommended for dependency management)
To set up the environment and install all dependencies:
uv syncOnce installed, you can launch the notebooks using your favorite editor (VS Code, JupyterLab, etc.) and start exploring!