This project analyzes the Seoul Bike Rental dataset to explore factors influencing bike rentals. The goal is to uncover meaningful patterns and relationships between bike rental counts and variables such as weather, time, and holidays.
- Source: Seoul public bike sharing system UC Irvine
- Duration: Hourly data from December 2017
- Key Features:
Rented Bike CountTemperature,Humidity,Wind Speed,Rainfall,Snowfall,Solar RadiationHour,Date,Holiday,Season,Functioning Day
- Explore when and why people rent public bikes.
- Understand how weather and time affect bike demand.
- Generate insights using descriptive statistics and visualizations.
- Apply feature engineering to enrich the dataset.
- Univariate Analysis: Distribution of
Rented Bike Count,Temperature, etc. - Bivariate Analysis: Correlations between
Rented Bike Countand weather variables. - Multivariate Analysis: Heatmap of relationships across all numerical variables.
- Time-of-Day Feature Engineering: New feature categorizing hour into logical periods (e.g., Morning Rush, Evening Rush).
- Scatter plots and box plots to observe rental trends.
- Correlation heatmap to reveal strongest predictors.
- Hourly and daily rental patterns.
- Rentals peak during Evening Rush and Night, likely due to leisure and commuting flexibility.
- Temperature and solar radiation are positively correlated with rentals.
- Humidity, rainfall, and wind speed reduce bike usage.
- Python
- Pandas
- Seaborn & Matplotlib
- Jupyter Notebook
- Clone the repo or open the notebook.
- Install required libraries:
pip install pandas seaborn matplotlib