multi class classification
Body performance analysis dataset - Kaggle. This is data that confirmed the grade of performance with age and some exercise performance data.
-
- look at data
- set proper data types / get dummies
- fill missing data
- create train / test split
-
Feature selection & regularization
- remove unnecessary / collinear features
- scale data
- search outliers
- create baseline models
-
- create pipelines
- run optuna
- choose best model
-
- test model in order to baseline model
- calculate model scores
data shape : (13393, 12)
- age : 20 ~64
- gender : F,M
- height_cm : (If you want to convert to feet, divide by 30.48)
- weight_kg
- body fat_%
- diastolic : diastolic blood pressure (min)
- systolic : systolic blood pressure (min)
- gripForce
- sit and bend forward_cm
- sit-ups counts
- broad jump_cm
- class : A,B,C,D (A: best) / stratified
Korea Sports Promotion Foundation Some post-processing and filtering has done from the raw data.