ML-Train is a comprehensive web-based platform that automates the entire machine learning pipeline, from data preprocessing to model deployment. It's designed to make machine learning accessible to users of all skill levels.
- Automated Data Cleaning
- Missing value handling
- Outlier detection and treatment
- Duplicate removal
- Data type conversion
- Feature Engineering
- Categorical encoding (One-Hot, Label)
- Feature scaling (Standard, Min-Max)
- Polynomial feature generation
- Data Validation
- Format verification
- Data type checking
- Value range validation
- Automated Model Selection
- Task type detection (Classification/Regression)
- Best model recommendation
- Hyperparameter optimization
- Multiple Model Support
-
Regression Models:
- Linear Regression
- Polynomial Regression
- Ridge Regression
- Lasso Regression
- Elastic Net
- Support Vector Regression (SVR)
- Decision Tree Regressor
- Random Forest Regressor
- XGBoost Regressor
- Gradient Boosting Regressor
- AdaBoost Regressor
- CatBoost Regressor
- LightGBM Regressor
-
Classification Models:
- Logistic Regression
- Ridge Classifier
- Support Vector Classifier (SVC)
- Decision Tree Classifier
- Random Forest Classifier
- Gradient Boosting Classifier
- AdaBoost Classifier
- XGBoost Classifier
- LightGBM Classifier
- CatBoost Classifier
- Naive Bayes variants
- K-Nearest Neighbors
-
- Multiple Format Support
- Pickle (.pkl)
- Joblib (.joblib)
- ONNX (.onnx)
- sav (.sav)
- dill (.dill)
- Interactive Prediction Interface
- Dynamic form generation based on model features
- Real-time prediction results
- Support for all model formats
- Prediction Features
- Single instance predictions
- Batch predictions
- Probability scores for classification
- Confidence intervals
- User-friendly Input
- Input validation
- Error handling
- Clear result presentation
- Python 3.8+
- pip (Python package manager)
-
Clone the repository:
git clone https://github.com/yourusername/ml-train.git cd ml-train -
Install dependencies:
pip install -r requirements.txt
-
Run the application:
python app.py
-
Open your browser and navigate to:
http://localhost:5000
- Click "Upload Dataset" on the homepage
- Select your CSV or Excel file
- Choose the target column
- Click "Upload"
- Review the automated preprocessing steps
- Adjust parameters if needed
- Click "Process Data"
- Select model type (Auto/Manual)
- Choose specific model (if manual)
- Set hyperparameters
- Click "Train Model"
- Select desired export format
- Click "Export Model"
- Download the model file
- Navigate to the predictions page
- Select your trained model
- Enter input values for each feature
- Click "Predict" to get results
- View prediction results and probabilities (if available)
- Flask web framework
- RESTful API design
- Modular code structure
- Error handling and logging
- Data validation
- Type conversion
- Missing value handling
- Feature engineering
- Model training
- Evaluation
- Export
- Model serialization
- Format conversion
- Optimization
- Validation
- Export
- Model loading and validation
- Input preprocessing
- Prediction generation
- Result formatting
- Error handling
This project is licensed under the MIT License - see the LICENSE file for details.
- Fork the repository
- Create your feature branch
- Commit your changes
- Push to the branch
- Create a Pull Request
For questions and support, please open an issue in the GitHub repository.
βββ app.py # Main application file
βββ requirements.txt # Project dependencies
βββ uploads/ # Directory for uploaded datasets
βββ models/ # Directory for saved models
βββ static/ # Static files
β βββ css/ # CSS stylesheets
β β βββ style.css # Main stylesheet
β βββ js/ # JavaScript files
β βββ main.js # Main JavaScript file
β βββ regression.js
β βββ classification.js
β βββ predictions.js
βββ templates/ # HTML templates
βββ main.html # Homepage
βββ regression.html # Regression model interface
βββ classification.html # Classification model interface
βββ predictions.html # Real-time predictions interface