This repository contains all the code and analyses for an advanced data mining project aimed at exploring and predicting banking subscription outcomes. The project utilizes a variety of machine learning algorithms and techniques to preprocess data, train models, and evaluate their performance.
The dataset used in this project includes customer information from a bank's marketing campaign. Attributes include age, job, marital status, education, and many more, totaling 21 different features.
- Data Preprocessing: Handling missing values, one-hot encoding, feature scaling.
- Machine Learning Algorithms: Artificial Neural Networks (ANN), Gradient Boosting Machine (GBM), Random Forest.
- Clustering: K-means for exploratory data analysis.
- Dimensionality Reduction: Principal Component Analysis (PCA).
- Visualization: Scatter plots, PairGrids.
To set up this project, clone this repository and install the required Python packages:
git clone https://github.com/yourusername/AdvancedDataMiningProject.git
cd AdvancedDataMiningProject