🩺 Diabetes Medical Classification (PyTorch ANN Implementation)

📌 Project Overview

This project develops an Artificial Neural Network (ANN) to classify patients into three medical categories: Normal, Prediabetes, and Diabetes.

The goal is to demonstrate how deep learning can be applied to clinical tabular data to assist in early medical diagnosis. The model processes various physiological metrics (cholesterol, glucose, BMI, etc.) to predict diabetic status with high precision.

📂 Dataset

The dataset used in this project is sourced from Kaggle.

Source: Diabetes Dataset (Kaggle)
Description: Patient clinical records including glycosylated hemoglobin levels, BMI, and blood pressure.
Target Variable: glyhb_cat (0: Normal, 1: Prediabetes, 2: Diabetes).

🧪 Deep Learning Workflow

The notebook follows a rigorous data science pipeline to ensure model reliability:

Exploratory Data Analysis (EDA): Detailed visualization of clinical variables using histograms to detect distribution patterns.
Missing Value Management:
- Mean Imputation: Applied to normally distributed features like blood pressure and cholesterol.
- Median Imputation: Applied to skewed features to maintain robust statistical integrity.
Advanced Preprocessing:
- Anti-Leakage Design: The glyhb column is dropped from the features ($X$) because the target ($y$) is derived from it.
- Robust Scaling: Utilization of RobustScaler to handle outliers effectively.
- Label Encoding: Converting text labels into integers (0, 1, 2) for PyTorch compatibility.
ANN Architecture:
- Input Layer: 14 nodes.
- Hidden Layers: Two dense layers (12 & 24 neurons) using ReLU activation.
- Output Layer: 3 nodes representing class probabilities.

📊 Evaluation Results

The model is trained for 500 epochs using the Adam Optimizer and Cross-Entropy Loss. The evaluation phase includes:

Loss Curve Plotting: Tracking the convergence of the model.
Accuracy Calculation: Performance metrics calculated over a 20% dedicated test set.

🚀 Getting Started

1. Installation

Clone this repository and install the required Python libraries:

# Clone the repository
git clone [https://github.com/nicolausprima/NN-Classifcation.git](https://github.com/nicolausprima/NN-Classifcation.git)

# Navigate to the project folder
cd diabetes-ann-pytorch

# Install dependencies
pip install torch pandas matplotlib scikit-learn notebook

2. Launch the Project

To view the analysis and run the model training, launch the Jupyter Notebook environment:

# Launch Jupyter Notebook
jupyter notebook NNDiabetes.ipynb

🏁 Conclusion

Through this project, several key insights were gathered regarding medical data classification:

Preprocessing Matters: Medical data often contains outliers; the use of RobustScaler was pivotal in stabilizing the model's learning process.
Integrity in Features: By removing the glyhb column from features, the model demonstrates true predictive power rather than relying on direct indicators, proving it can generalize based on other physiological metrics.
Efficiency of ANN: A relatively simple 3-layer architecture is sufficient to capture the non-linear relationships in tabular clinical data, achieving a high degree of confidence in predicting patient health status.

Created by [Nicolaus Prima Dharma]

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
NNDiabetes.ipynb		NNDiabetes.ipynb
README.md		README.md
diabetes.csv		diabetes.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🩺 Diabetes Medical Classification (PyTorch ANN Implementation)

📌 Project Overview

📂 Dataset

🧪 Deep Learning Workflow

📊 Evaluation Results

🚀 Getting Started

1. Installation

2. Launch the Project

🏁 Conclusion

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🩺 Diabetes Medical Classification (PyTorch ANN Implementation)

📌 Project Overview

📂 Dataset

🧪 Deep Learning Workflow

📊 Evaluation Results

🚀 Getting Started

1. Installation

2. Launch the Project

🏁 Conclusion

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages