Skip to content

srsimsima81-cloud/Employee-Performance-Predictor

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

👨‍💼 Employee Performance Predictor using Data Analytics

📌 Overview

A machine learning-based HR analytics system that predicts employee performance using historical and simulated workforce data.

The project helps organizations:

  • Identify high-performing employees
  • Detect underperformers early
  • Support HR decision-making
  • Improve workforce planning

🎯 Objective

To build an end-to-end machine learning pipeline that:

  • Analyzes employee data
  • Trains a predictive model
  • Classifies employee performance
  • Visualizes key insights through an interactive dashboard

🧠 Problem Statement

Organizations often struggle to evaluate employee performance objectively.

This system solves this by using data-driven predictions instead of manual evaluation methods.


🛠️ Tech Stack

  • Programming Language: Python
  • Frontend (UI): Streamlit
  • Machine Learning: Scikit-learn
  • Data Handling: Pandas, NumPy
  • Visualization: Matplotlib
  • Model: Random Forest Classifier
  • Serialization: Joblib

📁 Project Structure

Employee-Performance-Predictor/
│
├── data/            # Dataset (CSV files)
├── models/         # Trained ML model (.pkl)
├── images/         # Generated visualizations
├── notebooks/      # Jupyter notebooks (EDA & analysis)
│
├── app.py          # Streamlit web application
├── main.py         # Model training script
├── requirements.txt
└── README.md

⚙️ Installation Guide

1️⃣ Clone the repository

git clone https://github.com/your-username/employee-performance-predictor.git
cd employee-performance-predictor

2️⃣ Create virtual environment

python -m venv venv

3️⃣ Activate environment

Windows

venv\Scripts\activate

Mac/Linux

source venv/bin/activate

4️⃣ Install dependencies

pip install -r requirements.txt

🚀 How to Run the Project

▶️ Run Streamlit Web App

streamlit run app.py

📊 Run Model Training (optional)

python main.py

📊 Features

  • Upload employee dataset (CSV)
  • Automatic data preprocessing
  • Label encoding for categorical variables
  • Machine learning model training (Random Forest)
  • Performance prediction (High / Medium / Low)
  • Model evaluation (Accuracy, Confusion Matrix)
  • Feature importance visualization
  • Interactive Streamlit dashboard

📈 Model Workflow

Data Collection
      ↓
Data Preprocessing
      ↓
Feature Engineering
      ↓
Model Training (Random Forest)
      ↓
Evaluation
      ↓
Prediction + Visualization

📷 Outputs

  • Confusion Matrix Visualization
  • Feature Importance Graph
  • Prediction Interface (Streamlit UI)

🧪 Sample Input Features

  • Age
  • Experience
  • Salary
  • Training Hours
  • Department

📌 Results

  • Achieved high classification accuracy on synthetic dataset
  • Identified key factors influencing employee performance
  • Built an interactive HR analytics dashboard

🔮 Future Improvements

  • Integration with real HR datasets
  • Employee attrition prediction
  • Cloud deployment (AWS / Render / Streamlit Cloud)
  • Authentication system for HR users
  • Improved UI dashboards

👨‍💻 Author

Developed as a Data Science + Machine Learning portfolio project Focused on HR analytics and business decision support systems

About

An end-to-end machine learning project that predicts employee performance using HR data with a Random Forest model and full data preprocessing pipeline. It includes an interactive Streamlit dashboard for real-time predictions and visualization of model results.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors