Skip to content

ASWD13/ML-project

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 

Repository files navigation

Employee Attrition Prediction and Estimated Loss

This project focuses on predicting employee attrition using machine learning and estimating the potential financial loss associated with it.

Project Overview

The project is divided into three main parts:

  1. Attrition Prediction (Classification): This involves building a classification model to predict whether an employee is likely to leave the company.
  2. Future Salary Prediction (Regression): This involves building a regression model to predict the future salaries of employees likely to stay in the company.
  3. Estimated Loss Calculation: This part estimates the potential financial loss associated with employee attrition.

Data

The project uses the "IBM HR dataset" containing information about employee demographics, job satisfaction, performance, and attrition status.

Methodology

Attrition Prediction (Classification):

  1. A Voting Classifier ensemble is trained on preprocessed data to predict attrition.

Future Salary Prediction (Regression):

  1. A Voting Regressor ensemble is trained on preprocessed data to predict future salaries of employees likely to stay.

Estimated Loss Calculation:

  1. The expected loss for each employee is calculated and aggregated to estimate the total financial impact of attrition.

Libraries Used

  • pandas
  • numpy
  • scikit-learn (including SMOTE, GridSearchCV, and various models)
  • xgboost
  • imblearn
  • seaborn
  • matplotlib

Authors

  • Amancharla Anirudh
  • Varshneya Kolla

Conclusion

This project demonstrates the application of machine learning for predicting employee attrition and estimating the associated financial loss. It can be valuable for businesses looking to identify employees at risk and plan accordingly.

How to Run

  1. Upload the Dataset
  2. Import Libraries
  3. Load the Dataset
  4. Run the Code Execute the rest of the code in the notebook, including the data preprocessing, model training, evaluation, and estimated loss calculation steps. ( Use google colab or jupyter notebook)

About

This was done as part of my university's machine learning course

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors