Employee Turnover Analysis Using Machine Learning Algorithms

Mahyar Karimi,Kamyar Seyedkazem Viliyani
2024-02-06
Abstract:Employee's knowledge is an organization asset. Turnover may impose apparent and hidden costs and irreparable damages. To overcome and mitigate this risk, employee's condition should be monitored. Due to high complexity of analyzing well-being features, employee's turnover predicting can be delegated to machine learning techniques. In this paper, we discuss employee's attrition rate. Three different supervised learning algorithms comprising AdaBoost, SVM and RandomForest are used to benchmark employee attrition accuracy. Attained models can help out at establishing predictive analytics.
Machine Learning
What problem does this paper attempt to address?
The paper primarily focuses on the issue of employee turnover prediction. The authors utilize machine learning algorithms to build predictive models to help companies forecast the likelihood of employee departures and take measures to reduce turnover, thereby minimizing the associated cost losses. Specifically, the paper discusses the following points: 1. **Problem Background**: Employees' knowledge is a valuable asset to companies, and employee turnover can lead to both direct and indirect cost losses, including the costs of recruiting new employees, training costs, and decreased productivity. Therefore, monitoring employee status and predicting turnover is very important. 2. **Research Methods**: The paper benchmarks three supervised learning algorithms (AdaBoost, Support Vector Machine SVM with linear kernel and radial basis function kernel, and Random Forest) on an employee turnover dataset. The dataset contains 30 features; the original dataset had 35 features, but 5 irrelevant or non-generalizable features were removed to reduce computational complexity and avoid overfitting. 3. **Feature Analysis**: The paper also conducts correlation analysis between features, such as the relationship between age and job level, job satisfaction and turnover rate, etc., to better understand which factors might influence employees' decisions to leave. 4. **Experimental Results**: In the experimental section, the AdaBoost algorithm achieved the highest accuracy in most cases, followed by SVM with a linear kernel, and Random Forest also performed quite well. AdaBoost had the largest area under the Receiver Operating Characteristic (ROC) curve, indicating the highest probability of successfully distinguishing and labeling instances. In summary, this paper provides valuable tools for companies by applying machine learning techniques to predict employee turnover, helping them better manage human resources and reduce unnecessary costs.