Optimizing Bank Customer Churn Prediction with LightGBM and Data-Driven Strategies

Weixuan Wu
DOI: https://doi.org/10.54097/wm3yk853
2024-07-17
Abstract:This study utilizes the LightGBM model to enhance the prediction of bank customer churn. By utilizing Kaggle's comprehensive datasets and feature engineering, to solve the missing value problem, and eliminate the useless data, so that the data becomes unified, centralized, and easy to identify the advantages. Used data visualization for in-depth analysis, an intervention to further narrow down the content and characteristics of the data by manually identifying correlations and characteristics of various aspects of the data and then conducting more precise checks. The use of LightGBM takes full advantage of handling massive datasets, outperforming traditional algorithms such as Random Forest and XGBoost in terms of efficiency and speed. The combination of new features such as age category and account balance further improves the prediction accuracy of the model, and more deeply complete, In conclusion, this study takes an important step in applying machine learning to improve bank customer churn prediction by proposing a model that balances complexity and practicality.
What problem does this paper attempt to address?