Application of XGBoost in P2P Default Prediction

Zhiqiang Li,Shouyan Li,Zhilong Li,Yixiang Hu,Hanlin Gao
DOI: https://doi.org/10.1088/1742-6596/1871/1/012115
2021-04-01
Journal of Physics: Conference Series
Abstract:Abstract P2P network lending is a “ peer-to-peer ” loan through a third-party Internet platform built by P2P companies. The application of machine learning algorithms to the field of P2P loan default prediction will improve the operating capabilities of the platform, and also effectively regulate the lending market. In this paper, we use Lending Club’s loan data and feature engineering technology to apply the XGBoost algorithm to construct a P2P loan default prediction model, and we choose five performances: accuracy, AUC value, error rate, model robustness, and model run time to compare it with Logistic Regression and Decision Trees. The result shows that the prediction accuracy rate of the XGBoost algorithm is 97.705%, which fits the actual results better, and can effectively control the loss cost caused by model errors. In addition, we also select 10 features that have the greatest impact on loan default rates based on the XGBoost algorithm, and provide a reference for P2P lending platforms.
What problem does this paper attempt to address?