Cost-sensitive boosted tree for loan evaluation in peer-to-peer lending
Yufei Xia,Chuanzhe Liu,Nana Liu
DOI: https://doi.org/10.1016/j.elerap.2017.06.004
IF: 6
2017-07-01
Electronic Commerce Research and Applications
Abstract:Loan evaluation is an effective tool for credit risk assessment in the peer-to-peer (P2P) lending market. However, most of the traditional loan evaluation models assume a balanced misclassification cost, which is far from business reality and leaves opportunities for investigating the cost-sensitive modelling approaches. Moreover, few existing studies consider the special characteristics of P2P lending. In this paper, we propose a cost-sensitive boosted tree loan evaluation model by incorporating cost-sensitive learning and extreme gradient boosting (XGBoost) to enhance the capability of discriminating potential default borrowers. Therefore, a portfolio allocation model that converts the portfolio optimization problem into an integer linear programming is proposed as a decision support system for unprofessional lenders. Unlike existing studies, which mainly evaluate the model through accuracy-based metrics, this study measures the expected profitability based on the annualized rate of return (ARR) and proposes a new metric (ARR curve) reflecting the ARR over different rejection rates. Two real-world P2P lending datasets are examined, and explanatory data analysis shows that explanatory variables differ in predicting the probability of default and profitability. Experimental results reveal that the area under the ARR curve is not a considerable metric in evaluating models from a profit perspective. Conventional cost-insensitive models may even lead to negative returns, whereas the proposed loan evaluation and portfolio allocation model are the best performing methods over the two datasets in terms of profitability.
business,computer science, information systems, interdisciplinary applications