Incorporating Multilevel Macroeconomic Variables into Credit Scoring for Online Consumer Lending

Yufei Xia,Yinguo Li,Lingyun He,Yixin Xu,Yiqun Meng
DOI: https://doi.org/10.1016/j.elerap.2021.101095
IF: 6
2021-01-01
Electronic Commerce Research and Applications
Abstract:Booming online consumer lending encounters high credit risk and thus needs well-designed credit scoring models. As a supplementary data source, multilevel macroeconomic variables (MVs), which means mixed regions, data frequency, and lags of MVs, are combined with application data for credit scoring. Moreover, we propose a Bayesian MV selection and lag optimization method to handle the highly correlated MVs and capture flexible lag effects. A structural model for consumer default behavior is proposed to analyze the potential role of multilevel MVs in theoretically predicting default loans. Validated on a real-world dataset from an anonymous Chinese online consumer lending platform, the empirical results confirm that multilevel MVs are significant determinants of consumer credit risk. The comparison of classification performance further implies the superiority of incorporating multilevel MVs and the proposed MV selection and lag optimization method relative to benchmarks. Gradient boosting decision tree-based approaches provide significantly better performance than industry benchmarks while maintaining interpretability via Shapley additive explanations values.
What problem does this paper attempt to address?