A Heterogeneous Ensemble Credit Scoring Model Based on Adaptive Classifier Selection: an Application on Imbalanced Data

Tong Zhang,Guotai Chi
DOI: https://doi.org/10.1002/ijfe.2019
2020-01-01
International Journal of Finance & Economics
Abstract:In the domain of credit scoring, the number of bad clients is far less than that of good ones. So imbalanced data classification is a realisitc and critical issue in the credit scoring process. In this study, a novel heterogeneous ensemble credit scoring model is proposed for the problem of imbalanced data classification. This proposed model is on basis of five standard classifiers, namely LSVM, KNN, MDA, DT, LR, and adaptively selects the base classifiers with highest AUC according to the data distribution, then integrates all base classifiers to obtain a prediction. Finally, by using five comprehensive performance measures and four classical credit datasets, we find that the proposed model is better than other baseline models. This novel model can be applied to actual credit scoring and assist financial institutions in credit risk management.
What problem does this paper attempt to address?