A novel dynamic ensemble selection classifier for an imbalanced data set: An application for credit risk assessment

Wen-hui Hou,Xiao-kang Wang,Hong-yu Zhang,Jian-qiang Wang,Lin Li
DOI: https://doi.org/10.1016/j.knosys.2020.106462
2020-11-01
Abstract:<p>Credit risk assessment is usually regarded as an imbalanced classification task solved by static ensemble classifiers. However, the dynamic ensemble selection (DES) strategy that can select different ensemble classifiers for each query sample is rarely used. Deficiency of the existing DES algorithm in dealing with imbalanced data is the major challenge. In this paper, a novel combined DES model is developed for imbalanced learning problems. To handle the imbalanced data sets, the synthetic minority over-sampling technique is initially used to balance a training set before generating a candidate classifier pool; then, the weighting mechanism of DES-MI (multi-class imbalance) is used to highlight the importance of minority instances when evaluating classifier competences. To further ensure the comprehensive evaluation and right selection of the ensemble classifier, the meta-learning framework of META-DES is used to account for multiple criteria, and the two-step selection strategy of DES-KNN (k-nearest neighbours) is employed to perform a trade-off between the competence and diversity of the classifiers. Our experiments on 15 imbalanced data sets from the KEEL repository show that the proposed model improves the performance of seven known and popular DES algorithms in terms of the area under the curve. Moreover, the type I error rate of the proposed method is lower than that of XGBoost and LightGBM in a real P2P loan data set indicating the efficiency of the proposed method for credit risk assessment.</p>
computer science, artificial intelligence
What problem does this paper attempt to address?