Features selection, data mining and finacial risk classification: a comparative study

Salim Lahmiri
DOI: https://doi.org/10.1002/isaf.1395
2016-09-08
Intelligent Systems in Accounting, Finance and Management
Abstract:Summary The aim of this paper is to compare several predictive models that combine features selection techniques with data mining classifiers in the context of credit risk assessment in terms of accuracy, sensitivity and specificity statistics. The t ‐statistic, Battacharrayia statistic, the area between the receiver operating characteristic, Wilcoxon statistic, relative entropy, and genetic algorithms were used for the features selection task. The selected features are used to train the support vector machine (SVM) classifier, backpropagation neural network, radial basis function neural network, linear discriminant analysis and naive Bayes classifier. Results from three datasets using a 10‐fold cross‐validation technique showed that the SVM provides the best accuracy under all features selections techniques adopted in the study for all three datasets. Therefore, the SVM is an attractive classifier to be used in real applications for bankruptcy prediction in corporate finance and financial risk management in financial institutions. In addition, we found that our best results are superior to earlier studies on the same datasets.
What problem does this paper attempt to address?