An incremental sparse LS-SVM classification method for imbalanced data sets

Leichen Chen,Zhihua Cai,Shuang Ao
2010-01-01
Abstract:In this paper, a new classification method (ISLS-SVM) for imbalanced data sets is proposed. In this method, the original training data set is constituted by all the minority samples and the same amount of randomly selected majority samples, and the incremental data set is consisted of the rest majority samples. The classifier tests the incremental data and the misclassification samples replace some small value of support vectors with the same class label, then both the training data set and the incremental data set are reconstructed. It stops until the whole incremental data are correct classification or the misclassification data are never changed. A series of experiments on both UCI standard data sets and an engineering data set of coal and gas outburst have shown that the new classification method (ISLS-SVM) performs better under the criterion of F-measure and ROC Area (AUC) than the existing methods of Adaboost, LS-SVM, Tomek Links and SMOTE. © 2010 IADIS.
What problem does this paper attempt to address?