Hybrid SVM algorithm oriented to classifying imbalanced datasets

Dongqi Liu,Zhijian Chen,Yin Xu,Feiteng Li
DOI: https://doi.org/10.3969/j.issn.1001-3695.2018.04.014
2018-01-01
Abstract:In order to improve the classification accuracy of traditional support vector machine (SVM) for imbalanced datasets,solving the problem that classifier had a low performance on minority class,this paper proposed a hybrid SVM algorithm.It combined adaptive synthetic sampling(ADASYN) algorithm with different error cost(DEC) algorithm to improve the bias of hyperplane caused by imbalanced datasets,and then it introduced a new correction algorithm to prediction model so as to improve the prediction model's adaptability to different data characteristics.It tested the proposed algorithm on 7 sets of realworld imbalanced datasets from UCI database.The experiment result shows that the hybrid SVM algorithm is able to surpass or match the state-of-the-art algorithms on each dataset,and it increases the classification performance by an average of 2.0% to 20.9%.It shows that the proposed algorithm is effective and robust.
What problem does this paper attempt to address?