Expediting the Accuracy-Improving Process of SVMs for Class Imbalance Learning

Bin Cao,Yuqi Liu,Chenyu Hou,Jing Fan,Baihua Zheng,Jianwei Yin
DOI: https://doi.org/10.1109/tkde.2020.2974949
IF: 9.235
2020-01-01
IEEE Transactions on Knowledge and Data Engineering
Abstract:To improve the classification performance of support vector machines (SVMs) on imbalanced datasets, cost-sensitive learning methods have been proposed, e.g., Different Error Costs (DEC) and Fuzzy SVM for Class Imbalance Learning (FSVM-CIL). They relocate the hyperplane by adjusting the costs associated with misclassifying samples. However, the error costs are determined either empirically or by performing an exhaustive search in the parameter space. Both strategies can not guarantee effectiveness and efficiency simultaneously. In this paper, we propose ATEC, a solution that can efficiently find a preferable hyperplane by automatically tuning the error cost for between-class samples. ATEC distinguishes itself from all existing parameter tuning strategies by two main features: (1) it can evaluate how effective an error cost is in terms of classification accuracy; and (2) it changes the error cost in the right direction if it is not effective. Extensive experiments show that compared with the state-of-art methods, SVMs that are equipped with ATEC can not only obtain comparable improvements in terms of F1 score of minority class, area under the precision-recall curve (AUC-PR) and area under the ROC curve (AUC-ROC) scores, but also outperform the grid-search parameter tuning strategy by two orders of magnitude in terms of the training time when a high F1 score is required.
What problem does this paper attempt to address?