Imbalanced Data Classification Based on Scaling Kernel-Based Support Vector Machine

Yong Zhang,Panpan Fu,Wenzhe Liu,Guolong Chen
DOI: https://doi.org/10.1007/s00521-014-1584-2
2014-01-01
Neural Computing and Applications
Abstract:In many classification problems, the class distribution is imbalanced. Learning from the imbalance data is a remarkable challenge in the knowledge discovery and data mining field. In this paper, we propose a scaling kernel-based support vector machine (SVM) approach to deal with the multi-class imbalanced data classification problem. We first use standard SVM algorithm to gain an approximate hyperplane. Then, we present a scaling kernel function and calculate its parameters using the chi-square test and weighting factors. Experimental results on KEEL data sets show the proposed algorithm can resolve the classifier performance degradation problem due to data skewed distribution and has a good generalization.
What problem does this paper attempt to address?