Grouping-based Oversampling in Kernel Space for Imbalanced Data Classification

Jinjun Ren,Yuping Wang,Yiu-ming Cheung,Xiao-Zhi Gao,Xiaofang Guo
DOI: https://doi.org/10.1016/j.patcog.2022.108992
IF: 8
2022-01-01
Pattern Recognition
Abstract:•We design a new grouping scheme. It can provide not only a theoretical basis for selecting the minority class samples in an oversampling method but also a new explanation for the poor performance of SVM on imbalanced data sets.•We design a new oversampling algorithm for generating the minority class samples, which can effectively reduce the bias of the decision hyperplane obtained on the imbalanced data sets toward the minority class. At the same time, it makes full use of the repeated sample pairs and reduces the risk of overfitting of the classifier trained on the balanced data set.•Extensive experimental results show that the proposed oversampling method outperforms the compared benchmark algorithms.
What problem does this paper attempt to address?