An Imbalanced Learning Method by Combining SMOTE with Center Offset Factor

Dongxia Meng,Yujian Li
DOI: https://doi.org/10.1016/j.asoc.2022.108618
IF: 8.7
2022-01-01
Applied Soft Computing
Abstract:SMOTE is a well-known oversampling method for learning on imbalanced datasets. However, it has the risk of introducing noisy instances and overfitting problems. In order to improve its performance, this paper proposes an oversampling method called SMOTE-COF, which is an improvement of SMOTE based on center offset factor. The SMOTE-COF method first removes noisy samples, then computes center offset factor to select sparsely distributed minority class samples. Furthermore, these samples are used to generate new minority class samples with other minority class instances distributed in the same sub-cluster by SMOTE. Comparative experiments on one simulated dataset and fourteen UCI datasets provide evidence that the SMOTE-COF can effectively reduce noisy samples, generate better minority classes, and improve classification performance for imbalanced datasets.
What problem does this paper attempt to address?