S-SMOTE Method in Class Imbalance Data Sets

DONG Xuan,CAI Li-jun
DOI: https://doi.org/10.3969/j.issn.1006-9348.2012.12.042
2012-01-01
Abstract:Analyzing the problem that the classification results is always biased to the majority class in class imbalance data sets.An improved method of SMOTE called Space-Synthetic Minority Over-sampling Technique(S-SMOTE) was proposed.A super geometry based on the minority class and its k nearest neighbors was constructed.New synthetic samples were generated inside the super geometry.The production space was reducing to avoid the noise if some of its k nearest neighbors belongs to majority class.The training of minority class samples that are hardly classified was strengthen.Then the validity of the virtual samples was confirmed.Based on the real data sets,the experiments show that this method performes better than SMOTE for the classification performance of minority class and the whole data set.
What problem does this paper attempt to address?