Stochastic Sensitivity Measure-Based Noise Filtering and Oversampling Method for Imbalanced Classification Problems

Jianjun Zhang,Wing Ng
DOI: https://doi.org/10.1109/SMC.2018.00078
2018-01-01
Abstract:Class imbalance problems occur in many real-world applications. Oversampling methods are effective to handle class imbalance issues by replicating or generating new minority samples to rebalance the class distribution. However, current methods directly using all minority samples will also use noisy samples to generate new samples which may lead to more severe class overlapping and introduce more noisy samples. In this work, we propose a stochastic sensitivity measure-based noise filtering and oversampling method, i.e. the SSMNFOS, to improve the robustness of oversampling method with respect to noisy samples. Samples yielding high stochastic sensitivities are identified as noises by a neural network ensemble and will not participate in the oversampling method for rebalancing the class distribution. Comprehensive experimental studies are carried out on ten datasets with five different noise levels to analyze the effectiveness of the proposed method. Experimental results show that the SSMNFOS outperforms state-of-the-art methods with 95% statistical significance.
What problem does this paper attempt to address?