Pruning and Undersampling Combination of Imbalanced Data Classification Method

ZHANG Jian,FANG Hong-bin
DOI: https://doi.org/10.3969/j.issn.1001-3695.2012.03.012
2012-01-01
Abstract:This paper proposed pruning and under-sampling combined approaches for selected the representative data as training data to improve the classification accuracy for minority class and investigated the effect of under-sampling methods in the imbalanced class distribution environment.The experimental results show that the accuracy of algorithm of this paper compare with direct undersampling algorithm have increased,the most important is to significantly improve the g-means value.Especially,the effect will be better on the imbalance rate of larger data sets.
What problem does this paper attempt to address?