BOS:a borderline over-sampling method for imbalanced data learning

ZHU Tuan-Fei,SUN Jing,LI Yi-Zhou,LI Meng-Long
DOI: https://doi.org/10.3969/j.issn.0490-6756.2012.03.014
2012-01-01
Abstract:The imbalance data are pervasive in a large number of real-world domains of great importance. Traditional classification learning algorithms behave undesirable in imbalanced problem.To address this problem,the authors proposed a new synthetic minority borderline synthetic over-sampling method named as BOS.In BOS,a novel K generalized Tomek links concept was used to locate minority class borderline instances,and adaptively generating minority instances were implemented base on the number of their K links.Experimental results show that BOS performed better than some existing typical methods, with more excellent F-Measure,G-mean and the area under the ROC(AUC) values.
What problem does this paper attempt to address?