An Ensemble Classifier for Mining Imbalanced Data Streams with Noise

Zhen-zheng OUYANG,Zi-jin TAO,Jian-yu CAI,Quan-yuan WU
DOI: https://doi.org/10.3969/j.issn.1007-130X.2011.12.018
2011-01-01
Abstract:Many real world data streams mining applications involve learning from imbalanced data streams,where such applications expect to have a higher predictive accuracy over the minority class,however most classification models assume relatively balanced data streams,and they cannot handle imbalanced distribution.In this paper,we propose a novel ensemble classifier framework(IMDAP) for mining concept-drifting and noisy data streams with imbalanced distribution by using an averaged probability ensemble framework and sampling technique.Our empirical study shows that the IMDAP is superior and have improves both the capability of the classifier and the accuracy in performing classification over the minority class.
What problem does this paper attempt to address?