Improving the performance of data stream classifiers by mining recurring contexts

Yong Wang,Zhanhuai Li,Yang Zhang,Longbo Zhang,Yun Jiang
DOI: https://doi.org/10.1007/11811305_119
2006-01-01
Abstract:Traditional researches on data stream mining only put emphasis on building classifiers with high accuracy, which always results in classifiers with dramatic drop of accuracy when concept drifts. In this paper, we present our RTRC system that has good classification accuracy when concept drifts and enough samples are scanned in data stream. By using Markov chain and least-square method, the system is able to predict not only on which the next concept is but also on when the concept is to drift. Experimental results confirm the advantages of our system over Weighted Bagging and CVFDT, two representative systems in streaming data mining.
What problem does this paper attempt to address?