Active Learning With Drifted Data Streams Using Time-related Weight Classifier Ensemble Framework

Jicheng Shan,Chenxi Chu,Weike Liu,Chaofan Dai,Qingbao Liu
2017-01-01
Abstract:In learning to classify data streams, labeling all data is considered expensive and impractical. Active learning focuses on labeling a small portion of data for learning a model to predict future instances as accurately as possible. Active learning over streaming data poses additional challenges for its increasing volumes and concept drifts. We propose a new weighted classifier ensemble framework for active learning with drifted data streams using combination labeling strategies. The ensemble classifier consists of a long stable classifier built since beginning and several recent base classifiers built from recent chunks. According to a combination strategy of uncertainty strategy and random strategy, instances in the chunk are selected to be labeled for the updating of the stable classifier and the newest base classifier. When dealing the new chunk, base classifiers update their weights based on the time stamp and the prediction accuracy for new labeled instances. Experimental results on synthetic and real-world data demonstrate the performance of the proposed work in comparison with other approaches.
What problem does this paper attempt to address?