Mining Multi-Label Data Streams Using Ensemble-Based Active Learning.

Peng Wang,Peng Zhang,Li Guo
DOI: https://doi.org/10.1137/1.9781611972825.97
2012-01-01
Abstract:Previous chapter Next chapter Full AccessProceedings Proceedings of the 2012 SIAM International Conference on Data Mining (SDM)Mining Multi-label Data Streams Using Ensemble-based Active LearningPeng Wang, Peng Zhang, and Li GuoPeng Wang, Peng Zhang, and Li Guopp.1131 - 1140Chapter DOI:https://doi.org/10.1137/1.9781611972825.97PDFBibTexSections ToolsAdd to favoritesExport CitationTrack CitationsEmail SectionsAboutAbstract Data stream classification has drawn increasing attention from the data mining community in recent years, where a large number of stream classification models were proposed. However, most existing models were merely focused on mining from single-label data streams. Mining from multi-label data streams has not been fully addressed yet. On the other hand, although some recent work touched the multi-label stream mining problem, they never consider the expensive labeling cost issue, preventing them from real-world applications. To this end, we study, in this paper, a challenging problem that mining from multi-label data streams with limited labeling resource. Specifically, we propose an ensemble-based active learning framework to handle the large volume of stream data, expensive labeling cost and concept drifting problems on multi-label data streams. Experiments on both synthetic and real world data sets demonstrate the performance of the proposed method. Previous chapter Next chapter RelatedDetails Published:2012ISBN:978-1-61197-232-0eISBN:978-1-61197-282-5 https://doi.org/10.1137/1.9781611972825Book Series Name:ProceedingsBook Code:PRDT12Book Pages:1-1150
What problem does this paper attempt to address?