Feature Selection For Classifying Data Stream Based On Maximum Entropy

Yao-Zong Liu,Yong-Li Wang,Wei Wei,Hong Zhang
DOI: https://doi.org/10.1109/CCPR.2009.5344111
2009-01-01
Abstract:Feature select ion is an important problem in the fields of machine learning and pat tern recognition. Data stream data classification with high dimensional and sparse, and the dimension of the need for compression, feature selection methods suitable for data stream classification study of very value of this area is currently a lack of in-depth study. This paper summarizes the current data flow classification feature selection research, analysis of the characteristics of different methods. Based on the principle of maximum entropy, naive Bayes with the technology on the data stream tuple feature selection attributes, divided into two different subsets of the merits, so as to enhance the work of C4.5 classifier results, the experiment proved not only StrearnMEFS classification of time-saving, but also to improve the quality of the classification.
What problem does this paper attempt to address?