Feature selection for text categorization using filtering and wrapping

Wei Wang,Qinghua Zheng
2006-01-01
Journal of Computational Information Systems
Abstract:After analyzing the advantages and disadvantages of filtering and wrapping approaches for feature selection, a hybrid approach named filtering-wrapping feature selection (FWFS) is proposed. This approach addresses feature selection as a sequential forward selections procedure. It uses information gain to evaluate feature's relevance to the target class and mutual information to evaluate redundancies among features. In search, a candidate set of individually discriminating and weakly dependent features added into Selecting Set. Then the actual classifier is used as a 'black box' to evaluate the fitness of the Selecting Set. F1 Micro-averaged measure is employed to evaluate classification performance. When F1 Micro-averaged measure is no improvement, search stop. Experiments over two different corpora show that the proposed approach performs better than traditional feature selection approaches.
What problem does this paper attempt to address?