An Efficient Feature Selection Method Using Named Entity Recognition for Chinese Text Categorization

Bin Liu,Chunping Li
DOI: https://doi.org/10.1109/icmlc.2009.5212749
2009-01-01
Abstract:Feature selection is an important task for text categorization. Traditional feature selection methods are based on terms but they may lose some useful information in texts. In this paper, we present a feature selection method that considers not only general terms but also named entities. Corresponding to our feature selection method, we propose a term weighting scheme for named entities. The experiments show that our method is effective comparing with traditional methods.
What problem does this paper attempt to address?