New Feature Selection Approach(cdf) for Text Categorization

熊忠阳,蒋健,张玉芳
DOI: https://doi.org/10.3724/sp.j.1087.2009.01755
2009-01-01
Journal of Computer Applications
Abstract:Reducing the high dimension of feature vectors is an essential part of text categorization.After studying current dimension reduction technique and analyzing some normal methods of feature selection,a new approach,named CDF,for feature selection was proposed by comprehensively taking account of concentration among classes,distribution in class and average frequency in class.Experiment takes K-Nearest Neighbor(KNN) as the evaluation classifier.Experimental results prove that CDF approach is simple and effective,and has better performance than conventional feature selection methods in dimension reduction.
What problem does this paper attempt to address?