New Approach to Feature Selection for Text Categorization Using Class Correlation

LIN Shao-bo,YANG Dan,XU Ling
DOI: https://doi.org/10.3969/j.issn.1001-3695.2012.05.021
2012-01-01
Abstract:This paper proposed a new approach of feature selection for text categorization,which was based on the strong class correlation and positive class correlation,named SP.SP could eliminate the effect of negative and poor correlation feature effectly.SP discriminated between the positive feature and the negative feature by positive correlation factor,and eliminated the effect of negative feature.SP discriminated between the strong class correlation of features and the poor class correlation of features by positive class correlation factor,and eliminated the effect of poor correlation feature.SP could select high quality features effectively by combining these two factors.The result of Experiment indicates that the proposed approach has a good performance at categorization and reducing high dimensional feature space.
What problem does this paper attempt to address?