A Text Feature Selection Method Based on TongYiCi CiLin

ZHENG Yan-hong,ZHANG Dong-zhan
2012-01-01
Abstract:Feature selection is one of important problems in text categorization,machine learning and pattern recognition.In particular,with the rapid development of network and cloud computing,the massive data analysis methods are vitally important.Feature selection can reduce high dimension data′s feature dimension under the condition of ensuring data integrity and classification accuracy.Previously proposed feature selection method based on TongYiCi CiLin can effectively avoid the eigenvalue repetitive in concept,but they did′t consider about that subset composed by the optimal weight of feature vectors may not the best one.To solve this problem,this article combine the TongYiCi and Genetic Algorithm,proposed a text feature selection method based on TongYiCi CiLin.The experiment results show that the method can reduce feature vector′s dimension and improve the efficiency of feature selection.
What problem does this paper attempt to address?