Joint Feature Selection Method Based on Relevance and Redundancy

ZHOU Cheng,GE Bin,TANG Jiu-yang,XIAO Wei-dong
DOI: https://doi.org/10.3969/j.issn.1002-137x.2012.04.041
2012-01-01
Computer Science
Abstract:Based on a comparative study of four feature selection methods,including document frequency(DF) unrelated to class information,and information gain(IG),mutual information(MI) and chi-square statistic(CHI),which are relatedto class information,we analyzed the disadvantages of combining these two kinds of methods directly and proposed a joint feature selection method based on relevance and redundancy to joint DF and one of IG,MI and CHI.This approach aims to eliminate redundant features,find useful features for classification and consequently improve the accuracy of text sentiment classification.The results of the experiment show that the proposed method can not only improve the performance but also reduce the feature dimension.
What problem does this paper attempt to address?