A feature selection algorithm based on classification complementarity

Li Li-Bin,Li Ning,Yang Yu-Bin
DOI: https://doi.org/10.3321/j.issn:0469-5097.2008.05.004
2008-01-01
Abstract:A novel feature selection algorithm based on classification complementarity is proposed in this paper, in order to atone for the shortcomings of using the filter or wrapper feature selection approach alone. The filter feature selection method can select features fast but has low accuracy, while the wrapper method can get better performance on feature selection but it costs lots of time. Thus, the proposed algorithm combines both the filter approach and the wrapper approach together. The algorithm includes two steps. In the first step, it removes the irrelevant features using ReliefF estimation and symmetric uncertainty estimation, which are two correlation measures on feature performance estimations in classical feature selection methods. Features with low correlation with class variance would be excluded as irrelevant features. Then symmetric uncertainty estimation is used to remove the redundant features. Features with high symmetric uncertainty to each other means there exists redundancy between them and the worst of them should be excluded.In the second step, it selects the target feature subset by using a wrapper feature selection algorithm based on classification complementarity estimation. We proposed the classification complementarity concept, a new estimation to the combination of feature sets. Classification complementarity indicates whether combination of feature set could improve classification performance. By this estimation feature sets are combined together iteratively until no better feature set could be found.Experiment results indicate that the proposed algorithm has advantages of high accuracy and low time cost and is effective in practical applications.
What problem does this paper attempt to address?