A Supervised Solution for Redundant Feature Detection Depending on Instances

Xue-Qiang Zeng,Guo-Zheng Li
DOI: https://doi.org/10.1109/bibmw.2012.6470320
2012-01-01
Abstract:As a high dimensional problem, analysis of microarray data sets is a challenging task, where many weakly relevant or redundant features hurt generalization performance of classifiers. The previous works used redundant feature detection methods to select discriminative compact gene set, which only considered the relationship among features, not the redundancy of classification ability among features. Here, we propose a novel algorithm named RESI (Redundant fEature Selection depending on Instance), which considers label information in the measure of feature subset redundancy. Experimental results on benchmark data sets show that RESI performs better than the previous state-of-arts algorithms on redundant feature selection methods like mRMR.
What problem does this paper attempt to address?