An Efficient Gene Selection Technique for Cancer Recognition Based on Neighborhood Mutual Information.

Qinghua Hu,Wei Pan,Shuang An,Peijun Ma,Jinmao Wei
DOI: https://doi.org/10.1007/s13042-010-0008-6
2010-01-01
International Journal of Machine Learning and Cybernetics
Abstract:Gene selection is a key problem in gene expression based cancer recognition and related tasks. A measure, called neighborhood mutual information (NMI), is introduced to evaluate the relevance between genes and related decision in this work. Then the measure is combined with the search strategy of minimal redundancy and maximal relevancy (mRMR) for constructing a NMI based mRMR gene selection algorithm (NMI_mRMR). In addition, it is also found that the first k best genes with respect to NMI are usually enough for cancer classification. We can just perform mRMR on these genes and remove the rest in the preprocessing step, which will lead to reduction of computational time. Based on this observation, an efficient gene selection algorithm, denoted by NMI_EmRMR, is proposed. Several cancer recognition tasks are gathered for testing the proposed technique. The experimental results show NMI_EmRMR is effective and efficient.
What problem does this paper attempt to address?