An Efficient Gene Selection Algorithm Based on Tolerance Rough Set Theory

Na Jiao,Duoqian Miao
DOI: https://doi.org/10.1007/978-3-642-10646-0_21
2009-01-01
Abstract:Gene selection, a key procedure of the discriminant analysis of microarray data, is to select the most informative genes from the whole gene set. Rough set theory is a mathematical tool for further reducing redundancy. One limitation of rough set theory is the lack of effective methods for processing real-valued data. However, most of gene expression data sets are continuous. Discretization methods can result in information loss. This paper investigates an approach combining feature ranking together with feature selection based on tolerance rough set theory. Compared with gene selection algorithm based on rough set theory, the proposed method is more effective for selecting high discriminative genes in cancer classification task.
What problem does this paper attempt to address?