Gene selection algorithm based on K-S test and mRMR

Juanying Xie,Qiufeng Hu,Yafei Dong
DOI: https://doi.org/10.3969/j.issn.1001-3695.2016.04.011
2016-01-01
Abstract:To deal with the challenging problem of selecting the distinguished genes in the gene expression datasets,this pa-per presented a gene subset selection algorithm based on K-S test and mRMR principles.The algorithm selected the distin-guished genes in K-S test firstly,then it used the minimum redundancy-maximum relevance principle to select the genes from those selected by K-S test.It adopted SVMas the classification tool,and used the criteria of F1_measure,accuracy and AUC to evaluate the performance of the classifiers on the selected gene subsets.It compared the proposed gene subset selection algo-rithm with K-S,mRMR,RELIEF and FAST algorithms.The average experimental results of the aforementioned gene selection algorithms on 5 popular gene expression datasets demonstrate that the new K-S and mRMR based algorithm is significantly fas-ter than mRMR,and the performance of it under the criteria of F1_measure,accuracy and AUC is better than those of K-S, mRMR,RELIEF and FAST.So,the proposed gene subset selection algorithm can find the excellent gene subset.
What problem does this paper attempt to address?