A Phase Diagram for Gene Selection and Disease Classification

Hong-Dong Li,Qing-Song Xu,Yi-Zeng Liang
DOI: https://doi.org/10.1016/j.chemolab.2017.06.008
IF: 4.175
2017-01-01
Chemometrics and Intelligent Laboratory Systems
Abstract:Identifying a small subset of genes that can classify disease samples from healthy controls plays an import role for evaluating disease risk and facilitating diagnosis. Existing methods often provide a single metric to assess predictive performances of genes. Also, model-based gene importance is conditioned on the subset of genes used to build multivariate models, and is thus model/context-specific. Existing methods often do not take into account such context-specific effects. Here we present a novel gene selection approach that evaluates predictive performance of genes using two criteria by taking into account gene interactions and project them onto four different regions in a 2-dimensional plot, like a phase diagram (PHADIA) in chemistry. Using two publicly available microarray datasets, we showed that PHADIA achieves comparable or better classification accuracies compared to reported results in the literature. The source codes are freely available at: www.libpls.net.
What problem does this paper attempt to address?