Ipcc: a Novel Feature Extraction Method for Accurate Disease Class Discovery and Prediction

Xianwen Ren,Yong Wang,Xiang-Sun Zhang,Qi Jin
DOI: https://doi.org/10.1093/nar/gkt343
IF: 14.9
2013-01-01
Nucleic Acids Research
Abstract:Gene expression profiling has gradually become a routine procedure for disease diagnosis and classification. In the past decade, many computational methods have been proposed, resulting in great improvements on various levels, including feature selection and algorithms for classification and clustering. In this study, we present iPcc, a novel method from the feature extraction perspective to further propel gene expression profiling technologies from bench to bedside. We define ‘correlation feature space’ for samples based on the gene expression profiles by iterative employment of Pearson’s correlation coefficient. Numerical experiments on both simulated and real gene expression data sets demonstrate that iPcc can greatly highlight the latent patterns underlying noisy gene expression data and thus greatly improve the robustness and accuracy of the algorithms currently available for disease diagnosis and classification based on gene expression profiles.
What problem does this paper attempt to address?