Topologically inferring risk-active pathways toward precise cancer classification by directed random walk.

Wei Liu,Chunquan Li,Yanjun Xu,Haixiu Yang,Qianlan Yao,Junwei Han,Desi Shang,Chunlong Zhang,Fei Su,Xiaoxi Li,Yun Xiao,Fan Zhang,Meng Dai,Xia Li
DOI: https://doi.org/10.1093/bioinformatics/btt373
IF: 5.8
2013-01-01
Bioinformatics
Abstract:Motivation: The accurate prediction of disease status is a central challenge in clinical cancer research. Microarray-based gene biomarkers have been identified to predict outcome and outperform traditional clinical parameters. However, the robustness of the individual gene biomarkers is questioned because of their little reproducibility between different cohorts of patients. Substantial progress in treatment requires advances in methods to identify robust biomarkers. Several methods incorporating pathway information have been proposed to identify robust pathway markers and build classifiers at the level of functional categories rather than of individual genes. However, current methods consider the pathways as simple gene sets but ignore the pathway topological information, which is essential to infer a more robust pathway activity. Results: Here, we propose a directed random walk (DRW)-based method to infer the pathway activity. DRW evaluates the topological importance of each gene by capturing the structure information embedded in the directed pathway network. The strategy of weighting genes by their topological importance greatly improved the reproducibility of pathway activities. Experiments on 18 cancer datasets showed that the proposed method yielded a more accurate and robust overall performance compared with several existing genebased and pathway-based classification methods. The resulting risk-active pathways are more reliable in guiding therapeutic selection and the development of pathway-specific therapeutic strategies.
What problem does this paper attempt to address?