Peer Review #1 of "dgpathinter: a Novel Model for Identifying Driver Genes Via Knowledge-Driven Matrix Factorization with Prior Knowledge from Interactome and Pathways (V0.1)"
Jianing Xi,Minghui Wang,Ao Li,John N. Weinstein,Eric A. Collisson,Gordon Mills,Kingston H. G. Mills,Brad Ozenberger,Chris Stuart,Thomas J. Hudson,Anna K. Barker,Anna Bell,Cindy Bernabé,Rosa Bhan,Michael D. McLellan,Fabio Vandin,Fan Liu,Charles Xie,Joshua F. McMichael,Matthew A. Wyczalkowski,Lawrence,Michael Stojanov,Paz Polak,Gregory V. Kryukov,Kristian Cibulskis,Andrey Carter,Craig H. Mermel,Craig Roberts,Tamborero,David González-Pérez,Michael Getz,Gary D. Bader,Ding Li,Nathan Zhang,William Koboldt,Daniel Mooney,Thomas Callaway,Hua,Sjöblom,Laura Parsons,David A. Williams,Jimmy Lin,Thomas Mandelker,Del Leary,Rebecca Ptak,Janine Silliman,Nathan D. Dees,Kandoth Zhang,Tobias Sjöblom
DOI: https://doi.org/10.7287/peerj-cs.133v0.1/reviews/1
2017-01-01
Abstract:Cataloging mutated driver genes that confer a selective growth advantage for tumor cells from sporadic passenger mutations is a critical problem in cancer genomic research.Previous studies have reported that some driver genes are not highly frequently mutated and cannot be tested as statistically significant, which complicates the identification of driver genes.To address this issue, some existing approaches incorporate prior knowledge from an interactome to detect driver genes which may be dysregulated by interaction network context.However, altered operations of many pathways in cancer progression have been frequently observed, and prior knowledge from pathways is not exploited in driver gene identification task.In this paper, we introduce a driver genes prioritization method called DGPathinter, which is based on knowledge-based matrix factorization model with prior knowledge from both interactome and pathways incorporated.When DGPathinter is applied on somatic mutation datasets of three types of cancers and evaluated by known driver genes, the prioritizing performance of DGPathinter is better than the existing interactome driven methods.The top ranked genes detected by DGPathinter are also significantly enriched for known driver genes.Moreover, most of the top ranked scored pathways given by DGPathinter are also cancer progression associated pathways.These results suggest that DGPathinter is a useful tool to identify potential driver genes.