CanDriS: posterior profiling of cancer-driving sites based on two-component evolutionary model

Wenyi Zhao,Jingwen Yang,Jingcheng Wu,Guoxing Cai,Yao Zhang,Jeffrey Haltom,Weijia Su,Michael J. Dong,Shuqing Chen,Jian Wu,Zhan Zhou,Xun Gu
DOI: https://doi.org/10.1093/bib/bbab131
IF: 9.5
2021-01-01
Briefings in Bioinformatics
Abstract:Current cancer genomics databases have accumulated millions of somatic mutations that remain to be further explored. Due to the over-excess mutations unrelated to cancer, the great challenge is to identify somatic mutations that are cancer-driven. Under the notion that carcinogenesis is a form of somatic-cell evolution, we developed a two-component mixture model: while the ground component corresponds to passenger mutations, the rapidly evolving component corresponds to driver mutations. Then, we implemented an empirical Bayesian procedure to calculate the posterior probability of a site being cancer-driven. Based on these, we developed a software CanDriS (Cancer Driver Sites) to profile the potential cancer-driving sites for thousands of tumor samples from the Cancer Genome Atlas and International Cancer Genome Consortium across tumor types and pan-cancer level. As a result, we identified that approximately 1% of the sites have posterior probabilities larger than 0.90 and listed potential cancer-wide and cancer-specific driver mutations. By comprehensively profiling all potential cancer-driving sites, CanDriS greatly enhances our ability to refine our knowledge of the genetic basis of cancer and might guide clinical medication in the upcoming era of precision medicine. The results were displayed in a database CandrisDB (http://biopharm.zju.edu.cn/candrisdb/).
What problem does this paper attempt to address?