LNDriver: Identifying Driver Genes by Integrating Mutation and Expression Data Based on Gene-Gene Interaction Network

Pi-Jing Wei,Di Zhang,Junfeng Xia,Chun-Hou Zheng
DOI: https://doi.org/10.1186/s12859-016-1332-y
IF: 3.307
2016-01-01
BMC Bioinformatics
Abstract:Background Cancer is a complex disease which is characterized by the accumulation of genetic alterations during the patient’s lifetime. With the development of the next-generation sequencing technology, multiple omics data, such as cancer genomic, epigenomic and transcriptomic data etc., can be measured from each individual. Correspondingly, one of the key challenges is to pinpoint functional driver mutations or pathways, which contributes to tumorigenesis, from millions of functional neutral passenger mutations. Results In this paper, in order to identify driver genes effectively, we applied a generalized additive model to mutation profiles to filter genes with long length and constructed a new gene-gene interaction network. Then we integrated the mutation data and expression data into the gene-gene interaction network. Lastly, greedy algorithm was used to prioritize candidate driver genes from the integrated data. We named the proposed method Length-Net-Driver (LNDriver). Conclusions Experiments on three TCGA datasets, i.e., head and neck squamous cell carcinoma, kidney renal clear cell carcinoma and thyroid carcinoma, demonstrated that the proposed method was effective. Also, it can identify not only frequently mutated drivers, but also rare candidate driver genes.
What problem does this paper attempt to address?