CFC: a Cascade Forest Approach to Discover Cancer Driver Genes Using Multi-Omics Data.

Lei Zhang,Yijing Yang,Zhe Wang,Dongdong Li,Jingping Liu,Hai Yang
DOI: https://doi.org/10.1109/bibm55620.2022.9995547
2022-01-01
Abstract:With the development of next-generation sequencing technology, massive genomic data has been generated, primarily encouraging research on cancer driver genes. Many bioinformatics methods were proposed to identify driver genes. However, the results of driver gene identification a mong these methods show considerable differences. It is still challenging to obtain a comprehensive catalog of cancer drivers. Although current methods have greatly promoted the development of driver genes, few methods can integrate the identification results of existing methods. To solve such problems in cancer driver genes research, we proposed a cascade forest model to discover cancer driver genes(CFC) that can integrate multi-omics data and annotation scores from different cancer driver gene identification algorithms. The proposed method got precise results for 33 cancer types and Pan-cancer. The CFC framework identified 275 driver genes in Pan-cancer, of which 179 were included in the Gold standard. The identified genes were enriched i n t he principal cancer signaling pathways.
What problem does this paper attempt to address?