Integrating multi-omics data by learning modality invariant representations for improved prediction of overall survival of cancer

Li Tong,Hang Wu,May D Wang,May D. Wang
DOI: https://doi.org/10.1016/j.ymeth.2020.07.008
IF: 4.647
2021-05-01
Methods
Abstract:<p>Breast cancer is the second leading cause of cancer death, and ovarian cancer is the fifth leading cause of cancer death among women. Prediction of overall survival for breast cancer and ovarian cancer patients can facilitate the decision making of treatments and serves as potential metrics for the evaluation of drug responses. For personalized survival prediction, multi-omics data analysis is one of the most promising approaches by utilizing the multi-scale molecular-level information of the patient. However, the effective integration of multi-omics data remains a challenging task. In this paper, we aim to improve the prediction of overall survival for breast cancer and ovarian cancer patients by integrating the multi-omics data, including gene expression, DNA methylation, miRNA expression, and copy number variations. With close interactions among the multi-omics data, we can assume features from each data modality are connected by either association or causal relationships, which jointly impact the survival of cancer patients. Instead of learning the explicit relationships of these features among various multi-omics modalities, we propose to learn modality-invariant representations from deep neural networks by divergence-based consensus regularization. The consensus regularization requires the features encoded from various modalities of the same subject to be accord with each other in a common feature space. With the proposed deep consensus neural networks, we have integrated the multi-omics data and improved the prediction of overall survival for breast cancer and ovarian cancer patients.</p>
What problem does this paper attempt to address?