Abstract:Evaluation of survival models to predict cancer patient prognosis is one of the most important areas of emphasis in cancer research. A binary classification approach has difficulty directly predicting survival due to the characteristics of censored observations and the fact that the predictive power depends on the threshold used to set two classes. In contrast, the traditional Cox regression approach has some drawbacks in the sense that it does not allow for the identification of interactions between genomic features, which could have key roles associated with cancer prognosis. In addition, data integration is regarded as one of the important issues in improving the predictive power of survival models since cancer could be caused by multiple alterations through meta-dimensional genomic data including genome, epigenome, transcriptome, and proteome. Here we have proposed a new integrative framework designed to perform these three functions simultaneously: (1) predicting censored survival data; (2) integrating meta-dimensional omics data; (3) identifying interactions within/between meta-dimensional genomic features associated with survival. In order to predict censored survival time, martingale residuals were calculated as a new continuous outcome and a new fitness function used by the grammatical evolution neural network (GENN) based on mean absolute difference of martingale residuals was implemented. To test the utility of the proposed framework, a simulation study was conducted, followed by an analysis of meta-dimensional omics data including copy number, gene expression, DNA methylation, and protein expression data in breast cancer retrieved from The Cancer Genome Atlas (TCGA). On the basis of the results from breast cancer dataset, we were able to identify interactions not only within a single dimension of genomic data but also between meta-dimensional omics data that are associated with survival. Notably, the predictive power of our best meta-dimensional model was 73% which outperformed all of the other models conducted based on a single dimension of genomic data. Breast cancer is an extremely heterogeneous disease and the high levels of genomic diversity within/between breast tumors could affect the risk of therapeutic responses and disease progression. Thus, identifying interactions within/between meta-dimensional omics data associated with survival in breast cancer is expected to deliver direction for improved meta-dimensional prognostic biomarkers and therapeutic targets.

Prognostically Relevant Subtypes and Survival Prediction for Breast Cancer Based on Multimodal Genomics Data

Prognostically Relevant Subtypes and Survival Prediction for Breast Cancer Based on Multimodal Genomics Data

Multi-omics-based Machine Learning for the Subtype Classification of Breast Cancer

Improving the robustness and stability of a machine learning model for breast cancer prognosis through the use of multi-modal classifiers

Evaluation of Machine Learning Algorithms for the Prognosis of Breast Cancer from the Surveillance, Epidemiology, and End Results Database

Advancing Breast Cancer Subtype Prediction and Mutation Analysis: Integrating Deep Learning and Machine Learning Techniques in Genomic Research

Improve Glioblastoma Multiforme Prognosis Prediction by Using Feature Selection and Multiple Kernel Learning

Multi-modal advanced deep learning architectures for breast cancer survival prediction

Machine learning integrated ensemble of feature selection methods followed by survival analysis for predicting breast cancer subtype specific miRNA biomarkers

A Robust Personalized Classification Method for Breast Cancer Metastasis Prediction

Diagnosis of breast cancer molecular subtypes using machine learning models on unimodal and multimodal datasets

Enhancing Breast Cancer Survival Prognosis Through Omic and Non-Omic Data Integration

Integrating Somatic Mutations for Breast Cancer Survival Prediction Using Machine Learning Methods

BC-Predict: Mining of signal biomarkers and multilevel validation of cascade classifier for early-stage breast cancer subtyping and prognosis

Identification and exploration of the pyroptosis-related molecular subtypes of breast cancer by bioinformatics and machine learning.

Deep learning based feature-level integration of multi-omics data for breast cancer patients survival analysis

Predicting censored survival data based on the interactions between meta-dimensional omics data in breast cancer

Multi-modal AI for comprehensive breast cancer prognostication

Prediction of Breast Cancer Recurrence Risk Using a Multi-Model Approach Integrating Whole Slide Imaging and Clinicopathologic Features

Deep learning assisted multi-omics integration for survival and drug-response prediction in breast cancer

A Multi-Omics Integration Framework Using Graph Attention Networks for Cancer Subtype Prediction