Constructing Tumor Immune Microenvironment and Identifying the Immune-Related Prognostic Signatures in Colorectal Cancer Using Multi-omics Data

Shuai Zhang,Jiali Lv,Bingbing Fan,Zhe Fan,Chunxia Li,Bingbing Gu,Wenhao Yu,Tao Zhang
DOI: https://doi.org/10.1101/2021.08.30.21262762
2021-08-30
Abstract:ABSTRACT Background The tumor immune microenvironment (TIME) plays a key role in occurrence, progression and prognosis of colorectal cancer (CRC). However, the genetic and epigenetic alterations and potential mechanisms in the TIME of CRC are still unclear. Methods We investigated the immune-related differences in three types of genetic or epigenetic alterations (gene expression, somatic mutation, and DNA methylation) and considered the potential roles that these alterations have in the immune response and the immune-related biological processes by analyzing the multi-omics data from The Cancer Genome Atlas (TCGA) portal. Additionally, a four-step method based on LASSO regression and Cox regression was used to construct the prognostic prediction model. Cross validation was performed to validate the model. Results A total of 1,745 differentially expressed genes, 178 differentially mutated genes and 1,961 differentially methylation probes were identified between the high-immunity group and the low-immunity group. We retained 15 genetic and epigenetic variables after using LASSO regression and Cox regression. For the prognostic predictions on the TCGA profiles, the performance of the model with 1-year, 3-year, and 5-year areas under the curve (AUCs) equal to 0.861, 0.797, and 0.875. Finally, the overall risk score model was constructed based on genetic, epigenetic, demographic and clinical characteristics, which comprised 18 variables and achieved a high degree of accuracy on the 1-year (AUC = 0.865), 3-year (AUC = 0.839), and 5-year (AUC = 0.914) survival predictions. Kaplan-Meier survival analysis demonstrated that the overall survival of the high-risk group was significantly poorer compared with the low-risk group. Prognostic nomogram, calibration plot and cross validation showed excellent predictive performance. Conclusions Our study provides a new clue to explore the TIME of CRC patients in genetic and epigenetic aspects. Meanwhile, the prognostic model also has clinical prognostic value and may provide new indicators for the treatment of CRC patients.
What problem does this paper attempt to address?