Constructing a personalized prognostic risk model for colorectal cancer using machine learning and multi‐omics approach based on epithelial–mesenchymal transition‐related genes

Shuze Zhang,Wanli Fan,Dong He
DOI: https://doi.org/10.1002/jgm.3660
2024-01-18
The Journal of Gene Medicine
Abstract:A computer analyzing gene expression data for colorectal cancer (CRC), identifying epithelial–mesenchymal transition (EMT) associated genes, is illustrated. Two distinct patient groups are depicted: one with high‐risk CRC featuring an immunosuppressive tumor environment and another with low‐risk CRC having better survival prospects. The image also includes a representation of the long non‐coding RNA (lncRNA) PVT1 interacting with TIMP1 and MMP1 genes, highlighting the study's novel insights into CRC molecular dynamics. The progression and the metastatic potential of colorectal cancer (CRC) are intricately linked to the epithelial–mesenchymal transition (EMT) process. The present study harnesses the power of machine learning combined with multi‐omics data to develop a risk stratification model anchored on EMT‐associated genes. The aim is to facilitate personalized prognostic assessments in CRC. We utilized publicly accessible gene expression datasets to pinpoint EMT‐associated genes, employing a CoxBoost algorithm to sift through these genes for prognostic significance. The resultant model, predicated on gene expression levels, underwent rigorous independent validation across various datasets. Our model demonstrated a robust capacity to segregate CRC patients into distinct high‐ and low‐risk categories, each correlating with markedly different survival probabilities. Notably, the risk score emerged as an independent prognostic indicator for CRC. High‐risk patients were characterized by an immunosuppressive tumor milieu and a heightened responsiveness to certain chemotherapeutic agents, underlining the model's potential in steering tailored oncological therapies. Moreover, our research unearthed a putative repressive interaction between the long non‐coding RNA PVT1 and the EMT‐associated genes TIMP1 and MMP1, offering new insights into the molecular intricacies of CRC. In essence, our research introduces a sophisticated risk model, leveraging machine learning and multi‐omics insights, which accurately prognosticates outcomes for CRC patients, paving the way for more individualized and effective oncological treatment paradigms.
genetics & heredity,medicine, research & experimental,biotechnology & applied microbiology
What problem does this paper attempt to address?