Development a Prognostic Model Integrating lncRNA/ mRNA novel Biomarkers Identified by Bioinformatics Analysis and Experiments in Breast Cancer

Jinrong Wei,Qianshu Dou,Futing Ba,Guo-Qin Jiang
DOI: https://doi.org/10.21203/rs.3.rs-251621/v1
2021-01-01
Abstract:Abstract Purpose: The purpose of this study is to established a prognosis model based on the expression profiles of lncRNAs and mRNAs for breast cancers.Methods: Single Variable Cox Proportional Risk Regression analysis and difference analysis were applied to screen survival-related and differently expressed lncRNAs and mRNAs between tumor and normal tissues from TCGA data. GO and KEGG analysis were applied for top 30 survival-related genes. LncRNA/mRNA co-expressed network was constructed based on correlation analysis. LASSO analysis and Multivariate Stepwise Cox Regression analysis were applied to establish the prognosis model. RT-PCR experiments were applied to verify the correctness of the analysis results. Relative components of the TME in breast cancers with high and low risk groups were analysed by xCell and Cox proportional risk regression analysis. The ceRNA network was constructed by calculating the Pearson correlation coefficient (PCC) for miRNA-mRNA and miRNA-lncRNA using paired miRNA, mRNA, and lncRNA expression profile data.Results:Venn diagrams showed that there were 60 genes and 54 lncRNAs that were differently expressed and related with survival. Through lncRNA/mRNA co-expressed network construction, 19 lncRNA and 16 mRNA hub genes were gained. The genes were then included in LASSO and multivariate Cox proportional hazard regression analysis, and finally, 3 lncRNAs (LINC01497, LINC02766, LINC02528) and 2 mRNAs (C20orf85, CST1) were selected as prognosis predictive genes. According to the median risk score of the 5 candidates, patients were divided into high-risk group and low-risk group. The results of RT-PCR were consistent with the analysis results. The proportions of Adipocytes, Endothelial cells, HSCs, Fibroblasts were significantly lower in low risk score tissues compared with the high risk score tissues, while the proportions of M1 macrophages, MSCs, Th2 cells were significantly higher. A lncRNA-miRNA-mRNA ceRNA network containing 3 lncRNAs, 2 mRNAs, and 158 miRNAs was finally constructed, preliminarily revealed a proper mechanism of the 5 molecules playing important roles in breast cancer progression and prognosis prediction.Conclusion: We found that LINC01497, LINC02766, LINC02528 and C20orf85, CST1 may serve as a powerful prognostic tool to optimize the prognosis evaluation system of breast cancer.
What problem does this paper attempt to address?