Identification and Validation of an Immunological Expression-Based Prognostic Signature in Breast Cancer

Jianying Pei,Yan Li,Tianxiong Su,Qiaomei Zhang,Xin He,Dan Tao,Yanyun Wang,Manqiu Yuan,Yanping Li
DOI: https://doi.org/10.3389/fgene.2020.00912
IF: 3.7
2020-09-16
Frontiers in Genetics
Abstract:<span><b>Background:</b> Emerging evidence suggests that the immune system plays a crucial role in the regulation of the response to therapy and long-term outcomes of patients with breast cancer (BRCA). In this study, we aimed to identify a significant signature based on immune-related genes to predict the prognosis of BRCA patients.<b>Methods:</b> The expression data were downloaded from The Cancer Genome Atlas (TCGA). The immune-related gene list, the transcription factor (TF) gene list, and the immune infiltrate scores of samples in the TCGA database were acquired from the ImmPort database, the Cistrome Cancer database, and the TIMER database, respectively. Univariate Cox regression analysis was utilized to identify prognostic immune-related differentially expressed genes (DEGs) (PIRDEGs) in BRCA. A prognostic immune signature containing 15 PIRDEGs in BRCA was established using the least absolute shrinkage and selection operator (LASSO) model with 1,000 iterations followed by a stepwise Cox proportional hazards model with a training set of 508 samples in TCGA. An independent assessment of the prognostic prediction ability of the signature was conducted using Kaplan–Meier survival analysis with a testing set of 505 samples in TCGA.<b>Results:</b> We identified 466 PIRDEGs and 80 TFs among the DEGs. A gene signature containing 15 PIRDEGs was constructed. Risk scores of BRCA patients were calculated using this model, which showed a high accuracy of prognosis prediction in both the training set and testing set and could be an independent prognostic factor of BRCA patients.<b>Conclusions:</b> Our study revealed that a PIRDEG signature could be a candidate prognostic biomarker for predicting the overall survival (OS) of patients with BRCA.</span>
genetics & heredity
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to predict the survival of breast cancer (Breast Cancer, BRCA) patients by identifying prognostic features based on immune expression. Specifically, the researchers aim to screen out prognostic immune - related differentially expressed genes (PIRDEGs) that are closely related to the prognosis of breast cancer patients from a large amount of gene expression data, and evaluate the value of these genes in predicting the overall survival rate (Overall Survival, OS) of patients by constructing a risk - scoring model. To achieve this goal, the research team adopted the following methods: 1. **Data collection**: Downloaded the gene expression data of breast cancer from The Cancer Genome Atlas (TCGA) database, and obtained the immune - related gene list, transcription factor (Transcription Factors, TFs) list and the immune infiltration score of samples from ImmPort, Cistrome Cancer and TIMER databases. 2. **Differential expression analysis**: Identify differentially expressed genes (Differentially Expressed Genes, DEGs) by comparing the gene expression differences between breast cancer tissues and normal tissues. 3. **Univariate Cox regression analysis**: Use univariate Cox regression analysis to screen out prognostic immune - related differentially expressed genes (PIRDEGs) that are significantly related to patient survival. 4. **Multivariate Cox regression analysis and LASSO model**: Further screen out the PIRDEGs with the most prognostic value through multivariate Cox regression analysis and the least absolute shrinkage and selection operator (Least Absolute Shrinkage and Selection Operator, LASSO) model, and finally construct a risk - scoring model containing 15 PIRDEGs. 5. **Model validation**: Use Kaplan - Meier survival analysis and receiver operating characteristic curve (Receiver Operating Characteristic Curve, ROC) analysis to validate the prognostic prediction ability of this model in the training set and the test set. Through the above steps, the research team has successfully identified an immune - related gene feature that can effectively predict the survival of breast cancer patients, which provides new biomarkers for the personalized treatment of breast cancer.