Identification of Crucial Genes for Predicting the Risk of Atherosclerosis with System Lupus Erythematosus Based on Comprehensive Bioinformatics Analysis and Machine Learning

Chunjiang Liu,Yufei Zhou,Yue Zhou,Xiaoqi Tang,Liming Tang,Jiajia Wang
DOI: https://doi.org/10.1016/j.compbiomed.2022.106388
IF: 7.7
2022-01-01
Computers in Biology and Medicine
Abstract:BACKGROUND:Systemic lupus erythematosus (SLE) has become a major public health problem over the years, and atherosclerosis (AS) is one of the main complications of SLE associated with serious cardiovascular consequences in this patient population. The present study aimed to identify potential biomarkers for SLE patients with AS.METHODS:Five microarray datasets (GSE50772, GSE81622, GSE100927, GSE28829, GSE37356) were downloaded from the NCBI Gene Expression Omnibus database. The Limma package was used to identify differentially expressed genes (DEGs) in AS. Weighted gene coexpression network analysis (WGCNA) was used to identify significant module genes associated with SLE. Functional enrichment analysis, protein-protein interaction (PPI) network construction, and machine learning algorithms (least absolute shrinkage and selection operator (Lasso, Support Vector Machine-Recursive Feature Elimination (SVM-RFE), and random forest) were applied to identify hub genes. Subsequently, we generated a nomogram and receiver operating characteristic curve (ROC) for predicting the risk of AS in SLE patients. Finally, immune cell infiltrations were analyzed, and Consensus Cluster Analysis was conducted based on Single Sample Gene Set Enrichment Analysis (ssGSEA) scores.RESULTS:Five hub genes (SPI1, MMP9, C1QA, CX3CR1, and MNDA) were identified and used to establish a nomogram that yielded a high predictive performance (area under the curve 0.900-0.981). Dysregulated immune cell infiltrations were found in AS, with positive correlations with the five hub genes. Consensus clustering showed that the optimal number of subtypes was 3. Compared to subtypes A and B, subtype C presented higher expression of the five hub genes, immune cell infiltration levels and immune checkpoint expression.CONCLUSION:Our study systematically identified five candidate hub genes (SPI1, MMP9, C1QA, CX3CR1, MNDA) and established a nomogram that could predict the risk of AS with SLE using various bioinformatic analyses and machine learning algorithms. Our findings provide the foothold for future studies on potential crucial genes for AS in SLE patients. Additionally, the dysregulated immune cell proportions and immune checkpoint expressions in AS with SLE were identified.
What problem does this paper attempt to address?