Abstract:Background Systemic lupus erythematosus (SLE) is an autoimmune illness caused by a malfunctioning immunomodulatory system. China has the second highest prevalence of SLE in the world, from 0.03% to 0.07%. SLE is diagnosed using a combination of immunological markers, clinical symptoms, and even invasive biopsy. As a result, genetic diagnostic biomarkers for SLE diagnosis are desperately needed. Method From the Gene Expression Omnibus (GEO) database, we downloaded three array data sets of SLE patients’ and healthy people’s peripheral blood mononuclear cells (PBMC) (GSE65391, GSE121239 and GSE61635) as the discovery metadata (n SLE = 1315, n normal = 122), and pooled four data sets (GSE4588, GSE50772, GSE99967, and GSE24706) as the validate data set (n SLE = 146, n normal = 76). We screened the differentially expressed genes (DEGs) between the SLE and control samples, and employed the least absolute shrinkage and selection operator (LASSO) regression, and support vector machine recursive feature elimination (SVM-RFE) analyze to discover possible diagnostic biomarkers. The candidate markers’ diagnostic efficacy was assessed using the receiver operating characteristic (ROC) curve. The reverse transcription quantitative polymerase chain reaction (RT-qPCR) was utilized to confirm the expression of the putative biomarkers using our own Chinese cohort (n SLE = 13, n normal = 10). Finally, the proportion of 22 immune cells in SLE patients was determined using the CIBERSORT algorithm, and the correlations between the biomarkers’ expression and immune cell ratios were also investigated. Results We obtained a total of 284 DEGs and uncovered that they were largely involved in several immune relevant pathways, such as type І interferon signaling pathway, defense response to virus, and inflammatory response. Following that, six candidate diagnostic biomarkers for SLE were selected, namely ABCB1, EIF2AK2, HERC6, ID3, IFI27, and PLSCR1, whose expression levels were validated by the discovery and validation cohort data sets. As a signature, the area under curve (AUC) values of these six genes reached to 0.96 and 0.913, respectively, in the discovery and validation data sets. After that, we checked to see if the expression of ABCB1, IFI27, and PLSCR1 in our own Chinese cohort matched that of the discovery and validation sets. Subsequently, we revealed the potentially disturbed immune cell types in SLE patients using the CIBERSORT analysis, and uncovered the most relevant immune cells with the expression of ABCB1, IFI27, and PLSCR1. Conclusion Our study identified ABCB1, IFI27, and PLSCR1 as potential diagnostic genes for Chinese SLE patients, and uncovered their most relevant immune cells. The findings in this paper provide possible biomarkers for diagnosing Chinese SLE patients.

Development of Clinical Decision Models for the Prediction of Systemic Lupus Erythematosus and Sjogren’s Syndrome Overlap

Development and validation of a predictive model for end-stage renal disease in systemic lupus erythematosus patients

Screening Biomarkers for Systemic Lupus Erythematosus Based on Machine Learning and Exploring Their Expression Correlations With the Ratios of Various Immune Cells

Improving the Diagnosis of Systemic Lupus Erythematosus with Machine Learning Algorithms Based on Real-World Data

Predicting the risk of cardiovascular and cerebrovascular event in systemic lupus erythematosus: a Chinese SLE treatment and research group study XXVI

LSO-080 Machine-learning Approach on Lupus Low Disease Activity Prediction

Exploration of Biomarkers for Systemic Lupus Erythematosus by Machine-Learning Analysis

Prognosis for Hospitalized Patients with Systemic Lupus Erythematosus in China: 5-Year Update of the Jiangsu Cohort

Machine learning models predicts risk of proliferative lupus nephritis

Development and Verify of Survival Analysis Models for Chinese Patients With Systemic Lupus Erythematosus

Identification of a gene-expression predictor for diagnosis and personalized stratification of lupus patients

Development of a nomogram for membranous nephropathy prediction in patients with primary Sjögren's syndrome: a 6-year retrospective study

Novel multiclass classification machine learning approach for the early-stage classification of systemic autoimmune rheumatic diseases

Using combination of albumin to fibrinogen ratio and prognostic nutritional index model for predicting disease activity in patients with systemic lupus erythematosus

Identification of Crucial Genes for Predicting the Risk of Atherosclerosis with System Lupus Erythematosus Based on Comprehensive Bioinformatics Analysis and Machine Learning

Development and external validation of a prediction model for interstitial lung disease in systemic lupus erythematosus patients: A cross-sectional study

Raman spectroscopy combined with machine learning algorithms for rapid detection Primary Sjögren's syndrome associated with interstitial lung disease

Identification of Diagnostic Biomarkers in Systemic Lupus Erythematosus Based on Bioinformatics Analysis and Machine Learning

Establishment and evaluation of a risk prediction model for coronary heart disease in primary Sjögren's syndrome based on peripheral blood IL-6 and Treg percentages

Predicting the Risk of Fundus Lesions in Systemic Lupus Erythematosus: A Nomogram Model

Identification of novel biomarkers for childhood-onset systemic lupus erythematosus using machine learning algorithms and immune infiltration analysis