Comprehensive molecular analyses of an autoimmune-related gene predictive model and immune infiltrations using machine learning methods in moyamoya disease
Shifu Li,Ying Han,Qian Zhang,Dong Tang,Jian Li,Ling Weng
DOI: https://doi.org/10.3389/fmolb.2022.991425
IF: 6.113
2022-12-21
Frontiers in Molecular Biosciences
Abstract:Background: Growing evidence suggests the links between moyamoya disease (MMD) and autoimmune diseases. However, the molecular mechanism from genetic perspective remains unclear. This study aims to clarify the potential roles of autoimmune-related genes (ARGs) in the pathogenesis of MMD. Methods: Two transcription profiles (GSE157628 and GSE141025) of MMD were downloaded from GEO databases. ARGs were obtained from the Gene and Autoimmune Disease Association Database (GAAD) and DisGeNET databases. Differentially expressed ARGs (DEARGs) were identified using "limma" R packages. GO, KEGG, GSVA, and GSEA analyses were conducted to elucidate the underlying molecular function. There machine learning methods (LASSO logistic regression, random forest (RF), support vector machine-recursive feature elimination (SVM-RFE)) were used to screen out important genes. An artificial neural network was applied to construct an autoimmune-related signature predictive model of MMD. The immune characteristics, including immune cell infiltration, immune responses, and HLA gene expression in MMD, were explored using ssGSEA. The miRNA-gene regulatory network and the potential therapeutic drugs for hub genes were predicted. Results: A total of 260 DEARGs were identified in GSE157628 dataset. These genes were involved in immune-related pathways, infectious diseases, and autoimmune diseases. We identified six diagnostic genes by overlapping the three machine learning algorithms: CD38, PTPN11, NOTCH1, TLR7, KAT2B, and ISG15. A predictive neural network model was constructed based on the six genes and presented with great diagnostic ability with area under the curve (AUC) = 1 in the GSE157628 dataset and further validated by GSE141025 dataset. Immune infiltration analysis showed that the abundance of eosinophils, natural killer T (NKT) cells, Th2 cells were significant different between MMD and controls. The expression levels of HLA-A, HLA-B, HLA-C, HLA-DMA, HLA-DRB6, HLA-F, and HLA-G were significantly upregulated in MMD. Four miRNAs (mir-26a-5p, mir-1343-3p, mir-129-2-3p, and mir-124-3p) were identified because of their interaction at least with four hub DEARGs. Conclusion: Machine learning was used to develop a reliable predictive model for the diagnosis of MMD based on ARGs. The uncovered immune infiltration and gene-miRNA and gene-drugs regulatory network may provide new insight into the pathogenesis and treatment of MMD.
biochemistry & molecular biology