Coexpression Network Analysis Coupled with Connectivity Map Database Mining Reveals Novel Genetic Biomarkers and Potential Therapeutic Drugs for Polymyositis
Wang Kai,Zhu Rujie,Li Ju,Zhang Zhongyuan,Wen Xin,Chen Hongwei,Sun Lingyun
DOI: https://doi.org/10.1007/s10067-021-06035-5
2022-01-01
Clinical Rheumatology
Abstract:Polymyositis (PM) is a chronic autoimmune connective tissue disease whose pathogenic mechanisms are unclear. This study aimed to identify the main genes and functionally enriched pathways involved in PM using weighted gene coexpression network analysis (WGCNA). To identify the candidate genes of PM, microarray datasets GSE128470, GSE3112, GSE39454 and GSE125977 were obtained from the Gene Expression Omnibus database. The gene network of GSE128470 was constructed, and WGCNA was used to divide genes into different modules. Subsequently, Gene Ontology and Kyoto Encyclopedia of Genes and Genomes analyses were applied to the most PM-related modules. The datasets were used to verify the expression profile and diagnostic capabilities of the hub genes. Additionally, gene set enrichment analysis (GSEA) was carried out. Moreover, gene signatures were then used as a search query to explore the connectivity map (CMap). A weighted gene coexpression network was constructed, and the genes were divided into 66 modules. The enriched functions and candidate pathway modules included interferon-γ, type I interferon, cellular response to interferon-γ, neutrophil activation, neutrophil degranulation, neutrophil-mediated immunity and neutrophil activation involved in the immune response. A total of 22 hub genes were identified. The Mann–Whitney U test was performed on these 22 genes using the three datasets of muscle samples and one dataset of whole blood samples, and two genes significantly differentially expressed in all datasets were obtained: VCAM1 and LY96. Receiver operating characteristic curve analysis determined that VCAM1 and LY96 gene expression can distinguish PM from healthy controls (the area under the curve [AUC] was greater than 0.75). Logistic regression analysis was performed on the combination of LY96 and VCAM1. The accuracy, sensitivity, specificity, and AUC of the combination reached 1.0. GSEA of VCAM1 and LY96 revealed their relation to ‘inflammatory response’, ‘TNF-α signalling via NF-κB’, ‘complement’ and ‘myogenesis’. CMap research revealed a few compounds with the potential to counteract the effects of the dysregulated molecular signature in PM. We used WGCNA to observe all aspects of PM, which helped to elucidate the molecular mechanisms of PM onset and progression and provide candidate targets for the diagnosis and treatment of PM.
What problem does this paper attempt to address?