Identification of gene and protein signatures associated with long-term effects of COVID-19 on the immune system after patient recovery by analyzing single-cell multi-omics data using a machine learning approach

JingXin Ren,Qian Gao,XianChao Zhou,Lei Chen,Wei Guo,KaiYan Feng,Jerry Hu,Tao Huang,Yu-Dong Cai
DOI: https://doi.org/10.1016/j.vaccine.2024.126253
IF: 4.169
2024-08-24
Vaccine
Abstract:Viral infections significantly impact the immune system, and impact will persist until recovery. However, the influence of severe acute respiratory syndrome coronavirus 2 infection on the homeostatic immune status and secondary immune response in recovered patients remains unclear. To investigate these persistent alterations, we employed five feature-ranking algorithms (LASSO, MCFS, RF, CATBoost, and XGBoost), incremental feature selection, synthetic minority oversampling technique and two classification algorithms (decision tree and k-nearest neighbors) to analyze multi-omics data (surface proteins and transcriptome) from coronavirus disease 2019 (COVID-19) recovered patients and healthy controls post-influenza vaccination. The single-cell multi-omics dataset was divided into five subsets corresponding to five immune cell subtypes: B cells, CD4+ T cells, CD8+ T cells, Monocytes, and Natural Killer cells. Each cell was represented by 28,402 scRNA-seq (RNA) features, 3 Hash Tag Oligo (HTO) features, 138 Cellular indexing of transcriptomes and epitopes by sequencing (CITE) features and 23,569 Single Cell Transform (SCT) features. Some multi-omics markers were identified and effective classifiers were constructed. Our findings indicate a distinct immune status in COVID-19 recovered patients, characterized by low expression of ribosomal protein (RPS26) and high expression of immune cell surface proteins (CD33, CD48). Notably, TMEM176B, a membrane protein, was highly expressed in monocytes of COVID-19 convalescent patients. These observations aid in discerning molecular differences among immune cell subtypes and contribute to understanding the prolonged effects of COVID-19 on the immune system, which is valuable for treating infectious diseases like COVID-19.
What problem does this paper attempt to address?