Identification of COVID-19 severity biomarkers based on feature selection on single-cell RNA-Seq data of CD8+ T cells.
Jian Lu,Mei Meng,XianChao Zhou,Shijian Ding,KaiYan Feng,Zhenbing Zeng,Tao Huang,Yu-Dong Cai
DOI: https://doi.org/10.3389/fgene.2022.1053772
IF: 3.7
2022-01-01
Frontiers in Genetics
Abstract:The global outbreak of the COVID-19 epidemic has become a major public health problem. COVID-19 virus infection triggers a complex immune response. CD8+ T cells, in particular, play an essential role in controlling the severity of the disease. However, the mechanism of the regulatory role of CD8+ T cells on COVID-19 remains poorly investigated. In this study, single-cell gene expression profiles from three CD8+ T cell subtypes (effector, memory, and naive T cells) were downloaded. Each cell subtype included three disease states, namely, acute COVID-19, convalescent COVID-19, and unexposed individuals. The profiles on each cell subtype were individually analyzed in the same way. Irrelevant features in the profiles were first excluded by the Boruta method. The remaining features for each CD8+ T cells subtype were further analyzed by Max-Relevance and Min-Redundancy, Monte Carlo feature selection, and light gradient boosting machine methods to obtain three feature lists. These lists were then brought into the incremental feature selection method to determine the optimal features for each cell subtype. Their corresponding genes may be latent biomarkers to determine COVID-19 severity. Genes, such as ZFP36, DUSP1, TCR, and IL7R, can be confirmed to play an immune regulatory role in COVID-19 infection and recovery. The results of functional enrichment analysis revealed that these important genes may be associated with immune functions, such as response to cAMP, response to virus, T cell receptor complex, T cell activation, and T cell differentiation. This study further set up different gene expression pattens, represented by classification rules, on three states of COVID-19 and constructed several efficient classifiers to distinguish COVID-19 severity. The findings of this study provided new insights into the biological processes of CD8+ T cells in regulating the immune response.
What problem does this paper attempt to address?