CIForm as a Transformer-based model for cell-type annotation of large-scale single-cell RNA-seq data

Jing Xu,Aidi Zhang,Fang Liu,Liang Chen,Xiujun Zhang
DOI: https://doi.org/10.1093/bib/bbad195
IF: 9.5
2023-05-20
Briefings in Bioinformatics
Abstract:Single-cell omics technologies have made it possible to analyze the individual cells within a biological sample, providing a more detailed understanding of biological systems. Accurately determining the cell type of each cell is a crucial goal in single-cell RNA-seq (scRNA-seq) analysis. Apart from overcoming the batch effects arising from various factors, single-cell annotation methods also face the challenge of effectively processing large-scale datasets. With the availability of an increase in the scRNA-seq datasets, integrating multiple datasets and addressing batch effects originating from diverse sources are also challenges in cell-type annotation. In this work, to overcome the challenges, we developed a supervised method called CIForm based on the Transformer for cell-type annotation of large-scale scRNA-seq data. To assess the effectiveness and robustness of CIForm, we have compared it with some leading tools on benchmark datasets. Through the systematic comparisons under various cell-type annotation scenarios, we exhibit that the effectiveness of CIForm is particularly pronounced in cell-type annotation. The source code and data are available at https://github.com/zhanglab-wbgcas/CIForm.
biochemical research methods,mathematical & computational biology
What problem does this paper attempt to address?