Cell type annotation of single-cell chromatin accessibility data via supervised Bayesian embedding

Xiaoyang Chen,Shengquan Chen,Shuang Song,Zijing Gao,Lin Hou,Xuegong Zhang,Hairong Lv,Rui Jiang
DOI: https://doi.org/10.1038/s42256-021-00432-w
IF: 23.8
2022-02-01
Nature Machine Intelligence
Abstract:Recent advances in single-cell technologies have enabled the characterization of epigenomic heterogeneity at the cellular level. Computational methods for automatic cell type annotation are urgently needed given the exponential growth in the number of cells. In particular, annotation of single-cell chromatin accessibility sequencing (scCAS) data, which can capture the chromatin regulatory landscape that governs transcription in each cell type, has not been fully investigated. Here we propose EpiAnno, a probabilistic generative model integrated with a Bayesian neural network, to annotate scCAS data automatically in a supervised manner. We systematically validate the superior performance of EpiAnno for both intra- and inter-dataset annotation on various datasets. We further demonstrate the advantages of EpiAnno for interpretable embedding and biological implications via expression enrichment analysis, partitioned heritability analysis, enhancer identification, cis-coaccessibility analysis and pathway enrichment analysis. In addition, we show that EpiAnno has the potential to reveal cell type-specific motifs and facilitate scCAS data simulation.
computer science, artificial intelligence, interdisciplinary applications
What problem does this paper attempt to address?