Automated Cell Type Annotation with Reference Cluster Mapping

Valerio Galanti,Lingting Shi,Elham Azizi,Yining Liu,Andrew J Blumberg
DOI: https://doi.org/10.1101/2024.11.30.626130
2024-12-05
Abstract:RNA sequencing (scRNA-seq) technologies have revolutionized our understanding of cellular heterogeneity. However, the characterization of scRNA-seq datasets remains challenging. We introduce a novel computational method that significantly enhances the annotation of scRNA clusters of a query dataset using established datasets as references. RefCM leverages optimal transport to measure the similarity in gene expression distributions between clusters and solves an integer program to optimally link the query and reference datasets based on this metric. Our algorithm produces more accurate cross-technology, cross-tissue, and cross-species mappings than any other currently available method. We demonstrate the efficacy of our method on a variety of benchmark datasets, showcasing its robustness and applicability across diverse biological contexts.
Biology
What problem does this paper attempt to address?