Scanorama: integrating large and diverse single-cell transcriptomic datasets

Brian L. Hie,Soochi Kim,Thomas A. Rando,Bryan Bryson,Bonnie Berger
DOI: https://doi.org/10.1038/s41596-024-00991-3
IF: 14.8
2024-06-07
Nature Protocols
Abstract:Merging diverse single-cell RNA sequencing (scRNA-seq) data from numerous experiments, laboratories and technologies can uncover important biological insights. Nonetheless, integrating scRNA-seq data encounters special challenges when the datasets are composed of diverse cell type compositions. Scanorama offers a robust solution for improving the quality and interpretation of heterogeneous scRNA-seq data by effectively merging information from diverse sources. Scanorama is designed to address the technical variation introduced by differences in sample preparation, sequencing depth and experimental batches that can confound the analysis of multiple scRNA-seq datasets. Here we provide a detailed protocol for using Scanorama within a Scanpy-based single-cell analysis workflow coupled with Google Colaboratory, a cloud-based free Jupyter notebook environment service. The protocol involves Scanorama integration, a process that typically spans 0.5–3 h. Scanorama integration requires a basic understanding of cellular biology, transcriptomic technologies and bioinformatics. Our protocol and new Scanorama–Colaboratory resource should make scRNA-seq integration more widely accessible to researchers.
biochemical research methods
What problem does this paper attempt to address?