Integration of Single-Cell RNA-Seq Datasets: A Review of Computational Methods

Yeonjae Ryu,Geun Hee Han,Eunsoo Jung,Daehee Hwang
DOI: https://doi.org/10.14348/molcells.2023.0009
2023-02-24
Molecules and Cells
Abstract:With the increased number of single-cell RNA sequencing (scRNA-seq) datasets in public repositories, integrative analysis of multiple scRNA-seq datasets has become commonplace. Batch effects among different datasets are inevitable because of differences in cell isolation and handling protocols, library preparation technology, and sequencing platforms. To remove these batch effects for effective integration of multiple scRNA-seq datasets, a number of methodologies have been developed based on diverse concepts and approaches. These methods have proven useful for examining whether cellular features, such as cell subpopulations and marker genes, identified from a certain dataset, are consistently present, or whether their condition-dependent variations, such as increases in cell subpopulations in particular disease-related conditions, are consistently observed in different datasets generated under similar or distinct conditions. In this review, we summarize the concepts and approaches of the integration methods and their pros and cons as has been reported in previous literature.
cell biology,biochemistry & molecular biology
What problem does this paper attempt to address?