scMerge leverages factor analysis, stable expression, and pseudoreplication to merge multiple single-cell RNA-seq datasets

Yingxin Lin,Shila Ghazanfar,Kevin Y X Wang,Johann A Gagnon-Bartsch,Kitty K Lo,Xianbin Su,Ze-Guang Han,John T Ormerod,Terence P Speed,Pengyi Yang,Jean Yee Hwa Yang,Kevin Y. X. Wang,Johann A. Gagnon-Bartsch,Kitty K. Lo,John T. Ormerod,Terence P. Speed
DOI: https://doi.org/10.1073/pnas.1820006116
IF: 11.1
2019-04-26
Proceedings of the National Academy of Sciences
Abstract:Significance Single-cell RNA-sequencing (scRNA-seq) profiling has exploded in recent years and enabled new biological knowledge to be discovered at the single-cell level. Successful and flexible integration of scRNA-Seq datasets from multiple sources promises to be an effective avenue to obtain further biological insights. This study presents a comprehensive approach to integration for scRNA-seq data analysis. It addresses the challenges involved in successful integration of scRNA-seq datasets by using the knowledge of genes that appear not to change across all samples and a robust algorithm to infer pseudoreplicates between datasets. This information is then consolidated into a single-factor model that enables tailored incorporation of prior knowledge. The effectiveness of scMerge is demonstrated by extensive comparison with other approaches.
multidisciplinary sciences
What problem does this paper attempt to address?