f-scLVM: scalable and versatile factor analysis for single-cell RNA-seq

Florian Buettner,Naruemon Pratanwanich,Davis J. McCarthy,John C. Marioni,Oliver Stegle
DOI: https://doi.org/10.1186/s13059-017-1334-8
IF: 17.906
2017-11-07
Genome Biology
Abstract:Single-cell RNA-sequencing (scRNA-seq) allows studying heterogeneity in gene expression in large cell populations. Such heterogeneity can arise due to technical or biological factors, making decomposing sources of variation difficult. We here describe f-scLVM (factorial single-cell latent variable model), a method based on factor analysis that uses pathway annotations to guide the inference of interpretable factors underpinning the heterogeneity. Our model jointly estimates the relevance of individual factors, refines gene set annotations, and infers factors without annotation. In applications to multiple scRNA-seq datasets, we find that f-scLVM robustly decomposes scRNA-seq datasets into interpretable components, thereby facilitating the identification of novel subpopulations.
genetics & heredity,biotechnology & applied microbiology
What problem does this paper attempt to address?