ZINBMM: a general mixture model for simultaneous clustering and gene selection using single-cell transcriptomic data

Yang Li,Mingcong Wu,Shuangge Ma,Mengyun Wu
DOI: https://doi.org/10.1186/s13059-023-03046-0
IF: 17.906
2023-09-13
Genome Biology
Abstract:Clustering is a critical component of single-cell RNA sequencing (scRNA-seq) data analysis and can help reveal cell types and infer cell lineages. Despite considerable successes, there are few methods tailored to investigating cluster-specific genes contributing to cell heterogeneity, which can promote biological understanding of cell heterogeneity. In this study, we propose a zero-inflated negative binomial mixture model (ZINBMM) that simultaneously achieves effective scRNA-seq data clustering and gene selection. ZINBMM conducts a systemic analysis on raw counts, accommodating both batch effects and dropout events. Simulations and the analysis of five scRNA-seq datasets demonstrate the practical applicability of ZINBMM.
genetics & heredity,biotechnology & applied microbiology
What problem does this paper attempt to address?