An entropy-based metric for assessing the purity of single cell populations

Baolin Liu,Chenwei Li,Ziyi Li,Dongfang Wang,Xianwen Ren,Zemin Zhang
DOI: https://doi.org/10.1038/s41467-020-16904-3
IF: 16.6
2020-06-22
Nature Communications
Abstract:Abstract Single-cell RNA sequencing (scRNA-seq) is a versatile tool for discovering and annotating cell types and states, but the determination and annotation of cell subtypes is often subjective and arbitrary. Often, it is not even clear whether a given cluster is uniform. Here we present an entropy-based statistic, ROGUE, to accurately quantify the purity of identified cell clusters. We demonstrate that our ROGUE metric is broadly applicable, and enables accurate, sensitive and robust assessment of cluster purity on a wide range of simulated and real datasets. Applying this metric to fibroblast, B cell and brain data, we identify additional subtypes and demonstrate the application of ROGUE-guided analyses to detect precise signals in specific subpopulations. ROGUE can be applied to all tested scRNA-seq datasets, and has important implications for evaluating the quality of putative clusters, discovering pure cell subtypes and constructing comprehensive, detailed and standardized single cell atlas.
multidisciplinary sciences
What problem does this paper attempt to address?