ESCHR: a hyperparameter-randomized ensemble approach for robust clustering across diverse datasets

Sarah M. Goggin and Eli R. Zunder
DOI: https://doi.org/10.1186/s13059-024-03386-5
IF: 17.906
2024-09-19
Genome Biology
Abstract:Clustering is widely used for single-cell analysis, but current methods are limited in accuracy, robustness, ease of use, and interpretability. To address these limitations, we developed an ensemble clustering method that outperforms other methods at hard clustering without the need for hyperparameter tuning. It also performs soft clustering to characterize continuum-like regions and quantify clustering uncertainty, demonstrated here by mapping the connectivity and intermediate transitions between MNIST handwritten digits and between hypothalamic tanycyte subpopulations. This hyperparameter-randomized ensemble approach improves the accuracy, robustness, ease of use, and interpretability of single-cell clustering, and may prove useful in other fields as well.
genetics & heredity,biotechnology & applied microbiology
What problem does this paper attempt to address?