The art of seeing the elephant in the room: 2D embeddings of single-cell data do make sense

Jan Lause,Philipp Berens,Dmitry Kobak
DOI: https://doi.org/10.1371/journal.pcbi.1012403
2024-10-03
PLoS Computational Biology
Abstract:A recent paper claimed that t -SNE and UMAP embeddings of single-cell datasets are "specious" and fail to capture true biological structure. The authors argued that such embeddings are as arbitrary and as misleading as forcing the data into an elephant shape. Here we show that this conclusion was based on inadequate and limited metrics of embedding quality. More appropriate metrics quantifying neighborhood and class preservation reveal the elephant in the room: while t -SNE and UMAP embeddings of single-cell data do not preserve high-dimensional distances, they can nevertheless provide biologically relevant information.
biochemical research methods,mathematical & computational biology
What problem does this paper attempt to address?