The art of seeing the elephant in the room: 2D embeddings of single-cell data do make sense.
Journal
PLoS computational biology
ISSN: 1553-7358
Titre abrégé: PLoS Comput Biol
Pays: United States
ID NLM: 101238922
Informations de publication
Date de publication:
Oct 2024
Oct 2024
Historique:
received:
27
03
2024
accepted:
09
08
2024
medline:
2
10
2024
pubmed:
2
10
2024
entrez:
2
10
2024
Statut:
epublish
Résumé
A recent paper claimed that t-SNE and UMAP embeddings of single-cell datasets are "specious" and fail to capture true biological structure. The authors argued that such embeddings are as arbitrary and as misleading as forcing the data into an elephant shape. Here we show that this conclusion was based on inadequate and limited metrics of embedding quality. More appropriate metrics quantifying neighborhood and class preservation reveal the elephant in the room: while t-SNE and UMAP embeddings of single-cell data do not preserve high-dimensional distances, they can nevertheless provide biologically relevant information.
Identifiants
pubmed: 39356722
doi: 10.1371/journal.pcbi.1012403
pii: PCOMPBIOL-D-24-00510
doi:
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Pagination
e1012403Informations de copyright
Copyright: © 2024 Lause et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. All used datasets are publicly available (see Table A in S1 Text). Our code in Python is available at https://github.com/berenslab/elephant-in-the-room.
Déclaration de conflit d'intérêts
The authors have declared that no competing interests exist.