Entropy sorting of single-cell RNA sequencing data reveals the inner cell mass in the human pre-implantation embryo.
feature selection
human embryo inner cell mass
single-cell RNA sequencing
Journal
Stem cell reports
ISSN: 2213-6711
Titre abrégé: Stem Cell Reports
Pays: United States
ID NLM: 101611300
Informations de publication
Date de publication:
10 01 2023
10 01 2023
Historique:
received:
08
04
2022
revised:
15
09
2022
accepted:
16
09
2022
pubmed:
15
10
2022
medline:
14
1
2023
entrez:
14
10
2022
Statut:
ppublish
Résumé
A major challenge in single-cell gene expression analysis is to discern meaningful cellular heterogeneity from technical or biological noise. To address this challenge, we present entropy sorting (ES), a mathematical framework that distinguishes genes indicative of cell identity. ES achieves this in an unsupervised manner by quantifying if observed correlations between features are more likely to have occurred due to random chance versus a dependent relationship, without the need for any user-defined significance threshold. On synthetic data, we demonstrate the removal of noisy signals to reveal a higher resolution of gene expression patterns than commonly used feature selection methods. We then apply ES to human pre-implantation embryo single-cell RNA sequencing (scRNA-seq) data. Previous studies failed to unambiguously identify early inner cell mass (ICM), suggesting that the human embryo may diverge from the mouse paradigm. In contrast, ES resolves the ICM and reveals sequential lineage bifurcations as in the classical model. ES thus provides a powerful approach for maximizing information extraction from high-dimensional datasets such as scRNA-seq data.
Identifiants
pubmed: 36240776
pii: S2213-6711(22)00456-8
doi: 10.1016/j.stemcr.2022.09.007
pmc: PMC9859930
pii:
doi:
Types de publication
Journal Article
Research Support, Non-U.S. Gov't
Langues
eng
Sous-ensembles de citation
IM
Pagination
47-63Subventions
Organisme : Medical Research Council
ID : G1100526
Pays : United Kingdom
Organisme : Medical Research Council
ID : MR/W025310/1
Pays : United Kingdom
Organisme : Medical Research Council
ID : MR/P010423/1
Pays : United Kingdom
Organisme : Medical Research Council
ID : G1100526/1
Pays : United Kingdom
Organisme : Biotechnology and Biological Sciences Research Council
ID : BB/P021573/1
Pays : United Kingdom
Organisme : Biotechnology and Biological Sciences Research Council
ID : 1943266
Pays : United Kingdom
Organisme : Medical Research Council
ID : MR/P00072X/2
Pays : United Kingdom
Organisme : Biotechnology and Biological Sciences Research Council
ID : BB/T007044/2
Pays : United Kingdom
Informations de copyright
Copyright © 2022 The Authors. Published by Elsevier Inc. All rights reserved.
Déclaration de conflit d'intérêts
Conflict of interests Sara-Jane Dunn was an employee at Microsoft Research during this study and is currently employed at DeepMind. Microsoft Research provided co-funding for Arthur Radley’s research council studentship and access to computational resources. Neither Microsoft Research nor DeepMind have directed any aspect of the study nor exerted any commercial rights over the results.
Références
Nat Struct Mol Biol. 2013 Sep;20(9):1131-9
pubmed: 23934149
Nat Biotechnol. 2022 Feb;40(2):163-166
pubmed: 35132262
Cold Spring Harb Perspect Med. 2019 Oct 1;9(10):
pubmed: 30617056
Nature. 2016 Sep 1;537(7618):57-62
pubmed: 27556940
Genome Biol. 2019 Dec 23;20(1):295
pubmed: 31870412
Development. 2016 Apr 1;143(7):1063-74
pubmed: 27048685
Cell. 2018 Jul 26;174(3):716-729.e27
pubmed: 29961576
Cell Stem Cell. 2021 Sep 2;28(9):1503-1504
pubmed: 34478626
F1000Res. 2016 Aug 31;5:2122
pubmed: 27909575
Genome Biol. 2020 Aug 27;21(1):218
pubmed: 32854757
Nat Methods. 2018 Jul;15(7):539-542
pubmed: 29941873
Nat Rev Genet. 2019 May;20(5):273-282
pubmed: 30617341
Nat Commun. 2022 Jan 11;13(1):192
pubmed: 35017482
Cell Stem Cell. 2021 Sep 2;28(9):1625-1640.e6
pubmed: 34004179
Cell. 2021 Jun 24;184(13):3573-3587.e29
pubmed: 34062119
Cell Stem Cell. 2021 Jun 3;28(6):1016-1022.e4
pubmed: 33957081
Cell. 2016 May 5;165(4):1012-26
pubmed: 27062923
Dev Biol. 2013 Mar 1;375(1):54-64
pubmed: 23261930
Nature. 2017 Oct 5;550(7674):67-73
pubmed: 28953884
F1000Res. 2018 Nov 2;7:1740
pubmed: 30906525
Nat Rev Nephrol. 2020 Jul;16(7):408-421
pubmed: 32221477
Nat Methods. 2018 Jun;15(6):399-400
pubmed: 29855577
Brief Bioinform. 2019 Jul 19;20(4):1583-1589
pubmed: 29481632
Development. 2018 Feb 7;145(3):
pubmed: 29361568
Nat Commun. 2021 Jun 24;12(1):3942
pubmed: 34168133
PLoS Comput Biol. 2009 Dec;5(12):e1000598
pubmed: 20011106
Bioinformatics. 2007 Oct 1;23(19):2507-17
pubmed: 17720704
Nat Methods. 2009 May;6(5):377-82
pubmed: 19349980
PeerJ. 2019 May 27;7:e6970
pubmed: 31179178
Front Genet. 2021 Feb 24;12:632620
pubmed: 33719342