Branching topology of the human embryo transcriptome revealed by entropy sort feature weighting.
Feature selection
Human development
Single cell
Journal
Development (Cambridge, England)
ISSN: 1477-9129
Titre abrégé: Development
Pays: England
ID NLM: 8701744
Informations de publication
Date de publication:
01 May 2024
01 May 2024
Historique:
received:
26
02
2024
accepted:
24
04
2024
medline:
1
5
2024
pubmed:
1
5
2024
entrez:
1
5
2024
Statut:
aheadofprint
Résumé
Analysis of single cell transcriptomics (scRNA-seq) data is typically performed after sub-setting to highly variable genes (HVGs). Here we show that Entropy Sorting provides an alternative mathematical framework for feature selection. On synthetic datasets, continuous entropy sort feature weighting (cESFW) outperforms HVG selection in distinguishing cell state specific genes. We apply cESFW to six merged scRNA-seq datasets spanning human early embryo development. Without smoothing or augmenting the raw counts matrices, cESFW generates a high-resolution embedding displaying coherent developmental progression from 8-cell to post-implantation stages and delineating 15 distinct cell states. The embedding highlights sequential lineage decisions during blastocyst development while unsupervised clustering identifies branch point populations obscured in previous analyses. The first branching region, where morula cells become specified for inner cell mass or trophectoderm, includes cells previously asserted to lack a developmental trajectory. We quantify the relatedness of different pluripotent stem cell cultures to distinct embryo cell types and identify marker genes of naïve and primed pluripotency. Finally, by revealing genes with dynamic lineage-specific expression we provide markers for staging progression from morula to blastocyst.
Identifiants
pubmed: 38691188
pii: 347105
doi: 10.1242/dev.202832
pii:
doi:
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Subventions
Organisme : Medical Research Council
ID : MR/W025310/1
Pays : United Kingdom
Organisme : European Research Council
ID : Plastinet, contract 835312
Pays : International
Informations de copyright
© 2024. Published by The Company of Biologists Ltd.