A flexible repertoire of transcription factor binding sites and a diversity threshold determines enhancer activity in embryonic stem cells.
Journal
Genome research
ISSN: 1549-5469
Titre abrégé: Genome Res
Pays: United States
ID NLM: 9518021
Informations de publication
Date de publication:
04 2021
04 2021
Historique:
received:
01
10
2020
accepted:
19
02
2021
pubmed:
14
3
2021
medline:
29
12
2021
entrez:
13
3
2021
Statut:
ppublish
Résumé
Transcriptional enhancers are critical for development and phenotype evolution and are often mutated in disease contexts; however, even in well-studied cell types, the sequence code conferring enhancer activity remains unknown. To examine the enhancer regulatory code for pluripotent stem cells, we identified genomic regions with conserved binding of multiple transcription factors in mouse and human embryonic stem cells (ESCs). Examination of these regions revealed that they contain on average 12.6 conserved transcription factor binding site (TFBS) sequences. Enriched TFBSs are a diverse repertoire of 70 different sequences representing the binding sequences of both known and novel ESC regulators. Using a diverse set of TFBSs from this repertoire was sufficient to construct short synthetic enhancers with activity comparable to native enhancers. Site-directed mutagenesis of conserved TFBSs in endogenous enhancers or TFBS deletion from synthetic sequences revealed a requirement for 10 or more different TFBSs. Furthermore, specific TFBSs, including the POU5F1:SOX2 comotif, are dispensable, despite cobinding the POU5F1 (also known as OCT4), SOX2, and NANOG master regulators of pluripotency. These findings reveal that a TFBS sequence diversity threshold overrides the need for optimized regulatory grammar and individual TFBSs that recruit specific master regulators.
Identifiants
pubmed: 33712417
pii: gr.272468.120
doi: 10.1101/gr.272468.120
pmc: PMC8015845
doi:
Substances chimiques
Transcription Factors
0
Types de publication
Journal Article
Research Support, Non-U.S. Gov't
Langues
eng
Sous-ensembles de citation
IM
Pagination
564-575Subventions
Organisme : CIHR
ID : FRN 153186
Pays : Canada
Informations de copyright
© 2021 Singh et al.; Published by Cold Spring Harbor Laboratory Press.
Références
PLoS One. 2014 Jul 17;9(7):e101717
pubmed: 25032825
PLoS Comput Biol. 2018 Oct 4;14(10):e1006484
pubmed: 30286077
J Vis Exp. 2016 Apr 02;(110):e53552
pubmed: 27078492
Genome Res. 2014 Oct;24(10):1595-602
pubmed: 25035418
Nat Commun. 2015 Apr 15;6:6905
pubmed: 25872643
Nat Genet. 2008 Feb;40(2):158-60
pubmed: 18176564
Nature. 2009 Feb 12;457(7231):854-8
pubmed: 19212405
Nature. 2013 Dec 12;504(7479):282-6
pubmed: 24172903
Proc Natl Acad Sci U S A. 2016 Jun 7;113(23):6508-13
pubmed: 27155014
J Cell Sci. 2013 Mar 1;126(Pt 5):1093-8
pubmed: 23345404
Genome Biol. 2020 Sep 10;21(1):243
pubmed: 32912294
Cell Stem Cell. 2019 Dec 5;25(6):737-753.e4
pubmed: 31708402
Bioinformatics. 2014 Apr 1;30(7):923-30
pubmed: 24227677
Genes Dev. 2014 Dec 15;28(24):2699-711
pubmed: 25512558
Proc Natl Acad Sci U S A. 2010 Dec 14;107(50):21931-6
pubmed: 21106759
Genome Biol. 2009;10(3):R25
pubmed: 19261174
Brief Bioinform. 2013 Mar;14(2):144-61
pubmed: 22908213
Mol Biol Evol. 2013 Apr;30(4):772-80
pubmed: 23329690
Cell. 2013 Feb 14;152(4):895-908
pubmed: 23375746
Genome Res. 2020 Jul;30(7):939-950
pubmed: 32616518
Proc Natl Acad Sci U S A. 2017 Feb 14;114(7):E1291-E1300
pubmed: 28137873
Bioinformatics. 2012 Feb 15;28(4):487-94
pubmed: 22334039
PLoS Genet. 2010 Jan 22;6(1):e1000829
pubmed: 20107516
Genome Res. 2013 May;23(5):800-11
pubmed: 23512712
PLoS One. 2017 Sep 29;12(9):e0185714
pubmed: 28961274
Nature. 2011 Mar 10;471(7337):216-9
pubmed: 21390129
Sci Data. 2016 Feb 16;3:160008
pubmed: 26881867
Nature. 2009 Sep 10;461(7261):199-205
pubmed: 19741700
Nat Immunol. 2020 Feb;21(2):210-220
pubmed: 31873292
PLoS One. 2012;7(8):e42302
pubmed: 22879936
Mol Biol Cell. 2007 Apr;18(4):1348-58
pubmed: 17267691
Nucleic Acids Res. 2010 Jul;38(Web Server issue):W214-20
pubmed: 20576703
Nucleic Acids Res. 2016 Jan 4;44(D1):D110-5
pubmed: 26531826
Stem Cells. 2011 Feb;29(2):229-40
pubmed: 21732481
Nucleic Acids Res. 2015 Jan;43(Database issue):D1117-23
pubmed: 25270877
Mol Cell. 2019 Aug 8;75(3):549-561.e7
pubmed: 31398323
Nature. 2012 Sep 6;489(7414):57-74
pubmed: 22955616
BMC Genomics. 2012 Apr 26;13:152
pubmed: 22537144
Elife. 2018 Mar 22;7:
pubmed: 29553368
Cell Stem Cell. 2018 Aug 02;23(2):276-288.e8
pubmed: 30033119
Nature. 2011 Feb 10;470(7333):279-83
pubmed: 21160473
Front Cell Dev Biol. 2018 Feb 13;6:12
pubmed: 29487849
Cell. 2015 Jan 29;160(3):554-66
pubmed: 25635462
Cell. 2019 Aug 22;178(5):1145-1158.e20
pubmed: 31402173
Nature. 2015 Feb 19;518(7539):344-9
pubmed: 25693565
Genome Biol. 2008;9(9):R137
pubmed: 18798982
Trends Immunol. 2014 May;35(5):211-8
pubmed: 24768519
Nat Cell Biol. 2018 May;20(5):553-564
pubmed: 29695788
Elife. 2014 Oct 03;3:e02626
pubmed: 25279814
Nat Methods. 2012 Feb 28;9(3):215-6
pubmed: 22373907
Nature. 2009 Nov 5;462(7269):65-70
pubmed: 19890324
Bioinformatics. 2016 May 15;32(10):1555-6
pubmed: 26794315
Nat Cell Biol. 2011 Jul 31;13(9):1092-9
pubmed: 21804543
Science. 2018 Oct 26;362(6413):
pubmed: 30361341
Genome Res. 2017 Feb;27(2):246-258
pubmed: 27895109
Cell. 2008 Jun 13;133(6):1106-17
pubmed: 18555785
PLoS Genet. 2010 Jul 15;6(7):e1001023
pubmed: 20657823
Cell. 2012 Sep 28;151(1):206-20
pubmed: 22981692
Genome Biol. 2019 Aug 28;20(1):180
pubmed: 31462275
Nat Genet. 2013 Sep;45(9):1021-1028
pubmed: 23892608
Science. 2012 Sep 7;337(6099):1190-5
pubmed: 22955828
J Biol Chem. 2012 Sep 7;287(37):30885-7
pubmed: 22451669
Blood. 2002 Apr 1;99(7):2408-17
pubmed: 11895773
Science. 2013 Feb 15;339(6121):823-6
pubmed: 23287722
Bioinformatics. 2009 Aug 15;25(16):2078-9
pubmed: 19505943
Genome Res. 2016 Jun;26(6):778-86
pubmed: 27197208
Cell Stem Cell. 2013 Apr 4;12(4):393-4
pubmed: 23561441
Cell. 2016 Oct 20;167(3):633-642.e11
pubmed: 27768887
Cell. 2015 Sep 24;163(1):68-83
pubmed: 26365491
Cell. 1995 Dec 29;83(7):1091-100
pubmed: 8548797
Bioinformatics. 2010 Mar 15;26(6):841-2
pubmed: 20110278
Nat Methods. 2014 May;11(5):559-65
pubmed: 24658142
Elife. 2020 Feb 11;9:
pubmed: 32043966
Syst Biol. 2010 May;59(3):307-21
pubmed: 20525638