Regulatory transposable elements in the encyclopedia of DNA elements.


Journal

Nature communications
ISSN: 2041-1723
Titre abrégé: Nat Commun
Pays: England
ID NLM: 101528555

Informations de publication

Date de publication:
31 Aug 2024
Historique:
received: 07 10 2023
accepted: 16 08 2024
medline: 1 9 2024
pubmed: 1 9 2024
entrez: 31 8 2024
Statut: epublish

Résumé

Transposable elements (TEs) comprise ~50% of our genome, but knowledge of how TEs affect genome evolution remains incomplete. Leveraging ENCODE4 data, we provide the most comprehensive study to date of TE contributions to the regulatory genome. We find 236,181 (~25%) human candidate cis-regulatory elements (cCREs) are TE-derived, with over 90% lineage-specific since the human-mouse split, accounting for 8-36% of lineage-specific cCREs. Except for SINEs, cCRE-associated transcription factor (TF) motifs in TEs are derived from ancestral TE sequence more than expected by chance. We show that TEs may adopt similar regulatory activities of elements near their integration site. Since human-mouse divergence, TEs have contributed 3-56% of TF binding site turnover events across 30 examined TFs. Finally, TE-derived cCREs are similar to non-TE cCREs in terms of MPRA activity and GWAS variant enrichment. Overall, our results substantiate the notion that TEs have played an important role in shaping the human regulatory genome.

Identifiants

pubmed: 39217141
doi: 10.1038/s41467-024-51921-6
pii: 10.1038/s41467-024-51921-6
doi:

Substances chimiques

DNA Transposable Elements 0
Transcription Factors 0

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

7594

Subventions

Organisme : U.S. Department of Health & Human Services | National Institutes of Health (NIH)
ID : R01HG007175
Organisme : U.S. Department of Health & Human Services | National Institutes of Health (NIH)
ID : U01HG009391
Organisme : U.S. Department of Health & Human Services | NIH | National Human Genome Research Institute (NHGRI)
ID : T32HG000045
Organisme : U.S. Department of Health & Human Services | NIH | National Institute of General Medical Sciences (NIGMS)
ID : 2R35GM122550-06

Informations de copyright

© 2024. The Author(s).

Références

McClintock, B. The origin and behavior of mutable loci in maize. Proc. Natl Acad. Sci. USA 36, 344 (1950).
pubmed: 15430309 pmcid: 1063197 doi: 10.1073/pnas.36.6.344
Osmanski, A. B. et al. Insights into mammalian TE diversity through the curation of 248 genome assemblies. Science 380, eabn1430 (2023).
pubmed: 37104570 pmcid: 11103246 doi: 10.1126/science.abn1430
Nurk, S. et al. The complete sequence of a human genome. Science 376, 44–53 (2022).
pubmed: 35357919 pmcid: 9186530 doi: 10.1126/science.abj6987
Wells, J. N. & Feschotte, C. A field guide to eukaryotic transposable elements. Annu. Rev. Genet. 54, 539–561 (2020).
pubmed: 32955944 pmcid: 8293684 doi: 10.1146/annurev-genet-040620-022145
Christmas, M. J. et al. Evolutionary constraint and innovation across hundreds of placental mammals. Science 380, eabn3943 (2023).
pubmed: 37104599 pmcid: 10250106 doi: 10.1126/science.abn3943
Rebollo, R., Romanish, M. T. & Mager, D. L. Transposable elements: an abundant and natural source of regulatory sequences for host genes. Annu. Rev. Genet. 46, 21–42 (2012).
pubmed: 22905872 doi: 10.1146/annurev-genet-110711-155621
Chuong, E. B., Elde, N. C. & Feschotte, C. Regulatory activities of transposable elements: from conflicts to benefits. Nat. Rev. Genet. 18, 71–86 (2017).
pubmed: 27867194 doi: 10.1038/nrg.2016.139
Bourque, G. et al. Ten things you should know about transposable elements. Genome Biol. 19, 199 (2018).
pubmed: 30454069 pmcid: 6240941 doi: 10.1186/s13059-018-1577-z
Sundaram, V. & Wysocka, J. Transposable elements as a potent source of diverse cis-regulatory sequences in mammalian genomes. Philos. Trans. R. Soc. B 375, 20190347 (2020).
doi: 10.1098/rstb.2019.0347
Fueyo, R., Judd, J., Feschotte, C. & Wysocka, J. Roles of transposable elements in the regulation of mammalian transcription. Nat. Rev. Mol. Cell Biol. 23, 481–497 (2022).
pubmed: 35228718 pmcid: 10470143 doi: 10.1038/s41580-022-00457-y
Lawson, H. A., Liang, Y. & Wang, T. Transposable elements in mammalian chromatin organization. Nat. Rev. Genet. https://doi.org/10.1038/s41576-023-00609-6 (2023).
Wang, T. et al. Species-specific endogenous retroviruses shape the transcriptional network of the human tumor suppressor protein p53. Proc. Natl Acad. Sci. USA 104, 18613–18618 (2007).
pubmed: 18003932 pmcid: 2141825 doi: 10.1073/pnas.0703637104
Chuong, E. B., Elde, N. C. & Feschotte, C. Regulatory evolution of innate immunity through co-option of endogenous retroviruses. Science 351, 1083–1087 (2016).
pubmed: 26941318 pmcid: 4887275 doi: 10.1126/science.aad5497
Sundaram, V. et al. Functional cis-regulatory modules encoded by mouse-specific endogenous retrovirus. Nat. Commun. 8, 14550 (2017).
pubmed: 28348391 pmcid: 5379053 doi: 10.1038/ncomms14550
Du, A. Y. et al. Functional characterization of enhancer activity during a long terminal repeat’s evolution. Genome Res 32, 1840–1851 (2022).
pubmed: 36192170 pmcid: 9712623
Zemojtel, T., Kielbasa, S. M., Arndt, P. F., Chung, H. R. & Vingron, M. Methylation and deamination of CpGs generate p53-binding sites on a genomic scale. Trends Genet. 25, 63–66 (2009).
pubmed: 19101055 doi: 10.1016/j.tig.2008.11.005
Zemojtel, T. et al. CpG deamination creates transcription factor–binding sites with high efficiency. Genome Biol. Evol. 3, 1304–1311 (2011).
pubmed: 22016335 pmcid: 3228489 doi: 10.1093/gbe/evr107
Judd, J., Sanderson, H. & Feschotte, C. Evolution of mouse circadian enhancers from transposable elements. Genome Biol. 22, 1–26 (2021).
doi: 10.1186/s13059-021-02409-9
The ENCODE Project Consortium. et al. Expanded encyclopaedias of DNA elements in the human and mouse genomes. Nature 583, 699–710 (2020).
doi: 10.1038/s41586-020-2493-4
Roadmap Epigenomics Consortium. et al. Integrative analysis of 111 reference human epigenomes. Nature 518, 317–330 (2015).
pmcid: 4530010 doi: 10.1038/nature14248
Trizzino, M., Kapusta, A. & Brown, C. D. Transposable elements generate regulatory novelty in a tissue-specific fashion. BMC Genomics 19, 468 (2018).
pubmed: 29914366 pmcid: 6006921 doi: 10.1186/s12864-018-4850-3
Pehrsson, E. C., Choudhary, M. N. K., Sundaram, V. & Wang, T. The epigenomic landscape of transposable elements across normal human development and anatomy. Nat. Commun. 10, 1–16 (2019).
doi: 10.1038/s41467-019-13555-x
Brocks, D. et al. DNMT and HDAC inhibitors induce cryptic transcription start sites encoded in long terminal repeats. Nat. Genet. 49, 1052–1060 (2017).
pubmed: 28604729 pmcid: 6005702 doi: 10.1038/ng.3889
Schmidt, D. et al. Waves of retrotransposon expansion remodel genome organization and CTCF binding in multiple mammalian lineages. Cell 148, 335–348 (2012).
pubmed: 22244452 pmcid: 3368268 doi: 10.1016/j.cell.2011.11.058
Choudhary, M. N. K. et al. Co-opted transposons help perpetuate conserved higher-order chromosomal structures. Genome Biol. 21, 1–14 (2020).
Choudhary, M. N. K., Quaid, K., Xing, X., Schmidt, H. & Wang, T. Widespread contribution of transposable elements to the rewiring of mammalian 3D genomes. Nat. Commun. 14, 1–12 (2023).
doi: 10.1038/s41467-023-36364-9
Simonti, C. N., Pavličev, M. & Capra, J. A. Transposable element exaptation into regulatory regions is rare, influenced by evolutionary age, and subject to pleiotropic constraints. Mol. Biol. Evol. 34, 2856 (2017).
pubmed: 28961735 pmcid: 5850124 doi: 10.1093/molbev/msx219
Diehl, A. G., Ouyang, N. & Boyle, A. P. Transposable elements contribute to cell and species-specific chromatin looping and gene regulation in mammalian genomes. Nat. Commun. 11, 1–18 (2020).
doi: 10.1038/s41467-020-15520-5
Kuhn, R. M., Haussler, D. & James Kent, W. The UCSC genome browser and associated tools. Brief. Bioinform. 14, 144–161 (2013).
pubmed: 22908213 doi: 10.1093/bib/bbs038
Chinwalla, A. T. et al. Initial sequencing and comparative analysis of the mouse genome. Nature 420, 520–562 (2002).
pubmed: 12466850 doi: 10.1038/nature01262
Jordan, I. K., Rogozin, I. B., Glazko, G. V. & Koonin, E. V. Origin of a substantial fraction of human regulatory sequences from transposable elements. Trends Genet. 19, 68–72 (2003).
pubmed: 12547512 doi: 10.1016/S0168-9525(02)00006-9
Van De Lagemaat, L. N., Landry, J. R., Mager, D. L. & Medstrand, P. Transposable elements in mammals promote regulatory variation and diversification of genes with specialized functions. Trends Genet. 19, 530–536 (2003).
pubmed: 14550626 doi: 10.1016/j.tig.2003.08.004
Lowe, C. B., Bejerano, G. & Haussler, D. Thousands of human mobile element fragments undergo strong purifying selection near developmental genes. Proc. Natl Acad. Sci. USA 104, 8005–8010 (2007).
pubmed: 17463089 pmcid: 1876562 doi: 10.1073/pnas.0611223104
Feschotte, C. Transposable elements and the evolution of regulatory networks. Nat. Rev. Genet. 9, 397–405 (2008).
pubmed: 18368054 pmcid: 2596197 doi: 10.1038/nrg2337
Swergold, G. D. Identification, characterization, and cell specificity of a human LINE-1 promoter. Mol. Cell. Biol. 10, 6718–6729 (1990).
pubmed: 1701022 pmcid: 362950
Minakami, R. et al. Identification of an internal cis-element essential for the human Li transcription and a nuclear factor(s) binding to the element. Nucleic Acids Res. 20, 3139–3145 (1992).
pubmed: 1320255 pmcid: 312450 doi: 10.1093/nar/20.12.3139
Alexandrova, E. A. et al. Sense transcripts originated from an internal part of the human retrotransposon LINE-1 5′ UTR. Gene 511, 46–53 (2012).
pubmed: 22982412 doi: 10.1016/j.gene.2012.09.026
Sun, X. et al. Transcription factor profiling reveals molecular choreography and key regulators of human retrotransposon expression. Proc. Natl. Acad. Sci. USA. https://doi.org/10.1073/pnas.1722565115 (2018).
Stefflova, K. et al. Cooperativity and rapid evolution of cobound transcription factors in closely related mammals. Cell 154, 530–540 (2013).
pubmed: 23911320 pmcid: 3732390 doi: 10.1016/j.cell.2013.07.007
Yue, F. et al. A comparative encyclopedia of DNA elements in the mouse genome. Nature 515, 355–364 (2014).
pubmed: 25409824 pmcid: 4266106 doi: 10.1038/nature13992
Cheng, Y. et al. Principles of regulatory information conservation between mouse and human. Nature 515, 371–375 (2014).
pubmed: 25409826 pmcid: 4343047 doi: 10.1038/nature13985
Vierstra, J. et al. Mouse regulatory DNA landscapes reveal global principles of cis-regulatory evolution. Science 346, 1007–1012 (2014).
pubmed: 25411453 pmcid: 4337786 doi: 10.1126/science.1246426
Sundaram, V. et al. Widespread contribution of transposable elements to the innovation of gene regulatory networks. Genome Res. 24, 1963–1976 (2014).
pubmed: 25319995 pmcid: 4248313 doi: 10.1101/gr.168872.113
Agarwal, V. et al. Massively parallel characterization of transcriptional regulatory elements in three diverse human cell types. Preprint at bioRxiv https://doi.org/10.1101/2023.03.05.531189 (2023).
The 1000 Genomes Project Consortium. A global reference for human genetic variation. Nature 526, 68–74 (2015).
doi: 10.1038/nature15393
Sollis, E. et al. The NHGRI-EBI GWAS Catalog: knowledgebase and deposition resource. Nucleic Acids Res. 51, D977–D985 (2023).
pubmed: 36350656 doi: 10.1093/nar/gkac1010
Vierstra, J. et al. Global reference mapping of human transcription factor footprints. Nature 583, 729–736 (2020).
pubmed: 32728250 pmcid: 7410829 doi: 10.1038/s41586-020-2528-x
Medstrand, P., Van De Lagemaat, L. N. & Mager, D. L. Retroelement distributions in the human genome: variations associated with age and proximity to genes. Genome Res. 12, 1483–1495 (2002).
pubmed: 12368240 pmcid: 187529 doi: 10.1101/gr.388902
Lynch, V. J., Leclerc, R. D., May, G. & Wagner, G. P. Transposon-mediated rewiring of gene regulatory networks contributed to the evolution of pregnancy in mammals. Nat. Genet. 43, 1154–1159 (2011).
pubmed: 21946353 doi: 10.1038/ng.917
Andrews, G. et al. Mammalian evolution of human cis-regulatory elements and transcription factor binding sites. Science 380, eabn7930 (2023).
pubmed: 37104580 doi: 10.1126/science.abn7930
Villar, D. et al. Enhancer evolution across 20 mammalian species. Cell 160, 554–566 (2015).
pubmed: 25635462 pmcid: 4313353 doi: 10.1016/j.cell.2015.01.006
Pace, J. K. & Feschotte, C. The evolutionary history of human DNA transposons: evidence for intense activity in the primate lineage. Genome Res. 17, 422–432 (2007).
pubmed: 17339369 pmcid: 1832089 doi: 10.1101/gr.5826307
Su, M., Han, D., Boyd-Kirkup, J., Yu, X. & Han, J. D. J. Evolution of Alu elements toward enhancers. Cell Rep. 7, 376–385 (2014).
pubmed: 24703844 doi: 10.1016/j.celrep.2014.03.011
Thompson, P. J., Macfarlan, T. S. & Lorincz, M. C. Long terminal repeats: from parasitic elements to building blocks of the transcriptional regulatory repertoire. Mol. Cell 62, 766–776 (2016).
pubmed: 27259207 pmcid: 4910160 doi: 10.1016/j.molcel.2016.03.029
Ito, J. et al. Systematic identification and characterization of regulatory elements derived from human endogenous retroviruses. PLOS Genet 13, e1006883 (2017).
pubmed: 28700586 pmcid: 5529029 doi: 10.1371/journal.pgen.1006883
Payer, L. M. et al. Structural variants caused by Alu insertions are associated with risks for many human diseases. Proc. Natl Acad. Sci. USA 114, E3984–E3992 (2017).
pubmed: 28465436 pmcid: 5441760 doi: 10.1073/pnas.1704117114
Gribble, S. M. et al. Cytogenetics of the chronic myeloid leukemia-derived cell line K562: karyotype clarification by multicolor fluorescence in situ hybridization, comparative genomic hybridization, and locus-specific fluorescence in situ hybridization. Cancer Genet. Cytogenet. 118, 1–8 (2000).
pubmed: 10731582 doi: 10.1016/S0165-4608(99)00169-7
Naumann, S., Reutzel, D., Speicher, M. & Decker, H. J. Complete karyotype characterization of the K562 cell line by combined application of G-banding, multiplex-fluorescence in situ hybridization, fluorescence in situ hybridization, and comparative genomic hybridization. Leuk. Res. 25, 313–322 (2001).
pubmed: 11248328 doi: 10.1016/S0145-2126(00)00125-9
Zhou, B. et al. Comprehensive, integrated, and phased whole-genome analysis of the primary ENCODE cell line K562. Genome Res. 29, 472–484 (2019).
pubmed: 30737237 pmcid: 6396411 doi: 10.1101/gr.234948.118
Sexton, C. E. & Han, M. V. Paired-end mappability of transposable elements in the human genome. Mob. DNA 10, 1–11 (2019).
doi: 10.1186/s13100-019-0172-5
de Koning, A. P. J., Gu, W., Castoe, T. A., Batzer, M. A. & Pollock, D. D. Repetitive elements may comprise over two-thirds of the human genome. PLOS Genet. 7, e1002384 (2011).
pubmed: 22144907 pmcid: 3228813 doi: 10.1371/journal.pgen.1002384
Matsushima, W., Planet, E. & Trono, D. Ancestral genome reconstruction enhances transposable element annotation by identifying degenerate integrants. Cell Genomics 4, 100497 (2024).
pubmed: 38295789 pmcid: 10879028 doi: 10.1016/j.xgen.2024.100497
Smit, A., Hubley, R. & Green, P. RepeatMasker Open-4.0. 2013–2015 http://www.repeatmasker.org (2015).
Quinlan, A. R. & Hall, I. M. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics 26, 841–842 (2010).
pubmed: 20110278 pmcid: 2832824 doi: 10.1093/bioinformatics/btq033
Siepel, A. et al. Evolutionarily conserved elements in vertebrate, insect, worm, and yeast genomes. Genome Res. 15, 1034–1050 (2005).
pubmed: 16024819 pmcid: 1182216 doi: 10.1101/gr.3715005
Kulakovskiy, I. V. et al. HOCOMOCO: towards a complete collection of transcription factor binding models for human and mouse via large-scale ChIP-Seq analysis. Nucleic Acids Res. 46, D252–D259 (2018).
pubmed: 29140464 doi: 10.1093/nar/gkx1106
Needleman, S. B. & Wunsch, C. D. A general method applicable to the search for similarities in the amino acid sequence of two proteins. J. Mol. Biol. 48, 443–453 (1970).
pubmed: 5420325 doi: 10.1016/0022-2836(70)90057-4
Favorov, A. et al. Exploring massive, genome scale datasets with the GenometriCorr package. PLOS Comput. Biol. 8, e1002529 (2012).
pubmed: 22693437 pmcid: 3364938 doi: 10.1371/journal.pcbi.1002529
Frankish, A. et al. GENCODE 2021. Nucleic Acids Res. 49, D916–D923 (2021).
pubmed: 33270111 doi: 10.1093/nar/gkaa1087
Sherry, S. T., Ward, M. & Sirotkin, K. dbSNP—database for single nucleotide polymorphisms and other classes of minor genetic variation. Genome Res. 9, 677–679 (1999).
pubmed: 10447503 doi: 10.1101/gr.9.8.677
Du, A. Y., Chobirko, J. D., Zhuo, X., Feschotte, C. & Wang, T. Regulatory transposable elements in the encyclopedia of DNA elements. twlab/ENCODE_TE. https://doi.org/10.5281/zenodo.12822146 (2024).

Auteurs

Alan Y Du (AY)

Department of Genetics, Washington University School of Medicine, St. Louis, MO, USA.
The Edison Family Center for Genome Sciences and Systems Biology, Washington University School of Medicine, St. Louis, MO, USA.

Jason D Chobirko (JD)

Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY, USA.

Xiaoyu Zhuo (X)

Department of Genetics, Washington University School of Medicine, St. Louis, MO, USA.
The Edison Family Center for Genome Sciences and Systems Biology, Washington University School of Medicine, St. Louis, MO, USA.

Cédric Feschotte (C)

Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY, USA. cf458@cornell.edu.

Ting Wang (T)

Department of Genetics, Washington University School of Medicine, St. Louis, MO, USA. twang@wustl.edu.
The Edison Family Center for Genome Sciences and Systems Biology, Washington University School of Medicine, St. Louis, MO, USA. twang@wustl.edu.
McDonnell Genome Institute, Washington University School of Medicine, St. Louis, MO, USA. twang@wustl.edu.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH