Functional annotations of three domestic animal genomes provide vital resources for comparative and agricultural research.
Amino Acid Motifs
Animals
Animals, Domestic
/ genetics
Cattle
/ genetics
Chickens
/ genetics
Chromatin Immunoprecipitation Sequencing
Enhancer Elements, Genetic
/ genetics
Epigenesis, Genetic
Epigenomics
Gene Expression Regulation
/ genetics
Genome
/ genetics
Genome-Wide Association Study
Mice
Organ Specificity
/ genetics
Phylogeny
Polymorphism, Single Nucleotide
Regulatory Sequences, Nucleic Acid
/ genetics
Swine
/ genetics
Transcription Factors
/ genetics
Journal
Nature communications
ISSN: 2041-1723
Titre abrégé: Nat Commun
Pays: England
ID NLM: 101528555
Informations de publication
Date de publication:
23 03 2021
23 03 2021
Historique:
received:
26
10
2020
accepted:
01
03
2021
entrez:
24
3
2021
pubmed:
25
3
2021
medline:
15
4
2021
Statut:
epublish
Résumé
Gene regulatory elements are central drivers of phenotypic variation and thus of critical importance towards understanding the genetics of complex traits. The Functional Annotation of Animal Genomes consortium was formed to collaboratively annotate the functional elements in animal genomes, starting with domesticated animals. Here we present an expansive collection of datasets from eight diverse tissues in three important agricultural species: chicken (Gallus gallus), pig (Sus scrofa), and cattle (Bos taurus). Comparative analysis of these datasets and those from the human and mouse Encyclopedia of DNA Elements projects reveal that a core set of regulatory elements are functionally conserved independent of divergence between species, and that tissue-specific transcription factor occupancy at regulatory elements and their predicted target genes are also conserved. These datasets represent a unique opportunity for the emerging field of comparative epigenomics, as well as the agricultural research community, including species that are globally important food resources.
Identifiants
pubmed: 33758196
doi: 10.1038/s41467-021-22100-8
pii: 10.1038/s41467-021-22100-8
pmc: PMC7988148
doi:
Substances chimiques
Transcription Factors
0
Types de publication
Journal Article
Research Support, Non-U.S. Gov't
Research Support, U.S. Gov't, Non-P.H.S.
Langues
eng
Sous-ensembles de citation
IM
Pagination
1821Références
Adesogan, A. T., Havelaar, A. H., McKune, S. L., Eilittä, M. & Dahl, G. E. Animal source foods: sustainability problem or malnutrition and sustainability solution? Perspective matters. Glob. Food Secur. 25, 100325 (2020).
doi: 10.1016/j.gfs.2019.100325
Wallis, J. W. et al. A physical map of the chicken genome. Nature 432, 761–764 (2004).
pubmed: 15592415
doi: 10.1038/nature03030
Hindorff, L. A. et al. Potential etiologic and functional implications of genome-wide association loci for human diseases and traits. Proc. Natl Acad. Sci. USA 106, 9362–9367 (2009).
pubmed: 19474294
doi: 10.1073/pnas.0903103106
pmcid: 2687147
Consortium, E. P. The ENCODE (ENCyclopedia Of DNA Elements) Project. Science 306, 636–640 (2004).
doi: 10.1126/science.1105136
Stamatoyannopoulos, J. A. et al. An encyclopedia of mouse DNA elements (Mouse ENCODE). Genome Biol. 13, 1–5 (2012).
Consortium, E. P. An integrated encyclopedia of DNA elements in the human genome. Nature 489, 57–74 (2012).
doi: 10.1038/nature11247
Maurano, M. T. et al. Systematic localization of common disease-associated variation in regulatory DNA. Science 337, 1190–1195 (2012).
pubmed: 22955828
pmcid: 3771521
doi: 10.1126/science.1222794
Kundaje, A. et al. Integrative analysis of 111 reference human epigenomes. Nature 518, 317–330 (2015).
pubmed: 25693563
pmcid: 4530010
doi: 10.1038/nature14248
Abascal, F. et al. Perspectives on ENCODE. Nature 583, 693–698 (2020).
doi: 10.1038/s41586-020-2449-8
Gorkin, D. U. et al. An atlas of dynamic chromatin landscapes in mouse fetal development. Nature 583, 744–751 (2020).
pubmed: 32728240
pmcid: 7398618
doi: 10.1038/s41586-020-2093-3
Janes, D. E. et al. Reptiles and mammals have differentially retained long conserved noncoding sequences from the amniote ancestor. Genome Biol. Evol. 3, 102–113 (2011).
pubmed: 21183607
doi: 10.1093/gbe/evq087
Sackton, T. B. et al. Convergent regulatory evolution and loss of flight in paleognathous birds. Science 364, 74 (2019).
pubmed: 30948549
doi: 10.1126/science.aat7244
Lowe, C. B., Clarke, J. A., Baker, A. J., Haussler, D. & Edwards, S. V. Feather development genes and associated regulatory innovation predate the origin of Dinosauria. Mol. Biol. Evol. 32, 23–28 (2015).
pubmed: 25415961
doi: 10.1093/molbev/msu309
Seki, R. et al. Functional roles of Aves class-specific cis-regulatory elements on macroevolution of bird-specific features. Nat. Commun. 8, 14229 (2017).
pubmed: 28165450
pmcid: 5473641
doi: 10.1038/ncomms14229
Lekven, A. C. et al. Analysis of the wnt1 regulatory chromosomal landscape. Dev. Genes Evol. 229, 43–52 (2019).
pubmed: 30825002
pmcid: 6500750
doi: 10.1007/s00427-019-00629-5
Foissac, S. et al. Multi-species annotation of transcriptome and chromatin structure in domesticated animals. BMC Biol. 17, 108 (2019).
pubmed: 31884969
pmcid: 6936065
doi: 10.1186/s12915-019-0726-5
Artemov, A. V. et al. Genome-wide DNA methylation profiling reveals epigenetic adaptation of stickleback to marine and freshwater conditions. Mol. Biol. Evol. 34, 2203–2213 (2017).
pubmed: 28873953
doi: 10.1093/molbev/msx156
Andersson, L. et al. Coordinated international action to accelerate genome-to-phenome with FAANG, the Functional Annotation of Animal Genomes project. Genome Biol. 16, 57 (2015).
pubmed: 25854118
pmcid: 4373242
doi: 10.1186/s13059-015-0622-4
Tuggle, C. K. et al. GO-FAANG meeting: a gathering on Functional Annotation of Animal Genomes. Anim. Genet. 47, 528–533 (2016).
pubmed: 27453069
pmcid: 5082551
doi: 10.1111/age.12466
Burns, E. N. et al. Generation of an equine biobank to be used for Functional Annotation of Animal Genomes project. Anim. Genet. 49, 564–570 (2018).
pubmed: 30311254
pmcid: 6264908
doi: 10.1111/age.12717
Kingsley, B. N. et al. Functionally annotating regulatory elements in the equine genome using histone mark ChIP-Seq. Genes 11, https://doi.org/10.3390/genes11010003 (2019).
Giuffra, E. & Tuggle, C. K. Functional Annotation of Animal Genomes (FAANG): current achievements and roadmap. Annu. Rev. Anim. Biosci. 7, 65–88 (2019).
pubmed: 30427726
doi: 10.1146/annurev-animal-020518-114913
Halstead, M. M. et al. A comparative analysis of chromatin accessibility in cattle, pig, and mouse tissues. BMC Genom. 21, 698 (2020).
doi: 10.1186/s12864-020-07078-9
Clark, E. L. et al. From FAANG to fork: application of highly annotated genomes to improve farmed animal production. Genome Biol. 21, 285 (2020).
pubmed: 33234160
pmcid: 7686664
doi: 10.1186/s13059-020-02197-8
Stergachis, A. B. et al. Conservation of trans-acting circuitry during mammalian regulatory evolution. Nature 515, 365–370 (2014).
pubmed: 25409825
pmcid: 4405208
doi: 10.1038/nature13972
Cheng, Y. et al. Principles of regulatory information conservation between mouse and human. Nature 515, 371–375 (2014).
pubmed: 25409826
pmcid: 4343047
doi: 10.1038/nature13985
Johnson, D. S., Mortazavi, A., Myers, R. M. & Wold, B. Genome-wide mapping of in vivo protein-DNA interactions. Science 316, 1497–1502 (2007).
pubmed: 17540862
doi: 10.1126/science.1141319
Barski, A. et al. High-resolution profiling of histone methylations in the human genome. Cell 129, 823–837 (2007).
doi: 10.1016/j.cell.2007.05.009
pubmed: 17512414
Crawford, G. E. et al. Genome-wide mapping of DNase hypersensitive sites using massively parallel signature sequencing (MPSS). Genome Res. 16, 123–131 (2006).
pubmed: 16344561
pmcid: 1356136
doi: 10.1101/gr.4074106
Buenrostro, J. D., Giresi, P. G., Zaba, L. C., Chang, H. Y. & Greenleaf, W. J. Transposition of native chromatin for fast and sensitive epigenomic profiling of open chromatin, DNA-binding proteins and nucleosome position. Nat. Methods 10, 1213–1218 (2013).
pubmed: 24097267
pmcid: 3959825
doi: 10.1038/nmeth.2688
Landt, S. G. et al. ChIP-seq guidelines and practices of the ENCODE and modENCODE consortia. Genome Res. 22, 1813–1831 (2012).
pubmed: 22955991
pmcid: 3431496
doi: 10.1101/gr.136184.111
Ernst, J. & Kellis, M. ChromHMM: automating chromatin-state discovery and characterization. Nat. Methods 9, 215–216 (2012).
pubmed: 22373907
pmcid: 3577932
doi: 10.1038/nmeth.1906
Hoffman, M. M. et al. Integrative annotation of chromatin elements from ENCODE data. Nucleic Acids Res. 41, 827–841 (2012).
pubmed: 23221638
pmcid: 3553955
doi: 10.1093/nar/gks1284
Guenther, M. G., Levine, S. S., Boyer, L. A., Jaenisch, R. & Young, R. A. A chromatin landmark and transcription initiation at most promoters in human cells. Cell 130, 77–88 (2007).
pubmed: 17632057
pmcid: 3200295
doi: 10.1016/j.cell.2007.05.042
Mikkelsen, T. S. et al. Genome-wide maps of chromatin state in pluripotent and lineage-committed cells. Nature 448, 553–560 (2007).
pubmed: 17603471
pmcid: 2921165
doi: 10.1038/nature06008
Nègre, N. et al. A cis-regulatory map of the Drosophila genome. Nature 471, 527–531 (2011).
pubmed: 21430782
pmcid: 3179250
doi: 10.1038/nature09990
Creyghton, M. P. et al. Histone H3K27ac separates active from poised enhancers and predicts developmental state. Proc. Natl Acad. Sci. USA 107, 21931 (2010).
pubmed: 21106759
doi: 10.1073/pnas.1016071107
pmcid: 3003124
Ernst, J. et al. Mapping and analysis of chromatin state dynamics in nine human cell types. Nature 473, 43 (2011).
pubmed: 21441907
pmcid: 3088773
doi: 10.1038/nature09906
Botero-Castro, F., Figuet, E., Tilak, M. K., Nabholz, B. & Galtier, N. Avian Genomes Revisited: Hidden Genes Uncovered and the Rates versus Traits Paradox in Birds. Mol. Biol. Evol. 34, 3123–3131 (2017).
Yue, F. et al. A comparative encyclopedia of DNA elements in the mouse genome. Nature 515, 355–364 (2014).
pubmed: 25409824
pmcid: 4266106
doi: 10.1038/nature13992
He, Q. et al. High conservation of transcription factor binding and evidence for combinatorial regulation across six Drosophila species. Nat. Genet. 43, 414–420 (2011).
pubmed: 21478888
doi: 10.1038/ng.808
Kanehisa, M. & Goto, S. KEGG: kyoto encyclopedia of genes and genomes. Nucleic Acids Res. 28, 27–30 (2000).
pubmed: 10592173
pmcid: 102409
doi: 10.1093/nar/28.1.27
Neph, S. et al. An expansive human regulatory lexicon encoded in transcription factor footprints. Nature 489, 83–90 (2012).
pubmed: 22955618
pmcid: 3736582
doi: 10.1038/nature11212
Heinz, S. et al. Simple combinations of lineage-determining transcription factors prime cis-regulatory elements required for macrophage and B cell identities. Mol. Cell 38, 576–589 (2010).
pubmed: 20513432
pmcid: 2898526
doi: 10.1016/j.molcel.2010.05.004
Smith, R. P. et al. Massively parallel decoding of mammalian regulatory sequences supports a flexible organizational model. Nat. Genet. 45, 1021–1028 (2013).
pubmed: 23892608
pmcid: 3775494
doi: 10.1038/ng.2713
Wu, W. et al. The role of Six1 in the genesis of muscle cell and skeletal muscle development. Int. J. Biol. Sci. 10, 983–989 (2014).
pubmed: 25210496
pmcid: 4159689
doi: 10.7150/ijbs.9442
Dixon, J. R. et al. Topological domains in mammalian genomes identified by analysis of chromatin interactions. Nature 485, 376–380 (2012).
pubmed: 22495300
pmcid: 3356448
doi: 10.1038/nature11082
Rao, S. S. et al. A 3D map of the human genome at kilobase resolution reveals principles of chromatin looping. Cell 159, 1665–1680 (2014).
pubmed: 25497547
pmcid: 5635824
doi: 10.1016/j.cell.2014.11.021
Kvon, E. Z. et al. Genome-scale functional characterization of Drosophila developmental enhancers in vivo. Nature 512, 91–95 (2014).
doi: 10.1038/nature13395
pubmed: 24896182
Zhang, Y. et al. Chromatin connectivity maps reveal dynamic promoter–enhancer long-range associations. Nature 504, 306–310 (2013).
pubmed: 24213634
pmcid: 3954713
doi: 10.1038/nature12716
Lettice, L. A. et al. A long-range Shh enhancer regulates expression in the developing limb and fin and is associated with preaxial polydactyly. Hum. Mol. Genet. 12, 1725–1735 (2003).
pubmed: 12837695
doi: 10.1093/hmg/ddg180
Karlić, R., Chung, H.-R., Lasserre, J., Vlahoviček, K. & Vingron, M. Histone modification levels are predictive for gene expression. Proc. Natl Acad. Sci. USA 107, 2926 (2010).
pubmed: 20133639
doi: 10.1073/pnas.0909344107
pmcid: 2814872
Zhang, Z. & Zhang, M. Q. Histone modification profiles are predictive for tissue/cell-type specific expression of both protein-coding and microRNA genes. BMC Bioinforma. 12, 155 (2011).
doi: 10.1186/1471-2105-12-155
Xiang, R. et al. Genome variants associated with RNA splicing variations in bovine are extensively shared between tissues. BMC Genom. 19, 521 (2018).
doi: 10.1186/s12864-018-4902-8
Xiang, R. et al. Quantifying the contribution of sequence variants with regulatory and evolutionary significance to 34 bovine complex traits. Proc. Natl Acad. Sci. USA 116, 19398 (2019).
pubmed: 31501319
doi: 10.1073/pnas.1904159116
pmcid: 6765237
Kern, C. et al. Genome-wide identification of tissue-specific long non-coding RNA in three farm animal species. BMC Genom. 19, 684 (2018).
doi: 10.1186/s12864-018-5037-7
Halstead, M. M. et al. Systematic alteration of ATAC-seq for profiling open chromatin in cryopreserved nuclei preparations from livestock tissues. Sci. Rep. 10, 5230–5230 (2020).
pubmed: 32251359
pmcid: 7089989
doi: 10.1038/s41598-020-61678-9
John, S. et al. Chromatin accessibility pre-determines glucocorticoid receptor binding patterns. Nat. Genet. 43, 264–268 (2011).
pubmed: 21258342
pmcid: 6386452
doi: 10.1038/ng.759
Dobin, A. et al. STAR: ultrafast universal RNA-seq aligner. Bioinformatics 29, 15–21 (2012).
pubmed: 23104886
pmcid: 3530905
doi: 10.1093/bioinformatics/bts635
Li, H. et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics 25, 2078–2079 (2009).
pubmed: 19505943
pmcid: 2723002
doi: 10.1093/bioinformatics/btp352
Anders, S., Pyl, P. T. & Huber, W. HTSeq—a Python framework to work with high-throughput sequencing data. Bioinformatics 31, 166–169 (2014).
pubmed: 25260700
pmcid: 4287950
doi: 10.1093/bioinformatics/btu638
Robinson, M. D., McCarthy, D. J. & Smyth, G. K. edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics 26, 139–140 (2009).
pubmed: 19910308
pmcid: 2796818
doi: 10.1093/bioinformatics/btp616
Li, H. Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM. arXiv: 1303.3997 (2013).
Broad Institute. Picard Toolkit. http://broadinstitute.github.io/picard/ (2019).
Kharchenko, P. V., Tolstorukov, M. Y. & Park, P. J. Design and analysis of ChIP-seq experiments for DNA-binding proteins. Nat. Biotechnol. 26, 1351–1359 (2008).
pubmed: 19029915
pmcid: 2597701
doi: 10.1038/nbt.1508
Ramírez, F., Dündar, F., Diehl, S., Grüning, B. A. & Manke, T. deepTools: a flexible platform for exploring deep-sequencing data. Nucleic Acids Res. 42, W187–W191 (2014).
pubmed: 24799436
pmcid: 4086134
doi: 10.1093/nar/gku365
Zhang, Y. et al. Model-based analysis of ChIP-Seq (MACS). Genome Biol. 9, R137 (2008).
pubmed: 18798982
pmcid: 2592715
doi: 10.1186/gb-2008-9-9-r137
Quinlan, A. R. & Hall, I. M. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics 26, 841–842 (2010).
pubmed: 20110278
pmcid: 2832824
doi: 10.1093/bioinformatics/btq033
Kumar, S., Stecher, G., Suleski, M. & Hedges, S. B. TimeTree: a resource for timelines, timetrees, and divergence times. Mol. Biol. Evol. 34, 1812–1819 (2017).
pubmed: 28387841
doi: 10.1093/molbev/msx116
Huang, D. W., Sherman, B. T. & Lempicki, R. A. Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat. Protoc. 4, 44 (2008).
doi: 10.1038/nprot.2008.211
Gusmao, E. G., Allhoff, M., Zenke, M. & Costa, I. G. Analysis of computational footprinting methods for DNase sequencing experiments. Nat. methods 13, 303–309 (2016).
pubmed: 26901649
doi: 10.1038/nmeth.3772
Li, Z. et al. Identification of transcription factor binding sites using ATAC-seq. Genome Biol. 20, 45 (2019).
pubmed: 30808370
pmcid: 6391789
doi: 10.1186/s13059-019-1642-2
Eisenberg, E. & Levanon, E. Y. Human housekeeping genes, revisited. Trends Genet. 29, 569–574 (2013).
pubmed: 23810203
doi: 10.1016/j.tig.2013.05.010
Lonfat, N. & Duboule, D. Structure, function and evolution of topologically associating domains (TADs) at HOX loci. FEBS Lett. 589, 2869–2876 (2015).
pubmed: 25913784
doi: 10.1016/j.febslet.2015.04.024
Krefting, J., Andrade-Navarro, M. A. & Ibn-Salem, J. Evolutionary stability of topologically associating domains is associated with conserved gene regulation. BMC Biol. 16, 87 (2018).
pubmed: 30086749
pmcid: 6091198
doi: 10.1186/s12915-018-0556-x
Wang, M. et al. Putative bovine topological association domains and CTCF binding motifs can reduce the search space for causative regulatory variants of complex traits. BMC Genom. 19, 395 (2018).
doi: 10.1186/s12864-018-4800-0
Oti, M., Falck, J., Huynen, M. A. & Zhou, H. CTCF-mediated chromatin loops enclose inducible gene regulatory domains. BMC Genom. 17, 252 (2016).
doi: 10.1186/s12864-016-2516-6
Grant, C. E., Bailey, T. L. & Noble, W. S. FIMO: scanning for occurrences of a given motif. Bioinformatics 27, 1017–1018 (2011).
pubmed: 21330290
pmcid: 3065696
doi: 10.1093/bioinformatics/btr064
Kent, W. J. et al. The human genome browser at UCSC. Genome Res. 12, 996–1006 (2002).
pubmed: 12045153
pmcid: 186604
doi: 10.1101/gr.229102
Kern, C. E. A. Functional Annotations of Three Domestic Animal Genomes Provide Vital Resources for Comparative and Agricultural Research. https://github.com/kernco/functional-annotation , https://doi.org/10.5281/zenodo.4540293 (2021).