Functional annotations of three domestic animal genomes provide vital resources for comparative and agricultural research.


Journal

Nature communications
ISSN: 2041-1723
Titre abrégé: Nat Commun
Pays: England
ID NLM: 101528555

Informations de publication

Date de publication:
23 03 2021
Historique:
received: 26 10 2020
accepted: 01 03 2021
entrez: 24 3 2021
pubmed: 25 3 2021
medline: 15 4 2021
Statut: epublish

Résumé

Gene regulatory elements are central drivers of phenotypic variation and thus of critical importance towards understanding the genetics of complex traits. The Functional Annotation of Animal Genomes consortium was formed to collaboratively annotate the functional elements in animal genomes, starting with domesticated animals. Here we present an expansive collection of datasets from eight diverse tissues in three important agricultural species: chicken (Gallus gallus), pig (Sus scrofa), and cattle (Bos taurus). Comparative analysis of these datasets and those from the human and mouse Encyclopedia of DNA Elements projects reveal that a core set of regulatory elements are functionally conserved independent of divergence between species, and that tissue-specific transcription factor occupancy at regulatory elements and their predicted target genes are also conserved. These datasets represent a unique opportunity for the emerging field of comparative epigenomics, as well as the agricultural research community, including species that are globally important food resources.

Identifiants

pubmed: 33758196
doi: 10.1038/s41467-021-22100-8
pii: 10.1038/s41467-021-22100-8
pmc: PMC7988148
doi:

Substances chimiques

Transcription Factors 0

Types de publication

Journal Article Research Support, Non-U.S. Gov't Research Support, U.S. Gov't, Non-P.H.S.

Langues

eng

Sous-ensembles de citation

IM

Pagination

1821

Références

Adesogan, A. T., Havelaar, A. H., McKune, S. L., Eilittä, M. & Dahl, G. E. Animal source foods: sustainability problem or malnutrition and sustainability solution? Perspective matters. Glob. Food Secur. 25, 100325 (2020).
doi: 10.1016/j.gfs.2019.100325
Wallis, J. W. et al. A physical map of the chicken genome. Nature 432, 761–764 (2004).
pubmed: 15592415 doi: 10.1038/nature03030
Hindorff, L. A. et al. Potential etiologic and functional implications of genome-wide association loci for human diseases and traits. Proc. Natl Acad. Sci. USA 106, 9362–9367 (2009).
pubmed: 19474294 doi: 10.1073/pnas.0903103106 pmcid: 2687147
Consortium, E. P. The ENCODE (ENCyclopedia Of DNA Elements) Project. Science 306, 636–640 (2004).
doi: 10.1126/science.1105136
Stamatoyannopoulos, J. A. et al. An encyclopedia of mouse DNA elements (Mouse ENCODE). Genome Biol. 13, 1–5 (2012).
Consortium, E. P. An integrated encyclopedia of DNA elements in the human genome. Nature 489, 57–74 (2012).
doi: 10.1038/nature11247
Maurano, M. T. et al. Systematic localization of common disease-associated variation in regulatory DNA. Science 337, 1190–1195 (2012).
pubmed: 22955828 pmcid: 3771521 doi: 10.1126/science.1222794
Kundaje, A. et al. Integrative analysis of 111 reference human epigenomes. Nature 518, 317–330 (2015).
pubmed: 25693563 pmcid: 4530010 doi: 10.1038/nature14248
Abascal, F. et al. Perspectives on ENCODE. Nature 583, 693–698 (2020).
doi: 10.1038/s41586-020-2449-8
Gorkin, D. U. et al. An atlas of dynamic chromatin landscapes in mouse fetal development. Nature 583, 744–751 (2020).
pubmed: 32728240 pmcid: 7398618 doi: 10.1038/s41586-020-2093-3
Janes, D. E. et al. Reptiles and mammals have differentially retained long conserved noncoding sequences from the amniote ancestor. Genome Biol. Evol. 3, 102–113 (2011).
pubmed: 21183607 doi: 10.1093/gbe/evq087
Sackton, T. B. et al. Convergent regulatory evolution and loss of flight in paleognathous birds. Science 364, 74 (2019).
pubmed: 30948549 doi: 10.1126/science.aat7244
Lowe, C. B., Clarke, J. A., Baker, A. J., Haussler, D. & Edwards, S. V. Feather development genes and associated regulatory innovation predate the origin of Dinosauria. Mol. Biol. Evol. 32, 23–28 (2015).
pubmed: 25415961 doi: 10.1093/molbev/msu309
Seki, R. et al. Functional roles of Aves class-specific cis-regulatory elements on macroevolution of bird-specific features. Nat. Commun. 8, 14229 (2017).
pubmed: 28165450 pmcid: 5473641 doi: 10.1038/ncomms14229
Lekven, A. C. et al. Analysis of the wnt1 regulatory chromosomal landscape. Dev. Genes Evol. 229, 43–52 (2019).
pubmed: 30825002 pmcid: 6500750 doi: 10.1007/s00427-019-00629-5
Foissac, S. et al. Multi-species annotation of transcriptome and chromatin structure in domesticated animals. BMC Biol. 17, 108 (2019).
pubmed: 31884969 pmcid: 6936065 doi: 10.1186/s12915-019-0726-5
Artemov, A. V. et al. Genome-wide DNA methylation profiling reveals epigenetic adaptation of stickleback to marine and freshwater conditions. Mol. Biol. Evol. 34, 2203–2213 (2017).
pubmed: 28873953 doi: 10.1093/molbev/msx156
Andersson, L. et al. Coordinated international action to accelerate genome-to-phenome with FAANG, the Functional Annotation of Animal Genomes project. Genome Biol. 16, 57 (2015).
pubmed: 25854118 pmcid: 4373242 doi: 10.1186/s13059-015-0622-4
Tuggle, C. K. et al. GO-FAANG meeting: a gathering on Functional Annotation of Animal Genomes. Anim. Genet. 47, 528–533 (2016).
pubmed: 27453069 pmcid: 5082551 doi: 10.1111/age.12466
Burns, E. N. et al. Generation of an equine biobank to be used for Functional Annotation of Animal Genomes project. Anim. Genet. 49, 564–570 (2018).
pubmed: 30311254 pmcid: 6264908 doi: 10.1111/age.12717
Kingsley, B. N. et al. Functionally annotating regulatory elements in the equine genome using histone mark ChIP-Seq. Genes 11, https://doi.org/10.3390/genes11010003 (2019).
Giuffra, E. & Tuggle, C. K. Functional Annotation of Animal Genomes (FAANG): current achievements and roadmap. Annu. Rev. Anim. Biosci. 7, 65–88 (2019).
pubmed: 30427726 doi: 10.1146/annurev-animal-020518-114913
Halstead, M. M. et al. A comparative analysis of chromatin accessibility in cattle, pig, and mouse tissues. BMC Genom. 21, 698 (2020).
doi: 10.1186/s12864-020-07078-9
Clark, E. L. et al. From FAANG to fork: application of highly annotated genomes to improve farmed animal production. Genome Biol. 21, 285 (2020).
pubmed: 33234160 pmcid: 7686664 doi: 10.1186/s13059-020-02197-8
Stergachis, A. B. et al. Conservation of trans-acting circuitry during mammalian regulatory evolution. Nature 515, 365–370 (2014).
pubmed: 25409825 pmcid: 4405208 doi: 10.1038/nature13972
Cheng, Y. et al. Principles of regulatory information conservation between mouse and human. Nature 515, 371–375 (2014).
pubmed: 25409826 pmcid: 4343047 doi: 10.1038/nature13985
Johnson, D. S., Mortazavi, A., Myers, R. M. & Wold, B. Genome-wide mapping of in vivo protein-DNA interactions. Science 316, 1497–1502 (2007).
pubmed: 17540862 doi: 10.1126/science.1141319
Barski, A. et al. High-resolution profiling of histone methylations in the human genome. Cell 129, 823–837 (2007).
doi: 10.1016/j.cell.2007.05.009 pubmed: 17512414
Crawford, G. E. et al. Genome-wide mapping of DNase hypersensitive sites using massively parallel signature sequencing (MPSS). Genome Res. 16, 123–131 (2006).
pubmed: 16344561 pmcid: 1356136 doi: 10.1101/gr.4074106
Buenrostro, J. D., Giresi, P. G., Zaba, L. C., Chang, H. Y. & Greenleaf, W. J. Transposition of native chromatin for fast and sensitive epigenomic profiling of open chromatin, DNA-binding proteins and nucleosome position. Nat. Methods 10, 1213–1218 (2013).
pubmed: 24097267 pmcid: 3959825 doi: 10.1038/nmeth.2688
Landt, S. G. et al. ChIP-seq guidelines and practices of the ENCODE and modENCODE consortia. Genome Res. 22, 1813–1831 (2012).
pubmed: 22955991 pmcid: 3431496 doi: 10.1101/gr.136184.111
Ernst, J. & Kellis, M. ChromHMM: automating chromatin-state discovery and characterization. Nat. Methods 9, 215–216 (2012).
pubmed: 22373907 pmcid: 3577932 doi: 10.1038/nmeth.1906
Hoffman, M. M. et al. Integrative annotation of chromatin elements from ENCODE data. Nucleic Acids Res. 41, 827–841 (2012).
pubmed: 23221638 pmcid: 3553955 doi: 10.1093/nar/gks1284
Guenther, M. G., Levine, S. S., Boyer, L. A., Jaenisch, R. & Young, R. A. A chromatin landmark and transcription initiation at most promoters in human cells. Cell 130, 77–88 (2007).
pubmed: 17632057 pmcid: 3200295 doi: 10.1016/j.cell.2007.05.042
Mikkelsen, T. S. et al. Genome-wide maps of chromatin state in pluripotent and lineage-committed cells. Nature 448, 553–560 (2007).
pubmed: 17603471 pmcid: 2921165 doi: 10.1038/nature06008
Nègre, N. et al. A cis-regulatory map of the Drosophila genome. Nature 471, 527–531 (2011).
pubmed: 21430782 pmcid: 3179250 doi: 10.1038/nature09990
Creyghton, M. P. et al. Histone H3K27ac separates active from poised enhancers and predicts developmental state. Proc. Natl Acad. Sci. USA 107, 21931 (2010).
pubmed: 21106759 doi: 10.1073/pnas.1016071107 pmcid: 3003124
Ernst, J. et al. Mapping and analysis of chromatin state dynamics in nine human cell types. Nature 473, 43 (2011).
pubmed: 21441907 pmcid: 3088773 doi: 10.1038/nature09906
Botero-Castro, F., Figuet, E., Tilak, M. K., Nabholz, B. & Galtier, N. Avian Genomes Revisited: Hidden Genes Uncovered and the Rates versus Traits Paradox in Birds. Mol. Biol. Evol. 34, 3123–3131 (2017).
Yue, F. et al. A comparative encyclopedia of DNA elements in the mouse genome. Nature 515, 355–364 (2014).
pubmed: 25409824 pmcid: 4266106 doi: 10.1038/nature13992
He, Q. et al. High conservation of transcription factor binding and evidence for combinatorial regulation across six Drosophila species. Nat. Genet. 43, 414–420 (2011).
pubmed: 21478888 doi: 10.1038/ng.808
Kanehisa, M. & Goto, S. KEGG: kyoto encyclopedia of genes and genomes. Nucleic Acids Res. 28, 27–30 (2000).
pubmed: 10592173 pmcid: 102409 doi: 10.1093/nar/28.1.27
Neph, S. et al. An expansive human regulatory lexicon encoded in transcription factor footprints. Nature 489, 83–90 (2012).
pubmed: 22955618 pmcid: 3736582 doi: 10.1038/nature11212
Heinz, S. et al. Simple combinations of lineage-determining transcription factors prime cis-regulatory elements required for macrophage and B cell identities. Mol. Cell 38, 576–589 (2010).
pubmed: 20513432 pmcid: 2898526 doi: 10.1016/j.molcel.2010.05.004
Smith, R. P. et al. Massively parallel decoding of mammalian regulatory sequences supports a flexible organizational model. Nat. Genet. 45, 1021–1028 (2013).
pubmed: 23892608 pmcid: 3775494 doi: 10.1038/ng.2713
Wu, W. et al. The role of Six1 in the genesis of muscle cell and skeletal muscle development. Int. J. Biol. Sci. 10, 983–989 (2014).
pubmed: 25210496 pmcid: 4159689 doi: 10.7150/ijbs.9442
Dixon, J. R. et al. Topological domains in mammalian genomes identified by analysis of chromatin interactions. Nature 485, 376–380 (2012).
pubmed: 22495300 pmcid: 3356448 doi: 10.1038/nature11082
Rao, S. S. et al. A 3D map of the human genome at kilobase resolution reveals principles of chromatin looping. Cell 159, 1665–1680 (2014).
pubmed: 25497547 pmcid: 5635824 doi: 10.1016/j.cell.2014.11.021
Kvon, E. Z. et al. Genome-scale functional characterization of Drosophila developmental enhancers in vivo. Nature 512, 91–95 (2014).
doi: 10.1038/nature13395 pubmed: 24896182
Zhang, Y. et al. Chromatin connectivity maps reveal dynamic promoter–enhancer long-range associations. Nature 504, 306–310 (2013).
pubmed: 24213634 pmcid: 3954713 doi: 10.1038/nature12716
Lettice, L. A. et al. A long-range Shh enhancer regulates expression in the developing limb and fin and is associated with preaxial polydactyly. Hum. Mol. Genet. 12, 1725–1735 (2003).
pubmed: 12837695 doi: 10.1093/hmg/ddg180
Karlić, R., Chung, H.-R., Lasserre, J., Vlahoviček, K. & Vingron, M. Histone modification levels are predictive for gene expression. Proc. Natl Acad. Sci. USA 107, 2926 (2010).
pubmed: 20133639 doi: 10.1073/pnas.0909344107 pmcid: 2814872
Zhang, Z. & Zhang, M. Q. Histone modification profiles are predictive for tissue/cell-type specific expression of both protein-coding and microRNA genes. BMC Bioinforma. 12, 155 (2011).
doi: 10.1186/1471-2105-12-155
Xiang, R. et al. Genome variants associated with RNA splicing variations in bovine are extensively shared between tissues. BMC Genom. 19, 521 (2018).
doi: 10.1186/s12864-018-4902-8
Xiang, R. et al. Quantifying the contribution of sequence variants with regulatory and evolutionary significance to 34 bovine complex traits. Proc. Natl Acad. Sci. USA 116, 19398 (2019).
pubmed: 31501319 doi: 10.1073/pnas.1904159116 pmcid: 6765237
Kern, C. et al. Genome-wide identification of tissue-specific long non-coding RNA in three farm animal species. BMC Genom. 19, 684 (2018).
doi: 10.1186/s12864-018-5037-7
Halstead, M. M. et al. Systematic alteration of ATAC-seq for profiling open chromatin in cryopreserved nuclei preparations from livestock tissues. Sci. Rep. 10, 5230–5230 (2020).
pubmed: 32251359 pmcid: 7089989 doi: 10.1038/s41598-020-61678-9
John, S. et al. Chromatin accessibility pre-determines glucocorticoid receptor binding patterns. Nat. Genet. 43, 264–268 (2011).
pubmed: 21258342 pmcid: 6386452 doi: 10.1038/ng.759
Dobin, A. et al. STAR: ultrafast universal RNA-seq aligner. Bioinformatics 29, 15–21 (2012).
pubmed: 23104886 pmcid: 3530905 doi: 10.1093/bioinformatics/bts635
Li, H. et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics 25, 2078–2079 (2009).
pubmed: 19505943 pmcid: 2723002 doi: 10.1093/bioinformatics/btp352
Anders, S., Pyl, P. T. & Huber, W. HTSeq—a Python framework to work with high-throughput sequencing data. Bioinformatics 31, 166–169 (2014).
pubmed: 25260700 pmcid: 4287950 doi: 10.1093/bioinformatics/btu638
Robinson, M. D., McCarthy, D. J. & Smyth, G. K. edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics 26, 139–140 (2009).
pubmed: 19910308 pmcid: 2796818 doi: 10.1093/bioinformatics/btp616
Li, H. Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM. arXiv: 1303.3997 (2013).
Broad Institute. Picard Toolkit. http://broadinstitute.github.io/picard/ (2019).
Kharchenko, P. V., Tolstorukov, M. Y. & Park, P. J. Design and analysis of ChIP-seq experiments for DNA-binding proteins. Nat. Biotechnol. 26, 1351–1359 (2008).
pubmed: 19029915 pmcid: 2597701 doi: 10.1038/nbt.1508
Ramírez, F., Dündar, F., Diehl, S., Grüning, B. A. & Manke, T. deepTools: a flexible platform for exploring deep-sequencing data. Nucleic Acids Res. 42, W187–W191 (2014).
pubmed: 24799436 pmcid: 4086134 doi: 10.1093/nar/gku365
Zhang, Y. et al. Model-based analysis of ChIP-Seq (MACS). Genome Biol. 9, R137 (2008).
pubmed: 18798982 pmcid: 2592715 doi: 10.1186/gb-2008-9-9-r137
Quinlan, A. R. & Hall, I. M. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics 26, 841–842 (2010).
pubmed: 20110278 pmcid: 2832824 doi: 10.1093/bioinformatics/btq033
Kumar, S., Stecher, G., Suleski, M. & Hedges, S. B. TimeTree: a resource for timelines, timetrees, and divergence times. Mol. Biol. Evol. 34, 1812–1819 (2017).
pubmed: 28387841 doi: 10.1093/molbev/msx116
Huang, D. W., Sherman, B. T. & Lempicki, R. A. Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat. Protoc. 4, 44 (2008).
doi: 10.1038/nprot.2008.211
Gusmao, E. G., Allhoff, M., Zenke, M. & Costa, I. G. Analysis of computational footprinting methods for DNase sequencing experiments. Nat. methods 13, 303–309 (2016).
pubmed: 26901649 doi: 10.1038/nmeth.3772
Li, Z. et al. Identification of transcription factor binding sites using ATAC-seq. Genome Biol. 20, 45 (2019).
pubmed: 30808370 pmcid: 6391789 doi: 10.1186/s13059-019-1642-2
Eisenberg, E. & Levanon, E. Y. Human housekeeping genes, revisited. Trends Genet. 29, 569–574 (2013).
pubmed: 23810203 doi: 10.1016/j.tig.2013.05.010
Lonfat, N. & Duboule, D. Structure, function and evolution of topologically associating domains (TADs) at HOX loci. FEBS Lett. 589, 2869–2876 (2015).
pubmed: 25913784 doi: 10.1016/j.febslet.2015.04.024
Krefting, J., Andrade-Navarro, M. A. & Ibn-Salem, J. Evolutionary stability of topologically associating domains is associated with conserved gene regulation. BMC Biol. 16, 87 (2018).
pubmed: 30086749 pmcid: 6091198 doi: 10.1186/s12915-018-0556-x
Wang, M. et al. Putative bovine topological association domains and CTCF binding motifs can reduce the search space for causative regulatory variants of complex traits. BMC Genom. 19, 395 (2018).
doi: 10.1186/s12864-018-4800-0
Oti, M., Falck, J., Huynen, M. A. & Zhou, H. CTCF-mediated chromatin loops enclose inducible gene regulatory domains. BMC Genom. 17, 252 (2016).
doi: 10.1186/s12864-016-2516-6
Grant, C. E., Bailey, T. L. & Noble, W. S. FIMO: scanning for occurrences of a given motif. Bioinformatics 27, 1017–1018 (2011).
pubmed: 21330290 pmcid: 3065696 doi: 10.1093/bioinformatics/btr064
Kent, W. J. et al. The human genome browser at UCSC. Genome Res. 12, 996–1006 (2002).
pubmed: 12045153 pmcid: 186604 doi: 10.1101/gr.229102
Kern, C. E. A. Functional Annotations of Three Domestic Animal Genomes Provide Vital Resources for Comparative and Agricultural Research. https://github.com/kernco/functional-annotation , https://doi.org/10.5281/zenodo.4540293 (2021).

Auteurs

Colin Kern (C)

Department of Animal Science, University of California, Davis, Davis, CA, USA.

Ying Wang (Y)

Department of Animal Science, University of California, Davis, Davis, CA, USA.

Xiaoqin Xu (X)

Department of Animal Science, University of California, Davis, Davis, CA, USA.

Zhangyuan Pan (Z)

Department of Animal Science, University of California, Davis, Davis, CA, USA.

Michelle Halstead (M)

Department of Animal Science, University of California, Davis, Davis, CA, USA.

Ganrea Chanthavixay (G)

Department of Animal Science, University of California, Davis, Davis, CA, USA.

Perot Saelao (P)

Department of Animal Science, University of California, Davis, Davis, CA, USA.

Susan Waters (S)

Department of Animal Science, University of California, Davis, Davis, CA, USA.

Ruidong Xiang (R)

Faculty of Veterinary and Agricultural Sciences, The University of Melbourne, Melbourne, VIC, Australia.
Agriculture Victoria, AgriBio, Centre for AgriBioscience, Bundoora, VIC, Australia.

Amanda Chamberlain (A)

Agriculture Victoria, AgriBio, Centre for AgriBioscience, Bundoora, VIC, Australia.

Ian Korf (I)

Genome Center, University of California, Davis, Davis, CA, USA.

Mary E Delany (ME)

Department of Animal Science, University of California, Davis, Davis, CA, USA.

Hans H Cheng (HH)

USDA-ARS, Avian Disease and Oncology Laboratory, East Lansing, MI, USA.

Juan F Medrano (JF)

Department of Animal Science, University of California, Davis, Davis, CA, USA.

Alison L Van Eenennaam (AL)

Department of Animal Science, University of California, Davis, Davis, CA, USA.

Chris K Tuggle (CK)

Department of Animal Science, Iowa State University, Ames, IA, USA.

Catherine Ernst (C)

Department of Animal Science, Michigan State University, East Lansing, MI, USA.

Paul Flicek (P)

European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK.

Gerald Quon (G)

Department of Molecular and Cellular Biology, University of California, David, Davis, CA, USA.

Pablo Ross (P)

Department of Animal Science, University of California, Davis, Davis, CA, USA. pross@ucdavis.edu.

Huaijun Zhou (H)

Department of Animal Science, University of California, Davis, Davis, CA, USA. hzhou@ucdavis.edu.

Articles similaires

Genome, Chloroplast Phylogeny Genetic Markers Base Composition High-Throughput Nucleotide Sequencing
Robotic Surgical Procedures Animals Humans Telemedicine Models, Animal

Odour generalisation and detection dog training.

Lyn Caldicott, Thomas W Pike, Helen E Zulch et al.
1.00
Animals Odorants Dogs Generalization, Psychological Smell
Animals TOR Serine-Threonine Kinases Colorectal Neoplasms Colitis Mice

Classifications MeSH