The DoGA consortium expression atlas of promoters and genes in 100 canine tissues.
Journal
Nature communications
ISSN: 2041-1723
Titre abrégé: Nat Commun
Pays: England
ID NLM: 101528555
Informations de publication
Date de publication:
21 Oct 2024
21 Oct 2024
Historique:
received:
14
03
2023
accepted:
13
09
2024
medline:
22
10
2024
pubmed:
22
10
2024
entrez:
21
10
2024
Statut:
epublish
Résumé
The dog, Canis lupus familiaris, is an important model for studying human diseases. Unlike many model organisms, the dog genome has a comparatively poor functional annotation, which hampers gene discovery for development, morphology, disease, and behavior. To fill this gap, we established a comprehensive tissue biobank for both the dog and wolf samples. The biobank consists of 5485 samples representing 132 tissues from 13 dogs, 12 dog embryos, and 24 wolves. In a subset of 100 tissues from nine dogs and 12 embryos, we characterized gene expression activity for each promoter, including alternative and novel, i.e., previously not annotated, promoter regions, using the 5' targeting RNA sequencing technology STRT2-seq. We identified over 100,000 promoter region candidates in the recent canine genome assembly, CanFam4, including over 45,000 highly reproducible sites with gene expression and respective tissue enrichment levels. We provide a promoter and gene expression atlas with interactive, open data resources, including a data coordination center and genome browser track hubs. We demonstrated the applicability of Dog Genome Annotation (DoGA) data and resources using multiple examples spanning canine embryonic development, morphology and behavior, and diseases across species.
Identifiants
pubmed: 39433728
doi: 10.1038/s41467-024-52798-1
pii: 10.1038/s41467-024-52798-1
doi:
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Pagination
9082Investigateurs
Carsten Daub
(C)
César L Araujo
(CL)
Ileana B Quintero
(IB)
Milla Salonen
(M)
Riika Sarviaho
(R)
Sruthi Hundi
(S)
Jenni Puurunen
(J)
Sini Sulkama
(S)
Sini Karjalainen
(S)
Henna Pekkarinen
(H)
Ilona Kareinen
(I)
Anna Knuuttila
(A)
Hanna-Maaria Javela
(HM)
Laura Tuomisto
(L)
Heli Nordgren
(H)
Karoliina Hagner
(K)
Tarja Jokinen
(T)
Kaarel Krjutskov
(K)
Auli Saarinen
(A)
Rasha Fahad Aljelaify
(RF)
Fiona Ross
(F)
Irene Stevens
(I)
Jeffrey J Schoenebeck
(JJ)
Heini Niinimäki
(H)
Marko Haapakoski
(M)
Informations de copyright
© 2024. The Author(s).
Références
Lohi, H. et al. Expanded repeat in canine epilepsy. Science 307, 81 (2005).
pubmed: 15637270
doi: 10.1126/science.1102832
Lingaas, F. et al. Bayesian mixed model analysis uncovered 21 risk loci for chronic kidney disease in boxer dogs. PLOS Genet. 19, e1010599 (2023).
pubmed: 36693108
pmcid: 9897549
doi: 10.1371/journal.pgen.1010599
Hytönen, M. K. et al. Molecular characterization of three canine models of human rare bone diseases: caffey, van den ende-gupta, and raine syndromes. PLoS Genet. 12, e1006037 (2016).
pubmed: 27187611
pmcid: 4871343
doi: 10.1371/journal.pgen.1006037
Tengvall, K. et al. Bayesian model and selection signature analyses reveal risk factors for canine atopic dermatitis. Commun. Biol. 5, 1348 (2022).
pubmed: 36482174
pmcid: 9731970
doi: 10.1038/s42003-022-04279-8
Kaukonen, M. et al. A missense variant in IFT122 associated with a canine model of retinitis pigmentosa. Hum. Genet. 140, 1569–1579 (2021).
pubmed: 33606121
pmcid: 8519925
doi: 10.1007/s00439-021-02266-3
Kim, J. H. et al. Genomically complex human angiosarcoma and canine hemangiosarcoma establish convergent angiogenic transcriptional programs driven by novel gene fusions. Mol. Cancer Res. MCR 19, 847–861 (2021).
pubmed: 33649193
doi: 10.1158/1541-7786.MCR-20-0937
Evans, J. M. et al. Multi-omics approach identifies germline regulatory variants associated with hematopoietic malignancies in retriever dog breeds. PLoS Genet. 17, e1009543 (2021).
pubmed: 33983928
pmcid: 8118335
doi: 10.1371/journal.pgen.1009543
Rimbault, M. et al. Derived variants at six genes explain nearly half of size reduction in dog breeds. Genome Res. 23, 1985–1995 (2013).
pubmed: 24026177
pmcid: 3847769
doi: 10.1101/gr.157339.113
Drögemüller, C. et al. A mutation in hirless dogs implicates FOXI3 in ectodermal development. Science 321, 1462–1462 (2008).
pubmed: 18787161
doi: 10.1126/science.1162525
Brown, E. A. et al. FGF4 retrogene on CFA12 is responsible for chondrodystrophy and intervertebral disc disease in dogs. Proc. Natl. Acad. Sci. USA. 114, 11476–11481 (2017).
pubmed: 29073074
pmcid: 5664524
doi: 10.1073/pnas.1709082114
Meadows, J. R. S. et al. Genome sequencing of 2000 canids by the Dog10K consortium advances the understanding of demography, genome function and architecture. Genome Biol. 24, 187 (2023).
pubmed: 37582787
pmcid: 10426128
doi: 10.1186/s13059-023-03023-7
Dutrow, E. V., Serpell, J. A. & Ostrander, E. A. Domestic dog lineages reveal genetic drivers of behavioral diversification. Cell 185, 4737–4755.e18 (2022).
pubmed: 36493753
pmcid: 10478034
doi: 10.1016/j.cell.2022.11.003
Sarviaho, R. et al. A novel genomic region on chromosome 11 associated with fearfulness in dogs. Transl. Psychiatry 10, 1–10 (2020).
doi: 10.1038/s41398-020-0849-z
Noh, H. J. et al. Integrating evolutionary and regulatory information with a multispecies approach implicates genes and pathways in obsessive-compulsive disorder. Nat. Commun. 8, 774 (2017).
pubmed: 29042551
pmcid: 5645406
doi: 10.1038/s41467-017-00831-x
Lindblad-Toh, K. et al. Genome sequence, comparative analysis and haplotype structure of the domestic dog. Nature 438, 803–819 (2005).
pubmed: 16341006
doi: 10.1038/nature04338
Hoeppner, M. P. et al. An improved canine genome and a comprehensive catalogue of coding genes and non-coding transcripts. PLoS One 9, e91172 (2014).
pubmed: 24625832
pmcid: 3953330
doi: 10.1371/journal.pone.0091172
Wang, C. et al. A novel canine reference genome resolves genomic architecture and uncovers transcript complexity. Commun. Biol. 4, 185 (2021).
pubmed: 33568770
pmcid: 7875987
doi: 10.1038/s42003-021-01698-x
Halo, J. V. et al. Long-read assembly of a great dane genome highlights the contribution of GC-rich sequence and mobile elements to canine genomes. Proc. Natl. Acad. Sci. USA. 118, e2016274118 (2021).
pubmed: 33836575
pmcid: 7980453
doi: 10.1073/pnas.2016274118
Jagannathan, V. et al. Dog10K_boxer_tasha_1.0: a long-read assembly of the dog reference genome. Genes 12, 847 (2021).
pubmed: 34070911
pmcid: 8228171
doi: 10.3390/genes12060847
Ballard, J. W. O. et al. The Australasian dingo archetype: de novo chromosome-length genome assembly, DNA methylome, and cranial morphology. bioRxiv https://doi.org/10.1101/2023.01.26.525801 (2023).
Megquier, K. et al. BarkBase: Epigenomic annotation of canine genomes. Genes 10, 433 (2019).
pubmed: 31181663
pmcid: 6627511
doi: 10.3390/genes10060433
van Steenbeek, F. G., Hytönen, M. K., Leegwater, P. A. J. & Lohi, H. The canine era: the rise of a biomedical model. Anim. Genet. 47, 519–527 (2016).
pubmed: 27324307
doi: 10.1111/age.12460
Adiconis, X. et al. Comprehensive comparative analysis of 5’-end RNA-sequencing methods. Nat. Methods 15, 505–511 (2018).
pubmed: 29867192
pmcid: 6075671
doi: 10.1038/s41592-018-0014-2
Fantom Consortium & others. A promoter-level mammalian expression atlas. Nature 507, 462–470 (2014).
Uhlén, M. et al. Tissue-based map of the human proteome. Science 347, 1260419 (2015).
pubmed: 25613900
doi: 10.1126/science.1260419
Nicholas, F. W. & Hobbs, M. Online Mendelian Inheritance in Animals (OMIA). https://omia.org/ (2012).
Kaukonen, M. et al. Maternal inheritance of a recessive RBP4 effect in canine congenital eye disease. Cell Rep. 23, 2643–2652 (2018).
pubmed: 29847795
pmcid: 6546432
doi: 10.1016/j.celrep.2018.04.118
Deviatiiarov, R. M. et al. An atlas of transcribed human cardiac promoters and enhancers reveals an important role of regulatory elements in heart failure. Nat. Cardiovasc. Res. 2, 58–75 (2023).
pubmed: 39196209
doi: 10.1038/s44161-022-00182-x
Morrill, K. et al. Ancestry-inclusive dog genomics challenges popular breed stereotypes. Science 376, eabk0639 (2022).
pubmed: 35482869
pmcid: 9675396
doi: 10.1126/science.abk0639
Evans, H. E. & Christensen, G. C. Miller’s Anatomy of the Dog 3rd edn, Vol. 1130 (WB Saunders Co, 1993).
van der Spuy, J. et al. The expression of the Leber congenital amaurosis protein AIPL1 coincides with rod and cone photoreceptor development. Invest. Ophthalmol. Vis. Sci. 44, 5396–5403 (2003).
pubmed: 14638743
doi: 10.1167/iovs.03-0686
Sproll, P. et al. Assembling the jigsaw puzzle: CBX2 isoform 2 and its targets in disorders/differences of sex development. Mol. Genet. Genomic Med. 6, 785–795 (2018).
pubmed: 29998616
pmcid: 6160712
doi: 10.1002/mgg3.445
Severin, J. et al. Interactive visualization and analysis of large-scale sequencing datasets using ZENBU. Nat. Biotechnol. 32, 217–219 (2014).
pubmed: 24727769
doi: 10.1038/nbt.2840
Kent, W. J. et al. The human genome browser at UCSC. Genome Res. 12, 996–1006 (2002).
pubmed: 12045153
pmcid: 186604
doi: 10.1101/gr.229102
Roller, M. et al. LINE retrotransposons characterize mammalian tissue-specific and evolutionarily dynamic regulatory regions. Genome Biol. 22, 62 (2021).
pubmed: 33602314
pmcid: 7890895
doi: 10.1186/s13059-021-02260-y
Son, K. H. et al. Integrative mapping of the dog epigenome: reference annotation for comparative intertissue and cross-species studies. Sci. Adv. 9, eade3399 (2023).
pubmed: 37406108
pmcid: 10321747
doi: 10.1126/sciadv.ade3399
Bannasch, D. L. et al. Dog colour patterns explained by modular promoters of ancient canid origin. Nat. Ecol. Evol. 5, 1415–1423 (2021).
pubmed: 34385618
pmcid: 8484016
doi: 10.1038/s41559-021-01524-x
Kaukonen, M. et al. A putative silencer variant in a spontaneous canine model of retinitis pigmentosa. PLoS Genet. 16, e1008659 (2020).
pubmed: 32150541
pmcid: 7082071
doi: 10.1371/journal.pgen.1008659
Niskanen, J. E. et al. Identification of novel genetic risk factors of dilated cardiomyopathy: from canine to human. Genome Med. 15, 73 (2023).
Kirilenko, B. M. et al. Integrating gene annotation with orthology inference at scale. Science 380, eabn3107 (2023).
pubmed: 37104600
pmcid: 10193443
doi: 10.1126/science.abn3107
Islam, S. et al. Highly multiplexed and strand-specific single-cell RNA 5′ end sequencing. Nat. Protoc. 7, 813–828 (2012).
pubmed: 22481528
doi: 10.1038/nprot.2012.022
Islam, S. et al. Quantitative single-cell RNA-seq with unique molecular identifiers. Nat. Methods 11, 163–166 (2014).
pubmed: 24363023
doi: 10.1038/nmeth.2772
Ezer, S. et al. Generation of RNA sequencing libraries for transcriptome analysis of globin-rich tissues of the domestic dog. STAR Protoc. 2, 100995 (2021).
pubmed: 34950881
pmcid: 8672047
doi: 10.1016/j.xpro.2021.100995
Feld, M. et al. The pruritus- and TH2-associated cytokine IL-31 promotes growth of sensory nerves. J. Allergy Clin. Immunol. 138, 500–508.e24 (2016).
pubmed: 27212086
doi: 10.1016/j.jaci.2016.02.020
Körber, I. et al. Gene-expression profiling suggests impaired signaling via the interferon pathway in Cstb-/- Microglia. PLoS One 11, e0158195 (2016).
pubmed: 27355630
pmcid: 4927094
doi: 10.1371/journal.pone.0158195
Hakonen, E. et al. MANF protects human pancreatic beta cells against stress-induced cell death. Diabetologia 61, 2202–2214 (2018).
pubmed: 30032427
pmcid: 6133171
doi: 10.1007/s00125-018-4687-y
Katayama, S. et al. Delineating the healthy human skin UV response and early induction of interferon pathway in cutaneous lupus erythematosus. J. Invest. Dermatol. 139, 2058–2061.e4 (2019).
pubmed: 30974166
doi: 10.1016/j.jid.2019.02.035
Vakkilainen, S. et al. The human long non-coding RNA gene RMRP has pleiotropic effects and regulates cell-cycle progression at G2. Sci. Rep. 9, 13758 (2019).
pubmed: 31551465
pmcid: 6760211
doi: 10.1038/s41598-019-50334-6
Katayama, S. et al. Acute wheeze-specific gene module shows correlation with vitamin D and asthma medication. Eur. Respir. J. 55, 1901330 (2020).
pubmed: 31619476
doi: 10.1183/13993003.01330-2019
Koel, M. et al. Human endometrial cell-type-specific RNA sequencing provides new insights into the embryo-endometrium interplay. Hum. Reprod. Open 2022, hoac043 (2022).
pubmed: 36339249
pmcid: 9632455
doi: 10.1093/hropen/hoac043
Wedenoja, S. et al. Fetal HLA-G mediated immune tolerance and interferon response in preeclampsia. EBioMedicine 59, 102872 (2020).
pubmed: 32680723
pmcid: 7502669
doi: 10.1016/j.ebiom.2020.102872
Lauter, G. et al. Differentiation of ciliated human midbrain-derived LUHMES neurons. J. Cell Sci. 133, jcs249789 (2020).
pubmed: 33115758
doi: 10.1242/jcs.249789
Kim, D., Langmead, B. & Salzberg, S. L. HISAT: a fast spliced aligner with low memory requirements. Nat. Methods 12, 357–360 (2015).
pubmed: 25751142
pmcid: 4655817
doi: 10.1038/nmeth.3317
Frith, M. C. et al. A code for transcription initiation in mammalian genomes. Genome Res. 18, 1–12 (2008).
pubmed: 18032727
pmcid: 2134772
doi: 10.1101/gr.6831208
Amezquita, R. A. et al. Orchestrating single-cell analysis with bioconductor. Nat. Methods 17, 137–145 (2020).
pubmed: 31792435
doi: 10.1038/s41592-019-0654-x
Patel, H. & others. nf-core/atacseq: nf-core/atacseq v1. 2.2—Iron Ossifrage. https://github.com/nf-core/atacseq/releases (2022).
Ewels, P. et al. The nf-core framework for community-curated bioinformatics pipelines. Nat. Biotechnol. 38, 276–278 (2022).
Chang, W. et al. Shiny: Web Application Framework for R. https://shiny.posit.co (2022).
Heinonen, T. et al. A loss-of-function variant in canine GLRA1 associates with a neurological disorder resembling human hyperekplexia. Hum. Genet. 142, 1221–1230 (2023).
Tretyakov, K. Pyliftover: Python Library for Lftover of Genomic oordinates. https://pypi.org/project/pyliftover/ (2019).