A multi-tissue atlas of regulatory variants in cattle.
Journal
Nature genetics
ISSN: 1546-1718
Titre abrégé: Nat Genet
Pays: United States
ID NLM: 9216904
Informations de publication
Date de publication:
09 2022
09 2022
Historique:
received:
23
11
2020
accepted:
07
07
2022
pubmed:
12
8
2022
medline:
16
9
2022
entrez:
11
8
2022
Statut:
ppublish
Résumé
Characterization of genetic regulatory variants acting on livestock gene expression is essential for interpreting the molecular mechanisms underlying traits of economic value and for increasing the rate of genetic gain through artificial selection. Here we build a Cattle Genotype-Tissue Expression atlas (CattleGTEx) as part of the pilot phase of the Farm animal GTEx (FarmGTEx) project for the research community based on 7,180 publicly available RNA-sequencing (RNA-seq) samples. We describe the transcriptomic landscape of more than 100 tissues/cell types and report hundreds of thousands of genetic associations with gene expression and alternative splicing for 23 distinct tissues. We evaluate the tissue-sharing patterns of these genetic regulatory effects, and functionally annotate them using multiomics data. Finally, we link gene expression in different tissues to 43 economically important traits using both transcriptome-wide association and colocalization analyses to decipher the molecular regulatory mechanisms underpinning such agronomic traits in cattle.
Identifiants
pubmed: 35953587
doi: 10.1038/s41588-022-01153-5
pii: 10.1038/s41588-022-01153-5
pmc: PMC7613894
mid: EMS157770
doi:
Types de publication
Journal Article
Research Support, Non-U.S. Gov't
Research Support, U.S. Gov't, Non-P.H.S.
Langues
eng
Sous-ensembles de citation
IM
Pagination
1438-1447Subventions
Organisme : Biotechnology and Biological Sciences Research Council
ID : BBS/E/D/10002070
Pays : United Kingdom
Organisme : Biotechnology and Biological Sciences Research Council
ID : BBS/E/D/30002275
Pays : United Kingdom
Organisme : Medical Research Council
ID : MR/P015514/1
Pays : United Kingdom
Organisme : Medical Research Council
ID : MR/R025851/1
Pays : United Kingdom
Informations de copyright
© 2022. The Author(s), under exclusive licence to Springer Nature America, Inc.
Références
Buniello, A. et al. The NHGRI-EBI GWAS Catalog of published genome-wide association studies, targeted arrays and summary statistics 2019. Nucleic Acids Res. 47, D1005–D1012 (2019).
pubmed: 30445434
doi: 10.1093/nar/gky1120
Hu, Z. L., Park, C. A. & Reecy, J. M. Building a livestock genetic and genomic information knowledgebase through integrative developments of Animal QTLdb and CorrDB. Nucleic Acids Res. 47, D701–D710 (2019).
pubmed: 30407520
doi: 10.1093/nar/gky1084
GTEx Consortium The GTEx Consortium atlas of genetic regulatory effects across human tissues. Science 369, 1318–1330 (2020).
doi: 10.1126/science.aaz1776
Fang, L. et al. Comprehensive analyses of 723 transcriptomes enhance genetic and biological interpretations for complex traits in cattle. Genome Res. 30, 790–801 (2020).
pubmed: 32424068
pmcid: 7263193
doi: 10.1101/gr.250704.119
Xiang, R. et al. Quantifying the contribution of sequence variants with regulatory and evolutionary significance to 34 bovine complex traits. Proc. Natl Acad. Sci. USA 116, 19398–19408 (2019).
pubmed: 31501319
pmcid: 6765237
doi: 10.1073/pnas.1904159116
Prowse-Wilkins, C. P. et al. Putative causal variants are enriched in annotated functional regions from six bovine tissues. Front. Genet. 12, 664379 (2021).
pubmed: 34249087
pmcid: 8260860
doi: 10.3389/fgene.2021.664379
Xiang, R. et al. Genome variants associated with RNA splicing variations in bovine are extensively shared between tissues. BMC Genomics 19, 521 (2018).
pubmed: 29973141
pmcid: 6032541
doi: 10.1186/s12864-018-4902-8
Cesar, A. S. M. et al. Identification of putative regulatory regions and transcription factors associated with intramuscular fat content traits. BMC Genomics 19, 499 (2018).
pubmed: 29945546
pmcid: 6020320
doi: 10.1186/s12864-018-4871-y
Littlejohn, M. D. et al. Sequence-based association analysis reveals an MGST1 eQTL with pleiotropic effects on bovine milk composition. Sci. Rep. 6, 25376 (2016).
pubmed: 27146958
pmcid: 4857175
doi: 10.1038/srep25376
Hayes, B. J. & Daetwyler, H. D. 1000 Bull Genomes Project to map simple and complex genetic traits in cattle: applications and outcomes. Annu. Rev. Anim. Biosci. 7, 89–102 (2019).
pubmed: 30508490
doi: 10.1146/annurev-animal-020518-115024
Deelen, P. et al. Calling genotypes from public RNA-sequencing data enables identification of genetic variants that affect gene-expression levels. Genome Med. 7, 30 (2015).
pubmed: 25954321
pmcid: 4423486
doi: 10.1186/s13073-015-0152-4
Hombach, S. & Kretz, M. Non-coding RNAs: classification, biology and functioning. Adv. Exp. Med. Biol. 937, 3–17 (2016).
pubmed: 27573892
doi: 10.1007/978-3-319-42059-2_1
GTEx Consortium et al.Human genomics. The Genotype-Tissue Expression (GTEx) pilot analysis: multitissue gene regulation in humans. Science 348, 648–660 (2015).
pmcid: 4547484
doi: 10.1126/science.1262110
Tenesa, A. et al. Estimation of linkage disequilibrium in a sample of the United Kingdom dairy cattle population using unphased genotypes. J. Anim. Sci. 81, 617–623 (2003).
pubmed: 12661641
doi: 10.2527/2003.813617x
Fang, L. et al. Functional annotation of the cattle genome through systematic discovery and characterization of chromatin states and butyrate-induced variations. BMC Biol. 17, 68 (2019).
pubmed: 31419979
pmcid: 6698049
doi: 10.1186/s12915-019-0687-8
Dixon, J. R. et al. Topological domains in mammalian genomes identified by analysis of chromatin interactions. Nature 485, 376–380 (2012).
pubmed: 22495300
pmcid: 3356448
doi: 10.1038/nature11082
Rosen, B.D. et al. De novo assembly of the cattle reference genome with single-molecule sequencing. Gigascience 9, giaa021 (2020).
pubmed: 32191811
pmcid: 7081964
doi: 10.1093/gigascience/giaa021
Jiang, J. et al. Functional annotation and Bayesian fine-mapping reveals candidate genes for important agronomic traits in Holstein bulls. Commun. Biol. 2, 212 (2019).
pubmed: 31240250
pmcid: 6582147
doi: 10.1038/s42003-019-0454-y
Freebern, E. et al. GWAS and fine-mapping of livability and six disease traits in Holstein cattle. BMC Genomics 21, 41 (2020).
pubmed: 31931710
pmcid: 6958677
doi: 10.1186/s12864-020-6461-z
Fang, L. et al. Genetic and epigenetic architecture of paternal origin contribute to gestation length in cattle. Commun. Biol. 2, 100 (2019).
pubmed: 30886909
pmcid: 6418173
doi: 10.1038/s42003-019-0341-6
Gao, Y. et al. Single-cell transcriptomic analyses of dairy cattle ruminal epithelial cells during weaning. Genomics 113, 2045–2055 (2021).
pubmed: 33933592
doi: 10.1016/j.ygeno.2021.04.039
Kim-Hellmuth, S. et al. Cell type-specific genetic regulation of gene expression across human tissues. Science 369, eaaz8528 (2020).
pubmed: 32913075
pmcid: 8051643
doi: 10.1126/science.aaz8528
Clark, E. L. et al. From FAANG to fork: application of highly annotated genomes to improve farmed animal production. Genome Biol. 21, 285 (2020).
pubmed: 33234160
pmcid: 7686664
doi: 10.1186/s13059-020-02197-8
Xiang, R. D. Genome-wide fine-mapping identifies pleiotropic and functional variants that predict many traits across global cattle populations. Nat. Commun. 12, 860 (2021).
pubmed: 33558518
pmcid: 7870883
doi: 10.1038/s41467-021-21001-0
Bolger, A. M., Lohse, M. & Usadel, B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30, 2114–2120 (2014).
pubmed: 24695404
pmcid: 4103590
doi: 10.1093/bioinformatics/btu170
Pertea, M., Kim, D., Pertea, G. M., Leek, J. T. & Salzberg, S. L. Transcript-level expression analysis of RNA-seq experiments with HISAT, StringTie and Ballgown. Nat. Protoc. 11, 1650–1667 (2016).
pubmed: 27560171
pmcid: 5032908
doi: 10.1038/nprot.2016.095
Liao, Y., Smyth, G. K. & Shi, W. featureCounts: an efficient general purpose program for assigning sequence reads to genomic features. Bioinformatics 30, 923–930 (2014).
pubmed: 24227677
doi: 10.1093/bioinformatics/btt656
Li, Y. I. et al. Annotation-free quantification of RNA splicing using LeafCutter. Nat. Genet. 50, 151–158 (2018).
pubmed: 29229983
doi: 10.1038/s41588-017-0004-9
Van der Auwera, G. A. et al. From FastQ data to high confidence variant calls: the Genome Analysis Toolkit best practices pipeline. Curr. Protoc. Bioinformatics 43, 11.10.1–11.10.33 (2013).
doi: 10.1002/0471250953.bi1110s43
Browning, B. L., Zhou, Y. & Browning, S. R. A one-penny imputed genome from next-generation reference panels. Am. J. Hum. Genet. 103, 338–348 (2018).
pubmed: 30100085
pmcid: 6128308
doi: 10.1016/j.ajhg.2018.07.015
Chang, C. C. et al. Second-generation PLINK: rising to the challenge of larger and richer datasets. Gigascience 4, 7 (2015).
pubmed: 25722852
pmcid: 4342193
doi: 10.1186/s13742-015-0047-8
Patterson, N., Price, A. L. & Reich, D. Population structure and eigenanalysis. PLoS Genet. 2, e190 (2006).
pubmed: 17194218
pmcid: 1713260
doi: 10.1371/journal.pgen.0020190
Castel, S. E., Mohammadi, P., Chung, W. K., Shen, Y. & Lappalainen, T. Rare variant phasing and haplotypic expression from RNA sequencing with phASER. Nat. Commun. 7, 12817 (2016).
pubmed: 27605262
pmcid: 5025529
doi: 10.1038/ncomms12817
Krueger, F. & Andrews, S. R. Bismark: a flexible aligner and methylation caller for Bisulfite-Seq applications. Bioinformatics 27, 1571–1572 (2011).
pubmed: 21493656
pmcid: 3102221
doi: 10.1093/bioinformatics/btr167
Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows–Wheeler transform. Bioinformatics 25, 1754–1760 (2009).
pubmed: 19451168
pmcid: 2705234
doi: 10.1093/bioinformatics/btp324
Ramirez, F. et al. High-resolution TADs reveal DNA sequences underlying genome organization in flies. Nat. Commun. 9, 189 (2018).
pubmed: 29335486
pmcid: 5768762
doi: 10.1038/s41467-017-02525-w
Servant, N. et al. HiC-Pro: an optimized and flexible pipeline for Hi-C data processing. Genome Biol. 16, 259 (2015).
pubmed: 26619908
pmcid: 4665391
doi: 10.1186/s13059-015-0831-x
Langmead, B. & Salzberg, S. L. Fast gapped-read alignment with Bowtie 2. Nat. Methods 9, 357–359 (2012).
pubmed: 22388286
pmcid: 3322381
doi: 10.1038/nmeth.1923
Song, Q. et al. A reference methylome database and analysis pipeline to facilitate integrative and comparative epigenomics. PLoS ONE 8, e81148 (2013).
pubmed: 24324667
pmcid: 3855694
doi: 10.1371/journal.pone.0081148
Liu, H. et al. Systematic identification and annotation of human methylation marks based on bisulfite sequencing methylomes reveals distinct roles of cell type-specific hypomethylation in the regulation of cell identity genes. Nucleic Acids Res. 44, 75–94 (2016).
pubmed: 26635396
doi: 10.1093/nar/gkv1332
Stegle, O., Parts, L., Piipari, M., Winn, J. & Durbin, R. Using probabilistic estimation of expression residuals (PEER) to obtain increased power and interpretability of gene expression analyses. Nat. Protoc. 7, 500–507 (2012).
pubmed: 22343431
pmcid: 3398141
doi: 10.1038/nprot.2011.457
Ongen, H., Buil, A., Brown, A. A., Dermitzakis, E. T. & Delaneau, O. Fast and efficient QTL mapper for thousands of molecular phenotypes. Bioinformatics 32, 1479–1485 (2016).
pubmed: 26708335
doi: 10.1093/bioinformatics/btv722
Mohammadi, P., Castel, S. E., Brown, A. A. & Lappalainen, T. Quantifying the regulatory effect size of cis-acting genetic variation using allelic fold change. Genome Res. 27, 1872–1884 (2017).
pubmed: 29021289
pmcid: 5668944
doi: 10.1101/gr.216747.116
Wen, X., Lee, Y., Luca, F. & Pique-Regi, R. Efficient integrative multi-SNP association analysis via deterministic approximation of posteriors. Am. J. Hum. Genet. 98, 1114–1129 (2016).
pubmed: 27236919
pmcid: 4908152
doi: 10.1016/j.ajhg.2016.03.029
Willer, C. J., Li, Y. & Abecasis, G. R. METAL: fast and efficient meta-analysis of genomewide association scans. Bioinformatics 26, 2190–2191 (2010).
pubmed: 20616382
pmcid: 2922887
doi: 10.1093/bioinformatics/btq340
Urbut, S. M., Wang, G., Carbonetto, P. & Stephens, M. Flexible statistical methods for estimating and testing effects in genomic studies with multiple conditions. Nat. Genet. 51, 187–195 (2019).
pubmed: 30478440
doi: 10.1038/s41588-018-0268-8
Derrien, T. et al. Fast computation and applications of genome mappability. PLoS ONE 7, e30377 (2012).
pubmed: 22276185
pmcid: 3261895
doi: 10.1371/journal.pone.0030377
Shabalin, A. A. Matrix eQTL: ultra fast eQTL analysis via large matrix operations. Bioinformatics 28, 1353–1358 (2012).
pubmed: 22492648
pmcid: 3348564
doi: 10.1093/bioinformatics/bts163
Saha, A. & Battle, A. False positives in trans-eQTL and co-expression analyses arising from RNA-sequencing alignment errors. F1000Res. 7, 1860 (2018).
pubmed: 30613398
doi: 10.12688/f1000research.17145.1
Yang, J., Zaitlen, N. A., Goddard, M. E., Visscher, P. M. & Price, A. L. Advantages and pitfalls in the application of mixed-model association methods. Nat. Genet. 46, 100–106 (2014).
pubmed: 24473328
pmcid: 3989144
doi: 10.1038/ng.2876
Barbeira, A. N. et al. Exploring the phenotypic consequences of tissue specific gene expression variation inferred from GWAS summary statistics. Nat. Commun. 9, 1825 (2018).
pubmed: 29739930
pmcid: 5940825
doi: 10.1038/s41467-018-03621-1
Barbeira, A. N. et al. Integrating predicted transcriptome from multiple tissues improves association detection. PLoS Genet. 15, e1007889 (2019).
pubmed: 30668570
pmcid: 6358100
doi: 10.1371/journal.pgen.1007889
Wen, X., Pique-Regi, R. & Luca, F. Integrating molecular QTL data into genome-wide genetic association analysis: probabilistic assessment of enrichment and colocalization. PLoS Genet. 13, e1006646 (2017).
pubmed: 28278150
pmcid: 5363995
doi: 10.1371/journal.pgen.1006646
Giambartolomei, C. et al. Bayesian test for colocalisation between pairs of genetic association studies using summary statistics. PLoS Genet. 10, e1004383 (2014).
pubmed: 24830394
pmcid: 4022491
doi: 10.1371/journal.pgen.1004383
Heger, A., Webber, C., Goodson, M., Ponting, C. P. & Lunter, G. GAT: a simulation framework for testing the association of genomic intervals. Bioinformatics 29, 2046–2048 (2013).
pubmed: 23782611
pmcid: 3722528
doi: 10.1093/bioinformatics/btt343
Yu, G., Wang, L. G., Han, Y. & He, Q. Y. clusterProfiler: an R package for comparing biological themes among gene clusters. OMICS 16, 284–287 (2012).
pubmed: 22455463
pmcid: 3339379
doi: 10.1089/omi.2011.0118
Liu, S. et al. A multi-tissue atlas of regulatory variants in cattle. Code resource at github website. GitHub: https://github.com/shuliliu/cattleGTEx ; Zenodo: https://doi.org/10.5281/zenodo.6510550