A multi-tissue atlas of regulatory variants in cattle.


Journal

Nature genetics
ISSN: 1546-1718
Titre abrégé: Nat Genet
Pays: United States
ID NLM: 9216904

Informations de publication

Date de publication:
09 2022
Historique:
received: 23 11 2020
accepted: 07 07 2022
pubmed: 12 8 2022
medline: 16 9 2022
entrez: 11 8 2022
Statut: ppublish

Résumé

Characterization of genetic regulatory variants acting on livestock gene expression is essential for interpreting the molecular mechanisms underlying traits of economic value and for increasing the rate of genetic gain through artificial selection. Here we build a Cattle Genotype-Tissue Expression atlas (CattleGTEx) as part of the pilot phase of the Farm animal GTEx (FarmGTEx) project for the research community based on 7,180 publicly available RNA-sequencing (RNA-seq) samples. We describe the transcriptomic landscape of more than 100 tissues/cell types and report hundreds of thousands of genetic associations with gene expression and alternative splicing for 23 distinct tissues. We evaluate the tissue-sharing patterns of these genetic regulatory effects, and functionally annotate them using multiomics data. Finally, we link gene expression in different tissues to 43 economically important traits using both transcriptome-wide association and colocalization analyses to decipher the molecular regulatory mechanisms underpinning such agronomic traits in cattle.

Identifiants

pubmed: 35953587
doi: 10.1038/s41588-022-01153-5
pii: 10.1038/s41588-022-01153-5
pmc: PMC7613894
mid: EMS157770
doi:

Types de publication

Journal Article Research Support, Non-U.S. Gov't Research Support, U.S. Gov't, Non-P.H.S.

Langues

eng

Sous-ensembles de citation

IM

Pagination

1438-1447

Subventions

Organisme : Biotechnology and Biological Sciences Research Council
ID : BBS/E/D/10002070
Pays : United Kingdom
Organisme : Biotechnology and Biological Sciences Research Council
ID : BBS/E/D/30002275
Pays : United Kingdom
Organisme : Medical Research Council
ID : MR/P015514/1
Pays : United Kingdom
Organisme : Medical Research Council
ID : MR/R025851/1
Pays : United Kingdom

Informations de copyright

© 2022. The Author(s), under exclusive licence to Springer Nature America, Inc.

Références

Buniello, A. et al. The NHGRI-EBI GWAS Catalog of published genome-wide association studies, targeted arrays and summary statistics 2019. Nucleic Acids Res. 47, D1005–D1012 (2019).
pubmed: 30445434 doi: 10.1093/nar/gky1120
Hu, Z. L., Park, C. A. & Reecy, J. M. Building a livestock genetic and genomic information knowledgebase through integrative developments of Animal QTLdb and CorrDB. Nucleic Acids Res. 47, D701–D710 (2019).
pubmed: 30407520 doi: 10.1093/nar/gky1084
GTEx Consortium The GTEx Consortium atlas of genetic regulatory effects across human tissues. Science 369, 1318–1330 (2020).
doi: 10.1126/science.aaz1776
Fang, L. et al. Comprehensive analyses of 723 transcriptomes enhance genetic and biological interpretations for complex traits in cattle. Genome Res. 30, 790–801 (2020).
pubmed: 32424068 pmcid: 7263193 doi: 10.1101/gr.250704.119
Xiang, R. et al. Quantifying the contribution of sequence variants with regulatory and evolutionary significance to 34 bovine complex traits. Proc. Natl Acad. Sci. USA 116, 19398–19408 (2019).
pubmed: 31501319 pmcid: 6765237 doi: 10.1073/pnas.1904159116
Prowse-Wilkins, C. P. et al. Putative causal variants are enriched in annotated functional regions from six bovine tissues. Front. Genet. 12, 664379 (2021).
pubmed: 34249087 pmcid: 8260860 doi: 10.3389/fgene.2021.664379
Xiang, R. et al. Genome variants associated with RNA splicing variations in bovine are extensively shared between tissues. BMC Genomics 19, 521 (2018).
pubmed: 29973141 pmcid: 6032541 doi: 10.1186/s12864-018-4902-8
Cesar, A. S. M. et al. Identification of putative regulatory regions and transcription factors associated with intramuscular fat content traits. BMC Genomics 19, 499 (2018).
pubmed: 29945546 pmcid: 6020320 doi: 10.1186/s12864-018-4871-y
Littlejohn, M. D. et al. Sequence-based association analysis reveals an MGST1 eQTL with pleiotropic effects on bovine milk composition. Sci. Rep. 6, 25376 (2016).
pubmed: 27146958 pmcid: 4857175 doi: 10.1038/srep25376
Hayes, B. J. & Daetwyler, H. D. 1000 Bull Genomes Project to map simple and complex genetic traits in cattle: applications and outcomes. Annu. Rev. Anim. Biosci. 7, 89–102 (2019).
pubmed: 30508490 doi: 10.1146/annurev-animal-020518-115024
Deelen, P. et al. Calling genotypes from public RNA-sequencing data enables identification of genetic variants that affect gene-expression levels. Genome Med. 7, 30 (2015).
pubmed: 25954321 pmcid: 4423486 doi: 10.1186/s13073-015-0152-4
Hombach, S. & Kretz, M. Non-coding RNAs: classification, biology and functioning. Adv. Exp. Med. Biol. 937, 3–17 (2016).
pubmed: 27573892 doi: 10.1007/978-3-319-42059-2_1
GTEx Consortium et al.Human genomics. The Genotype-Tissue Expression (GTEx) pilot analysis: multitissue gene regulation in humans. Science 348, 648–660 (2015).
pmcid: 4547484 doi: 10.1126/science.1262110
Tenesa, A. et al. Estimation of linkage disequilibrium in a sample of the United Kingdom dairy cattle population using unphased genotypes. J. Anim. Sci. 81, 617–623 (2003).
pubmed: 12661641 doi: 10.2527/2003.813617x
Fang, L. et al. Functional annotation of the cattle genome through systematic discovery and characterization of chromatin states and butyrate-induced variations. BMC Biol. 17, 68 (2019).
pubmed: 31419979 pmcid: 6698049 doi: 10.1186/s12915-019-0687-8
Dixon, J. R. et al. Topological domains in mammalian genomes identified by analysis of chromatin interactions. Nature 485, 376–380 (2012).
pubmed: 22495300 pmcid: 3356448 doi: 10.1038/nature11082
Rosen, B.D. et al. De novo assembly of the cattle reference genome with single-molecule sequencing. Gigascience 9, giaa021 (2020).
pubmed: 32191811 pmcid: 7081964 doi: 10.1093/gigascience/giaa021
Jiang, J. et al. Functional annotation and Bayesian fine-mapping reveals candidate genes for important agronomic traits in Holstein bulls. Commun. Biol. 2, 212 (2019).
pubmed: 31240250 pmcid: 6582147 doi: 10.1038/s42003-019-0454-y
Freebern, E. et al. GWAS and fine-mapping of livability and six disease traits in Holstein cattle. BMC Genomics 21, 41 (2020).
pubmed: 31931710 pmcid: 6958677 doi: 10.1186/s12864-020-6461-z
Fang, L. et al. Genetic and epigenetic architecture of paternal origin contribute to gestation length in cattle. Commun. Biol. 2, 100 (2019).
pubmed: 30886909 pmcid: 6418173 doi: 10.1038/s42003-019-0341-6
Gao, Y. et al. Single-cell transcriptomic analyses of dairy cattle ruminal epithelial cells during weaning. Genomics 113, 2045–2055 (2021).
pubmed: 33933592 doi: 10.1016/j.ygeno.2021.04.039
Kim-Hellmuth, S. et al. Cell type-specific genetic regulation of gene expression across human tissues. Science 369, eaaz8528 (2020).
pubmed: 32913075 pmcid: 8051643 doi: 10.1126/science.aaz8528
Clark, E. L. et al. From FAANG to fork: application of highly annotated genomes to improve farmed animal production. Genome Biol. 21, 285 (2020).
pubmed: 33234160 pmcid: 7686664 doi: 10.1186/s13059-020-02197-8
Xiang, R. D. Genome-wide fine-mapping identifies pleiotropic and functional variants that predict many traits across global cattle populations. Nat. Commun. 12, 860 (2021).
pubmed: 33558518 pmcid: 7870883 doi: 10.1038/s41467-021-21001-0
Bolger, A. M., Lohse, M. & Usadel, B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30, 2114–2120 (2014).
pubmed: 24695404 pmcid: 4103590 doi: 10.1093/bioinformatics/btu170
Pertea, M., Kim, D., Pertea, G. M., Leek, J. T. & Salzberg, S. L. Transcript-level expression analysis of RNA-seq experiments with HISAT, StringTie and Ballgown. Nat. Protoc. 11, 1650–1667 (2016).
pubmed: 27560171 pmcid: 5032908 doi: 10.1038/nprot.2016.095
Liao, Y., Smyth, G. K. & Shi, W. featureCounts: an efficient general purpose program for assigning sequence reads to genomic features. Bioinformatics 30, 923–930 (2014).
pubmed: 24227677 doi: 10.1093/bioinformatics/btt656
Li, Y. I. et al. Annotation-free quantification of RNA splicing using LeafCutter. Nat. Genet. 50, 151–158 (2018).
pubmed: 29229983 doi: 10.1038/s41588-017-0004-9
Van der Auwera, G. A. et al. From FastQ data to high confidence variant calls: the Genome Analysis Toolkit best practices pipeline. Curr. Protoc. Bioinformatics 43, 11.10.1–11.10.33 (2013).
doi: 10.1002/0471250953.bi1110s43
Browning, B. L., Zhou, Y. & Browning, S. R. A one-penny imputed genome from next-generation reference panels. Am. J. Hum. Genet. 103, 338–348 (2018).
pubmed: 30100085 pmcid: 6128308 doi: 10.1016/j.ajhg.2018.07.015
Chang, C. C. et al. Second-generation PLINK: rising to the challenge of larger and richer datasets. Gigascience 4, 7 (2015).
pubmed: 25722852 pmcid: 4342193 doi: 10.1186/s13742-015-0047-8
Patterson, N., Price, A. L. & Reich, D. Population structure and eigenanalysis. PLoS Genet. 2, e190 (2006).
pubmed: 17194218 pmcid: 1713260 doi: 10.1371/journal.pgen.0020190
Castel, S. E., Mohammadi, P., Chung, W. K., Shen, Y. & Lappalainen, T. Rare variant phasing and haplotypic expression from RNA sequencing with phASER. Nat. Commun. 7, 12817 (2016).
pubmed: 27605262 pmcid: 5025529 doi: 10.1038/ncomms12817
Krueger, F. & Andrews, S. R. Bismark: a flexible aligner and methylation caller for Bisulfite-Seq applications. Bioinformatics 27, 1571–1572 (2011).
pubmed: 21493656 pmcid: 3102221 doi: 10.1093/bioinformatics/btr167
Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows–Wheeler transform. Bioinformatics 25, 1754–1760 (2009).
pubmed: 19451168 pmcid: 2705234 doi: 10.1093/bioinformatics/btp324
Ramirez, F. et al. High-resolution TADs reveal DNA sequences underlying genome organization in flies. Nat. Commun. 9, 189 (2018).
pubmed: 29335486 pmcid: 5768762 doi: 10.1038/s41467-017-02525-w
Servant, N. et al. HiC-Pro: an optimized and flexible pipeline for Hi-C data processing. Genome Biol. 16, 259 (2015).
pubmed: 26619908 pmcid: 4665391 doi: 10.1186/s13059-015-0831-x
Langmead, B. & Salzberg, S. L. Fast gapped-read alignment with Bowtie 2. Nat. Methods 9, 357–359 (2012).
pubmed: 22388286 pmcid: 3322381 doi: 10.1038/nmeth.1923
Song, Q. et al. A reference methylome database and analysis pipeline to facilitate integrative and comparative epigenomics. PLoS ONE 8, e81148 (2013).
pubmed: 24324667 pmcid: 3855694 doi: 10.1371/journal.pone.0081148
Liu, H. et al. Systematic identification and annotation of human methylation marks based on bisulfite sequencing methylomes reveals distinct roles of cell type-specific hypomethylation in the regulation of cell identity genes. Nucleic Acids Res. 44, 75–94 (2016).
pubmed: 26635396 doi: 10.1093/nar/gkv1332
Stegle, O., Parts, L., Piipari, M., Winn, J. & Durbin, R. Using probabilistic estimation of expression residuals (PEER) to obtain increased power and interpretability of gene expression analyses. Nat. Protoc. 7, 500–507 (2012).
pubmed: 22343431 pmcid: 3398141 doi: 10.1038/nprot.2011.457
Ongen, H., Buil, A., Brown, A. A., Dermitzakis, E. T. & Delaneau, O. Fast and efficient QTL mapper for thousands of molecular phenotypes. Bioinformatics 32, 1479–1485 (2016).
pubmed: 26708335 doi: 10.1093/bioinformatics/btv722
Mohammadi, P., Castel, S. E., Brown, A. A. & Lappalainen, T. Quantifying the regulatory effect size of cis-acting genetic variation using allelic fold change. Genome Res. 27, 1872–1884 (2017).
pubmed: 29021289 pmcid: 5668944 doi: 10.1101/gr.216747.116
Wen, X., Lee, Y., Luca, F. & Pique-Regi, R. Efficient integrative multi-SNP association analysis via deterministic approximation of posteriors. Am. J. Hum. Genet. 98, 1114–1129 (2016).
pubmed: 27236919 pmcid: 4908152 doi: 10.1016/j.ajhg.2016.03.029
Willer, C. J., Li, Y. & Abecasis, G. R. METAL: fast and efficient meta-analysis of genomewide association scans. Bioinformatics 26, 2190–2191 (2010).
pubmed: 20616382 pmcid: 2922887 doi: 10.1093/bioinformatics/btq340
Urbut, S. M., Wang, G., Carbonetto, P. & Stephens, M. Flexible statistical methods for estimating and testing effects in genomic studies with multiple conditions. Nat. Genet. 51, 187–195 (2019).
pubmed: 30478440 doi: 10.1038/s41588-018-0268-8
Derrien, T. et al. Fast computation and applications of genome mappability. PLoS ONE 7, e30377 (2012).
pubmed: 22276185 pmcid: 3261895 doi: 10.1371/journal.pone.0030377
Shabalin, A. A. Matrix eQTL: ultra fast eQTL analysis via large matrix operations. Bioinformatics 28, 1353–1358 (2012).
pubmed: 22492648 pmcid: 3348564 doi: 10.1093/bioinformatics/bts163
Saha, A. & Battle, A. False positives in trans-eQTL and co-expression analyses arising from RNA-sequencing alignment errors. F1000Res. 7, 1860 (2018).
pubmed: 30613398 doi: 10.12688/f1000research.17145.1
Yang, J., Zaitlen, N. A., Goddard, M. E., Visscher, P. M. & Price, A. L. Advantages and pitfalls in the application of mixed-model association methods. Nat. Genet. 46, 100–106 (2014).
pubmed: 24473328 pmcid: 3989144 doi: 10.1038/ng.2876
Barbeira, A. N. et al. Exploring the phenotypic consequences of tissue specific gene expression variation inferred from GWAS summary statistics. Nat. Commun. 9, 1825 (2018).
pubmed: 29739930 pmcid: 5940825 doi: 10.1038/s41467-018-03621-1
Barbeira, A. N. et al. Integrating predicted transcriptome from multiple tissues improves association detection. PLoS Genet. 15, e1007889 (2019).
pubmed: 30668570 pmcid: 6358100 doi: 10.1371/journal.pgen.1007889
Wen, X., Pique-Regi, R. & Luca, F. Integrating molecular QTL data into genome-wide genetic association analysis: probabilistic assessment of enrichment and colocalization. PLoS Genet. 13, e1006646 (2017).
pubmed: 28278150 pmcid: 5363995 doi: 10.1371/journal.pgen.1006646
Giambartolomei, C. et al. Bayesian test for colocalisation between pairs of genetic association studies using summary statistics. PLoS Genet. 10, e1004383 (2014).
pubmed: 24830394 pmcid: 4022491 doi: 10.1371/journal.pgen.1004383
Heger, A., Webber, C., Goodson, M., Ponting, C. P. & Lunter, G. GAT: a simulation framework for testing the association of genomic intervals. Bioinformatics 29, 2046–2048 (2013).
pubmed: 23782611 pmcid: 3722528 doi: 10.1093/bioinformatics/btt343
Yu, G., Wang, L. G., Han, Y. & He, Q. Y. clusterProfiler: an R package for comparing biological themes among gene clusters. OMICS 16, 284–287 (2012).
pubmed: 22455463 pmcid: 3339379 doi: 10.1089/omi.2011.0118
Liu, S. et al. A multi-tissue atlas of regulatory variants in cattle. Code resource at github website. GitHub: https://github.com/shuliliu/cattleGTEx ; Zenodo: https://doi.org/10.5281/zenodo.6510550

Auteurs

Shuli Liu (S)

Animal Genomics and Improvement Laboratory, Henry A. Wallace Beltsville Agricultural Research Center, Agricultural Research Service, USDA, Beltsville, MD, USA.
National Engineering Laboratory of Animal Breeding, College of Animal Science and Technology, China Agricultural University, Beijing, China.
School of Life Sciences, Westlake University, Hangzhou, China.

Yahui Gao (Y)

Animal Genomics and Improvement Laboratory, Henry A. Wallace Beltsville Agricultural Research Center, Agricultural Research Service, USDA, Beltsville, MD, USA.
Department of Animal and Avian Sciences, University of Maryland, College Park, MD, USA.

Oriol Canela-Xandri (O)

MRC Human Genetics Unit at the Institute of Genetics and Cancer, The University of Edinburgh, Edinburgh, UK.

Sheng Wang (S)

State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, China.

Ying Yu (Y)

National Engineering Laboratory of Animal Breeding, College of Animal Science and Technology, China Agricultural University, Beijing, China.

Wentao Cai (W)

Institute of Animal Science, Chinese Academy of Agricultural Science, Beijing, China.

Bingjie Li (B)

Scotland's Rural College (SRUC), Roslin Institute Building, Midlothian, UK.

Ruidong Xiang (R)

Faculty of Veterinary & Agricultural Science, The University of Melbourne, Parkville, Victoria, Australia.
Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, Victoria, Australia.

Amanda J Chamberlain (AJ)

Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, Victoria, Australia.

Erola Pairo-Castineira (E)

MRC Human Genetics Unit at the Institute of Genetics and Cancer, The University of Edinburgh, Edinburgh, UK.
The Roslin Institute, Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Midlothian, UK.

Kenton D'Mellow (K)

MRC Human Genetics Unit at the Institute of Genetics and Cancer, The University of Edinburgh, Edinburgh, UK.

Konrad Rawlik (K)

The Roslin Institute, Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Midlothian, UK.

Charley Xia (C)

The Roslin Institute, Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Midlothian, UK.

Yuelin Yao (Y)

MRC Human Genetics Unit at the Institute of Genetics and Cancer, The University of Edinburgh, Edinburgh, UK.

Pau Navarro (P)

MRC Human Genetics Unit at the Institute of Genetics and Cancer, The University of Edinburgh, Edinburgh, UK.

Dominique Rocha (D)

INRAE, AgroParisTech, GABI, Université Paris-Saclay, Jouy-en-Josas, France.

Xiujin Li (X)

Guangdong Provincial Key Laboratory of Waterfowl Healthy Breeding, College of Animal Science & Technology, Zhongkai University of Agriculture and Engineering, Guangzhou, China.

Ze Yan (Z)

National Engineering Laboratory of Animal Breeding, College of Animal Science and Technology, China Agricultural University, Beijing, China.

Congjun Li (C)

Animal Genomics and Improvement Laboratory, Henry A. Wallace Beltsville Agricultural Research Center, Agricultural Research Service, USDA, Beltsville, MD, USA.

Benjamin D Rosen (BD)

Animal Genomics and Improvement Laboratory, Henry A. Wallace Beltsville Agricultural Research Center, Agricultural Research Service, USDA, Beltsville, MD, USA.

Curtis P Van Tassell (CP)

Animal Genomics and Improvement Laboratory, Henry A. Wallace Beltsville Agricultural Research Center, Agricultural Research Service, USDA, Beltsville, MD, USA.

Paul M Vanraden (PM)

Animal Genomics and Improvement Laboratory, Henry A. Wallace Beltsville Agricultural Research Center, Agricultural Research Service, USDA, Beltsville, MD, USA.

Shengli Zhang (S)

National Engineering Laboratory of Animal Breeding, College of Animal Science and Technology, China Agricultural University, Beijing, China.

Li Ma (L)

Department of Animal and Avian Sciences, University of Maryland, College Park, MD, USA.

John B Cole (JB)

Animal Genomics and Improvement Laboratory, Henry A. Wallace Beltsville Agricultural Research Center, Agricultural Research Service, USDA, Beltsville, MD, USA.

George E Liu (GE)

Animal Genomics and Improvement Laboratory, Henry A. Wallace Beltsville Agricultural Research Center, Agricultural Research Service, USDA, Beltsville, MD, USA. George.Liu@usda.gov.

Albert Tenesa (A)

MRC Human Genetics Unit at the Institute of Genetics and Cancer, The University of Edinburgh, Edinburgh, UK. Albert.Tenesa@ed.ac.uk.
The Roslin Institute, Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Midlothian, UK. Albert.Tenesa@ed.ac.uk.

Lingzhao Fang (L)

Animal Genomics and Improvement Laboratory, Henry A. Wallace Beltsville Agricultural Research Center, Agricultural Research Service, USDA, Beltsville, MD, USA. lingzhao.fang@qgg.au.dk.
MRC Human Genetics Unit at the Institute of Genetics and Cancer, The University of Edinburgh, Edinburgh, UK. lingzhao.fang@qgg.au.dk.
Center for Quantitative Genetics and Genomics, Aarhus University, Aarhus, Denmark. lingzhao.fang@qgg.au.dk.

Articles similaires

Robotic Surgical Procedures Animals Humans Telemedicine Models, Animal

Odour generalisation and detection dog training.

Lyn Caldicott, Thomas W Pike, Helen E Zulch et al.
1.00
Animals Odorants Dogs Generalization, Psychological Smell
Animals TOR Serine-Threonine Kinases Colorectal Neoplasms Colitis Mice
Animals Tail Swine Behavior, Animal Animal Husbandry

Classifications MeSH