Enhanced bovine genome annotation through integration of transcriptomics and epi-transcriptomics datasets facilitates genomic biology.

QTL epi-genetics functional genomics multi-omics integration trait-similarity network transcriptomics

Journal

GigaScience
ISSN: 2047-217X
Titre abrégé: Gigascience
Pays: United States
ID NLM: 101596872

Informations de publication

Date de publication:
02 Jan 2024
Historique:
received: 11 02 2023
revised: 29 07 2023
accepted: 27 03 2024
medline: 17 4 2024
pubmed: 17 4 2024
entrez: 16 4 2024
Statut: ppublish

Résumé

The accurate identification of the functional elements in the bovine genome is a fundamental requirement for high-quality analysis of data informing both genome biology and genomic selection. Functional annotation of the bovine genome was performed to identify a more complete catalog of transcript isoforms across bovine tissues. A total of 160,820 unique transcripts (50% protein coding) representing 34,882 unique genes (60% protein coding) were identified across tissues. Among them, 118,563 transcripts (73% of the total) were structurally validated by independent datasets (PacBio isoform sequencing data, Oxford Nanopore Technologies sequencing data, de novo assembled transcripts from RNA sequencing data) and comparison with Ensembl and NCBI gene sets. In addition, all transcripts were supported by extensive data from different technologies such as whole transcriptome termini site sequencing, RNA Annotation and Mapping of Promoters for the Analysis of Gene Expression, chromatin immunoprecipitation sequencing, and assay for transposase-accessible chromatin using sequencing. A large proportion of identified transcripts (69%) were unannotated, of which 86% were produced by annotated genes and 14% by unannotated genes. A median of two 5' untranslated regions were expressed per gene. Around 50% of protein-coding genes in each tissue were bifunctional and transcribed both coding and noncoding isoforms. Furthermore, we identified 3,744 genes that functioned as noncoding genes in fetal tissues but as protein-coding genes in adult tissues. Our new bovine genome annotation extended more than 11,000 annotated gene borders compared to Ensembl or NCBI annotations. The resulting bovine transcriptome was integrated with publicly available quantitative trait loci data to study tissue-tissue interconnection involved in different traits and construct the first bovine trait similarity network. These validated results show significant improvement over current bovine genome annotations.

Sections du résumé

BACKGROUND BACKGROUND
The accurate identification of the functional elements in the bovine genome is a fundamental requirement for high-quality analysis of data informing both genome biology and genomic selection. Functional annotation of the bovine genome was performed to identify a more complete catalog of transcript isoforms across bovine tissues.
RESULTS RESULTS
A total of 160,820 unique transcripts (50% protein coding) representing 34,882 unique genes (60% protein coding) were identified across tissues. Among them, 118,563 transcripts (73% of the total) were structurally validated by independent datasets (PacBio isoform sequencing data, Oxford Nanopore Technologies sequencing data, de novo assembled transcripts from RNA sequencing data) and comparison with Ensembl and NCBI gene sets. In addition, all transcripts were supported by extensive data from different technologies such as whole transcriptome termini site sequencing, RNA Annotation and Mapping of Promoters for the Analysis of Gene Expression, chromatin immunoprecipitation sequencing, and assay for transposase-accessible chromatin using sequencing. A large proportion of identified transcripts (69%) were unannotated, of which 86% were produced by annotated genes and 14% by unannotated genes. A median of two 5' untranslated regions were expressed per gene. Around 50% of protein-coding genes in each tissue were bifunctional and transcribed both coding and noncoding isoforms. Furthermore, we identified 3,744 genes that functioned as noncoding genes in fetal tissues but as protein-coding genes in adult tissues. Our new bovine genome annotation extended more than 11,000 annotated gene borders compared to Ensembl or NCBI annotations. The resulting bovine transcriptome was integrated with publicly available quantitative trait loci data to study tissue-tissue interconnection involved in different traits and construct the first bovine trait similarity network.
CONCLUSIONS CONCLUSIONS
These validated results show significant improvement over current bovine genome annotations.

Identifiants

pubmed: 38626724
pii: 7646333
doi: 10.1093/gigascience/giae019
pii:
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Subventions

Organisme : Agriculture and Food Research Initiative Competitive
ID : 2018-67015-27500
Organisme : U.S. Department of Agriculture
Organisme : National Institute of Food and Agriculture

Informations de copyright

© The Author(s) 2024. Published by Oxford University Press GigaScience.

Auteurs

Hamid Beiki (H)

Department of Animal Science, Iowa State University, Ames, IA 50011, USA.

Brenda M Murdoch (BM)

Department of Animal and Veterinary and Food Science, University of Idaho, ID 83844, USA.

Carissa A Park (CA)

Department of Animal Science, Iowa State University, Ames, IA 50011, USA.

Chandlar Kern (C)

Department of Animal Science, Pennsylvania State University, PA 16802, USA.

Denise Kontechy (D)

Department of Animal and Veterinary and Food Science, University of Idaho, ID 83844, USA.

Gabrielle Becker (G)

Department of Animal and Veterinary and Food Science, University of Idaho, ID 83844, USA.

Gonzalo Rincon (G)

Zoetis, Parsippany-Troy Hills, NJ 07054, USA.

Honglin Jiang (H)

Department of Animal and Poultry Sciences, Virginia Tech, VA 24060, USA.

Huaijun Zhou (H)

Department of Animal Science, University of California, Davis, CA 95616, USA.

Jacob Thorne (J)

Department of Animal and Veterinary and Food Science, University of Idaho, ID 83844, USA.

James E Koltes (JE)

Department of Animal Science, Iowa State University, Ames, IA 50011, USA.

Jennifer J Michal (JJ)

Department of Animal Science, Washington State University, WA 99164, USA.

Kimberly Davenport (K)

Department of Animal and Veterinary and Food Science, University of Idaho, ID 83844, USA.

Monique Rijnkels (M)

Department of Veterinary Integrative Biosciences, Texas A&M University, TX 77843, USA.

Pablo J Ross (PJ)

Department of Animal Science, University of California, Davis, CA 95616, USA.

Rui Hu (R)

Department of Animal and Poultry Sciences, Virginia Tech, VA 24060, USA.

Sarah Corum (S)

Zoetis, Parsippany-Troy Hills, NJ 07054, USA.

Stephanie McKay (S)

University of Missouri, Columbia, MO 65211, USA.

Wansheng Liu (W)

Department of Animal Science, Pennsylvania State University, PA 16802, USA.

Wenzhi Ma (W)

Department of Animal Science, Pennsylvania State University, PA 16802, USA.

Xiaohui Zhang (X)

Department of Animal Science, Washington State University, WA 99164, USA.

Xiaoqing Xu (X)

Department of Animal Science, University of California, Davis, CA 95616, USA.

Xuelei Han (X)

Department of Animal Science, Washington State University, WA 99164, USA.

Zhihua Jiang (Z)

Department of Animal Science, Washington State University, WA 99164, USA.

Zhi-Liang Hu (ZL)

Department of Animal Science, Iowa State University, Ames, IA 50011, USA.

James M Reecy (JM)

Department of Animal Science, Iowa State University, Ames, IA 50011, USA.

Classifications MeSH