BaRTv1.0: an improved barley reference transcript dataset to determine accurate changes in the barley transcriptome using RNA-seq.
Barley
Differential alternative splicing
Differential gene expression
Reference transcript dataset
Transcriptome
Journal
BMC genomics
ISSN: 1471-2164
Titre abrégé: BMC Genomics
Pays: England
ID NLM: 100965258
Informations de publication
Date de publication:
11 Dec 2019
11 Dec 2019
Historique:
received:
06
06
2019
accepted:
29
10
2019
entrez:
13
12
2019
pubmed:
13
12
2019
medline:
14
4
2020
Statut:
epublish
Résumé
The time required to analyse RNA-seq data varies considerably, due to discrete steps for computational assembly, quantification of gene expression and splicing analysis. Recent fast non-alignment tools such as Kallisto and Salmon overcome these problems, but these tools require a high quality, comprehensive reference transcripts dataset (RTD), which are rarely available in plants. A high-quality, non-redundant barley gene RTD and database (Barley Reference Transcripts - BaRTv1.0) has been generated. BaRTv1.0, was constructed from a range of tissues, cultivars and abiotic treatments and transcripts assembled and aligned to the barley cv. Morex reference genome (Mascher et al. Nature; 544: 427-433, 2017). Full-length cDNAs from the barley variety Haruna nijo (Matsumoto et al. Plant Physiol; 156: 20-28, 2011) determined transcript coverage, and high-resolution RT-PCR validated alternatively spliced (AS) transcripts of 86 genes in five different organs and tissue. These methods were used as benchmarks to select an optimal barley RTD. BaRTv1.0-Quantification of Alternatively Spliced Isoforms (QUASI) was also made to overcome inaccurate quantification due to variation in 5' and 3' UTR ends of transcripts. BaRTv1.0-QUASI was used for accurate transcript quantification of RNA-seq data of five barley organs/tissues. This analysis identified 20,972 significant differentially expressed genes, 2791 differentially alternatively spliced genes and 2768 transcripts with differential transcript usage. A high confidence barley reference transcript dataset consisting of 60,444 genes with 177,240 transcripts has been generated. Compared to current barley transcripts, BaRTv1.0 transcripts are generally longer, have less fragmentation and improved gene models that are well supported by splice junction reads. Precise transcript quantification using BaRTv1.0 allows routine analysis of gene expression and AS.
Sections du résumé
BACKGROUND
BACKGROUND
The time required to analyse RNA-seq data varies considerably, due to discrete steps for computational assembly, quantification of gene expression and splicing analysis. Recent fast non-alignment tools such as Kallisto and Salmon overcome these problems, but these tools require a high quality, comprehensive reference transcripts dataset (RTD), which are rarely available in plants.
RESULTS
RESULTS
A high-quality, non-redundant barley gene RTD and database (Barley Reference Transcripts - BaRTv1.0) has been generated. BaRTv1.0, was constructed from a range of tissues, cultivars and abiotic treatments and transcripts assembled and aligned to the barley cv. Morex reference genome (Mascher et al. Nature; 544: 427-433, 2017). Full-length cDNAs from the barley variety Haruna nijo (Matsumoto et al. Plant Physiol; 156: 20-28, 2011) determined transcript coverage, and high-resolution RT-PCR validated alternatively spliced (AS) transcripts of 86 genes in five different organs and tissue. These methods were used as benchmarks to select an optimal barley RTD. BaRTv1.0-Quantification of Alternatively Spliced Isoforms (QUASI) was also made to overcome inaccurate quantification due to variation in 5' and 3' UTR ends of transcripts. BaRTv1.0-QUASI was used for accurate transcript quantification of RNA-seq data of five barley organs/tissues. This analysis identified 20,972 significant differentially expressed genes, 2791 differentially alternatively spliced genes and 2768 transcripts with differential transcript usage.
CONCLUSION
CONCLUSIONS
A high confidence barley reference transcript dataset consisting of 60,444 genes with 177,240 transcripts has been generated. Compared to current barley transcripts, BaRTv1.0 transcripts are generally longer, have less fragmentation and improved gene models that are well supported by splice junction reads. Precise transcript quantification using BaRTv1.0 allows routine analysis of gene expression and AS.
Identifiants
pubmed: 31829136
doi: 10.1186/s12864-019-6243-7
pii: 10.1186/s12864-019-6243-7
pmc: PMC6907147
doi:
Substances chimiques
Plant Proteins
0
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Pagination
968Subventions
Organisme : BBSRC
ID : BB/I00663X/1
Organisme : BBSRC
ID : BB/G016232/1
Organisme : BBSRC
ID : BB/K01613X/1
Organisme : ERC
ID : 669182
Références
Curr Opin Plant Biol. 2015 Apr;24:125-35
pubmed: 25835141
Nucleic Acids Res. 2007 Jan;35(Database issue):D883-7
pubmed: 17145706
Plant Cell. 2009 Jul;21(7):2045-57
pubmed: 19602621
Curr Opin Plant Biol. 2015 Oct;27:97-103
pubmed: 26190743
Nature. 2012 Nov 29;491(7426):711-6
pubmed: 23075845
Genom Data. 2017 May 22;13:15-17
pubmed: 28626638
Protoplasma. 2013 Jun;250(3):639-50
pubmed: 22961303
J Mol Biol. 1990 Oct 5;215(3):403-10
pubmed: 2231712
RNA. 2015 Sep;21(9):1521-31
pubmed: 26179515
Nat Genet. 2016 Sep;48(9):1024-30
pubmed: 27428750
New Phytol. 2017 Jan;213(2):525-530
pubmed: 27659901
Front Plant Sci. 2018 Jan 09;8:2212
pubmed: 29375595
Genome Res. 2012 Jun;22(6):1184-95
pubmed: 22391557
Front Plant Sci. 2018 Apr 18;9:500
pubmed: 29720989
Biotechnol Adv. 2014 Jan-Feb;32(1):137-57
pubmed: 24084493
Physiol Plant. 2018 May;163(1):18-29
pubmed: 29111595
Methods Mol Biol. 2016;1415:245-62
pubmed: 27115637
Theor Appl Genet. 2014 Oct;127(10):2095-103
pubmed: 25212109
RNA Biol. 2021 Nov;18(11):1574-1587
pubmed: 33345702
Nucleic Acids Res. 2017 May 19;45(9):5061-5073
pubmed: 28402429
Bioinformatics. 2014 Aug 1;30(15):2114-20
pubmed: 24695404
Nat Biotechnol. 2016 May;34(5):525-7
pubmed: 27043002
Plant J. 2008 Mar;53(6):1035-48
pubmed: 18088312
Plant Physiol. 2011 May;156(1):20-8
pubmed: 21415278
Plant Cell. 2014 Sep;26(9):3472-87
pubmed: 25248552
New Phytol. 2015 Oct;208(1):96-101
pubmed: 26111100
J Hered. 2005 Nov-Dec;96(6):654-62
pubmed: 16251510
Nat Biotechnol. 2015 Mar;33(3):290-5
pubmed: 25690850
Nat Methods. 2013 Dec;10(12):1185-91
pubmed: 24185836
Mol Biol Evol. 2015 Oct;32(10):2726-37
pubmed: 26116860
Plant Cell. 2013 Oct;25(10):3640-56
pubmed: 24179132
Nature. 2011 Aug 28;477(7365):419-23
pubmed: 21874022
Life Sci Alliance. 2019 Jan 17;2(1):
pubmed: 30655364
BMC Plant Biol. 2019 Apr 11;19(1):134
pubmed: 30971212
BMC Genomics. 2008 Apr 10;9:159
pubmed: 18402682
Nature. 2010 Jan 28;463(7280):457-63
pubmed: 20110989
Planta. 2014 Jan;239(1):127-38
pubmed: 24097263
Bioinformatics. 2016 Jan 1;32(1):43-9
pubmed: 26519505
Annu Rev Biochem. 2015;84:291-323
pubmed: 25784052
Nucleic Acids Res. 2012 Mar;40(6):2454-69
pubmed: 22127866
Plant Cell. 2018 Jul;30(7):1424-1444
pubmed: 29764987
PLoS One. 2016 Mar 31;11(3):e0152824
pubmed: 27031341
Bioinformatics. 2005 May 1;21(9):1859-75
pubmed: 15728110
Plant Cell. 2013 Oct;25(10):3657-83
pubmed: 24179125
Genome Biol. 2012 Feb 22;13(2):143
pubmed: 22356731
Front Plant Sci. 2018 Aug 15;9:1174
pubmed: 30158945
Cell. 1999 Nov 12;99(4):355-66
pubmed: 10571178
Bioinformatics. 2013 Jan 1;29(1):15-21
pubmed: 23104886
J Plant Physiol. 2016 Feb 1;191:127-39
pubmed: 26788957
BMC Genomics. 2017 Oct 11;18(1):772
pubmed: 29020934
Plant Sci. 2012 Apr;185-186:40-9
pubmed: 22325865
Nature. 2017 Apr 26;544(7651):427-433
pubmed: 28447635
Bioinformatics. 2015 Dec 15;31(24):3938-45
pubmed: 26338770
FEBS Lett. 2015 Nov 30;589(23):3564-75
pubmed: 26454178
New Phytol. 2015 May;206(3):913-931
pubmed: 25605349
Nat Protoc. 2013 Aug;8(8):1494-512
pubmed: 23845962
Nat Methods. 2017 Apr;14(4):417-419
pubmed: 28263959
Front Plant Sci. 2019 Feb 28;10:235
pubmed: 30891054
Front Bioeng Biotechnol. 2015 Mar 26;3:33
pubmed: 25859541
Front Plant Sci. 2018 Aug 21;9:1209
pubmed: 30186296
PLoS One. 2016 Dec 13;11(12):e0168028
pubmed: 27959947
Trends Plant Sci. 2018 Feb;23(2):140-150
pubmed: 29074233