Enhanced genome assembly and a new official gene set for Tribolium castaneum.
Gene annotation
Gene prediction
Gene set OGS3
Genome
Genome assembly Tcas5.2
Reannotation
RefSeq genome
Tribolium castaneum
miRNA
microRNA
Journal
BMC genomics
ISSN: 1471-2164
Titre abrégé: BMC Genomics
Pays: England
ID NLM: 100965258
Informations de publication
Date de publication:
14 Jan 2020
14 Jan 2020
Historique:
received:
30
08
2019
accepted:
12
12
2019
entrez:
16
1
2020
pubmed:
16
1
2020
medline:
9
9
2020
Statut:
epublish
Résumé
The red flour beetle Tribolium castaneum has emerged as an important model organism for the study of gene function in development and physiology, for ecological and evolutionary genomics, for pest control and a plethora of other topics. RNA interference (RNAi), transgenesis and genome editing are well established and the resources for genome-wide RNAi screening have become available in this model. All these techniques depend on a high quality genome assembly and precise gene models. However, the first version of the genome assembly was generated by Sanger sequencing, and with a small set of RNA sequence data limiting annotation quality. Here, we present an improved genome assembly (Tcas5.2) and an enhanced genome annotation resulting in a new official gene set (OGS3) for Tribolium castaneum, which significantly increase the quality of the genomic resources. By adding large-distance jumping library DNA sequencing to join scaffolds and fill small gaps, the gaps in the genome assembly were reduced and the N50 increased to 4753kbp. The precision of the gene models was enhanced by the use of a large body of RNA-Seq reads of different life history stages and tissue types, leading to the discovery of 1452 novel gene sequences. We also added new features such as alternative splicing, well defined UTRs and microRNA target predictions. For quality control, 399 gene models were evaluated by manual inspection. The current gene set was submitted to Genbank and accepted as a RefSeq genome by NCBI. The new genome assembly (Tcas5.2) and the official gene set (OGS3) provide enhanced genomic resources for genetic work in Tribolium castaneum. The much improved information on transcription start sites supports transgenic and gene editing approaches. Further, novel types of information such as splice variants and microRNA target genes open additional possibilities for analysis.
Sections du résumé
BACKGROUND
BACKGROUND
The red flour beetle Tribolium castaneum has emerged as an important model organism for the study of gene function in development and physiology, for ecological and evolutionary genomics, for pest control and a plethora of other topics. RNA interference (RNAi), transgenesis and genome editing are well established and the resources for genome-wide RNAi screening have become available in this model. All these techniques depend on a high quality genome assembly and precise gene models. However, the first version of the genome assembly was generated by Sanger sequencing, and with a small set of RNA sequence data limiting annotation quality.
RESULTS
RESULTS
Here, we present an improved genome assembly (Tcas5.2) and an enhanced genome annotation resulting in a new official gene set (OGS3) for Tribolium castaneum, which significantly increase the quality of the genomic resources. By adding large-distance jumping library DNA sequencing to join scaffolds and fill small gaps, the gaps in the genome assembly were reduced and the N50 increased to 4753kbp. The precision of the gene models was enhanced by the use of a large body of RNA-Seq reads of different life history stages and tissue types, leading to the discovery of 1452 novel gene sequences. We also added new features such as alternative splicing, well defined UTRs and microRNA target predictions. For quality control, 399 gene models were evaluated by manual inspection. The current gene set was submitted to Genbank and accepted as a RefSeq genome by NCBI.
CONCLUSIONS
CONCLUSIONS
The new genome assembly (Tcas5.2) and the official gene set (OGS3) provide enhanced genomic resources for genetic work in Tribolium castaneum. The much improved information on transcription start sites supports transgenic and gene editing approaches. Further, novel types of information such as splice variants and microRNA target genes open additional possibilities for analysis.
Identifiants
pubmed: 31937263
doi: 10.1186/s12864-019-6394-6
pii: 10.1186/s12864-019-6394-6
pmc: PMC6961396
doi:
Substances chimiques
MicroRNAs
0
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Pagination
47Subventions
Organisme : Deutsche Forschungsgemeinschaft
ID : SFB680
Organisme : NIGMS NIH HHS
ID : P20 GM103418
Pays : United States
Organisme : Deutsche Forschungsgemeinschaft
ID : PA2044/1
Organisme : Deutsche Forschungsgemeinschaft
ID : KL656/7
Organisme : Deutsche Forschungsgemeinschaft
ID : BU1443/6
Organisme : Deutsche Forschungsgemeinschaft
ID : WI1797/5
Organisme : Deutsche Forschungsgemeinschaft
ID : FOR1234
Organisme : Deutsche Forschungsgemeinschaft
ID : STA1009/10
Organisme : Deutsche Forschungsgemeinschaft
ID : BU1443/7
Organisme : Schweizerischer Nationalfonds zur Förderung der Wissenschaftlichen Forschung (CH)
ID : PP00P3_170664
Organisme : Max-Planck-Gesellschaft
ID : Department Herbert Jäckle
Organisme : Deutsche Forschungsgemeinschaft
ID : SFB 680 / A12
Organisme : Deutsche Forschungsgemeinschaft
ID : Scho1058/4
Références
Nature. 1999 Nov 25;402(6760):370-1
pubmed: 10586872
Genome Res. 2016 Jan;26(1):85-96
pubmed: 26518483
Nucleic Acids Res. 2015 Jan;43(Database issue):D720-5
pubmed: 25378303
Arthropod Struct Dev. 2010 Nov;39(6):399-410
pubmed: 20800703
Genome Biol. 2012 Jun 25;13(6):R56
pubmed: 22731987
Curr Biol. 2002 Feb 5;12(3):R85-6
pubmed: 11839285
Bioinformatics. 2006 Nov 1;22(21):2688-90
pubmed: 16928733
Proc Biol Sci. 2018 Aug 22;285(1885):
pubmed: 30135167
Genome Biol. 2003;5(1):R1
pubmed: 14709173
Development. 2018 Apr 5;145(7):
pubmed: 29540499
Science. 2007 Dec 21;318(5858):1913-6
pubmed: 18096805
Nucleic Acids Res. 2014 Jan;42(Database issue):D68-73
pubmed: 24275495
Bioinformatics. 2008 Mar 1;24(5):637-44
pubmed: 18218656
BMC Dev Biol. 2010 May 19;10:53
pubmed: 20482875
Nucleic Acids Res. 2017 Jan 4;45(D1):D663-D671
pubmed: 27799470
Annu Rev Entomol. 2002;47:669-99
pubmed: 11729088
Insect Mol Biol. 2003 Oct;12(5):433-40
pubmed: 12974948
PLoS Genet. 2013;9(7):e1003596
pubmed: 23874211
Science. 2012 Apr 20;336(6079):338-41
pubmed: 22403177
Dev Genes Evol. 2016 Jun;226(3):209-19
pubmed: 27056385
BMC Biol. 2009 Nov 05;7:73
pubmed: 19891766
Bioinformatics. 2003 Nov 1;19(16):2122-30
pubmed: 14594718
BMC Genomics. 2014 Jan 30;15:86
pubmed: 24479613
Science. 2005 Nov 25;310(5752):1325-6
pubmed: 16311335
Nat Genet. 2000 May;25(1):25-9
pubmed: 10802651
Nucleic Acids Res. 2015 Jan;43(Database issue):D250-6
pubmed: 25428351
BMC Dev Biol. 2013 Jun 18;13:25
pubmed: 23777260
Nature. 2008 Apr 24;452(7190):949-55
pubmed: 18362917
Dev Biol. 2008 Jan 15;313(2):471-91
pubmed: 18082679
Evol Dev. 1999 Jul-Aug;1(1):11-5
pubmed: 11324015
Development. 2015 Aug 15;142(16):2832-9
pubmed: 26160901
Dev Genes Evol. 2004 Nov;214(11):575-8
pubmed: 15365833
Bioinformatics. 2015 Oct 1;31(19):3210-2
pubmed: 26059717
PLoS Genet. 2015 Jun 04;11(6):e1005245
pubmed: 26042831
BMC Genomics. 2015 Sep 03;16:674
pubmed: 26334912
Development. 2019 Jan 4;146(1):
pubmed: 30552128
BMC Genomics. 2015 Sep 29;16:734
pubmed: 26416786
Cold Spring Harb Protoc. 2009 Aug;2009(8):pdb.emo126
pubmed: 20147228
Curr Biol. 2004 Aug 24;14(16):R639-40
pubmed: 15324675
Dev Genes Evol. 2008 Apr;218(3-4):119-26
pubmed: 18392874
J Comput Biol. 2006 Mar;13(2):336-50
pubmed: 16597244
Nucleic Acids Res. 2018 Jan 4;46(D1):D831-D835
pubmed: 29069517
Nucleic Acids Res. 2014 Jan;42(Database issue):D780-8
pubmed: 24234449
BMC Biol. 2017 Jul 31;15(1):62
pubmed: 28756775
Nucleic Acids Res. 2012 Jul;40(Web Server issue):W569-72
pubmed: 22695796
Genesis. 2001 May;30(1):12-20
pubmed: 11353513
Dev Cell. 2004 Sep;7(3):301-12
pubmed: 15363406
Bioinformatics. 2010 Apr 1;26(7):873-81
pubmed: 20147302
Development. 2001 Jan;128(2):287-97
pubmed: 11124123
Genome Biol. 2008 Jan 17;9(1):R10
pubmed: 18201385
Cell. 2009 Jan 23;136(2):215-33
pubmed: 19167326
Nature. 2005 Feb 10;433(7026):643-7
pubmed: 15703749
Proc Natl Acad Sci U S A. 2005 Aug 9;102(32):11337-42
pubmed: 16076951
Front Zool. 2012 Aug 06;9(1):15
pubmed: 22866820
Genome Res. 2002 Apr;12(4):656-64
pubmed: 11932250
Elife. 2015 Aug 12;4:null
pubmed: 26267216
Genome Biol Evol. 2010;2:686-96
pubmed: 20817720
BMC Bioinformatics. 2004 Aug 19;5:113
pubmed: 15318951
Nucleic Acids Res. 2006 Jan 1;34(Database issue):D590-8
pubmed: 16381938
Bioinformatics. 2009 Aug 1;25(15):1972-3
pubmed: 19505945
Nat Rev Mol Cell Biol. 2014 Aug;15(8):509-24
pubmed: 25027649
Insect Biochem Mol Biol. 2012 Apr;42(4):264-76
pubmed: 22212827
Genetics. 2005 Jun;170(2):741-7
pubmed: 15834150
Nat Commun. 2015 Jul 28;6:7822
pubmed: 26215380
BMC Genomics. 2014 Dec 18;15:1141
pubmed: 25523483
Curr Opin Insect Sci. 2016 Oct;17:1-9
pubmed: 27720067
Nucleic Acids Res. 2016 Jan 4;44(D1):D733-45
pubmed: 26553804
Nat Methods. 2013 Jan;10(1):71-3
pubmed: 23160280
Dev Genes Evol. 2012 Sep;222(5):287-98
pubmed: 22890852
Arthropod Struct Dev. 2014 Nov;43(6):605-13
pubmed: 25242057
Nucleic Acids Res. 2010 Jan;38(Database issue):D437-42
pubmed: 19820115