Enhanced genome assembly and a new official gene set for Tribolium castaneum.


Journal

BMC genomics
ISSN: 1471-2164
Titre abrégé: BMC Genomics
Pays: England
ID NLM: 100965258

Informations de publication

Date de publication:
14 Jan 2020
Historique:
received: 30 08 2019
accepted: 12 12 2019
entrez: 16 1 2020
pubmed: 16 1 2020
medline: 9 9 2020
Statut: epublish

Résumé

The red flour beetle Tribolium castaneum has emerged as an important model organism for the study of gene function in development and physiology, for ecological and evolutionary genomics, for pest control and a plethora of other topics. RNA interference (RNAi), transgenesis and genome editing are well established and the resources for genome-wide RNAi screening have become available in this model. All these techniques depend on a high quality genome assembly and precise gene models. However, the first version of the genome assembly was generated by Sanger sequencing, and with a small set of RNA sequence data limiting annotation quality. Here, we present an improved genome assembly (Tcas5.2) and an enhanced genome annotation resulting in a new official gene set (OGS3) for Tribolium castaneum, which significantly increase the quality of the genomic resources. By adding large-distance jumping library DNA sequencing to join scaffolds and fill small gaps, the gaps in the genome assembly were reduced and the N50 increased to 4753kbp. The precision of the gene models was enhanced by the use of a large body of RNA-Seq reads of different life history stages and tissue types, leading to the discovery of 1452 novel gene sequences. We also added new features such as alternative splicing, well defined UTRs and microRNA target predictions. For quality control, 399 gene models were evaluated by manual inspection. The current gene set was submitted to Genbank and accepted as a RefSeq genome by NCBI. The new genome assembly (Tcas5.2) and the official gene set (OGS3) provide enhanced genomic resources for genetic work in Tribolium castaneum. The much improved information on transcription start sites supports transgenic and gene editing approaches. Further, novel types of information such as splice variants and microRNA target genes open additional possibilities for analysis.

Sections du résumé

BACKGROUND BACKGROUND
The red flour beetle Tribolium castaneum has emerged as an important model organism for the study of gene function in development and physiology, for ecological and evolutionary genomics, for pest control and a plethora of other topics. RNA interference (RNAi), transgenesis and genome editing are well established and the resources for genome-wide RNAi screening have become available in this model. All these techniques depend on a high quality genome assembly and precise gene models. However, the first version of the genome assembly was generated by Sanger sequencing, and with a small set of RNA sequence data limiting annotation quality.
RESULTS RESULTS
Here, we present an improved genome assembly (Tcas5.2) and an enhanced genome annotation resulting in a new official gene set (OGS3) for Tribolium castaneum, which significantly increase the quality of the genomic resources. By adding large-distance jumping library DNA sequencing to join scaffolds and fill small gaps, the gaps in the genome assembly were reduced and the N50 increased to 4753kbp. The precision of the gene models was enhanced by the use of a large body of RNA-Seq reads of different life history stages and tissue types, leading to the discovery of 1452 novel gene sequences. We also added new features such as alternative splicing, well defined UTRs and microRNA target predictions. For quality control, 399 gene models were evaluated by manual inspection. The current gene set was submitted to Genbank and accepted as a RefSeq genome by NCBI.
CONCLUSIONS CONCLUSIONS
The new genome assembly (Tcas5.2) and the official gene set (OGS3) provide enhanced genomic resources for genetic work in Tribolium castaneum. The much improved information on transcription start sites supports transgenic and gene editing approaches. Further, novel types of information such as splice variants and microRNA target genes open additional possibilities for analysis.

Identifiants

pubmed: 31937263
doi: 10.1186/s12864-019-6394-6
pii: 10.1186/s12864-019-6394-6
pmc: PMC6961396
doi:

Substances chimiques

MicroRNAs 0

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

47

Subventions

Organisme : Deutsche Forschungsgemeinschaft
ID : SFB680
Organisme : NIGMS NIH HHS
ID : P20 GM103418
Pays : United States
Organisme : Deutsche Forschungsgemeinschaft
ID : PA2044/1
Organisme : Deutsche Forschungsgemeinschaft
ID : KL656/7
Organisme : Deutsche Forschungsgemeinschaft
ID : BU1443/6
Organisme : Deutsche Forschungsgemeinschaft
ID : WI1797/5
Organisme : Deutsche Forschungsgemeinschaft
ID : FOR1234
Organisme : Deutsche Forschungsgemeinschaft
ID : STA1009/10
Organisme : Deutsche Forschungsgemeinschaft
ID : BU1443/7
Organisme : Schweizerischer Nationalfonds zur Förderung der Wissenschaftlichen Forschung (CH)
ID : PP00P3_170664
Organisme : Max-Planck-Gesellschaft
ID : Department Herbert Jäckle
Organisme : Deutsche Forschungsgemeinschaft
ID : SFB 680 / A12
Organisme : Deutsche Forschungsgemeinschaft
ID : Scho1058/4

Références

Nature. 1999 Nov 25;402(6760):370-1
pubmed: 10586872
Genome Res. 2016 Jan;26(1):85-96
pubmed: 26518483
Nucleic Acids Res. 2015 Jan;43(Database issue):D720-5
pubmed: 25378303
Arthropod Struct Dev. 2010 Nov;39(6):399-410
pubmed: 20800703
Genome Biol. 2012 Jun 25;13(6):R56
pubmed: 22731987
Curr Biol. 2002 Feb 5;12(3):R85-6
pubmed: 11839285
Bioinformatics. 2006 Nov 1;22(21):2688-90
pubmed: 16928733
Proc Biol Sci. 2018 Aug 22;285(1885):
pubmed: 30135167
Genome Biol. 2003;5(1):R1
pubmed: 14709173
Development. 2018 Apr 5;145(7):
pubmed: 29540499
Science. 2007 Dec 21;318(5858):1913-6
pubmed: 18096805
Nucleic Acids Res. 2014 Jan;42(Database issue):D68-73
pubmed: 24275495
Bioinformatics. 2008 Mar 1;24(5):637-44
pubmed: 18218656
BMC Dev Biol. 2010 May 19;10:53
pubmed: 20482875
Nucleic Acids Res. 2017 Jan 4;45(D1):D663-D671
pubmed: 27799470
Annu Rev Entomol. 2002;47:669-99
pubmed: 11729088
Insect Mol Biol. 2003 Oct;12(5):433-40
pubmed: 12974948
PLoS Genet. 2013;9(7):e1003596
pubmed: 23874211
Science. 2012 Apr 20;336(6079):338-41
pubmed: 22403177
Dev Genes Evol. 2016 Jun;226(3):209-19
pubmed: 27056385
BMC Biol. 2009 Nov 05;7:73
pubmed: 19891766
Bioinformatics. 2003 Nov 1;19(16):2122-30
pubmed: 14594718
BMC Genomics. 2014 Jan 30;15:86
pubmed: 24479613
Science. 2005 Nov 25;310(5752):1325-6
pubmed: 16311335
Nat Genet. 2000 May;25(1):25-9
pubmed: 10802651
Nucleic Acids Res. 2015 Jan;43(Database issue):D250-6
pubmed: 25428351
BMC Dev Biol. 2013 Jun 18;13:25
pubmed: 23777260
Nature. 2008 Apr 24;452(7190):949-55
pubmed: 18362917
Dev Biol. 2008 Jan 15;313(2):471-91
pubmed: 18082679
Evol Dev. 1999 Jul-Aug;1(1):11-5
pubmed: 11324015
Development. 2015 Aug 15;142(16):2832-9
pubmed: 26160901
Dev Genes Evol. 2004 Nov;214(11):575-8
pubmed: 15365833
Bioinformatics. 2015 Oct 1;31(19):3210-2
pubmed: 26059717
PLoS Genet. 2015 Jun 04;11(6):e1005245
pubmed: 26042831
BMC Genomics. 2015 Sep 03;16:674
pubmed: 26334912
Development. 2019 Jan 4;146(1):
pubmed: 30552128
BMC Genomics. 2015 Sep 29;16:734
pubmed: 26416786
Cold Spring Harb Protoc. 2009 Aug;2009(8):pdb.emo126
pubmed: 20147228
Curr Biol. 2004 Aug 24;14(16):R639-40
pubmed: 15324675
Dev Genes Evol. 2008 Apr;218(3-4):119-26
pubmed: 18392874
J Comput Biol. 2006 Mar;13(2):336-50
pubmed: 16597244
Nucleic Acids Res. 2018 Jan 4;46(D1):D831-D835
pubmed: 29069517
Nucleic Acids Res. 2014 Jan;42(Database issue):D780-8
pubmed: 24234449
BMC Biol. 2017 Jul 31;15(1):62
pubmed: 28756775
Nucleic Acids Res. 2012 Jul;40(Web Server issue):W569-72
pubmed: 22695796
Genesis. 2001 May;30(1):12-20
pubmed: 11353513
Dev Cell. 2004 Sep;7(3):301-12
pubmed: 15363406
Bioinformatics. 2010 Apr 1;26(7):873-81
pubmed: 20147302
Development. 2001 Jan;128(2):287-97
pubmed: 11124123
Genome Biol. 2008 Jan 17;9(1):R10
pubmed: 18201385
Cell. 2009 Jan 23;136(2):215-33
pubmed: 19167326
Nature. 2005 Feb 10;433(7026):643-7
pubmed: 15703749
Proc Natl Acad Sci U S A. 2005 Aug 9;102(32):11337-42
pubmed: 16076951
Front Zool. 2012 Aug 06;9(1):15
pubmed: 22866820
Genome Res. 2002 Apr;12(4):656-64
pubmed: 11932250
Elife. 2015 Aug 12;4:null
pubmed: 26267216
Genome Biol Evol. 2010;2:686-96
pubmed: 20817720
BMC Bioinformatics. 2004 Aug 19;5:113
pubmed: 15318951
Nucleic Acids Res. 2006 Jan 1;34(Database issue):D590-8
pubmed: 16381938
Bioinformatics. 2009 Aug 1;25(15):1972-3
pubmed: 19505945
Nat Rev Mol Cell Biol. 2014 Aug;15(8):509-24
pubmed: 25027649
Insect Biochem Mol Biol. 2012 Apr;42(4):264-76
pubmed: 22212827
Genetics. 2005 Jun;170(2):741-7
pubmed: 15834150
Nat Commun. 2015 Jul 28;6:7822
pubmed: 26215380
BMC Genomics. 2014 Dec 18;15:1141
pubmed: 25523483
Curr Opin Insect Sci. 2016 Oct;17:1-9
pubmed: 27720067
Nucleic Acids Res. 2016 Jan 4;44(D1):D733-45
pubmed: 26553804
Nat Methods. 2013 Jan;10(1):71-3
pubmed: 23160280
Dev Genes Evol. 2012 Sep;222(5):287-98
pubmed: 22890852
Arthropod Struct Dev. 2014 Nov;43(6):605-13
pubmed: 25242057
Nucleic Acids Res. 2010 Jan;38(Database issue):D437-42
pubmed: 19820115

Auteurs

Nicolae Herndon (N)

Department of Computer Science, East Carolina University, Greenville, NC, 27858, USA.

Jennifer Shelton (J)

Division of Biology, Kansas State University, Manhattan, KS, 66506, USA.

Lizzy Gerischer (L)

Institut für Mathematik und Informatik, Universität Greifswald, Greifswald, Germany.

Panos Ioannidis (P)

Department of Genetic Medicine and Development, University of Geneva Medical School and Swiss Institute of Bioinformatics, 1211, Geneva, Switzerland.

Maria Ninova (M)

Faculty of Biology, Medicine and Health, University of Manchester, Michael Smith Building, Oxford Road, Manchester, M13 9PT, UK.

Jürgen Dönitz (J)

Department of Evolutionary Developmental Genetics, GZMB, University of Göttingen, Justus-von-Liebig-Weg 11, 37077, Göttingen, Germany.

Robert M Waterhouse (RM)

Department of Ecology and Evolution, University of Lausanne and Swiss Institute of Bioinformatics, 1015, Lausanne, Switzerland.

Chun Liang (C)

Department of Biology, Miami University, Oxford, OH, 45056, USA.

Carsten Damm (C)

Institut für Informatik, Fakultät für Mathematik und Informatik, Georg-August-Universität Göttingen, Goldschmidtstr. 7, 37077, Göttingen, Germany.

Janna Siemanowski (J)

Department of Evolutionary Developmental Genetics, GZMB, University of Göttingen, Justus-von-Liebig-Weg 11, 37077, Göttingen, Germany.

Peter Kitzmann (P)

Department of Evolutionary Developmental Genetics, GZMB, University of Göttingen, Justus-von-Liebig-Weg 11, 37077, Göttingen, Germany.

Julia Ulrich (J)

Department of Evolutionary Developmental Genetics, GZMB, University of Göttingen, Justus-von-Liebig-Weg 11, 37077, Göttingen, Germany.

Stefan Dippel (S)

Göttinger Graduiertenschule fur Neurowissenschaften Biophysik und Molekulare Biowissenschaften, Georg-August-Universität Göttingen, Göttingen, Germany.

Georg Oberhofer (G)

Department of Evolutionary Developmental Genetics, GZMB, University of Göttingen, Justus-von-Liebig-Weg 11, 37077, Göttingen, Germany.

Yonggang Hu (Y)

Department of Evolutionary Developmental Genetics, GZMB, University of Göttingen, Justus-von-Liebig-Weg 11, 37077, Göttingen, Germany.

Jonas Schwirz (J)

Department of Evolutionary Developmental Genetics, GZMB, University of Göttingen, Justus-von-Liebig-Weg 11, 37077, Göttingen, Germany.

Magdalena Schacht (M)

Department of Evolutionary Developmental Genetics, GZMB, University of Göttingen, Justus-von-Liebig-Weg 11, 37077, Göttingen, Germany.

Sabrina Lehmann (S)

Department of Evolutionary Developmental Genetics, GZMB, University of Göttingen, Justus-von-Liebig-Weg 11, 37077, Göttingen, Germany.

Alice Montino (A)

Department of Evolutionary Developmental Genetics, GZMB, University of Göttingen, Justus-von-Liebig-Weg 11, 37077, Göttingen, Germany.

Nico Posnien (N)

Department of Developmental Biology, GZMB, University of Göttingen, Justus-von-Liebig-Weg 11, 37077, Göttingen, Germany.

Daniela Gurska (D)

Institute for Zoology: Developmental Biology, University of Cologne, Zülpicher Str. 47b, 50674, Cologne, Germany.

Thorsten Horn (T)

Institute for Zoology: Developmental Biology, University of Cologne, Zülpicher Str. 47b, 50674, Cologne, Germany.

Jan Seibert (J)

Institute for Zoology: Developmental Biology, University of Cologne, Zülpicher Str. 47b, 50674, Cologne, Germany.

Iris M Vargas Jentzsch (IM)

Institute for Zoology: Developmental Biology, University of Cologne, Zülpicher Str. 47b, 50674, Cologne, Germany.

Kristen A Panfilio (KA)

School of Life Sciences, University of Warwick, Gibbet Hill Campus, Coventry, CV4 7AL, UK.

Jianwei Li (J)

Department Developmental Biology, GZMB, University of Göttingen, Justus-von-Liebig-Weg 11, 37077, Göttingen, Germany.

Ernst A Wimmer (EA)

Department of Developmental Biology, University of Göttingen, Justus-von-Liebig-Weg 11, 37077, Göttingen, Germany.

Dominik Stappert (D)

Institute of Zoology: Developmental Biology, University of Cologne, Zülpicher Weg 47b, 50674, Cologne, Germany.

Siegfried Roth (S)

Institute of Zoology: Developmental Biology, University of Cologne, Zülpicher Weg 47b, 50674, Cologne, Germany.

Reinhard Schröder (R)

Institut für Biowissenschaften, Universität Rostock, Albert-Einstein-Str. 3, 18059, Rostock, Germany.

Yoonseong Park (Y)

Department of Entomology, Kansas State University, Manhattan, KS, 66506, USA.

Michael Schoppmeier (M)

Department of Biology, Divison of Developmental Biology, Friedrich-Alexander-University of Erlangen-Nürnberg, Staudtstr. 5, 91058, Erlangen, Germany.

Ho-Ryun Chung (HR)

Department of Computational Molecular Biology, Max-Planck-Institute for Molecular Genetics, Ihnenstraße 63-73, 14195, Berlin, Germany.

Martin Klingler (M)

Department of Biology, Division of Developmental Biology, Friedrich-Alexander-University of Erlangen-Nürnberg, Staudtstr. 5, 91058, Erlangen, Germany.

Sebastian Kittelmann (S)

Oxford Brookes University, Centre for Functional Genomics, Gipsy Lane, Oxford, OX3 0BP, UK.

Markus Friedrich (M)

Department of Anatomy and Cell Biology, Wayne State University, Detroit, MI, 48202, USA.

Rui Chen (R)

Baylor College of Medicine, Houston, Texas, USA.

Boran Altincicek (B)

Institute of Crop Science and Resource Conservation (INRES-Phytomedicine), Rheinische Friedrich-Wilhelms-University of Bonn, Bonn, Germany.

Andreas Vilcinskas (A)

Institute for Insect Biotechnology, Justus-Liebig University of Giessen, Heinrich-Buff-Ring 26-32, 35392, Giessen, Germany.

Evgeny Zdobnov (E)

Department of Genetic Medicine and Development, University of Geneva Medical School and Swiss Institute of Bioinformatics, 1211, Geneva, Switzerland.

Sam Griffiths-Jones (S)

Faculty of Biology, Medicine and Health, University of Manchester, Michael Smith Building, Oxford Road, Manchester, M13 9PT, UK.

Matthew Ronshaugen (M)

Faculty of Biology, Medicine and Health, University of Manchester, Michael Smith Building, Oxford Road, Manchester, M13 9PT, UK.

Mario Stanke (M)

Institut für Mathematik und Informatik, Universität Greifswald, Greifswald, Germany. mario.stanke@uni-greifswald.de.

Sue J Brown (SJ)

Division of Biology, Kansas State University, Manhattan, KS, 66506, USA. sjbrown@ksu.edu.

Gregor Bucher (G)

Georg-August-Universität Göttingen, Göttingen, Germany. gbucher1@uni-goettingen.de.

Articles similaires

Genome, Chloroplast Phylogeny Genetic Markers Base Composition High-Throughput Nucleotide Sequencing
Robotic Surgical Procedures Animals Humans Telemedicine Models, Animal

Odour generalisation and detection dog training.

Lyn Caldicott, Thomas W Pike, Helen E Zulch et al.
1.00
Animals Odorants Dogs Generalization, Psychological Smell
Animals TOR Serine-Threonine Kinases Colorectal Neoplasms Colitis Mice

Classifications MeSH