Optical maps refine the bread wheat Triticum aestivum cv. Chinese Spring genome assembly.


Journal

The Plant journal : for cell and molecular biology
ISSN: 1365-313X
Titre abrégé: Plant J
Pays: England
ID NLM: 9207397

Informations de publication

Date de publication:
07 2021
Historique:
revised: 12 04 2021
received: 17 11 2020
accepted: 19 04 2021
pubmed: 25 4 2021
medline: 14 10 2021
entrez: 24 4 2021
Statut: ppublish

Résumé

Until recently, achieving a reference-quality genome sequence for bread wheat was long thought beyond the limits of genome sequencing and assembly technology, primarily due to the large genome size and > 80% repetitive sequence content. The release of the chromosome scale 14.5-Gb IWGSC RefSeq v1.0 genome sequence of bread wheat cv. Chinese Spring (CS) was, therefore, a milestone. Here, we used a direct label and stain (DLS) optical map of the CS genome together with a prior nick, label, repair and stain (NLRS) optical map, and sequence contigs assembled with Pacific Biosciences long reads, to refine the v1.0 assembly. Inconsistencies between the sequence and maps were reconciled and gaps were closed. Gap filling and anchoring of 279 unplaced scaffolds increased the total length of pseudomolecules by 168 Mb (excluding Ns). Positions and orientations were corrected for 233 and 354 scaffolds, respectively, representing 10% of the genome sequence. The accuracy of the remaining 90% of the assembly was validated. As a result of the increased contiguity, the numbers of transposable elements (TEs) and intact TEs have increased in IWGSC RefSeq v2.1 compared with v1.0. In total, 98% of the gene models identified in v1.0 were mapped onto this new assembly through development of a dedicated approach implemented in the MAGAAT pipeline. The numbers of high-confidence genes on pseudomolecules have increased from 105 319 to 105 534. The reconciled assembly enhances the utility of the sequence for genetic mapping, comparative genomics, gene annotation and isolation, and more general studies on the biology of wheat.

Identifiants

pubmed: 33893684
doi: 10.1111/tpj.15289
pmc: PMC8360199
doi:

Substances chimiques

DNA Transposable Elements 0

Types de publication

Journal Article Research Support, Non-U.S. Gov't Research Support, U.S. Gov't, Non-P.H.S.

Langues

eng

Sous-ensembles de citation

IM

Pagination

303-314

Informations de copyright

© 2021 The Authors. The Plant Journal published by Society for Experimental Biology and John Wiley & Sons Ltd.

Références

Nucleic Acids Res. 2012 Apr;40(7):e49
pubmed: 22217600
Bioinformatics. 2010 Mar 1;26(5):589-95
pubmed: 20080505
Bioinformatics. 2016 Jun 1;32(11):1749-51
pubmed: 26826718
Trends Genet. 2000 Jun;16(6):276-7
pubmed: 10827456
Plant J. 2018 Aug;95(3):487-503
pubmed: 29770515
Nat Genet. 2019 May;51(5):885-895
pubmed: 30962619
Science. 2017 Jul 7;357(6346):93-97
pubmed: 28684525
Science. 2018 Aug 17;361(6403):
pubmed: 30115783
J Mol Biol. 1990 Oct 5;215(3):403-10
pubmed: 2231712
Genetics. 2020 Oct;216(2):599-608
pubmed: 32796007
Mob DNA. 2015 Jun 02;6:11
pubmed: 26045719
PLoS One. 2013;8(2):e55864
pubmed: 23405223
Science. 2014 Jul 18;345(6194):1251788
pubmed: 25035500
Front Genet. 2020 Aug 18;11:891
pubmed: 33014014
Genome Biol. 2019 Dec 18;20(1):284
pubmed: 31849336
Hortic Res. 2019 Mar 25;6:55
pubmed: 30937174
Genome Res. 2002 Apr;12(4):656-64
pubmed: 11932250
Sci Rep. 2018 Mar 26;8(1):5181
pubmed: 29581476
Plant Biotechnol J. 2014 Aug;12(6):787-96
pubmed: 24646323
Nature. 2017 Apr 26;544(7651):427-433
pubmed: 28447635
Gigascience. 2017 Nov 1;6(11):1-7
pubmed: 29069494
Nature. 2017 Nov 23;551(7681):498-502
pubmed: 29143815
Genome Biol. 2014;15(12):546
pubmed: 25476263
Plant Biotechnol J. 2010 Feb;8(2):196-210
pubmed: 20078842
PLoS One. 2014 Nov 19;9(11):e112963
pubmed: 25409509
Science. 2008 Oct 3;322(5898):101-4
pubmed: 18832645
PLoS One. 2013 Jun 19;8(6):e66428
pubmed: 23840465
Science. 2007 Jun 29;316(5833):1862-6
pubmed: 17600208
Nucleic Acids Res. 2007 Jul;35(Web Server issue):W265-8
pubmed: 17485477
Bioinformatics. 2009 Aug 15;25(16):2078-9
pubmed: 19505943
G3 (Bethesda). 2019 Mar 7;9(3):619-624
pubmed: 30622124
Front Plant Sci. 2017 Oct 20;8:1798
pubmed: 29104581
Bioinformatics. 2018 Sep 15;34(18):3094-3100
pubmed: 29750242
Genome Biol. 2018 Aug 17;19(1):103
pubmed: 30115100
Bioinformatics. 2010 Mar 15;26(6):841-2
pubmed: 20110278
PLoS One. 2018 Jan 2;13(1):e0186329
pubmed: 29293495
Nat Rev Genet. 2007 Dec;8(12):973-82
pubmed: 17984973

Auteurs

Tingting Zhu (T)

Department of Plant Sciences, University of California, Davis, CA, 95616, USA.

Le Wang (L)

Department of Plant Sciences, University of California, Davis, CA, 95616, USA.

Hélène Rimbert (H)

GDEC, Université Clermont Auvergne, INRAE, Clermont-Ferrand, 63000, France.

Juan C Rodriguez (JC)

Department of Plant Sciences, University of California, Davis, CA, 95616, USA.

Karin R Deal (KR)

Department of Plant Sciences, University of California, Davis, CA, 95616, USA.

Romain De Oliveira (R)

GDEC, Université Clermont Auvergne, INRAE, Clermont-Ferrand, 63000, France.

Frédéric Choulet (F)

GDEC, Université Clermont Auvergne, INRAE, Clermont-Ferrand, 63000, France.

Gabriel Keeble-Gagnère (G)

Centre for AgriBioscience, Agriculture Victoria, AgriBio, Bundoora, VIC, 3083, Australia.

Josquin Tibbits (J)

Centre for AgriBioscience, Agriculture Victoria, AgriBio, Bundoora, VIC, 3083, Australia.

Jane Rogers (J)

International Wheat Genome Sequencing Consortium, Eau Claire, WI, 54701, USA.

Kellye Eversole (K)

International Wheat Genome Sequencing Consortium, Eau Claire, WI, 54701, USA.

Rudi Appels (R)

Centre for AgriBioscience, Agriculture Victoria, AgriBio, Bundoora, VIC, 3083, Australia.
International Wheat Genome Sequencing Consortium, Eau Claire, WI, 54701, USA.

Yong Q Gu (YQ)

Crop Improvement and Genetics Research Unit, USDA-ARS, Albany, CA, 94710, USA.

Martin Mascher (M)

Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), Seeland, Germany.

Jan Dvorak (J)

Department of Plant Sciences, University of California, Davis, CA, 95616, USA.

Ming-Cheng Luo (MC)

Department of Plant Sciences, University of California, Davis, CA, 95616, USA.

Articles similaires

Genome Size Genome, Plant Magnoliopsida Evolution, Molecular Arabidopsis
Triticum Transcription Factors Gene Expression Regulation, Plant Plant Proteins Salt Stress
Zea mays Triticum China Seasons Crops, Agricultural
Genome, Plant Medicago sativa Crops, Agricultural Genomics Polyploidy

Classifications MeSH