Chromosome-level draft genome of a diploid plum (Prunus salicina).
Chromosome-level
Genome
Plum
Prunus
Journal
GigaScience
ISSN: 2047-217X
Titre abrégé: Gigascience
Pays: United States
ID NLM: 101596872
Informations de publication
Date de publication:
10 12 2020
10 12 2020
Historique:
received:
26
06
2020
revised:
28
08
2020
accepted:
29
10
2020
entrez:
10
12
2020
pubmed:
11
12
2020
medline:
26
10
2021
Statut:
ppublish
Résumé
Plums are one of the most economically important Rosaceae fruit crops and comprise dozens of species distributed across the world. Until now, only limited genomic information has been available for the genetic studies and breeding programs of plums. Prunus salicina, an important diploid plum species, plays a predominant role in modern commercial plum production. Here we selected P. salicina for whole-genome sequencing and present a chromosome-level genome assembly through the combination of Pacific Biosciences sequencing, Illumina sequencing, and Hi-C technology. The assembly had a total size of 284.2 Mb, with contig N50 of 1.78 Mb and scaffold N50 of 32.32 Mb. A total of 96.56% of the assembled sequences were anchored onto 8 pseudochromosomes, and 24,448 protein-coding genes were identified. Phylogenetic analysis showed that P. salicina had a close relationship with Prunus mume and Prunus armeniaca, with P. salicina diverging from their common ancestor ∼9.05 million years ago. During P. salicina evolution 146 gene families were expanded, and some cell wall-related GO terms were significantly enriched. It was noteworthy that members of the DUF579 family, a new class involved in xylan biosynthesis, were significantly expanded in P. salicina, which provided new insight into the xylan metabolism in plums. We constructed the first high-quality chromosome-level plum genome using Pacific Biosciences, Illumina, and Hi-C technologies. This work provides a valuable resource for facilitating plum breeding programs and studying the genetic diversity mechanisms of plums and Prunus species.
Sections du résumé
BACKGROUND
Plums are one of the most economically important Rosaceae fruit crops and comprise dozens of species distributed across the world. Until now, only limited genomic information has been available for the genetic studies and breeding programs of plums. Prunus salicina, an important diploid plum species, plays a predominant role in modern commercial plum production. Here we selected P. salicina for whole-genome sequencing and present a chromosome-level genome assembly through the combination of Pacific Biosciences sequencing, Illumina sequencing, and Hi-C technology.
FINDINGS
The assembly had a total size of 284.2 Mb, with contig N50 of 1.78 Mb and scaffold N50 of 32.32 Mb. A total of 96.56% of the assembled sequences were anchored onto 8 pseudochromosomes, and 24,448 protein-coding genes were identified. Phylogenetic analysis showed that P. salicina had a close relationship with Prunus mume and Prunus armeniaca, with P. salicina diverging from their common ancestor ∼9.05 million years ago. During P. salicina evolution 146 gene families were expanded, and some cell wall-related GO terms were significantly enriched. It was noteworthy that members of the DUF579 family, a new class involved in xylan biosynthesis, were significantly expanded in P. salicina, which provided new insight into the xylan metabolism in plums.
CONCLUSIONS
We constructed the first high-quality chromosome-level plum genome using Pacific Biosciences, Illumina, and Hi-C technologies. This work provides a valuable resource for facilitating plum breeding programs and studying the genetic diversity mechanisms of plums and Prunus species.
Identifiants
pubmed: 33300949
pii: 6029397
doi: 10.1093/gigascience/giaa130
pmc: PMC7727024
pii:
doi:
Types de publication
Journal Article
Research Support, Non-U.S. Gov't
Langues
eng
Sous-ensembles de citation
IM
Informations de copyright
© The Author(s) 2020. Published by Oxford University Press on behalf of [GigaScience].
Références
Plant J. 2020 Jan;101(2):455-472
pubmed: 31529539
Bioinformatics. 2009 Jul 15;25(14):1754-60
pubmed: 19451168
DNA Res. 2017 Oct 1;24(5):499-508
pubmed: 28541388
Plant J. 2011 May;66(3):401-13
pubmed: 21251108
Nat Genet. 2019 Mar;51(3):541-547
pubmed: 30804557
Mol Biol Evol. 2007 Aug;24(8):1586-91
pubmed: 17483113
J Mol Biol. 1997 Apr 25;268(1):78-94
pubmed: 9149143
PLoS One. 2018 Dec 3;13(12):e0208032
pubmed: 30507961
Nucleic Acids Res. 2004 Jul 1;32(Web Server issue):W309-12
pubmed: 15215400
Gigascience. 2012 Dec 27;1(1):18
pubmed: 23587118
Mol Biol Evol. 2013 Apr;30(4):772-80
pubmed: 23329690
Genome Biol. 2020 Dec 29;21(1):306
pubmed: 33372615
Gigascience. 2019 Dec 1;8(12):
pubmed: 31816089
Cell Syst. 2018 Feb 28;6(2):256-258.e1
pubmed: 29428417
Bioinformatics. 2014 May 1;30(9):1236-40
pubmed: 24451626
Cytogenet Genome Res. 2005;110(1-4):462-7
pubmed: 16093699
Curr Protoc Bioinformatics. 2007 Jun;Chapter 4:Unit 4.3
pubmed: 18428791
Gigascience. 2020 Mar 1;9(3):
pubmed: 32141509
Hortic Res. 2019 Nov 18;6:128
pubmed: 31754435
Nucleic Acids Res. 1997 Mar 1;25(5):955-64
pubmed: 9023104
Bioinformatics. 2005 Jun;21 Suppl 1:i351-8
pubmed: 15961478
Genome Res. 2004 May;14(5):988-95
pubmed: 15123596
Mol Biol Evol. 2017 Feb 1;34(2):262-281
pubmed: 27856652
Front Plant Sci. 2018 Jun 07;9:692
pubmed: 29930561
Nat Commun. 2012;3:1318
pubmed: 23271652
Plant J. 2011 May;66(3):387-400
pubmed: 21288268
Hortic Res. 2019 Apr 5;6:58
pubmed: 30962943
Nucleic Acids Res. 2003 Jan 1;31(1):439-41
pubmed: 12520045
Nucleic Acids Res. 1999 Jan 15;27(2):573-80
pubmed: 9862982
PLoS One. 2014 Apr 03;9(4):e92644
pubmed: 24699266
Bioinformatics. 2015 Oct 1;31(19):3210-2
pubmed: 26059717
Hortic Res. 2020 Aug 1;7(1):122
pubmed: 32821405
Science. 2019 Jun 14;364(6445):1095-1098
pubmed: 31197015
BMC Bioinformatics. 2018 Nov 29;19(1):460
pubmed: 30497373
BMC Evol Biol. 2007 Nov 08;7:214
pubmed: 17996036
Nat Biotechnol. 2015 Mar;33(3):290-5
pubmed: 25690850
Nat Commun. 2019 Apr 2;10(1):1494
pubmed: 30940818
Proc Natl Acad Sci U S A. 2012 Jul 10;109(28):E1980-9
pubmed: 22733783
Bioinformatics. 2007 May 1;23(9):1061-7
pubmed: 17332020
Methods Mol Biol. 2007;396:59-70
pubmed: 18025686
Nucleic Acids Res. 2014 Jan;42(Database issue):D222-30
pubmed: 24288371
Nucleic Acids Res. 2000 Jan 1;28(1):45-8
pubmed: 10592178
Nat Genet. 2010 Oct;42(10):833-9
pubmed: 20802477
G3 (Bethesda). 2019 Jul 9;9(7):2051-2060
pubmed: 31126974
Curr Opin Biotechnol. 2014 Apr;26:100-7
pubmed: 24679265
Gigascience. 2020 Dec 10;9(12):
pubmed: 33300949
Curr Protoc Bioinformatics. 2009 Mar;Chapter 4:4.10.1-4.10.14
pubmed: 19274634
Plant J. 2016 Sep;87(6):535-47
pubmed: 27228578
Mol Biol Evol. 2015 Jan;32(1):268-74
pubmed: 25371430
Nat Genet. 2013 May;45(5):487-94
pubmed: 23525075
Nat Methods. 2015 Apr;12(4):357-60
pubmed: 25751142
Plant Biotechnol J. 2020 Feb;18(2):581-595
pubmed: 31368610
Bioinformatics. 2006 May 15;22(10):1269-71
pubmed: 16543274
PLoS One. 2014 Nov 19;9(11):e112963
pubmed: 25409509
Nat Methods. 2016 Dec;13(12):1050-1054
pubmed: 27749838
Nat Biotechnol. 2013 Dec;31(12):1119-25
pubmed: 24185095
Genome Res. 2013 Feb;23(2):396-408
pubmed: 23149293
Genome Biol. 2008 Jan 11;9(1):R7
pubmed: 18190707
F1000Res. 2015 Nov 20;4:1310
pubmed: 26835000
Nucleic Acids Res. 2007 Jul;35(Web Server issue):W265-8
pubmed: 17485477
BMC Biol. 2006 Dec 07;4:41
pubmed: 17156431
Nucleic Acids Res. 2000 Jan 1;28(1):27-30
pubmed: 10592173
Plant Direct. 2019 Feb 12;3(2):e00117
pubmed: 31245760
Nat Protoc. 2013 Aug;8(8):1494-512
pubmed: 23845962
Nat Methods. 2013 Jun;10(6):563-9
pubmed: 23644548
Nat Genet. 2017 Jul;49(7):1099-1106
pubmed: 28581499
Genome Biol. 2015 Aug 06;16:157
pubmed: 26243257
Bioinformatics. 2004 Nov 1;20(16):2878-9
pubmed: 15145805
Bioinformatics. 2013 Nov 15;29(22):2933-5
pubmed: 24008419
Nat Genet. 2011 Feb;43(2):109-16
pubmed: 21186353
BMC Bioinformatics. 2004 May 14;5:59
pubmed: 15144565