The Capparis spinosa var. herbacea genome provides the first genomic instrument for a diversity and evolution study of the Capparaceae family.

Capparis spinosa var. herbacea genome assembly population evolution

Journal

GigaScience
ISSN: 2047-217X
Titre abrégé: Gigascience
Pays: United States
ID NLM: 101596872

Informations de publication

Date de publication:
30 10 2022
Historique:
received: 23 03 2022
revised: 17 06 2022
accepted: 13 10 2022
entrez: 31 10 2022
pubmed: 1 11 2022
medline: 2 11 2022
Statut: ppublish

Résumé

The caper bush Capparis spinosa L., one of the most economically important species of Capparaceae, is a xerophytic shrub that is well adapted to drought and harsh environments. However, genetic studies on this species are limited because of the lack of its reference genome. We sequenced and assembled the Capparis spinosa var. herbacea (Willd.) genome using data obtained from the combination of PacBio circular consensus sequencing and high-throughput chromosome conformation capture. The final genome assembly was approximately 274.53 Mb (contig N50 length of 9.36 Mb, scaffold N50 of 15.15 Mb), 99.23% of which was assigned to 21 chromosomes. In the whole-genome sequence, tandem repeats accounted for 19.28%, and transposable element sequences accounted for 43.98%. The proportion of tandem repeats in the C. spinosa var. herbacea genome was much higher than the average of 8.55% in plant genomes. A total of 21,577 protein-coding genes were predicted, with 98.82% being functionally annotated. The result of species divergence times showed that C. spinosa var. herbacea and Tarenaya hassleriana separated from a common ancestor 43.31 million years ago. This study reported a high-quality reference genome assembly and genome features for the Capparaceae family. The assembled C. spinosa var. herbacea genome might provide a system for studying the diversity, speciation, and evolution of this family and serve as an important resource for understanding the mechanism of drought and high-temperature resistance.

Sections du résumé

BACKGROUND
The caper bush Capparis spinosa L., one of the most economically important species of Capparaceae, is a xerophytic shrub that is well adapted to drought and harsh environments. However, genetic studies on this species are limited because of the lack of its reference genome.
FINDINGS
We sequenced and assembled the Capparis spinosa var. herbacea (Willd.) genome using data obtained from the combination of PacBio circular consensus sequencing and high-throughput chromosome conformation capture. The final genome assembly was approximately 274.53 Mb (contig N50 length of 9.36 Mb, scaffold N50 of 15.15 Mb), 99.23% of which was assigned to 21 chromosomes. In the whole-genome sequence, tandem repeats accounted for 19.28%, and transposable element sequences accounted for 43.98%. The proportion of tandem repeats in the C. spinosa var. herbacea genome was much higher than the average of 8.55% in plant genomes. A total of 21,577 protein-coding genes were predicted, with 98.82% being functionally annotated. The result of species divergence times showed that C. spinosa var. herbacea and Tarenaya hassleriana separated from a common ancestor 43.31 million years ago.
CONCLUSIONS
This study reported a high-quality reference genome assembly and genome features for the Capparaceae family. The assembled C. spinosa var. herbacea genome might provide a system for studying the diversity, speciation, and evolution of this family and serve as an important resource for understanding the mechanism of drought and high-temperature resistance.

Identifiants

pubmed: 36310248
pii: 6780308
doi: 10.1093/gigascience/giac106
pmc: PMC9618406
pii:
doi:

Types de publication

Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Informations de copyright

© The Author(s) 2022. Published by Oxford University Press GigaScience.

Références

BMC Bioinformatics. 2018 May 30;19(1):189
pubmed: 29843602
Bioinformatics. 2018 Sep 1;34(17):i884-i890
pubmed: 30423086
Mol Biol Rep. 2019 Apr;46(2):2209-2219
pubmed: 30771147
Nat Methods. 2015 Jan;12(1):59-60
pubmed: 25402007
Am J Bot. 2020 Aug;107(8):1148-1164
pubmed: 32830865
Genome Biol. 2015 Dec 01;16:259
pubmed: 26619908
BMC Bioinformatics. 2004 May 14;5:59
pubmed: 15144565
Genome Biol. 2004;5(2):R7
pubmed: 14759257
Nucleic Acids Res. 2018 Nov 30;46(21):e126
pubmed: 30107434
Bioinformatics. 2017 Sep 01;33(17):2759-2761
pubmed: 28472236
Nat Protoc. 2016 Sep;11(9):1650-67
pubmed: 27560171
Mitochondrion. 2014 Nov;19 Pt B:135-43
pubmed: 24907441
Syst Biol. 2007 Aug;56(4):564-77
pubmed: 17654362
Methods Mol Biol. 2016;1374:23-54
pubmed: 26519399
Nucleic Acids Res. 2019 Jan 8;47(D1):D155-D162
pubmed: 30423142
Plant Physiol. 2018 Feb;176(2):1410-1422
pubmed: 29233850
Curr Protoc Bioinformatics. 2009 Mar;Chapter 4:Unit 4.10
pubmed: 19274634
Methods Mol Biol. 2019;1962:1-14
pubmed: 31020551
Nat Commun. 2020 Nov 16;11(1):5817
pubmed: 33199703
Nutrients. 2018 Jan 24;10(2):
pubmed: 29364841
Innovation (Camb). 2021 Jul 01;2(3):100141
pubmed: 34557778
Nucleic Acids Res. 2015 Jan;43(Database issue):D130-7
pubmed: 25392425
Mol Biol Evol. 2013 Apr;30(4):772-80
pubmed: 23329690
Nat Biotechnol. 2011 May 15;29(7):644-52
pubmed: 21572440
Ann Bot. 2005 Jan;95(1):99-110
pubmed: 15596459
Bioinformatics. 2017 Aug 15;33(16):2583-2585
pubmed: 28398459
Nucleic Acids Res. 2012 Jan;40(Database issue):D565-70
pubmed: 22123736
Nucleic Acids Res. 2011 Jan;39(Database issue):D225-9
pubmed: 21109532
Front Plant Sci. 2017 Oct 25;8:1845
pubmed: 29118777
BMC Genomics. 2020 Aug 13;21(1):556
pubmed: 32791963
Gigascience. 2020 Mar 1;9(3):
pubmed: 32141509
Genome Biol. 2020 Sep 14;21(1):245
pubmed: 32928274
Trends Plant Sci. 2017 Jan;22(1):53-65
pubmed: 27666516
Mol Biol Evol. 2021 Sep 27;38(10):4647-4654
pubmed: 34320186
Nucleic Acids Res. 1999 Jan 15;27(2):573-80
pubmed: 9862982
Sci Rep. 2019 Jul 18;9(1):10411
pubmed: 31320697
Hortic Res. 2021 Jan 31;8(1):35
pubmed: 33517348
Proc Natl Acad Sci U S A. 2020 Apr 28;117(17):9451-9457
pubmed: 32300014
Nat Biotechnol. 2015 Mar;33(3):290-5
pubmed: 25690850
J Agric Food Chem. 2002 Feb 27;50(5):1168-71
pubmed: 11853498
Nucleic Acids Res. 2015 Jul 13;43(12):e78
pubmed: 25870408
Genome Res. 2009 Sep;19(9):1639-45
pubmed: 19541911
Bioinformatics. 2007 May 1;23(9):1061-7
pubmed: 17332020
BMC Bioinformatics. 2008 Jan 14;9:18
pubmed: 18194517
Bioinformatics. 2011 Mar 15;27(6):757-63
pubmed: 21216780
Genome Biol Evol. 2022 May 3;14(5):
pubmed: 35482027
Mol Biol Evol. 2017 Jul 1;34(7):1812-1819
pubmed: 28387841
Yi Chuan. 2018 Jan 20;40(1):44-56
pubmed: 29367192
J Ethnopharmacol. 2012 Jun 26;142(1):48-52
pubmed: 22579676
Nat Commun. 2020 Mar 18;11(1):1432
pubmed: 32188846
Nucleic Acids Res. 2003 Jan 1;31(1):365-70
pubmed: 12520024
Science. 2010 Jan 1;327(5961):92-4
pubmed: 20044577
PLoS One. 2016 Oct 5;11(10):e0163962
pubmed: 27706213
Genome Biol. 2019 Nov 14;20(1):238
pubmed: 31727128
J Agric Food Chem. 2005 Sep 7;53(18):7136-41
pubmed: 16131121
Biochem Biophys Res Commun. 2012 Mar 23;419(4):779-81
pubmed: 22390928
Nucleic Acids Res. 2019 Dec 2;47(21):10994-11006
pubmed: 31584084
Nat Methods. 2021 Feb;18(2):170-175
pubmed: 33526886
Comput Appl Biosci. 1997 Oct;13(5):555-6
pubmed: 9367129
Mol Biol Evol. 2015 Jan;32(1):268-74
pubmed: 25371430
Nucleic Acids Res. 2021 Jan 8;49(D1):D412-D419
pubmed: 33125078
J Plant Res. 2007 Mar;120(2):219-28
pubmed: 17024517
Bioinformatics. 2020 May 1;36(9):2896-2898
pubmed: 31971576
J Exp Bot. 2020 Jul 6;71(14):3966-3985
pubmed: 32293686
Genome Biol. 2019 Feb 21;20(1):38
pubmed: 30791939
Curr Protoc Bioinformatics. 2014 Sep 08;47:11.12.1-34
pubmed: 25199790
Nat Biotechnol. 2013 Dec;31(12):1119-25
pubmed: 24185095
Nucleic Acids Res. 2007 Jul;35(Web Server issue):W265-8
pubmed: 17485477
Genome Biol. 2008 Jan 11;9(1):R7
pubmed: 18190707
Nucleic Acids Res. 2019 Jan 8;47(D1):D419-D426
pubmed: 30407594
Food Chem Toxicol. 2018 Aug;118:181-189
pubmed: 29751072
Nucleic Acids Res. 2000 Jan 1;28(1):27-30
pubmed: 10592173
Mol Biol Evol. 2013 Aug;30(8):1987-97
pubmed: 23709260
Bioinformatics. 2019 Dec 15;35(24):5321-5322
pubmed: 31292621
Nat Methods. 2017 Jun;14(6):587-589
pubmed: 28481363
Plants (Basel). 2021 Jun 17;10(6):
pubmed: 34204211
Biomed Pharmacother. 2017 Mar;87:171-179
pubmed: 28056421
Zhongguo Zhong Yao Za Zhi. 2008 Nov;33(21):2453-8
pubmed: 19149246
Bioinformatics. 2013 Nov 15;29(22):2933-5
pubmed: 24008419
Biomed Res Int. 2016;2016:5792708
pubmed: 27314028
J Mol Biol. 1990 Oct 5;215(3):403-10
pubmed: 2231712

Auteurs

Lei Wang (L)

State Key Laboratory of Desert and Oasis Ecology, Xinjiang Institute of Ecology and Geography, Chinese Academy of Sciences, Urumqi 830011, China.
University of Chinese Academy of Sciences, Beijing 100049, China.

Liqiang Fan (L)

Institute of Cotton Research of the Chinese Academy of Agricultural Sciences, Anyang, Henan 455000, China.
Zhengzhou Research Base, State Key Laboratory of Cotton Biology, Zhengzhou University, Zhengzhou 450000, China.

Zhenyong Zhao (Z)

State Key Laboratory of Desert and Oasis Ecology, Xinjiang Institute of Ecology and Geography, Chinese Academy of Sciences, Urumqi 830011, China.
University of Chinese Academy of Sciences, Beijing 100049, China.

Zhibin Zhang (Z)

Institute of Cotton Research of the Chinese Academy of Agricultural Sciences, Anyang, Henan 455000, China.
Zhengzhou Research Base, State Key Laboratory of Cotton Biology, Zhengzhou University, Zhengzhou 450000, China.

Li Jiang (L)

State Key Laboratory of Desert and Oasis Ecology, Xinjiang Institute of Ecology and Geography, Chinese Academy of Sciences, Urumqi 830011, China.
University of Chinese Academy of Sciences, Beijing 100049, China.

Mao Chai (M)

Institute of Cotton Research of the Chinese Academy of Agricultural Sciences, Anyang, Henan 455000, China.
Zhengzhou Research Base, State Key Laboratory of Cotton Biology, Zhengzhou University, Zhengzhou 450000, China.

Changyan Tian (C)

State Key Laboratory of Desert and Oasis Ecology, Xinjiang Institute of Ecology and Geography, Chinese Academy of Sciences, Urumqi 830011, China.
University of Chinese Academy of Sciences, Beijing 100049, China.

Articles similaires

Genome, Chloroplast Phylogeny Genetic Markers Base Composition High-Throughput Nucleotide Sequencing
Animals Hemiptera Insect Proteins Phylogeny Insecticides
Amaryllidaceae Alkaloids Lycoris NADPH-Ferrihemoprotein Reductase Gene Expression Regulation, Plant Plant Proteins
Drought Resistance Gene Expression Profiling Gene Expression Regulation, Plant Gossypium Multigene Family

Classifications MeSH