Chromosome-scale genome assembly of the mangrove climber species Dalbergia candenatensis.

Chromosomes, Plant Genome, Plant Molecular Sequence Annotation Rhizophoraceae Wetlands Dalbergia / genetics

Journal

Scientific data

ISSN: 2052-4463

Titre abrégé: Sci Data

Pays: England

ID NLM: 101640192

Informations de publication

Date de publication:
31 Oct 2024

Historique:

received: 15 07 2024

accepted: 23 10 2024

medline: 1 11 2024

pubmed: 1 11 2024

entrez: 1 11 2024

Statut: epublish

Résumé

Consisting of trees, climbers and herbs exclusively in the intertidal environments, mangrove forest is one of the most extreme and vulnerable ecosystems of our planet and has long been of great interest for biologists and ecologists. Here, we first assembled the chromosome-scale genome of a climber mangrove plant, Dalbergia candenatensis. The assembled genome size is approximately 474.55 Mb, with a scaffold N50 of 48.1 Mb, a complete BUSCO score of 98.4%, and a high LTR Assembly Index value of 21. The genome contained 283.46 Mb (59.74%) repetitive sequences, and 29,554 protein-coding genes were predicted, of which 87.54% were functionally annotated in five databases. The high-quality genome assembly and annotation presented herein provide a valuable genomic resource that will expedite genomic and evolutionary studies of mangrove plants and facilitate the elucidation of molecular mechanisms underlying the salt- and water-logging-tolerance of mangrove plants.

Identifiants

DOI: 10.1038/s41597-024-04032-2 PMID: 39482322

pubmed: 39482322

doi: 10.1038/s41597-024-04032-2

pii: 10.1038/s41597-024-04032-2

doi:

Types de publication

Dataset Journal Article

Langues

eng

Sous-ensembles de citation

Pagination

1187

Subventions

Organisme : National Natural Science Foundation of China (National Science Foundation of China)

ID : 32070222

Organisme : National Natural Science Foundation of China (National Science Foundation of China)

ID : 32170232

Organisme : National Natural Science Foundation of China (National Science Foundation of China)

ID : 32271613

Informations de copyright

Références

Tomlinson, P. B. The botany of mangrove. (Cambridge University Press, 2016).

Lyu, H., He, Z., Wu, C. I. & Shi, S. Convergent adaptive evolution in marginal environments: unloading transposable elements as a common strategy among mangrove genomes. New Phytol. 217, 428–438 (2018).

pubmed: 28960318 doi: 10.1111/nph.14784

Feng, X. et al. Genomic insights into molecular adaptation to intertidal environments in the mangrove Aegiceras corniculatum. New Phytol. 231, 2346–2358 (2021).

pubmed: 34115401 doi: 10.1111/nph.17551

Wang, Y. & Gu, J. Ecological responses, adaptation and mechanisms of mangrove wetland ecosystem to global climate change and anthropogenic activities Int. Biodeterior. Biodegrad. 162, 105248 (2021).

doi: 10.1016/j.ibiod.2021.105248

FAO. The world’s mangroves 2000–2020. (2023).

Duke, N. C. et al. A world without mangroves. Science 317, 41–42 (2007).

pubmed: 17615322 doi: 10.1126/science.317.5834.41b

Ma, D. et al. Chromosome-level assembly of the mangrove plant Aegiceras corniculatum genome generated through Illumina, PacBio and Hi-C sequencing technologies. Mol. Ecol. Resour. 21, 1593–1607 (2021).

pubmed: 33550674 doi: 10.1111/1755-0998.13347

Klitgård, B. B. & Lavin, M. in Legumes of the world (eds Lewis, G., Schrire, B., Mackinder, B. & Lock, M.) 307-335 (Royal Botanical Garden, Kew, 2005).

Li, S. Dalbergia in Asia. (Science Press, 2017).

Qin, M. et al. Comparative analysis of complete plastid genome reveals powerful barcode regions for identifying wood of Dalbergia odorifera and D. tonkinensis (Leguminosae). J. Syst. Evol. 60, 73–84 (2022).

doi: 10.1111/jse.12598

Lavin, M. et al. The Dalbergioid legumes (Fabaceae): delimitation of a pantropical monophyletic clade. Am. J. Bot. 88, 503–533 (2001).

pubmed: 11250829 doi: 10.2307/2657116

Yang, J. et al. Chromosome-scale genomes of five Hongmu species in Leguminosae. Sci. Data 10, 710 (2023).

pubmed: 37848504 pmcid: 10582184 doi: 10.1038/s41597-023-02593-2

Sprent, J. I. Legume nodulation: a global perspective. (Wiley-Blackwell, 2009).

Huang, H. Genomic insights into adaptation to mangrove habitat in Dalbergia candenatensis Master thesis, University of Chinese Academy of Sciences, (2023).

Hunga, T. H. et al. Range-wide differential adaptation and genomic offset in critically endangered Asian rosewoods. Proc. Natl. Acad. Sci. USA 120, e2301603120 (2023).

doi: 10.1073/pnas.2301603120

Hong, Z. et al. The chromosome-level draft genome of Dalbergia odorifera. GigaScience 9, giaa084 (2020).

pubmed: 32808664 pmcid: 7433187 doi: 10.1093/gigascience/giaa084

Sahu, S. K. et al. Chromosome-scale genome of Indian rosewood (Dalbergia sissoo). Front Plant Sci. 14, 1218515 (2023).

pubmed: 37662156 pmcid: 10470032 doi: 10.3389/fpls.2023.1218515

Anisuzzman, M., Hasan, M. M., Acharzo, A. K., Das, A. K. & Rahman, S. In vivo and in vitro evaluation of pharmacological potentials of secondary bioactive metabolites of Dalbergia candenatensis leaves. Evid. Based Complementary Altern. Med. 2017, 5034827 (2017).

doi: 10.1155/2017/5034827

Hamburger, M. O., Cordell, G. A., Tantivatana, P. & Ruangrungsi, N. Traditional medicinal plants of Thailand, VIII. Isoflavonoids of Dalbergia candenatensis. J. Nat. Prod. 50, 696–699 (1987).

pubmed: 3430167 doi: 10.1021/np50052a020

Cheenpracha, S., Karalai, C., Ponglimanont, C. & Kanjana-Opas, A. Candenatenins A-F, phenolic compounds from the heartwood of Dalbergia candenatensis. J. Nat. Prod. 72, 1395–1398 (2009).

pubmed: 19653666 doi: 10.1021/np900077h

Sahu, S. K., Thangaraj, M. & Kathiresan, K. DNA extraction protocol for plants with high levels of secondary metabolites and polysaccharides without using liquid nitrogen and phenol. ISRN Mol. Biol. 2012, 205049 (2012).

pubmed: 27335662 doi: 10.5402/2012/205049

Chen, S. Ultrafast one-pass FASTQ data preprocessing, quality control, and deduplication using fastp. Imeta 2, e107 (2023).

pubmed: 38868435 doi: 10.1002/imt2.107

Marçais, G. & Kingsford, C. A fast, lock-free approach for efficient parallel counting of occurrences of k-mers. Bioinformatics 27, 764–770 (2011).

pubmed: 21217122 pmcid: 3051319 doi: 10.1093/bioinformatics/btr011

Vurture, G. W. et al. GenomeScope: fast reference-free genome profiling from short reads. Bioinformatics 33, 2202–2204 (2017).

pubmed: 28369201 doi: 10.1093/bioinformatics/btx153

Cheng, H., Concepcion, G. T., Feng, X., Zhang, H. & Li, H. Haplotype-resolved de novo assembly using phased assembly graphs with hifiasm. Nat. Methods 18, 170–175 (2021).

pubmed: 33526886 doi: 10.1038/s41592-020-01056-5

Goto, S., Tsuda, Y., Koike, Y., Chunlan, L. & Ide, Y. Effects of landscape and demographic history on genetic variation in Picea glehnii at the regional scale. Ecol. Res. 24, 1267–1277 (2009).

doi: 10.1007/s11284-009-0611-8

Tarasov, A., Vilella, A. J., Cuppen, E., Nijman, I. J. & Prins, P. Sambamba: fast processing of NGS alignment formats. Bioinformatics 31, 2032–2034 (2015).

pubmed: 25697820 doi: 10.1093/bioinformatics/btv098

Walker, B. J. et al. Pilon: an integrated tool for comprehensive microbial variant detection and genome assembly improvement. PLoS One 9, e112963 (2014).

pubmed: 25409509 doi: 10.1371/journal.pone.0112963

Zhang, X., Zhang, S., Zhao, Q., Ming, R. & Tang, H. Assembly of allele-aware, chromosomal-scale autopolyploid genomes based on Hi-C data. Nat. Plants 5, 833–845 (2019).

pubmed: 31383970 doi: 10.1038/s41477-019-0487-8

Durand, N. C. et al. Juicer provides a one-click system for analyzing loop-resolution Hi-C experiments. Cell Syst. 3, 95–98 (2016).

pubmed: 27467249 doi: 10.1016/j.cels.2016.07.002

Flynn, J. M. et al. RepeatModeler2 for automated genomic discovery of transposable element families. Proc. Natl. Acad. Sci. USA 117, 9451–9457 (2020).

pubmed: 32300014 doi: 10.1073/pnas.1921046117

Bao, W., Kojima, K. K. & Kohany, O. Repbase Update, a database of repetitive elements in eukaryotic genomes. Mob. DNA 6, 11 (2015).

pubmed: 26045719 doi: 10.1186/s13100-015-0041-9

Storer, J., Hubley, R., Rosen, J., Wheeler, T. J. & Smit, A. F. The Dfam community resource of transposable element families, sequence models, and genome annotations. Mob. DNA 12, 2 (2021).

pubmed: 33436076 doi: 10.1186/s13100-020-00230-y

Tarailo-Graovac, M. & Chen, N. Using RepeatMasker to identify repetitive elements in genomic sequences. Curr. Protoc. Bioinformatics Chapter 4, 4.10.11–14.10.14 (2009).

Yan, H., Bombarely, A. & Li, S. DeepTE: a computational method for de novo classification of transposons with convolutional neural network. Bioinformatics 36, 4269–4275 (2020).

pubmed: 32415954 doi: 10.1093/bioinformatics/btaa519

Stanke, M., Schöffmann, O., Morgenstern, B. & Waack, S. Gene prediction in eukaryotes with a generalized hidden Markov model that uses hints from external sources. BMC Bioinform. 7, 62 (2006).

doi: 10.1186/1471-2105-7-62

Guigó, R., Knudsen, S., Drake, N. & Smith, T. Prediction of gene structure. J. Mol. Biol. 226, 141–157 (1992).

pubmed: 1619647 doi: 10.1016/0022-2836(92)90130-C

Korf, I. Gene finding in novel genomes. BMC Bioinform. 5, 59 (2004).

doi: 10.1186/1471-2105-5-59

Majoros, W. H., Pertea, M. & Salzberg, S. L. TigrScan and GlimmerHMM: two open source ab initio eukaryotic gene-finders. Bioinformatics 20, 2878–2879 (2004).

pubmed: 15145805 doi: 10.1093/bioinformatics/bth315

Lomsadze, A., Ter-Hovhannisyan, V., Chernoff, Y. O. & Borodovsky, M. Gene identification in novel eukaryotic genomes by self-training algorithm. Nucleic Acids Res. 33, 6494–6506 (2005).

pubmed: 16314312 doi: 10.1093/nar/gki937

Keilwagen, J., Hartung, F., Paulini, M., Twardziok, S. O. & Grau, J. Combining RNA-seq data and homology-based gene prediction for plants, animals and fungi. BMC Bioinform. 19, 189 (2018).

doi: 10.1186/s12859-018-2203-5

The Arabidopsis Genome Initiative. Analysis of the genome sequence of the fowering plant Arabidopsis thaliana. Nature 408, 796–815 (2000).

doi: 10.1038/35048692

Goff, S. A. et al. A draft sequence of the rice genome (Oryza sativa L. ssp. japonica). Science 296, 92–100 (2002).

Grabherr, M. G. et al. Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nat. Biotechnol. 29, 644–652 (2011).

pubmed: 21572440 doi: 10.1038/nbt.1883

Haas, B. J. et al. Improving the Arabidopsis genome annotation using maximal transcript alignment assemblies. Nucleic Acids Res. 31, 5654–5666 (2003).

pubmed: 14500829 doi: 10.1093/nar/gkg770

Haas, B. J. et al. Automated eukaryotic gene structure annotation using EVidenceModeler and the Program to Assemble Spliced Alignments. Genome Biol. 9, R7 (2008).

pubmed: 18190707 doi: 10.1186/gb-2008-9-1-r7

Huerta-Cepas, J. et al. eggNOG 5.0: a hierarchical, functionally and phylogenetically annotated orthology resource based on 5090 organisms and 2502 viruses. Nucleic Acids Res 47, D309–d314 (2019).

doi: 10.1093/nar/gky1085 pubmed: 30418610

Ashburner, M. et al. Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat. Genet. 25, 25–29 (2000).

pubmed: 10802651 pmcid: 3037419 doi: 10.1038/75556

Kanehisa, M., Goto, S., Sato, Y., Furumichi, M. & Tanabe, M. KEGG for integration and interpretation of large-scale molecular data sets. Nucleic Acids Res. 40, D109–d114 (2012).

pubmed: 22080510 doi: 10.1093/nar/gkr988

Coudert, E. et al. Annotation of biologically relevant ligands in UniProtKB using ChEBI. Bioinformatics 39, btac793 (2023).

pubmed: 36484697 doi: 10.1093/bioinformatics/btac793

Buchfink, B., Xie, C. & Huson, D. H. Fast and sensitive protein alignment using DIAMOND. Nat. Methods 12, 59–60 (2015).

pubmed: 25402007 doi: 10.1038/nmeth.3176

Tang, H. et al. Synteny and collinearity in plant genomes. Science 320, 486–488 (2008).

pubmed: 18436778 doi: 10.1126/science.1153917

Chen, C. et al. TBtools: an integrative toolkit developed for interactive analyses of big biological data. Mol. Plant 13, 1194–1202 (2020).

pubmed: 32585190 doi: 10.1016/j.molp.2020.06.009

NCBI Sequence Read Archive https://identifiers.org/ncbi/insdc.sra:SRP513077 (2024).

Shi, M., Zhang, Y., Huang, H. & Tu, T. Dalbergia candenatensis isolate MS-2024a, whole genome shotgun sequencing project. GenBank https://identifiers.org/ncbi/insdc:JBHFQC000000000 (2024).

Shi, M. et al. Chromosome-scale genome assembly of the mangrove climber species Dalbergia candenatensis. Figshare https://doi.org/10.6084/m9.figshare.26170126 (2024).

Manni, M., Berkeley, M. R., Seppey, M., Simão, F. A. & Zdobnov, E. M. BUSCO Update: Novel and streamlined workflows along with broader and deeper phylogenetic coverage for scoring of eukaryotic, prokaryotic, and viral genomes. Mol. Biol. Evol. 38, 4647–4654 (2021).

pubmed: 34320186 doi: 10.1093/molbev/msab199

Ou, S. & Jiang, N. LTR_retriever: A highly accurate and sensitive program for identification of long terminal repeat retrotransposons. Plant Physiol. 176, 1410–1422 (2017).

pubmed: 29233850 pmcid: 5813529 doi: 10.1104/pp.17.01310

Li, H. Minimap2: Pairwise alignment for nucleotide sequences. Bioinformatics 34, 3094–3100 (2018).

pubmed: 29750242 pmcid: 6137996 doi: 10.1093/bioinformatics/bty191

Kim, D., Langmead, B. & Salzberg, S. L. HISAT: A fast spliced aligner with low memory requirements. Nat. Methods 12, 357–360 (2015).

pubmed: 25751142 pmcid: 4655817 doi: 10.1038/nmeth.3317

Chromosome-scale genome assembly of the mangrove climber species Dalbergia candenatensis.

Journal

Informations de publication

Résumé

Identifiants

Types de publication

Langues

Sous-ensembles de citation

Pagination

Subventions

Informations de copyright

Références

Auteurs

Miaomiao Shi (M)

Yu Zhang (Y)

Huiwen Huang (H)

Shiran Gu (S)

Xiangping Wang (X)

Shijin Li (S)

Zhongtao Zhao (Z)

Tieyao Tu (T)

Articles similaires

Exploring the complexity of genome size reduction in angiosperms.

A stepwise guide for pangenome development in crop plants: an alfalfa (Medicago sativa) case study.

Fine mapping of a major QTL, qECQ8, for rice taste quality.

Genome-wide analysis of the Amorphophallus konjac AkCSLA gene family and its functional characterization in drought tolerance of transgenic arabidopsis.

Classifications MeSH