Chromosomal level genome assemblies of two Malus crabapple cultivars Flame and Royalty.


Journal

Scientific data
ISSN: 2052-4463
Titre abrégé: Sci Data
Pays: England
ID NLM: 101640192

Informations de publication

Date de publication:
13 Feb 2024
Historique:
received: 07 11 2023
accepted: 05 02 2024
medline: 15 2 2024
pubmed: 14 2 2024
entrez: 13 2 2024
Statut: epublish

Résumé

Malus hybrid 'Flame' and Malus hybrid 'Royalty' are representative ornamental crabapples, rich in flavonoids and serving as the preferred materials for studying the coloration mechanism. We generated two sets of high-quality chromosome-level and haplotype-resolved genome of 'Flame' with sizes of 688.2 Mb and 675.7 Mb, and those of 'Royalty' with sizes of 674.1 Mb and 663.6 Mb, all anchored to 17 chromosomes and with a high BUSCO completeness score nearly 99.0%. A total of 47,833 and 47,307 protein-coding genes were annotated in the two haplotype genomes of 'Flame', and the numbers of 'Royalty' were 46,305 and 46,920 individually. The assembled high-quality genomes offer new resources for studying the origin and adaptive evolution of crabapples and the molecular basis of the accumulation of flavonoids and anthocyanins, facilitating molecular breeding of Malus plants.

Identifiants

pubmed: 38351118
doi: 10.1038/s41597-024-03049-x
pii: 10.1038/s41597-024-03049-x
pmc: PMC10864326
doi:

Substances chimiques

Anthocyanins 0
Flavonoids 0

Types de publication

Dataset Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

201

Subventions

Organisme : Beijing University of Agriculture (BUA)
ID : Start-up fund

Informations de copyright

© 2024. The Author(s).

Références

Wang, Z., Wang, W., Zhang, J., Song, T. & Yao, Y. Genetic diversity and phylogenetic relationships analysis of major ornamental crabapple species. Journal of Fruit Science 31, 1005–1016 (2014).
Tian, J. et al. The Balance of Expression of Dihydroflavonol 4-reductase and Flavonol Synthase Regulates Flavonoid Biosynthesis and Red Foliage Coloration in Crabapples. Sci Rep 5, 12228 (2015).
pubmed: 26192267 pmcid: 4507444 doi: 10.1038/srep12228
Li, H. et al. MdMYB8 is associated with flavonol biosynthesis via the activation of the MdFLS promoter in the fruits of Malus crabapple. Hort. Res. 7 (2020).
He, X. & Liu, R. H. Phytochemicals of apple peels: isolation, structure elucidation, and their antiproliferative and antioxidant activities. J Agr Food Chem 56, 9905–9910 (2008).
doi: 10.1021/jf8015255
Boyer, J. & Liu, R. H. Apple phytochemicals and their health benefits. Nutr. J. 3, 1–15 (2004).
doi: 10.1186/1475-2891-3-5
Lu, Y. et al. Flavonoid accumulation plays an important role in the rust resistance of Malus plant leaves. Front Plant Sci 8, 1286 (2017).
pubmed: 28769974 pmcid: 5514348 doi: 10.3389/fpls.2017.01286
Liu, F., Wang, M. & Wang, M. Phenolic compounds and antioxidant activities of flowers, leaves and fruits of five crabapple cultivars (Malus Mill. species). Sci. Hortic. 235, 460–467 (2018).
doi: 10.1016/j.scienta.2018.02.051
Wang, Y.-R. et al. Different coloration patterns between the red-and white-fleshed fruits of malus crabapples. Sci. Hortic. 194, 26–33 (2015).
doi: 10.1016/j.scienta.2015.07.041
Jiang, R., Tian, J., Song, T., Zhang, J. & Yao, Y. The Malus crabapple transcription factor McMYB10 regulates anthocyanin biosynthesis during petal coloration. Sci. Hortic. 166, 42–49 (2014).
doi: 10.1016/j.scienta.2013.12.002
Tian, J. et al. Mc MYB 10 regulates coloration via activating McF3’H and later structural genes in ever‐red leaf crabapple. Plant Biotechnol. J. 13, 948–961 (2015).
pubmed: 25641214 doi: 10.1111/pbi.12331
Tian, J. et al. Characteristics of dihydroflavonol 4-reductase gene promoters from different leaf colored Malus crabapple cultivars. Hort. Res. 4 (2017).
Li, K.-T. et al. McMYB10 modulates the expression of a Ubiquitin Ligase, McCOP1 during leaf coloration in crabapple. Front Plant Sci 9, 704 (2018).
pubmed: 29915606 pmcid: 5994411 doi: 10.3389/fpls.2018.00704
Tian, J. et al. McMYB12 transcription factors co-regulate proanthocyanidin and anthocyanin biosynthesis in Malus crabapple. Sci. Rep. 7, 43715 (2017).
pubmed: 28255171 pmcid: 5334656 doi: 10.1038/srep43715
Tai, D., Tian, J., Zhang, J., Song, T. & Yao, Y. A Malus crabapple chalcone synthase gene, McCHS, regulates red petal color and flavonoid biosynthesis. PLoS One 9, e110570 (2014).
pubmed: 25357207 pmcid: 4214706 doi: 10.1371/journal.pone.0110570
Lieberman-Aiden, E. et al. Comprehensive mapping of long-range interactions reveals folding principles of the human genome. Science 326, 289–293 (2009).
pubmed: 19815776 pmcid: 2858594 doi: 10.1126/science.1181369
Marcais, G. & Kingsford, C. A fast, lock-free approach for efficient parallel counting of occurrences of k-mers. Bioinformatics 27, 764-770 (2011).
pubmed: 21217122 pmcid: 3051319 doi: 10.1093/bioinformatics/btr011
Vurture, G. W. et al. GenomeScope: fast reference-free genome profiling from short reads. Bioinformatics 33, 2202–2204 (2017).
pubmed: 28369201 pmcid: 5870704 doi: 10.1093/bioinformatics/btx153
Cheng, H. et al. Haplotype-resolved assembly of diploid genomes without parental data. Nat Biotechnol. Nat. Biotechnol 40, 1332–1335 (2022).
pubmed: 35332338 doi: 10.1038/s41587-022-01261-x
Durand, N. C. et al. Juicer provides a one-click system for analyzing loop-resolution Hi-C experiments. Cell Syst. 3, 95–98 (2016).
pubmed: 27467249 pmcid: 5846465 doi: 10.1016/j.cels.2016.07.002
Dudchenko, O. et al. De novo assembly of the Aedes aegypti genome using Hi-C yields chromosome-length scaffolds. Science 356, 92–95 (2017).
pubmed: 28336562 pmcid: 5635820 doi: 10.1126/science.aal3327
Robinson, J. T. et al. Juicebox. js provides a cloud-based visualization system for Hi-C data. Cell Syst. 6, 256–258. e251 (2018).
pubmed: 29428417 pmcid: 6047755 doi: 10.1016/j.cels.2018.01.001
Alonge, M. et al. Automated assembly scaffolding using RagTag elevates a new tomato system for high-throughput genome editing. Genome Biol. 23, 258 (2022).
pubmed: 36522651 pmcid: 9753292 doi: 10.1186/s13059-022-02823-7
Benson, G. Tandem repeats finder: a program to analyze DNA sequences. Nucleic Acids Res. 27, 573–580 (1999).
pubmed: 9862982 pmcid: 148217 doi: 10.1093/nar/27.2.573
Jurka, J. et al. Repbase Update, a database of eukaryotic repetitive elements. Cytogenet. Genome Res. 110, 462–467 (2005).
pubmed: 16093699 doi: 10.1159/000084979
Price, A. L., Jones, N. C. & Pevzner, P. A. De novo identification of repeat families in large genomes. Bioinformatics 21, i351–i358 (2005).
pubmed: 15961478 doi: 10.1093/bioinformatics/bti1018
Zhang, L. et al. A high-quality apple genome assembly reveals the association of a retrotransposon and red fruit colour. Nat. Commun. 10, 1494 (2019).
pubmed: 30940818 pmcid: 6445120 doi: 10.1038/s41467-019-09518-x
NCBI Sequence Read Archive https://identifiers.org/ncbi/insdc.sra:SRP200472 (2019).
NCBI Sequence Read Archive https://identifiers.org/ncbi/insdc.sra:SRP200468 (2019).
Kim, D., Paggi, J. M., Park, C., Bennett, C. & Salzberg, S. L. Graph-based genome alignment and genotyping with HISAT2 and HISAT-genotype. Nat. Biotechnol. 37, 907–915 (2019).
pubmed: 31375807 pmcid: 7605509 doi: 10.1038/s41587-019-0201-4
Pertea, M. et al. StringTie enables improved reconstruction of a transcriptome from RNA-seq reads. Nat. Biotechnol. 33, 290–295 (2015).
pubmed: 25690850 pmcid: 4643835 doi: 10.1038/nbt.3122
Slater, G. S. C. & Birney, E. Automated generation of heuristics for biological sequence comparison. BMC Bioinform. 6, 31 (2005).
doi: 10.1186/1471-2105-6-31
Stanke, M. et al. AUGUSTUS: ab initio prediction of alternative transcripts. Nucleic Acids Res. 34, W435–W439 (2006).
pubmed: 16845043 pmcid: 1538822 doi: 10.1093/nar/gkl200
Brůna, T., Hoff, K. J., Lomsadze, A., Stanke, M. & Borodovsky, M. BRAKER2: automatic eukaryotic genome annotation with GeneMark-EP+ and AUGUSTUS supported by a protein database. NAR genomics and bioinformatics 3, lqaa108 (2021).
pubmed: 33575650 pmcid: 7787252 doi: 10.1093/nargab/lqaa108
Holt, C. & Yandell, M. MAKER2: an annotation pipeline and genome-database management tool for second-generation genome projects. BMC Bioinform. 12, 491 (2011).
doi: 10.1186/1471-2105-12-491
Simão, F. A., Waterhouse, R. M., Ioannidis, P., Kriventseva, E. V. & Zdobnov, E. M. BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics 31, 3210–3212 (2015).
pubmed: 26059717 doi: 10.1093/bioinformatics/btv351
Buchfink, B., Reuter, K. & Drost, H.-G. Sensitive protein alignments at tree-of-life scale using DIAMOND. Nat. Methods 18, 366–368 (2021).
pubmed: 33828273 pmcid: 8026399 doi: 10.1038/s41592-021-01101-x
O’Leary, N. A. et al. Reference sequence (RefSeq) database at NCBI: current status, taxonomic expansion, and functional annotation. Nucleic Acids Res. 44, D733–D745 (2016).
pubmed: 26553804 doi: 10.1093/nar/gkv1189
Boeckmann, B. et al. The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003. Nucleic Acids Res. 31, 365–370 (2003).
pubmed: 12520024 pmcid: 165542 doi: 10.1093/nar/gkg095
Consortium, U. UniProt: a worldwide hub of protein knowledge. Nucleic Acids Res. 47, D506–D515 (2019).
doi: 10.1093/nar/gky1049
Mergner, J. et al. Mass-spectrometry-based draft of the Arabidopsis proteome. Nature 579, 409–414 (2020).
pubmed: 32188942 doi: 10.1038/s41586-020-2094-2
Jones, P. et al. InterProScan 5: genome-scale protein function classification. Bioinformatics 30, 1236–1240 (2014).
pubmed: 24451626 pmcid: 3998142 doi: 10.1093/bioinformatics/btu031
Mulder, N. & Apweiler, R. InterPro and InterProScan: tools for protein sequence classification and comparison. Methods Mol Biol. 396, 59–70 (2007).
pubmed: 18025686 doi: 10.1007/978-1-59745-515-2_5
Ashburner, M. et al. Gene ontology: tool for the unification of biology. Nat. Genet. 25, 25–29 (2000).
pubmed: 10802651 pmcid: 3037419 doi: 10.1038/75556
Conesa, A. et al. Blast2GO: a universal tool for annotation, visualization and analysis in functional genomics research. Bioinformatics 21, 3674–3676 (2005).
pubmed: 16081474 doi: 10.1093/bioinformatics/bti610
Kanehisa, M. & Goto, S. KEGG: kyoto encyclopedia of genes and genomes. Nucleic Acids Res. 28, 27–30 (2000).
pubmed: 10592173 pmcid: 102409 doi: 10.1093/nar/28.1.27
Zheng, Y. et al. iTAK: a program for genome-wide prediction and classification of plant transcription factors, transcriptional regulators, and protein kinases. Mol. Plant 9, 1667–1670 (2016).
pubmed: 27717919 doi: 10.1016/j.molp.2016.09.014
NCBI Sequence Read Archive https://identifiers.org/ncbi/insdc.sra:SRP465516 (2023).
Peng, H.-X. Haplotype-resolved genome assembly and annotation of Malus hybrid cultivar Flame and Malus hybrid cultivar Royalty. figshare https://doi.org/10.6084/m9.figshare.24276916.v1 (2023).
NCBI GenBank https://identifiers.org/ncbi/insdc.gca:GCA_036218565.1 (2024).
NCBI GenBank https://identifiers.org/ncbi/insdc.gca:GCA_036220445.1 (2024).
NCBI GenBank https://identifiers.org/ncbi/insdc.gca:GCA_036320615.1 (2024).
NCBI GenBank https://identifiers.org/ncbi/insdc.gca:GCA_036320635.1 (2024).

Auteurs

Hua Li (H)

Beijing Key Laboratory for Agriculture Application and New Technique, College of Plant Science and Technology, Beijing University of Agriculture, Beijing, 102206, China.
Bioinformatics Center, Beijing University of Agriculture, Beijing, 102206, China.

Xuyang Zhai (X)

Beijing Key Laboratory for Agriculture Application and New Technique, College of Plant Science and Technology, Beijing University of Agriculture, Beijing, 102206, China.
Bioinformatics Center, Beijing University of Agriculture, Beijing, 102206, China.

Haixu Peng (H)

Beijing Key Laboratory for Agriculture Application and New Technique, College of Plant Science and Technology, Beijing University of Agriculture, Beijing, 102206, China.
Bioinformatics Center, Beijing University of Agriculture, Beijing, 102206, China.

You Qing (Y)

Beijing Key Laboratory for Agriculture Application and New Technique, College of Plant Science and Technology, Beijing University of Agriculture, Beijing, 102206, China.
Bioinformatics Center, Beijing University of Agriculture, Beijing, 102206, China.

Yulin Deng (Y)

Beijing Key Laboratory for Agriculture Application and New Technique, College of Plant Science and Technology, Beijing University of Agriculture, Beijing, 102206, China.
Bioinformatics Center, Beijing University of Agriculture, Beijing, 102206, China.

Shijie Zhou (S)

Beijing Key Laboratory for Agriculture Application and New Technique, College of Plant Science and Technology, Beijing University of Agriculture, Beijing, 102206, China.
Bioinformatics Center, Beijing University of Agriculture, Beijing, 102206, China.

Tairui Bei (T)

Bioinformatics Center, Beijing University of Agriculture, Beijing, 102206, China.

Ji Tian (J)

Beijing Key Laboratory for Agriculture Application and New Technique, College of Plant Science and Technology, Beijing University of Agriculture, Beijing, 102206, China.

Jie Zhang (J)

Beijing Key Laboratory for Agriculture Application and New Technique, College of Plant Science and Technology, Beijing University of Agriculture, Beijing, 102206, China.

Yujing Hu (Y)

Beijing Key Laboratory for Agriculture Application and New Technique, College of Plant Science and Technology, Beijing University of Agriculture, Beijing, 102206, China.

Xiaoxiao Qin (X)

Beijing Key Laboratory for Agriculture Application and New Technique, College of Plant Science and Technology, Beijing University of Agriculture, Beijing, 102206, China.

Yanfen Lu (Y)

Beijing Key Laboratory for Agriculture Application and New Technique, College of Plant Science and Technology, Beijing University of Agriculture, Beijing, 102206, China.

Yuncong Yao (Y)

Beijing Key Laboratory for Agriculture Application and New Technique, College of Plant Science and Technology, Beijing University of Agriculture, Beijing, 102206, China.

Sen Wang (S)

Beijing Key Laboratory for Agriculture Application and New Technique, College of Plant Science and Technology, Beijing University of Agriculture, Beijing, 102206, China. wangsen@moilab.net.
Bioinformatics Center, Beijing University of Agriculture, Beijing, 102206, China. wangsen@moilab.net.

Yi Zheng (Y)

Beijing Key Laboratory for Agriculture Application and New Technique, College of Plant Science and Technology, Beijing University of Agriculture, Beijing, 102206, China. yz@moilab.net.
Bioinformatics Center, Beijing University of Agriculture, Beijing, 102206, China. yz@moilab.net.

Articles similaires

Fragaria Light Plant Leaves Osmosis Stress, Physiological
Genome Size Genome, Plant Magnoliopsida Evolution, Molecular Arabidopsis
Humans Flavonoids Female Apoptosis Granulosa Cells

Classifications MeSH