Chromosomal level genome assemblies of two Malus crabapple cultivars Flame and Royalty.
Journal
Scientific data
ISSN: 2052-4463
Titre abrégé: Sci Data
Pays: England
ID NLM: 101640192
Informations de publication
Date de publication:
13 Feb 2024
13 Feb 2024
Historique:
received:
07
11
2023
accepted:
05
02
2024
medline:
15
2
2024
pubmed:
14
2
2024
entrez:
13
2
2024
Statut:
epublish
Résumé
Malus hybrid 'Flame' and Malus hybrid 'Royalty' are representative ornamental crabapples, rich in flavonoids and serving as the preferred materials for studying the coloration mechanism. We generated two sets of high-quality chromosome-level and haplotype-resolved genome of 'Flame' with sizes of 688.2 Mb and 675.7 Mb, and those of 'Royalty' with sizes of 674.1 Mb and 663.6 Mb, all anchored to 17 chromosomes and with a high BUSCO completeness score nearly 99.0%. A total of 47,833 and 47,307 protein-coding genes were annotated in the two haplotype genomes of 'Flame', and the numbers of 'Royalty' were 46,305 and 46,920 individually. The assembled high-quality genomes offer new resources for studying the origin and adaptive evolution of crabapples and the molecular basis of the accumulation of flavonoids and anthocyanins, facilitating molecular breeding of Malus plants.
Identifiants
pubmed: 38351118
doi: 10.1038/s41597-024-03049-x
pii: 10.1038/s41597-024-03049-x
pmc: PMC10864326
doi:
Substances chimiques
Anthocyanins
0
Flavonoids
0
Types de publication
Dataset
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Pagination
201Subventions
Organisme : Beijing University of Agriculture (BUA)
ID : Start-up fund
Informations de copyright
© 2024. The Author(s).
Références
Wang, Z., Wang, W., Zhang, J., Song, T. & Yao, Y. Genetic diversity and phylogenetic relationships analysis of major ornamental crabapple species. Journal of Fruit Science 31, 1005–1016 (2014).
Tian, J. et al. The Balance of Expression of Dihydroflavonol 4-reductase and Flavonol Synthase Regulates Flavonoid Biosynthesis and Red Foliage Coloration in Crabapples. Sci Rep 5, 12228 (2015).
pubmed: 26192267
pmcid: 4507444
doi: 10.1038/srep12228
Li, H. et al. MdMYB8 is associated with flavonol biosynthesis via the activation of the MdFLS promoter in the fruits of Malus crabapple. Hort. Res. 7 (2020).
He, X. & Liu, R. H. Phytochemicals of apple peels: isolation, structure elucidation, and their antiproliferative and antioxidant activities. J Agr Food Chem 56, 9905–9910 (2008).
doi: 10.1021/jf8015255
Boyer, J. & Liu, R. H. Apple phytochemicals and their health benefits. Nutr. J. 3, 1–15 (2004).
doi: 10.1186/1475-2891-3-5
Lu, Y. et al. Flavonoid accumulation plays an important role in the rust resistance of Malus plant leaves. Front Plant Sci 8, 1286 (2017).
pubmed: 28769974
pmcid: 5514348
doi: 10.3389/fpls.2017.01286
Liu, F., Wang, M. & Wang, M. Phenolic compounds and antioxidant activities of flowers, leaves and fruits of five crabapple cultivars (Malus Mill. species). Sci. Hortic. 235, 460–467 (2018).
doi: 10.1016/j.scienta.2018.02.051
Wang, Y.-R. et al. Different coloration patterns between the red-and white-fleshed fruits of malus crabapples. Sci. Hortic. 194, 26–33 (2015).
doi: 10.1016/j.scienta.2015.07.041
Jiang, R., Tian, J., Song, T., Zhang, J. & Yao, Y. The Malus crabapple transcription factor McMYB10 regulates anthocyanin biosynthesis during petal coloration. Sci. Hortic. 166, 42–49 (2014).
doi: 10.1016/j.scienta.2013.12.002
Tian, J. et al. Mc MYB 10 regulates coloration via activating McF3’H and later structural genes in ever‐red leaf crabapple. Plant Biotechnol. J. 13, 948–961 (2015).
pubmed: 25641214
doi: 10.1111/pbi.12331
Tian, J. et al. Characteristics of dihydroflavonol 4-reductase gene promoters from different leaf colored Malus crabapple cultivars. Hort. Res. 4 (2017).
Li, K.-T. et al. McMYB10 modulates the expression of a Ubiquitin Ligase, McCOP1 during leaf coloration in crabapple. Front Plant Sci 9, 704 (2018).
pubmed: 29915606
pmcid: 5994411
doi: 10.3389/fpls.2018.00704
Tian, J. et al. McMYB12 transcription factors co-regulate proanthocyanidin and anthocyanin biosynthesis in Malus crabapple. Sci. Rep. 7, 43715 (2017).
pubmed: 28255171
pmcid: 5334656
doi: 10.1038/srep43715
Tai, D., Tian, J., Zhang, J., Song, T. & Yao, Y. A Malus crabapple chalcone synthase gene, McCHS, regulates red petal color and flavonoid biosynthesis. PLoS One 9, e110570 (2014).
pubmed: 25357207
pmcid: 4214706
doi: 10.1371/journal.pone.0110570
Lieberman-Aiden, E. et al. Comprehensive mapping of long-range interactions reveals folding principles of the human genome. Science 326, 289–293 (2009).
pubmed: 19815776
pmcid: 2858594
doi: 10.1126/science.1181369
Marcais, G. & Kingsford, C. A fast, lock-free approach for efficient parallel counting of occurrences of k-mers. Bioinformatics 27, 764-770 (2011).
pubmed: 21217122
pmcid: 3051319
doi: 10.1093/bioinformatics/btr011
Vurture, G. W. et al. GenomeScope: fast reference-free genome profiling from short reads. Bioinformatics 33, 2202–2204 (2017).
pubmed: 28369201
pmcid: 5870704
doi: 10.1093/bioinformatics/btx153
Cheng, H. et al. Haplotype-resolved assembly of diploid genomes without parental data. Nat Biotechnol. Nat. Biotechnol 40, 1332–1335 (2022).
pubmed: 35332338
doi: 10.1038/s41587-022-01261-x
Durand, N. C. et al. Juicer provides a one-click system for analyzing loop-resolution Hi-C experiments. Cell Syst. 3, 95–98 (2016).
pubmed: 27467249
pmcid: 5846465
doi: 10.1016/j.cels.2016.07.002
Dudchenko, O. et al. De novo assembly of the Aedes aegypti genome using Hi-C yields chromosome-length scaffolds. Science 356, 92–95 (2017).
pubmed: 28336562
pmcid: 5635820
doi: 10.1126/science.aal3327
Robinson, J. T. et al. Juicebox. js provides a cloud-based visualization system for Hi-C data. Cell Syst. 6, 256–258. e251 (2018).
pubmed: 29428417
pmcid: 6047755
doi: 10.1016/j.cels.2018.01.001
Alonge, M. et al. Automated assembly scaffolding using RagTag elevates a new tomato system for high-throughput genome editing. Genome Biol. 23, 258 (2022).
pubmed: 36522651
pmcid: 9753292
doi: 10.1186/s13059-022-02823-7
Benson, G. Tandem repeats finder: a program to analyze DNA sequences. Nucleic Acids Res. 27, 573–580 (1999).
pubmed: 9862982
pmcid: 148217
doi: 10.1093/nar/27.2.573
Jurka, J. et al. Repbase Update, a database of eukaryotic repetitive elements. Cytogenet. Genome Res. 110, 462–467 (2005).
pubmed: 16093699
doi: 10.1159/000084979
Price, A. L., Jones, N. C. & Pevzner, P. A. De novo identification of repeat families in large genomes. Bioinformatics 21, i351–i358 (2005).
pubmed: 15961478
doi: 10.1093/bioinformatics/bti1018
Zhang, L. et al. A high-quality apple genome assembly reveals the association of a retrotransposon and red fruit colour. Nat. Commun. 10, 1494 (2019).
pubmed: 30940818
pmcid: 6445120
doi: 10.1038/s41467-019-09518-x
NCBI Sequence Read Archive https://identifiers.org/ncbi/insdc.sra:SRP200472 (2019).
NCBI Sequence Read Archive https://identifiers.org/ncbi/insdc.sra:SRP200468 (2019).
Kim, D., Paggi, J. M., Park, C., Bennett, C. & Salzberg, S. L. Graph-based genome alignment and genotyping with HISAT2 and HISAT-genotype. Nat. Biotechnol. 37, 907–915 (2019).
pubmed: 31375807
pmcid: 7605509
doi: 10.1038/s41587-019-0201-4
Pertea, M. et al. StringTie enables improved reconstruction of a transcriptome from RNA-seq reads. Nat. Biotechnol. 33, 290–295 (2015).
pubmed: 25690850
pmcid: 4643835
doi: 10.1038/nbt.3122
Slater, G. S. C. & Birney, E. Automated generation of heuristics for biological sequence comparison. BMC Bioinform. 6, 31 (2005).
doi: 10.1186/1471-2105-6-31
Stanke, M. et al. AUGUSTUS: ab initio prediction of alternative transcripts. Nucleic Acids Res. 34, W435–W439 (2006).
pubmed: 16845043
pmcid: 1538822
doi: 10.1093/nar/gkl200
Brůna, T., Hoff, K. J., Lomsadze, A., Stanke, M. & Borodovsky, M. BRAKER2: automatic eukaryotic genome annotation with GeneMark-EP+ and AUGUSTUS supported by a protein database. NAR genomics and bioinformatics 3, lqaa108 (2021).
pubmed: 33575650
pmcid: 7787252
doi: 10.1093/nargab/lqaa108
Holt, C. & Yandell, M. MAKER2: an annotation pipeline and genome-database management tool for second-generation genome projects. BMC Bioinform. 12, 491 (2011).
doi: 10.1186/1471-2105-12-491
Simão, F. A., Waterhouse, R. M., Ioannidis, P., Kriventseva, E. V. & Zdobnov, E. M. BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics 31, 3210–3212 (2015).
pubmed: 26059717
doi: 10.1093/bioinformatics/btv351
Buchfink, B., Reuter, K. & Drost, H.-G. Sensitive protein alignments at tree-of-life scale using DIAMOND. Nat. Methods 18, 366–368 (2021).
pubmed: 33828273
pmcid: 8026399
doi: 10.1038/s41592-021-01101-x
O’Leary, N. A. et al. Reference sequence (RefSeq) database at NCBI: current status, taxonomic expansion, and functional annotation. Nucleic Acids Res. 44, D733–D745 (2016).
pubmed: 26553804
doi: 10.1093/nar/gkv1189
Boeckmann, B. et al. The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003. Nucleic Acids Res. 31, 365–370 (2003).
pubmed: 12520024
pmcid: 165542
doi: 10.1093/nar/gkg095
Consortium, U. UniProt: a worldwide hub of protein knowledge. Nucleic Acids Res. 47, D506–D515 (2019).
doi: 10.1093/nar/gky1049
Mergner, J. et al. Mass-spectrometry-based draft of the Arabidopsis proteome. Nature 579, 409–414 (2020).
pubmed: 32188942
doi: 10.1038/s41586-020-2094-2
Jones, P. et al. InterProScan 5: genome-scale protein function classification. Bioinformatics 30, 1236–1240 (2014).
pubmed: 24451626
pmcid: 3998142
doi: 10.1093/bioinformatics/btu031
Mulder, N. & Apweiler, R. InterPro and InterProScan: tools for protein sequence classification and comparison. Methods Mol Biol. 396, 59–70 (2007).
pubmed: 18025686
doi: 10.1007/978-1-59745-515-2_5
Ashburner, M. et al. Gene ontology: tool for the unification of biology. Nat. Genet. 25, 25–29 (2000).
pubmed: 10802651
pmcid: 3037419
doi: 10.1038/75556
Conesa, A. et al. Blast2GO: a universal tool for annotation, visualization and analysis in functional genomics research. Bioinformatics 21, 3674–3676 (2005).
pubmed: 16081474
doi: 10.1093/bioinformatics/bti610
Kanehisa, M. & Goto, S. KEGG: kyoto encyclopedia of genes and genomes. Nucleic Acids Res. 28, 27–30 (2000).
pubmed: 10592173
pmcid: 102409
doi: 10.1093/nar/28.1.27
Zheng, Y. et al. iTAK: a program for genome-wide prediction and classification of plant transcription factors, transcriptional regulators, and protein kinases. Mol. Plant 9, 1667–1670 (2016).
pubmed: 27717919
doi: 10.1016/j.molp.2016.09.014
NCBI Sequence Read Archive https://identifiers.org/ncbi/insdc.sra:SRP465516 (2023).
Peng, H.-X. Haplotype-resolved genome assembly and annotation of Malus hybrid cultivar Flame and Malus hybrid cultivar Royalty. figshare https://doi.org/10.6084/m9.figshare.24276916.v1 (2023).
NCBI GenBank https://identifiers.org/ncbi/insdc.gca:GCA_036218565.1 (2024).
NCBI GenBank https://identifiers.org/ncbi/insdc.gca:GCA_036220445.1 (2024).
NCBI GenBank https://identifiers.org/ncbi/insdc.gca:GCA_036320615.1 (2024).
NCBI GenBank https://identifiers.org/ncbi/insdc.gca:GCA_036320635.1 (2024).