Epigenetic patterns within the haplotype phased fig (Ficus carica L.) genome.
Ficus carica L.
N4-methylcytosine
N6-methyladenine
genome assembly
single-molecule real-time sequencing
Journal
The Plant journal : for cell and molecular biology
ISSN: 1365-313X
Titre abrégé: Plant J
Pays: England
ID NLM: 9207397
Informations de publication
Date de publication:
05 2020
05 2020
Historique:
received:
22
06
2019
revised:
13
11
2019
accepted:
26
11
2019
pubmed:
7
12
2019
medline:
2
2
2021
entrez:
7
12
2019
Statut:
ppublish
Résumé
Due to DNA heterozygosity and repeat content, assembly of non-model plant genomes is challenging. Herein, we report a high-quality genome reference of one of the oldest known domesticated species, fig (Ficus carica L.), using Pacific Biosciences single-molecule, real-time sequencing. The fig genome is ~333 Mbp in size, of which 80% has been anchored to 13 chromosomes. Genome-wide analysis of N
Substances chimiques
N-methyladenosine
CLE6G00625
Adenosine
K72T3FS567
Types de publication
Journal Article
Research Support, Non-U.S. Gov't
Langues
eng
Sous-ensembles de citation
IM
Pagination
600-614Informations de copyright
© 2019 The Authors The Plant Journal © 2019 John Wiley & Sons Ltd.
Références
Al-Dous, E.K., George, B., Al-Mahmoud, M.E. et al. (2011) De novo genome sequencing and comparative genomics of date palm (Phoenix dactylifera). Nat. Biotechnol. 29, 521.
Ansorge, W.J. (2016) Next-generation DNA sequencing (II): techniques, applications. Next Gener. Seq. Appl. 1, 1-10.
Bierne, H., Hamon, M. and Cossart, P. (2012) Epigenetics and bacterial infections. Cold Spring Harb. Perspect. Med. 2, a010272.
Bolger, A.M., Lohse, M. and Usadel, B. (2014) Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics, 30, 2114-2120.
Boratyn, G.M., Camacho, C., Cooper, P.S. et al. (2013) BLAST: a more efficient report with usability improvements. Nucleic Acids Res. 41, W29-W33.
Bradnam, K.R., Fass, J.N., Alexandrov, A. et al. (2013) Assemblathon 2: evaluating de novo methods of genome assembly in three vertebrate species. GigaScience, 2, 10.
Buti, M., Moretto, M., Barghini, E. et al. (2017) The genome sequence and transcriptome of Potentilla micrantha and their comparison to Fragaria vesca (the woodland strawberry). GigaScience, 7, giy010.
Chaisson, M.J. and Tesler, G. (2012) Mapping single molecule sequencing reads using basic local alignment with successive refinement (BLASR): application and theory. BMC Bioinformatics, 13, 238.
Chin, C.S., Alexander, D.H., Marks, P. et al. (2013) Nonhybrid, finished microbial genome assemblies from long-read SMRT sequencing data. Nat. Methods, 10, 563.
Chin, C.S., Peluso, P., Sedlazeck, F.J. et al. (2016) Phased diploid genome assembly with single-molecule real-time sequencing. Nat. Methods, 13, 1050.
Conesa, A., Götz, S., García-Gómez, J.M., Terol, J., Talón, M. and Robles, M. (2005) Blast2GO: a universal tool for annotation, visualization and analysis in functional genomics research. Bioinformatics, 21, 3674-3676.
Daccord, N., Celton, J.M., Linsmith, G. et al. (2017) High-quality de novo assembly of the apple genome and methylome dynamics of early fruit development. Nature Genet. 49, 1099.
Diez, C.M., Roessler, K. and Gaut, B.S. (2014) Epigenetics and plant genome evolution. Curr. Opin. Plant. Biol. 18, 1-8.
Dong, A.X., Xin, H.B., Li, Z.J. et al. (2018) High-quality assembly of the reference genome for scarlet sage, Salvia splendens, an economically important ornamental plant. GigaScience, 7, giy068.
Eddy, S.R. (1998) Profile hidden Markov models. Bioinformatics, 14, 755-763.
Ellinghaus, D., Kurtz, S. and Willhoeft, U. (2008) LTRharvest, an efficient and flexible software for de novo detection of LTR retrotransposons. BMC Bioinformatics, 9, 18.
Feng, S., Cokus, S.J., Zhang, X. et al. (2010) Conservation and divergence of methylation patterning in plants and animals. Proc. Natl Acad. Sci. USA, 107, 8689-8694.
Fu, Y., Luo, G.Z., Chen, K. et al. (2015) N6-methyldeoxyadenosine marks active transcription start sites in Chlamydomonas. Cell, 161, 879-892.
Garrison, E. and Marth, G. (2012) Haplotype-based variant detection from short-read sequencing. arXiv, 1207.3907.
Grabherr, M.G., Haas, B.J., Yassour, M. et al. (2011) Trinity: reconstructing a full-length transcriptome without a genome from RNA-Seq data. Nature Biotechnol. 29, 644.
Han, Y. and Wessler, S.R. (2010) MITE-Hunter: a program for discovering miniature inverted-repeat transposable elements from genomic sequences. Nucleic Acids Res. 38, e199.
Holt, C. and Yandell, M. (2011) MAKER2: an annotation pipeline and genome-database management tool for second-generation genome projects. BMC Bioinformatics, 12, 491.
Huson, D.H., Auch, A.F., Qi, J. and Schuster, S.C. (2007) MEGAN analysis of metagenomic data. Genome Res. 17, 377-386.
Jones, P., Binns, D., Chang, H.Y. et al. (2014) InterProScan 5: genome-scale protein function classification. Bioinformatics, 30, 1236-1240.
Kalvari, I., Argasinska, J., Quinones-Olvera, N., Nawrocki, E.P., Rivas, E., Eddy, S.R., Bateman, A., Finn, R.D. and Petrov, A.I. (2017) Rfam 13.0: shifting to a genome-centric resource for non-coding RNA families. Nucleic Acids Res. 46, D335-D342.
Kearse, M., Moir, R., Wilson, A. et al. (2012) Geneious basic: an integrated and extendable desktop software platform for the organization and analysis of sequence data. Bioinformatics, 28, 1647-1649.
Kislev, M.E., Hartmann, A. and Bar-Yosef, O. (2006) Early domesticated fig in the Jordan Valley. Science, 312, 1372-1374.
Koren, S., Walenz, B.P., Berlin, K., Miller, J.R., Bergman, N.H. and Phillippy, A.M. (2017) Canu: scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation. Genome Res. 27, 722-736.
Korlach, J., Gedman, G., Kingan, S.B., Chin, C.S., Howard, J.T., Audet, J.N., Cantin, L. and Jarvis, E.D. (2017) De novo PacBio long-read and phased avian genome assemblies correct and add to reference genes generated with intermediate and short reads. GigaScience, 6, 1-16.
Koziol, M.J., Bradshaw, C.R., Allen, G.E., Costa, A.S., Frezza, C. and Gurdon, J.B. (2016) Identification of methylated deoxyadenosines in vertebrates reveals diversity in DNA modifications. Nat. Struct. Mol. Biol. 23, 24.
Kronenberg, Z.N., Hall, R.J., Hiendleder, S., Smith, T.P., Sullivan, S.T., Williams, J.L. and Kingan, S.B. (2018) FALCON-phase: integrating PacBio and Hi-C data for phased diploid genomes. bioRxiv [Preprint]. https://doi.org/10.1101/327064.
Krzywinski, M., Schein, J., Birol, I., Connors, J., Gascoyne, R., Horsman, D., Jones, S.J. and Marra, M.A. (2009) Circos: an information aesthetic for comparative genomics. Genome Res. 19, 1639-1645.
Kurtz, S., Phillippy, A., Delcher, A.L., Smoot, M., Shumway, M., Antonescu, C. and Salzberg, S.L. (2004) Versatile and open software for comparing large genomes. Genome Biol. 5, R12.
Lam, K.K., LaButti, K., Khalak, A. and Tse, D. (2015) FinisherSC: a repeat-aware tool for upgrading de novo assembly using long reads. Bioinformatics, 31, 3207-3209.
Li, H. and Durbin, R. (2010) Fast and accurate long-read alignment with Burrows-Wheeler transform. Bioinformatics, 26, 589-595.
Li, H., Handsaker, B., Wysoker, A., Fennell, T., Ruan, J., Homer, N., Marth, G., Abecasis, G. and Durbin, R. (2009) The sequence alignment/map format and SAMtools. Bioinformatics, 25, 2078-2079.
Liang, Z., Shen, L., Cui, X. et al. (2018) DNA N6-adenine methylation in Arabidopsis thaliana. Dev. Cell, 45, 406-416.
Liu, M.J., Zhao, J., Cai, Q.L. et al. (2014) The complex jujube genome provides insights into fruit tree biology. Nat. Commun. 5, 5315.
Loureiro, J., Rodriguez, E., Doležel, J. and Santos, C. (2007) Two new nuclear isolation buffers for plant DNA flow cytometry: a test with 37 species. Ann. Bot. 100, 875-888.
Low, W.Y., Tearle, R., Bickhart, D.M. et al. (2019) Chromosome-level assembly of the water buffalo genome surpasses human and goat genomes in sequence contiguity. Nat. Commun. 10, 260.
Lowe, T.M. and Chan, P.P. (2016) tRNAscan-SE On-line: integrating search and context for analysis of transfer RNA genes. Nucleic Acids Res. 44, W54-W57.
Machanick, P. and Bailey, T.L. (2011) MEME-ChIP: motif analysis of large DNA datasets. Bioinformatics, 27, 1696-1697.
Mafrica, R., Marchese, A., Bruno, M., Costa, F., Fretto, S., Marra, F.P., Pangallo, S., Quartararo, A. and Caruso, T. (2015) Morphological and molecular variability within the fig cultivar 'Dottato' in the Italian protected designation origin area “Fichi di Cosenza”. Acta Hortic. 1173, 29-34.
Mao, H. and Wang, H. (2017) SINE_scan: an efficient tool to discover short interspersed nuclear elements (SINEs) in large-scale genomic datasets. Bioinformatics, 33, 743-745.
Mascagni, F., Cavallini, A., Giordani, T. and Natali, L. (2017) Different histories of two highly variable LTR retrotransposons in sunflower species. Gene, 634, 5-14.
Mascagni, F., Vangelisti, A., Giordani, T., Cavallini, A. and Natali, L. (2018) Specific LTR-retrotransposons show copy number variations between wild and cultivated sunflowers. Genes, 9, 433.
Mayer, C. (2006-2010) Phobos 3.3.11. Available from: https://www.rub.de/ecoevo/cm/cm_phobos.htm.
Meuwissen, T., Hayes, B. and Goddard, M. (2013) Accelerating improvement of livestock with genomic selection. Annu. Rev. Anim. Biosci. 1, 221-237.
Ming, R., Hou, S., Feng, Y. et al. (2008) The draft genome of the transgenic tropical fruit tree papaya (Carica papaya Linnaeus). Nature, 452, 991.
Mori, K., Shirasawa, K., Nogata, H., Hirata, C., Tashiro, K., Habu, T., Kim, S., Himeno, S., Kuhara, S. and Ikegami, H. (2017) Identification of RAN1 orthologue associated with sex determination through whole genome sequencing analysis in fig (Ficus carica L.). Sci. Rep. 7, 41124.
Murray, I.A., Clark, T.A., Morgan, R.D., Boitano, M., Anton, B.P., Luong, K., Fomenkov, A., Turner, S.W., Korlach, J. and Roberts, R.J. (2012) The methylomes of six bacteria. Nucleic Acids Res. 40, 11450-11462.
Neumann, P., Navrátilová, A., Koblížková, A., Kejnovský, E., Hřibová, E., Hobza, R., Widmer, A., Doležel, J. and Macas, J. (2011) Plant centromeric retrotransposons: a structural and cytogenetic perspective. Mob. DNA, 2, 4.
Quinlan, A.R. and Hall, I.M. (2010) BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics, 26, 841-842.
Reinders, J., Wulff, B.B., Mirouze, M., Marí-Ordóñez, A., Dapp, M., Rozhon, W., Bucher, E., Theiler, G. and Paszkowski, J. (2009) Compromised stability of DNA methylation and transposon immobilization in mosaic Arabidopsis epigenomes. Genes Dev. 23, 939-950.
Rhoads, A. and Au, K.F. (2015) PacBio sequencing and its applications. Genomics Proteomics Bioinformatics, 13, 278-289.
Schwessinger, B., Sperschneider, J., Cuddy, W.S., Garnica, D.P., Miller, M.E., Taylor, J.M., Dodds, P.N., Figueroa, M., Park, R.F. and Rathjen, J.P. (2018) A near-complete haplotype-phased genome of the dikaryotic wheat stripe rust fungus Puccinia striiformis f. sp. tritici reveals high interhaplotype diversity. Am. Soc. Microbiol. 9, e02275-17.
Shirasawa, K., Esumi, T., Hirakawa, H., Tanaka, H., Itai, A., Ghelfi, A., Nagasaki, H. and Isobe, S. (2019) Phased genome sequence of an interspecific hybrid flowering cherry, Somei-Yoshino (Cerasus × yedoensis). DNA Res. 26, 379-389.
Simão, F.A., Waterhouse, R.M., Ioannidis, P., Kriventseva, E.V. and Zdobnov, E.M. (2015) BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics, 31, 3210-3212.
Smit, A.F.A., Hubley, R. and Green, P. (2013-2015) RepeatMasker Open-4.0. Available from: http://www.repeatmasker.org.
Smit, A.F.A. and Hubley, R. (2008) RepeatModeler Open-1.0. Available from: https://www.repeatmasker.org/RepeatModeler/.
Solomon, A., Golubowicz, S., Yablowicz, Z., Grossman, S., Bergman, M., Gottlieb, H.E., Altman, A., Kerem, Z. and Flaishman, M.A. (2006) Antioxidant activities and anthocyanin content of fresh fruits of common fig (Ficus carica L.). J. Agric. Food Chem. 54, 7717-7723.
Stanke, M., Keller, O., Gunduz, I., Hayes, A., Waack, S. and Morgenstern, B. (2006) AUGUSTUS: ab initio prediction of alternative transcripts. Nucleic Acids Res. 34, W435-W439.
Steinbiss, S., Willhoeft, U., Gremme, G. and Kurtz, S. (2009) Fine-grained annotation and classification of de novo predicted LTR retrotransposons. Nucleic Acids Res. 37, 7002-7013.
Teh, B.T., Lim, K., Yong, C.H. et al. (2017) The draft genome of tropical fruit durian (Durio zibethinus). Nat. Genet. 49, 1633.
Ter-Hovhannisyan, V., Lomsadze, A., Chernoff, Y.O. and Borodovsky, M. (2008) Gene prediction in novel fungal genomes using an ab initio algorithm with unsupervised training. Genome Res. 18, 1979-1990.
Tuskan, G.A., Difazio, S., Jansson, S. et al. (2006) The genome of black cottonwood, Populus trichocarpa (Torr. & Gray). Science, 313, 1596-1604.
Vangelisti, A., Zambrano, L.S., Caruso, G. et al. (2019) How an ancient, salt-tolerant fruit crop, Ficus carica L., copes with salinity: a transcriptome analysis. Sci. Rep. 9, 2561.
Varshney, R.K., Chen, W., Li, Y. et al. (2012) Draft genome sequence of pigeonpea (Cajanus cajan), an orphan legume crop of resource-poor farmers. Nat. Biotechnol. 30, 83.
Veberic, R., Colaric, M. and Stampar, F. (2008) Phenolic acids and flavonoids of fig fruit (Ficus carica L.) in the northern Mediterranean region. Food Chem. 106, 153-157.
Veeckman, E., Ruttink, T. and Vandepoele, K. (2016) Are we there yet? Reliably estimating the completeness of plant genome sequences. Plant Cell, 28, 1759-1768.
Vinson, J.A., Zubik, L., Bose, P., Samman, N. and Proch, J. (2005) Dried fruits: excellent in vitro and in vivo antioxidants. J. Am. Coll. Nutr. 24, 44-50.
Vogel, A., Schwacke, R., Denton, A.K. et al. (2018) Footprints of parasitism in the genome of the parasitic flowering plant Cuscuta campestris. Nat. Commun. 9, 2515.
Walker, B.J., Abeel, T., Shea, T. et al. (2014) Pilon: an integrated tool for comprehensive microbial variant detection and genome assembly improvement. PLoS One, 9, e112963.
Wicker, T., Sabot, F., Hua-Van, A. et al. (2007) A unified classification system for eukaryotic transposable elements. Nature. Rev. Genet. 8, 973.
Wu, J., Wang, Z., Shi, Z. et al. (2013) The genome of the pear (Pyrus bretschneideri Rehd.). Genome Res. 23, 396-408.
Wu, T.P., Wang, T., Seetin, M.G. et al. (2016) DNA methylation on N 6-adenine in mammalian embryonic stem cells. Nature, 532, 329.
Xiong, W., He, L., Lai, J., Dooner, H.K. and Du, C. (2014) HelitronScanner uncovers a large overlooked cache of Helitron transposons in many plant genomes. Proc. Natl Acad. Sci. USA, 111, 10263-10268.
Ye, C., Ma, Z.S., Cannon, C.H., Pop, M. and Douglas, W.Y. (2012) Exploiting sparseness in de novo genome assembly. BMC Bioinformatics, 13, S1.
Ye, C., Hill, C.M., Wu, S., Ruan, J. and Ma, Z.S. (2016) DBG2OLC: efficient assembly of large genomes using long erroneous reads of the third generation sequencing technologies. Sci. Rep. 6, 31900.
Ye, G., Zhang, H., Chen, B., Nie, S., Liu, H., Gao, W., Wang, H., Gao, Y. and Gu, L. (2019) De novo genome assembly of the stress tolerant forest species Casuarina equisetifolia provides insight into secondary growth. Plant J. 97, 779-794.
Zambrano, L.S., Usai, G., Vangelisti, A. et al. (2017) Cultivar-specific transcriptome prediction and annotation in Ficus carica L. Genom. Data, 13, 64-66.
Zhang, Q., Chen, W., Sun, L. et al. (2012) The genome of Prunus mume. Nat. Commun. 3, 1318.
Zhang, W., Spector, T.D., Deloukas, P., Bell, J.T. and Engelhardt, B.E. (2015) Predicting genome-wide DNA methylation using methylation marks, genomic position, and DNA regulatory elements. Genome Biol. 16, 14.
Zhao, Q., Yang, J., Liu, J. et al. (2018) A draft reference genome sequence for Scutellaria baicalensis Georgi. bioRxiv [Preprint]. https://doi.org/10.1101/398032
Zhou, C., Wang, C., Liu, H. et al. (2018) Identification and analysis of adenine N 6-methylation sites in the rice genome. Nat. Plants, 4, 554.