Genome-Wide SNP discovery and genomic characterization in avocado (Persea americana Mill.).
Journal
Scientific reports
ISSN: 2045-2322
Titre abrégé: Sci Rep
Pays: England
ID NLM: 101563288
Informations de publication
Date de publication:
27 12 2019
27 12 2019
Historique:
received:
14
05
2019
accepted:
13
12
2019
entrez:
29
12
2019
pubmed:
29
12
2019
medline:
18
11
2020
Statut:
epublish
Résumé
Modern crop breeding is based on the use of genetically and phenotypically diverse plant material and, consequently, a proper understanding of population structure and genetic diversity is essential for the effective development of breeding programs. An example is avocado, a woody perennial fruit crop native to Mesoamerica with an increasing popularity worldwide. Despite its commercial success, there are important gaps in the molecular tools available to support on-going avocado breeding programs. In order to fill this gap, in this study, an avocado 'Hass' draft assembly was developed and used as reference to study 71 avocado accessions which represent the three traditionally recognized avocado horticultural races or subspecies (Mexican, Guatemalan and West Indian). An average of 5.72 M reads per individual and a total of 7,108 single nucleotide polymorphism (SNP) markers were produced for the 71 accessions analyzed. These molecular markers were used in a study of genetic diversity and population structure. The results broadly separate the accessions studied according to their botanical race in four main groups: Mexican, Guatemalan, West Indian and an additional group of Guatemalan × Mexican hybrids. The high number of SNP markers developed in this study will be a useful genomic resource for the avocado community.
Identifiants
pubmed: 31882769
doi: 10.1038/s41598-019-56526-4
pii: 10.1038/s41598-019-56526-4
pmc: PMC6934854
doi:
Types de publication
Journal Article
Research Support, Non-U.S. Gov't
Langues
eng
Sous-ensembles de citation
IM
Pagination
20137Commentaires et corrections
Type : ErratumIn
Références
Chase, M. W. et al. An update of the Angiosperm Phylogeny Group classification for the orders and families of flowering plants: APG IV. Bot. J. Linn. Soc. 181(1), 1–20 (2016).
doi: 10.1111/boj.12385
Schaffer, B., Wolstenholme, B. N. & Wiley, A. W. Introduction in The Avocado: Botany, Production, and Uses. (eds. Schaffer, B., Wolstenholme, B. N & Whiley, A. W.) 1–9 (CABI, Wallingford, UK, 2013).
FAO. Statistics Division of Food and Agriculture Organization of the United Nations (FAOSTAT) http://www.fao.org/faostat/es/#data/QC (Accessed September 13th 2019).
Crane, J. H. et al. Cultivars and rootstocks in The Avocado: Botany, Production, and Uses (eds. Schaffer, B., Wolstenholme, B. N & Whiley, A. W.) 1–9 (CABI, Wallingford, UK, 2013).
Lavi, U., Hillel, J. & Vainstein, A. Application of DNA fingerprints for identification and genetic analysis of avocado. J. Am. Soc. Hort. Sci. 116, 1078–1081 (1991).
doi: 10.21273/JASHS.116.6.1078
Mhameed, S. et al. Level of heterozygosity and mode of inheritance of variable number of tandem repeat loci in avocado. J. Am. Soc. Hort. Sci. 121, 778–782 (1996).
doi: 10.21273/JASHS.121.5.768
Fiedler, J., Bufler, G. & Bangerth, F. Genetic relationships of avocado (Persea americana Mill.) using RAPD markers. Euphytica 101, 249–255 (1998).
doi: 10.1023/A:1018321928400
Furnier, G. R., Cummings, M. P. & Clegg, M. T. Evolution of the avocados as revealed by DNA restriction site variation. J. Hered. 81, 183–188 (1990).
doi: 10.1093/oxfordjournals.jhered.a110963
Davis, J., Henderson, D., Kobayashi, M., Clegg, M. T. & Clegg, M. T. Genealogical relationships among cultivated avocado as revealed through RFLP analysis. J. Hered. 89, 319–323 (1998).
doi: 10.1093/jhered/89.4.319
Sharon, D. et al. An integrated genetic linkage map of avocado. Theor. Appl. Genet. 95, 911–921 (1997).
doi: 10.1007/s001220050642
Schnell, R. J. et al. Evaluation of avocado germplasm using microsatellite markers. J. Am. Soc. Hort. Sci. 128, 881–889 (2003).
doi: 10.21273/JASHS.128.6.0881
Ashworth, V. E. T. M. & Clegg, M. T. Microsatellite markers in avocado (Persea americana Mill.): genealogical relationships among cultivated avocado genotypes. J. Hered. 94, 407–415 (2003).
pubmed: 14557394
doi: 10.1093/jhered/esg076
Ashworth, V. E. T. M., Kobayashi, M. C., De La Cruz, M. & Clegg, M. T. Microsatellite markers in avocado (Persea americana Mill.): development of dinucleotide and trinucleotide markers. Sci. Hortic. 101, 255–267 (2004).
doi: 10.1016/j.scienta.2003.11.008
Borrone, W. J., Schnell, R. J., Viola, H. A. & Ploetz, R. C. Seventy microsatellite markers from Persea americana Miller (avocado) expressed sequences tags. Mol. Ecol. Notes 7, 439–444 (2007).
doi: 10.1111/j.1471-8286.2006.01611.x
Alcaraz, M. L. & Hormaza, J. I. Molecular characterization and genetic diversity in an avocado collection of cultivars and local Spanish genotypes using SSRs. Hereditas 144, 244–253 (2007).
pubmed: 18215247
doi: 10.1111/j.2007.0018-0661.02019x
Gross-German, E. & Viruel, M. A. Molecular characterization of avocado germplasm with a new set of SSR and EST-SSR markers: genetic diversity, population structure, and identification of race-specific markers in a group of cultivated genotypes. Tree Genet. Genomes 9, 539–555 (2013).
doi: 10.1007/s11295-012-0577-5
Guzmán, L. F. et al. Genetic structure and selection of a core collection for long term conservation of avocado in Mexico. Front. Plant. Sci. 8, 243, https://doi.org/10.3389/fpls.2017.00243 (2017).
doi: 10.3389/fpls.2017.00243
pubmed: 28286510
pmcid: 5323459
Boza, J. E. et al. Genetic differentiation, races and interracial admixture in avocado (Persea americana Mill.), and Persea spp. evaluated using SSR markers. Genet. Resour. Crop. Ev. 65, 1195–1215 (2018).
doi: 10.1007/s10722-018-0608-7
Ge, Y. et al. Transcriptome sequencing of different avocado ecotypes: de novo transcriptome assembly, annotation, identification and validation of EST-SSR Markers. Forests 10, 411, https://doi.org/10.3390/f10050411 (2019).
doi: 10.3390/f10050411
Ching, A. et al. SNP frequency, haplotype structure and linkage disequilibrium in elite maize inbred lines. BMC Genetics 3, 19, https://doi.org/10.1186/1471-2156-3-19 (2002).
doi: 10.1186/1471-2156-3-19
pubmed: 12366868
pmcid: 130040
Rasheed, A. et al. Crop breeding chips and genotyping plataforms: progress, challenge, and perspectives. Mol. Plant 10, 1047–1064 (2017).
pubmed: 28669791
doi: 10.1016/j.molp.2017.06.008
Scheben, A., Batley, J. & Edwards, D. Genotyping-by-sequencing approaches to characterize crop genomes: choosing the right tool for the right application. Plant Biotecnol. J. 15, 149–161 (2017).
doi: 10.1111/pbi.12645
Studer, B. & Kölliker, R. SNP Genotyping Technologies. In Diagnostics in Plant Breeding (eds. Lübberstedt, T. & Varshney, R. K.) (Springer Science + Business Media Dordrecht, 2013).
Chagné, D. et al. Development of a set of SNP markers present in expressed genes of the apple. Genomics 92, 353–358 (2008).
pubmed: 18721872
doi: 10.1016/j.ygeno.2008.07.008
Wang, B., Tan, H. W. & Fang, W. Developing single nucleotide polymorphism (SNP) markers from transcriptome sequences for identification of longan (Dimocarpus longan) germplasm. Hortic. Res. 2, 14065, https://doi.org/10.1038/hortres.2014.65 (2015).
doi: 10.1038/hortres.2014.65
pubmed: 26504559
pmcid: 4595986
Ibarra-Laclette, E. et al. Deep sequencing of the Mexican avocado transcriptome, an ancient angiosperm with a high content of fatty acids. BMC Genomics 16, 599, https://doi.org/10.1186/s12864-015-1775-y (2015).
doi: 10.1186/s12864-015-1775-y
pubmed: 26268848
pmcid: 4533766
Vergara-Pulgar, C. et al. De novo assembly of Persea americana cv. “Hass“ transcriptome during fruit development. BCM Genomics 20, 108, https://doi.org/10.1186/s12864-019-5486-7 (2019).
doi: 10.1186/s12864-019-5486-7
Kuhn, D. N. et al. Application of genomic tools to avocado (Persea americana) breeding: SNP discovery for genotyping and germplasm characterization. Sci. Hortic. 246, 1–11 (2019).
doi: 10.1016/j.scienta.2018.10.011
Ge, Y. et al. Genome-wide assessment of avocado germplasm determined from Specific Length Amplified Fragment sequencing and transcriptomes: population structure, genetic diversity, identification, and application of race-specific markers. Genes 10, 215, https://doi.org/10.3390/genes10030215 (2019).
doi: 10.3390/genes10030215
pubmed: 30871275
pmcid: 6471495
Rubinstein, M. et al. Genetic diversity of avocado (Persea americana Mill.) germplasm using pooled sequencing. BMC Genomics 20, 379, https://doi.org/10.1186/s12864-019-5672-7 (2019).
doi: 10.1186/s12864-019-5672-7
pubmed: 31092188
pmcid: 6521498
Rendón-Anaya, M. et al. The avocado genome informs deep angiosperm phylogeny, highlights introgressive hybridization, and reveals pathogen-influenced gene space adaptation. PNAS 116, 17081–17089 (2019).
pubmed: 31387975
pmcid: 6708331
doi: 10.1073/pnas.1822129116
Wortman, J. R. et al. Annotation of the Arabidopsis genome. Plant Physiol. 132, 461–468 (2003).
pubmed: 12805579
pmcid: 166989
doi: 10.1104/pp.103.022251
Soorni, A., Fatahi, R., Salami, S. A., Haak, D. C. & Bombarely, A. Assessment of genetic diversity and population structure in Iranian cannabis germplasm. Sci Rep. 7, 15668, https://doi.org/10.1038/s41598-017-15816-5 (2017).
doi: 10.1038/s41598-017-15816-5
pubmed: 29142201
pmcid: 5688169
Shearman, J. R. et al. SNP identification from RNA sequencing and linkage map construction of rubber tree for anchoring the draft genome. PLoS. One 10, e0121961, https://doi.org/10.1371/journal.pone.0121961 (2015).
doi: 10.1371/journal.pone.0121961
pubmed: 25831195
pmcid: 4382108
Pootakham, W. et al. Genome-wide SNP discovery and identification of QTL associated with agronomic traits in oil palm using genotyping-by-sequencing (GBS). Genomics 105, 288–295 (2015).
pubmed: 25702931
doi: 10.1016/j.ygeno.2015.02.002
Prevosti, A., Ocaña, J. & Alonso, G. Distance between populations of Drosophila subobscura based on chromosome arrangement frequencies. Theor. Appl. Genet. 45, 231–241 (1975).
pubmed: 24419466
doi: 10.1007/BF00831894
Alexander, D. H., Novembre, J. & Lange, K. Fast model-based estimation of ancestry in unrelated individuals. Genome Res. 19, 1655–1664 (2009).
pubmed: 19648217
pmcid: 2752134
doi: 10.1101/gr.094052.109
Pritchard, J. K., Stephens, M. & Donnelly, P. Inference of population structure using multilocus genotype data. Genetics 155, 945–959 (2000).
pubmed: 10835412
pmcid: 1461096
doi: 10.1093/genetics/155.2.945
Earl, D. A. & vonHoldt, B. M. STRUCTURE HARVESTER: a website and program for visualizing STRUCTURE output and implementing the Evanno method. Conserv. Genet. Resour. 4, 359–361 (2012).
doi: 10.1007/s12686-011-9548-7
Chen, H., Morrell, P. L., Ashworth, V. E. T. M. & Clegg, M. T. Tracing the geographic origins of major avocado cultivars. J. Hered. 100, 56–65 (2009).
pubmed: 18779226
doi: 10.1093/jhered/esn068
Variety Database of the Univ. of California at Riverside, http://ucavo.ucr.edu/ (Accessed September 13th 2019) (2019).
Lavi, U., Cregan, P. B. & Hillel, J. Application of DNA markers for identification and breeding of fruit trees. Plant Breed. Rev. 12, 195–226 (1994).
Chen, H., Morrell, P. L. & de la Cruz, M. Nucleotide diversity and linkage disequilibrium in wild avocado (Persea americana Mill.). J Hered. 99, 382–389 (2008).
pubmed: 18343895
doi: 10.1093/jhered/esn016
Catchen, J. M., Amores, A., Hohenlohe, P., Cresko, W. & Postlethwait, J. H. Stacks: Building and genotyping loci de novo from short-read sequences. G3-Genes Genom. Genet. 1, 171–182 (2011).
Lu, F. et al. Switchgrass genomic diversity, ploidy, and evolution: novel insights from a network-based SNP discovery protocol. PLoS. Genet. 9, e1003215, https://doi.org/10.1371/journal.pgen.1003215 (2013).
doi: 10.1371/journal.pgen.1003215
pubmed: 23349638
pmcid: 3547862
Melo, A. T. O., Bartaula, R. & Hale, L. GBS-SNP-CROP: a reference-optional pipeline for SNP discovery and plant germplasm characterization using variable length, paired-end genotyping-by-sequencing data. BMC Bioinformatics 17, 29, https://doi.org/10.1186/s12859-016-0879-y (2016).
doi: 10.1186/s12859-016-0879-y
pubmed: 26754002
pmcid: 4709900
Leggett, R. M. & MacLean, D. Reference-free SNP detection: dealing with the data deluge. BMC Genomics 15, S10, https://doi.org/10.1186/1471-2164-15-S4-S10 (2014).
doi: 10.1186/1471-2164-15-S4-S10
pubmed: 25056481
pmcid: 4083407
Berthouly-Salazar, C. et al. Genotyping-by-Sequencing SNP identification for crops without a reference genome: using transcriptome based mapping as an alternative strategy. Front. Plant. Sci. 7, 777, https://doi.org/10.3389/fpls.2016.00777 (2016).
doi: 10.3389/fpls.2016.00777
pubmed: 27379109
pmcid: 4908121
Taranto, F., D´Agostino, N., Greco, B., Cardi, T. & Tripoli, P. Genome-wide SNP discovery and population structure analysis in pepper (Capsicum annum) using genotyping by sequencing. BMC Genomics 17, 943, https://doi.org/10.1186/s12864-016-3297-7 (2016).
doi: 10.1186/s12864-016-3297-7
pubmed: 27871227
pmcid: 5117568
Pootakham, W. et al. Construction of high-density integrated genetic linkage map of rubber tree (Hevea brasiliensis) using genotyping-by-sequencing (GBS). Genomics 6, 367, https://doi.org/10.3389/fpls.2015.00367 (2015).
doi: 10.3389/fpls.2015.00367
Kujur, A. et al. Employing genome-wide SNP discovery and genotyping strategy to extrapolate the natural allelic diversity and domestication patterns in chickpea. Front. Plant. Sci. 6, 162, https://doi.org/10.3389/fpls.2015.00162 (2015).
doi: 10.3389/fpls.2015.00162
pubmed: 25873920
pmcid: 4379880
Micheletti, D. et al. Whole-Genome Analysis of diversity and SNP-major gene association in peach germplasm. Plant. Genome 5, 92–102 (2015).
Helyar, S. J. et al. Application of SNPs for population genetics of nonmodel organisms: new opportunities and challenges. Mol. Ecol. Resour. 1, 123–36 (2011).
doi: 10.1111/j.1755-0998.2010.02943.x
Aranzana, M. J., Illa, E., Howad, W. & Arús, P. A first insight into peach [Prunus persica (L.) Batsch] SNP variability. Tree Genet. Genomes 8, 1359–1369 (2012).
doi: 10.1007/s11295-012-0523-6
Biton, I. et al. Development of a large set of SNP markers for assessing phylogenetic relationships between the olive cultivars composing the Israel olive germplasm collection. Mol. Breed. 35, 107 (2015).
doi: 10.1007/s11032-015-0304-7
Liu, W. et al. Identifying litchi (Litchi chinensis Sonn.) cultivars and their genetic relationships using single nucleotide polymorphism (SNP) markers. PLoS. One 10, e0135390, https://doi.org/10.1371/journal.pone.0135390 (2015).
doi: 10.1371/journal.pone.0135390
pubmed: 26261993
pmcid: 4532366
Chanderbali, A. S., Soltis, D. E.,Soltis, P. S. & Wolstenholme, B. N. Taxonomy and botany in The Avocado: Botany, Production, and Uses. (eds. Schaffer, B., Wolstenholme, B. N & Whiley, A. W.) 32–50 (CABI, Wallingford, UK, 2013).
Söderquist, P. et al. Admixture between released and wild game birds: a changing genetic landscape in European mallards (Anas platyrhynchos). Eur. J. Wildl. Res. 63, 98, https://doi.org/10.1007/s10344-017-1156-8 (2017).
doi: 10.1007/s10344-017-1156-8
Frosch, C. et al. The genetic legacy of multiple beaver reintroductions in Central Europe. PLoS. One 9, e97619, https://doi.org/10.1371/journal.pone.0097619 (2014).
doi: 10.1371/journal.pone.0097619
pubmed: 24827835
pmcid: 4020922
Sonah, H. et al. An improved genotyping by sequencing (GBS) approach offering increased versatility and efficiency of SNP discovery and genotyping. PLoS. One 8, e54603, https://doi.org/10.1371/journal.pone.0054603 (2013).
doi: 10.1371/journal.pone.0054603
pubmed: 23372741
pmcid: 3553054
Herten, K., Hestand, M. S., Vermeesch, J. R. & Van Houdt, J. K. J. GBSX: a toolkit for experimental design and demultiplexing genotyping by sequencing experiments. BMC Bioinformatics 16, 73, https://doi.org/10.1186/s12859-015-0514-3 (2015).
doi: 10.1186/s12859-015-0514-3
pubmed: 25887893
pmcid: 4359581
Aronesty, E. Comparison of sequencing utility programs. Open Bioinforma. J. 7, 1–8, https://doi.org/10.2174/1875036201307010001 (2013).
doi: 10.2174/1875036201307010001
Liu, B. et al. Estimation of genomic characteristics by analyzing k-mer frequency in de novo genome projects. Preprint at, https://arxiv.org/abs/1308.2012 (2013).
Vurture, G. W. et al. GenomeScope: fast reference-free genome profiling from short reads. Bioinformatics 33, 2202–2204, https://doi.org/10.1093/bioinformatics/btx153 (2017).
doi: 10.1093/bioinformatics/btx153
pubmed: 28369201
pmcid: 5870704
Chikhi, R. & Rizk, G. Space-efficient and exact de Bruijn graph representation based on a Bloom filter. Algorithm. Mol. Biol. 8, 22, https://doi.org/10.1186/1748-7188-8-22 (2013).
doi: 10.1186/1748-7188-8-22
Luo, R. B. et al. SOAPdenovo2: an empirically improved memory-efficient short-read de novo assembler. Gigascience 1, 18, https://doi.org/10.1186/2047-217x-1-18 (2012).
doi: 10.1186/2047-217x-1-18
pubmed: 23587118
pmcid: 3626529
Boetzer, M., Henkel, C. V., Jansen, H. J., Butler, D. & Pirovano, W. Scaffolding pre-assembled contigs using SSPACE. Bioinformatics 27, 578–9 (2011).
pubmed: 21149342
doi: 10.1093/bioinformatics/btq683
Li, H. & Durbin, R. Fast and accurate long-read alignment with Burrows-Wheeler transformation. Bioinformatics 26, 589–595 (2010).
pubmed: 20080505
pmcid: 2828108
doi: 10.1093/bioinformatics/btp698
Li, H. et al. The sequence alignment/map format and SAMtools. Bioinformatics 25, 2078–2079 (2009).
pubmed: 19505943
pmcid: 2723002
doi: 10.1093/bioinformatics/btp352
Garrison E. & Marth G. Haplotype-based variant detection from short-read sequencing. Preprint at, http://arxiv.org/abs/1207.3907 (2012).
Danecek, P. et al. The variant call format and VCFtools. Bioinformatics 27, 2156–2158 (2011).
pubmed: 21653522
pmcid: 3137218
doi: 10.1093/bioinformatics/btr330
Jombart, T. Adegenet: a R package for the multivariate analysis of genetic markers. Bioinformatics 24, 1403–1405 (2008).
pubmed: 18397895
doi: 10.1093/bioinformatics/btn129
Paradis, E. Pegas: an R package for population genetics with an integrated–modular approach. Bioinformatics 26, 419–420 (2010).
pubmed: 20080509
doi: 10.1093/bioinformatics/btp696
Wickham, H. Ggplot2: Elegant Graphics for Data Analysis. (Springer-Verlag New York, 2009).
R core Team. R: a language and environment for statistical computing. R foundation for statistical computing, Vienna; https://www.R-project.org (Accessed September 13th 2019) (2018).
Kamvar, Z. N., Tabina, J. F. & Grünwald, N. J. Poppr: an R package for genetic analysis of populations with clonal, partially clonal, and/or sexual reproduction. PeerJ Prepr. 2, e281, https://doi.org/10.7717/peerj.281 (2014).
doi: 10.7717/peerj.281
Kamvar, Z. N., Brooks, J. C. & Grünwald, N. J. Novel R tools for analysis of genome-wide population genetic data with emphasis on clonality. Front. Genet. 6, 208, https://doi.org/10.3389/fgene.2015.00208 (2015).
doi: 10.3389/fgene.2015.00208
pubmed: 26113860
pmcid: 4462096
Rambaut, A. FigTree version 1.4.4, http://tree.bio.ed.ac.uk/software/figtree/ (Accessed September 13th 2019).
Larrañaga, N. et al. A Mesoamerican origin of cherimoya (Annona cherimola Mill.): Implications for conservation of plant genetic resources. Mol. Ecol. 26, 4116–4130 (2017).
pubmed: 28437594
doi: 10.1111/mec.14157
Martin, C., Herrero, M. & Hormaza, J. I. Molecular characterization of apricot germplasm from an old stone collection. PLoS. One 6, e23979, https://doi.org/10.1371/journal.pone.0023979 (2011).
doi: 10.1371/journal.pone.0023979
pubmed: 21901149
pmcid: 3162011
Pritchard, J. K., Wen, X. & Falush, D. Documentation for structure software: version 2.3. Preprint at, http://burfordreiskind.com/wp-content/uploads/Structure_Manual_doc.pdf (Accessed September 13th 2019) (2010).
Evanno, G., Regnaut, S. & GOUDET, J. Detecting the number of clusters of individuals using the software: STRUCTURE: a simulation study. Mol. Ecol. 14, 2611–2620 (2005).
pubmed: 15969739
doi: 10.1111/j.1365-294X.2005.02553.x
Hahn, M. W. Population structure in Molecular Population Genetics. (eds Sinauer Associates) 81–83 (Oxford University Press. U.S.A., 2018).
Pfeifer, B., Wittelsbürger, U., Ramos-Onsins, S. E. & Lercher, M. J. PopGenome: an efficient Swiss army knife for population genomic analyses in R. Mol. Biol. Evol. 31, 1929–36, https://doi.org/10.1093/molbev/msu136 (2014).
doi: 10.1093/molbev/msu136
pubmed: 24739305
pmcid: 4069620
Hofshi, R. Avocado database, http://www.avocadosource.com/AvocadoVarieties/QueryDB.asp (Accessed September 13th 2019).
U.S. National Plant Germplasm System, https://npgsweb.ars-grin.gov/gringlobal/search.aspx? (Accessed September 13th 2019).
Avocado information database, https://www.myavocadotrees.com/beta-avocado.html (Accessed September 13th 2019).
Wolfe, H. S., Toy, L. R. & Stahl, A. L. Avocado production in Florida. Fl. Agr. Ext. Serv. Bull. 141 (1949).
Ben-Ya’cov, A., Zilberstaine, M., Goren, M. & Tomer, E. The Israeli avocado germplasm bank: where and why the items had been collected. In Proc. V World Avocado Congress. Spain. October 19–24 (2003).