High-quality, genome-wide SNP genotypic data for pedigreed germplasm of the diploid outbreeding species apple, peach, and sweet cherry through a common workflow.
Journal
PloS one
ISSN: 1932-6203
Titre abrégé: PLoS One
Pays: United States
ID NLM: 101285081
Informations de publication
Date de publication:
2019
2019
Historique:
received:
04
01
2019
accepted:
19
04
2019
entrez:
28
6
2019
pubmed:
28
6
2019
medline:
6
2
2020
Statut:
epublish
Résumé
High-quality genotypic data is a requirement for many genetic analyses. For any crop, errors in genotype calls, phasing of markers, linkage maps, pedigree records, and unnoticed variation in ploidy levels can lead to spurious marker-locus-trait associations and incorrect origin assignment of alleles to individuals. High-throughput genotyping requires automated scoring, as manual inspection of thousands of scored loci is too time-consuming. However, automated SNP scoring can result in errors that should be corrected to ensure recorded genotypic data are accurate and thereby ensure confidence in downstream genetic analyses. To enable quick identification of errors in a large genotypic data set, we have developed a comprehensive workflow. This multiple-step workflow is based on inheritance principles and on removal of markers and individuals that do not follow these principles, as demonstrated here for apple, peach, and sweet cherry. Genotypic data was obtained on pedigreed germplasm using 6-9K SNP arrays for each crop and a subset of well-performing SNPs was created using ASSIsT. Use of correct (and corrected) pedigree records readily identified violations of simple inheritance principles in the genotypic data, streamlined with FlexQTL software. Retained SNPs were grouped into haploblocks to increase the information content of single alleles and reduce computational power needed in downstream genetic analyses. Haploblock borders were defined by recombination locations detected in ancestral generations of cultivars and selections. Another round of inheritance-checking was conducted, for haploblock alleles (i.e., haplotypes). High-quality genotypic data sets were created using this workflow for pedigreed collections representing the U.S. breeding germplasm of apple, peach, and sweet cherry evaluated within the RosBREED project. These data sets contain 3855, 4005, and 1617 SNPs spread over 932, 103, and 196 haploblocks in apple, peach, and sweet cherry, respectively. The highly curated phased SNP and haplotype data sets, as well as the raw iScan data, of germplasm in the apple, peach, and sweet cherry Crop Reference Sets is available through the Genome Database for Rosaceae.
Identifiants
pubmed: 31246947
doi: 10.1371/journal.pone.0210928
pii: PONE-D-18-37133
pmc: PMC6597046
doi:
Types de publication
Journal Article
Research Support, Non-U.S. Gov't
Research Support, U.S. Gov't, Non-P.H.S.
Langues
eng
Sous-ensembles de citation
IM
Pagination
e0210928Déclaration de conflit d'intérêts
The authors have declared that no competing interests exist.
Références
J Exp Bot. 2016 Apr;67(9):2875-88
pubmed: 27034326
Nat Rev Genet. 2005 Nov;6(11):847-59
pubmed: 16304600
Theor Appl Genet. 2013 Feb;126(2):401-14
pubmed: 23015217
Mol Breed. 2016;36:119
pubmed: 27547106
Hortic Res. 2016 Nov 23;3:16057
pubmed: 27917289
Eur J Hum Genet. 2001 Feb;9(2):130-4
pubmed: 11313746
Theor Appl Genet. 1989 Jan;77(1):95-101
pubmed: 24232480
Nat Genet. 2010 Oct;42(10):833-9
pubmed: 20802477
J Exp Bot. 2017 Mar 1;68(7):1451-1466
pubmed: 28338805
PLoS One. 2012;7(2):e31745
pubmed: 22363718
Nat Genet. 2017 Jul;49(7):1099-1106
pubmed: 28581499
PLoS One. 2014 Oct 10;9(10):e110377
pubmed: 25303088
Hortic Res. 2017 Feb 22;4:17003
pubmed: 28243452
PLoS One. 2018 Nov 21;13(11):e0207724
pubmed: 30462743
PLoS One. 2012;7(12):e48305
pubmed: 23284615
Hum Hered. 1997 Mar-Apr;47(2):86-100
pubmed: 9097090
Mol Plant. 2017 Aug 7;10(8):1047-1064
pubmed: 28669791
BMC Genomics. 2017 Jun 6;18(1):404
pubmed: 28583082
PLoS One. 2012;7(5):e36674
pubmed: 22574211
Plant J. 2016 Apr;86(1):62-74
pubmed: 26919684
Front Plant Sci. 2017 Jun 07;8:858
pubmed: 28638387
Proc Natl Acad Sci U S A. 2009 Apr 14;106(15):6256-61
pubmed: 19329491
Am J Hum Genet. 1991 Nov;49(5):985-94
pubmed: 1928104
PLoS One. 2013;8(1):e54743
pubmed: 23382953
Bioinformatics. 2015 Dec 1;31(23):3873-4
pubmed: 26249809
Heredity (Edinb). 2003 Jan;90(1):33-8
pubmed: 12522423
BMC Genomics. 2015 Mar 07;16:155
pubmed: 25886969
BMC Genomics. 2017 Mar 11;18(1):225
pubmed: 28284188
Theor Appl Genet. 2016 Jun;129(6):1191-201
pubmed: 26910360
Nat Protoc. 2014 Nov;9(11):2643-62
pubmed: 25321409
Theor Appl Genet. 2018 Oct;131(10):2167-2177
pubmed: 30032317
Eur J Hum Genet. 2002 Oct;10(10):616-22
pubmed: 12357332
Hortic Res. 2018 Mar 1;5:11
pubmed: 29507735
Genet Epidemiol. 2014 May;38(4):291-9
pubmed: 24718985
Am J Hum Genet. 2009 Dec;85(6):847-61
pubmed: 19931040
PLoS One. 2012;7(4):e35668
pubmed: 22536421
Hum Hered. 2002;54(1):22-33
pubmed: 12446984
Heredity (Edinb). 2006 Aug;97(2):102-10
pubmed: 16721391
Mol Ecol. 2005 Feb;14(2):599-612
pubmed: 15660949
Mol Ecol. 2007 Mar;16(5):1099-106
pubmed: 17305863
Nat Genet. 2013 May;45(5):487-94
pubmed: 23525075
Hortic Res. 2019 Apr 5;6:59
pubmed: 30962944
G3 (Bethesda). 2017 Jun 7;7(6):1707-1719
pubmed: 28592652
Arthritis Rheum. 2009 Apr;60(4):1085-95
pubmed: 19333953
PLoS One. 2013 Jun 27;8(6):e67407
pubmed: 23826289
Am J Hum Genet. 2007 Sep;81(3):559-75
pubmed: 17701901