Genomic selection for agronomic traits in a winter wheat breeding program.
Journal
TAG. Theoretical and applied genetics. Theoretische und angewandte Genetik
ISSN: 1432-2242
Titre abrégé: Theor Appl Genet
Pays: Germany
ID NLM: 0145600
Informations de publication
Date de publication:
10 Mar 2023
10 Mar 2023
Historique:
received:
29
09
2022
accepted:
19
12
2022
entrez:
10
3
2023
pubmed:
11
3
2023
medline:
15
3
2023
Statut:
epublish
Résumé
rAMP-seq based genomic selection for agronomic traits has been shown to be a useful tool for winter wheat breeding programs by increasing the rate of genetic gain. Genomic selection (GS) is an effective strategy to employ in a breeding program that focuses on optimizing quantitative traits, which results in the ability for breeders to select the best genotypes. GS was incorporated into a breeding program to determine the potential for implementation on an annual basis, with emphasis on selecting optimal parents and decreasing the time and costs associated with phenotyping large numbers of genotypes. The design options for applying repeat amplification sequencing (rAMP-seq) in bread wheat were explored, and a low-cost single primer pair strategy was implemented. A total of 1870 winter wheat genotypes were phenotyped and genotyped using rAMP-seq. The optimization of training to testing population size showed that the 70:30 ratio provided the most consistent prediction accuracy. Three GS models were tested, rrBLUP, RKHS and feed-forward neural networks using the University of Guelph Winter Wheat Breeding Program (UGWWBP) and Elite-UGWWBP populations. The models performed equally well for both populations and did not differ in prediction accuracy (r) for most agronomic traits, with the exception of yield, where RKHS performed the best with an r = 0.34 and 0.39 for each population, respectively. The ability to operate a breeding program where multiple selection strategies, including GS, are utilized will lead to higher efficiency in the program and ultimately lead to a higher rate of genetic gain.
Identifiants
pubmed: 36897431
doi: 10.1007/s00122-023-04294-1
pii: 10.1007/s00122-023-04294-1
doi:
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Pagination
38Subventions
Organisme : National Research Council of Canada
ID : ational Canadian Wheat Improvement Flagship Program
Informations de copyright
© 2023. The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature.
Références
Akdemir D, Sanchez JI, Jannink JL (2015) Optimization of genomic selection training populations with a genetic algorithm. Genet Select Evol 47:38
doi: 10.1186/s12711-015-0116-6
Ali M, Zhang Y, Rasheed A et al (2020) Genomic prediction for grain yield and yield-related traits in Chinese winter wheat. Int J Mol Sci 21:1342
pubmed: 32079240
pmcid: 7073225
doi: 10.3390/ijms21041342
Andrews S, Krueger F, Segonds-Pichon A et al (2012) FastQC: a quality control tool for high throughput sequence data. http://www.bioinformatics.babraham.ac.uk/projects/fastqc/
Arruda MP, Brown PJ, Lipka AE et al (2015) Genomic selection for predicting Fusarium head blight resistance in a wheat breeding program. Plant Genome. https://doi.org/10.3835/plantgenome2015.01.0003
doi: 10.3835/plantgenome2015.01.0003
pubmed: 33228272
Bates D, Machler M, Bolker B et al (2015) Fitting linear mixed-effects models using lme4. J Stat Softw 67:1–48
doi: 10.18637/jss.v067.i01
Battenfield SD, Guzmàn C, Gaynor RC et al (2016) Genomic selection for processing and end-use quality traits in the CIMMYT spring bread wheat breeding program. Plant Genome. https://doi.org/10.3835/plantgenome2016.01.0005
doi: 10.3835/plantgenome2016.01.0005
pubmed: 27898810
Bolger AM, Lohse M, Usadel B (2014) Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30(15):2114–2120
pubmed: 24695404
pmcid: 4103590
doi: 10.1093/bioinformatics/btu170
Berro I, Lado B, Nalin RS et al (2019) Training population optimization for genomic selection. Plant Genome 12:190028
doi: 10.3835/plantgenome2019.04.0028
Picard Toolkit (2019) Broad Institute, GitHub Repository. https://broadinstitute.github.io/picard/
Bradbury BJ, Zhang Z, Kroon DE et al (2007) TASSEL: software for association mapping of complex traits in diverse samples. Bioinformatics 23:2633–2635
pubmed: 17586829
doi: 10.1093/bioinformatics/btm308
Browning BL, Zhou Y, Browning SR (2018) A one-penny imputed genome from next generation reference panels. Am J Hum Genet 103:338–348
pubmed: 30100085
pmcid: 6128308
doi: 10.1016/j.ajhg.2018.07.015
Buckler ES, Ilut DC, Wang X et al (2016) rAmpSeq: using repetitive sequences for robust genotyping. bioRxiv 096628
Canziani A, Paszke A, Culurciello E (2017) An analysis of deep neural network models for practical applications. arXiv e1605.07678
Cooper M, Hammer, GL (1996) Plant adaptation and crop improvement. CABI Publishing, Wallingford, UK, ICRISAT, Patancheru, India, and IRRI, Manila, The Philippines
Cooper M, DeLacy IH (1994) Relationships among analytical methods used to study genotypic variation and genotype-by-environment interaction in plant breeding multi-environment experiments. Theor Appl Genet 88:561–572
pubmed: 24186111
doi: 10.1007/BF01240919
Cornelius PL, Crossa J, Seyedsadr MS (1996) Statistical tests and estimators for multiplicative models for genotype-by-environment interaction. CRC Press, Boca Raton
Crossa J, de los Campos G, Pérez P et al (2010) Prediction of genetic values of quantitative traits in plant breeding using pedigree and molecular markers. Genetics 186:713–724
pubmed: 20813882
pmcid: 2954475
doi: 10.1534/genetics.110.118521
Crossa J, Pérez P, Hickey J et al (2014) Genomic prediction in CIMMYT maize and wheat breeding programs. Heredity 112:48–60
pubmed: 23572121
doi: 10.1038/hdy.2013.16
Cullis BR, Smith AB, Coombes NE (2006) On the design of early generation variety trials with correlated data. J Agric Biol Environ Stat 11:381
doi: 10.1198/108571106X154443
Danecek P, Auton A, Abecasis G et al (2011) The variant call format and VCFtools. Bioinformatics 27:2156–2158
pubmed: 21653522
pmcid: 3137218
doi: 10.1093/bioinformatics/btr330
dd los Campos G, Gianola D, Rosa GJM (2009) Reproducing kernel Hilbert spaces regression: a general framework for genetic evaluation. J Anim Sci 87:1883–1887
pubmed: 19213705
doi: 10.2527/jas.2008-1259
de los Campos G, Gianola D, Allison DB (2010) Predicting genetic predisposition in humans: the promise of whole-genome markers. Nat Rev Genet 11:880–886c
pubmed: 21045869
doi: 10.1038/nrg2898
Endelman JB (2011) Ridge regression and other kernels for genomic selection with R package rrBLUP. Plant Genome 4:250–255
doi: 10.3835/plantgenome2011.08.0024
Fine TL (2006) Feedforward neural network methodology. Statistics for engineering and information science. Springer, New York, pp 53–81
Fischer RA, Edmeades GO (2010) Breeding and cereal yield progress. Crop Sci 50:85–98
doi: 10.2135/cropsci2009.10.0564
Fritsch S, Guenther F, Wright MN (2019) Neuralnet: training of neural networks. R package version 1.44.2
Fu YB, Peterson GW, Horbach C, Konkin DJ, Beiles A, Nevo E (2019) Elevated mutation and selection in wild emmer wheat in response to 28 years of global warming. Proc Natl Acad Sci 116:20002–20008
pubmed: 31527251
pmcid: 6778259
doi: 10.1073/pnas.1909564116
Gianola D, Van Kaam JBCHM (2008) Reproducing kernel Hilbert spaces regression methods for genomic assisted prediction of quantitative traits. Genetics 178:2289–2303
pubmed: 18430950
pmcid: 2323816
doi: 10.1534/genetics.107.084285
Gianola D, Perez-Enciso M, Toro MA (2003) On marker-assisted prediction of genetic value: beyond the ridge. Genetics 163:347–365
pubmed: 12586721
pmcid: 1462425
doi: 10.1093/genetics/163.1.347
Hayes BJ, Bowman PJ, Chamberlain AC et al (2009) Accuracy of genomic breeding values in multi-breed dairy cattle populations. Genet Sel Evol 41:51
pubmed: 19930712
pmcid: 2791750
doi: 10.1186/1297-9686-41-51
He S, Schulthess AW, Mirdita V et al (2016) Genomic selection in a commercial winter wheat population. Theor Appl Genet 129:641–651
pubmed: 26747048
doi: 10.1007/s00122-015-2655-1
Heffner EL, Sorrells ME, Jannink JL (2009) Genomic selection for crop improvement. Crop Sci 49:1–12
doi: 10.2135/cropsci2008.08.0512
Heffner EL, Jannink JL, Sorrells ME (2011) Genomic selection accuracy using multifamily prediction models in a wheat breeding program. Plant Genome 4:65–75
doi: 10.3835/plantgenome.2010.12.0029
Henderson CR (1984) Applications of linear models in animal breeding. University of Guelph, Guelph
Heslot N, Yang HP, Sorrells ME (2012) Genomic selection in plant breeding: a comparison of models. Crop Sci 52:146–160
doi: 10.2135/cropsci2011.06.0297
Heslot N, Jannink JL, Sorrells ME (2013) Using genomic prediction to characterize environments and optimize prediction accuracy in applied breeding data. Crop Sci 53:921–933
doi: 10.2135/cropsci2012.07.0420
International Wheat Genome Sequencing Consortium (IWGSC), Appels R, Eversole K, Stein N, Feuillet C, Keller B, Rogers J, Pozniak CJ, Choulet F, Distelfeld A, Poland J (2018) Shifting the limits in wheat research and breeding using a fully annotated reference genome. Sci 361(6403):eaar7191
Jannink JL, Lorenz AJ, Iwata H (2010) Genomic selection in plant breeding: from theory to practice. Brief Funct Genom 9:166–177
doi: 10.1093/bfgp/elq001
Jolliffe IT, Cadima J (2016) Principal component analysis: a review and recent developments. R Soc Open Sci 374:20150202
Jordan KW, Wang S, Lun Y et al (2015) A haplotype map of allohexaploid wheat reveals distinct patterns of selection on homoeologous genomes. Genome Biol 2015(16):1–8
Jordan KW, Bradbury PJ, Miller ZR et al (2022) Development of the wheat practical haplotype graph database as a resource for genotyping data storage and genotype imputation. G3 12:jkab390
pubmed: 34751373
doi: 10.1093/g3journal/jkab390
Katoh K, Misawa K, Kuma KI, Miyata T (2002) MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform. Nucleic Acids Res 30:3059–3066
pubmed: 12136088
pmcid: 135756
doi: 10.1093/nar/gkf436
Lado B, Barrios PG, Quincke M et al (2016) Modeling genotype × environment interaction for genomic selection with unbalanced data from a wheat breeding program. Crop Sci 56:2165–2179
doi: 10.2135/cropsci2015.04.0207
Li H, Durbin R (2009) Fast and accurate short read alignment with Burrows–Wheeler transform. Bioinformatics 25:1754–1760
pubmed: 19451168
pmcid: 2705234
doi: 10.1093/bioinformatics/btp324
Marçais G, Kingsford C (2011) A fast, lock-free approach for efficient parallel counting of occurrences of k-mers. Bioinformatics 27:764–770
pubmed: 21217122
pmcid: 3051319
doi: 10.1093/bioinformatics/btr011
Mathews KL, Malosetti M, Chapman S et al (2008) Multi-environment QTL mixed models for drought stress adaptation in wheat. Theor Appl Genet 117:1077–1091
pubmed: 18696042
doi: 10.1007/s00122-008-0846-8
McKenna A, Hanna M, Banks E (2010) The genome analysis toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res 20:1297–1303
pubmed: 20644199
pmcid: 2928508
doi: 10.1101/gr.107524.110
Meuwissen THE, Hayes B, Goddard M (2001) Prediction of total genetic value using genome-wide dense marker maps. Genetics 157:1819
pubmed: 11290733
pmcid: 1461589
doi: 10.1093/genetics/157.4.1819
Morota G, Gianola D (2014) Kernel-based whole-genome prediction of complex traits: a review. Front Genet 5:1–13
doi: 10.3389/fgene.2014.00363
Pérez P, de los Campos G (2014) Genome-wide regression and prediction with the BGLR statistical package. Genetics 198:483–495
pubmed: 25009151
pmcid: 4196607
doi: 10.1534/genetics.114.164442
Pérez-Rodríguez P, Gianola D, González-Camacho JM (2012) Comparison between linear and non-parametric regression models for genome-enabled prediction in wheat. G3 (Bethesda) 2:1595–1605
pubmed: 23275882
doi: 10.1534/g3.112.003665
Piepho HP (2009) Ridge regression and extensions for genomewide selection in maize. Crop Sci 49:1165–1176
doi: 10.2135/cropsci2008.10.0595
Piepho HP, Büsche A, Truberg B (2006) On the use of multiple lattice designs and α-designs in plant breeding trials. Plant Breed 125:523–528
doi: 10.1111/j.1439-0523.2006.01267.x
Piepho HP, Möhring J, Melchinger AE et al (2008) BLUP for phenotypic selection in plant breeding and variety testing. Euphytica 161:209–228
doi: 10.1007/s10681-007-9449-8
Podlich DW, Cooper M, Basford KE et al (1999) Computer simulation of a selection strategy to accommodate genotype environment interactions in a wheat recurrent selection programme. Crop Sci 118:17–28
Schmidt P, Hartung J, Bennewitz J et al (2019) Heritability in plant breeding on a genotype-difference basis. Genetics 212:991–1008
pubmed: 31248886
pmcid: 6707473
doi: 10.1534/genetics.119.302134
Spindel JE, Begum H, Akdemir D et al (2016) Genome-wide prediction models that incorporate de novo GWAS are a powerful new tool for tropical rice improvement. Heredity 116:395–408
pubmed: 26860200
pmcid: 4806696
doi: 10.1038/hdy.2015.113
Tweeten L, Thompson SR (2008) Long-term global agricultural output supply-demand balance and real farm and food prices. Ohio State University. Department of Agricultural, Environmental and Development Economics. pp 1–32
VanRaden PM, Van Tassell CP, Wiggans GR et al (2009) Invited review: reliability of genomic predictions for North American holstein bulls. J Dairy Sci 92:16–24
pubmed: 19109259
doi: 10.3168/jds.2008-1514
Wang B, Sverdlov S, Thompson E (2017) Efficient estimation of realized kinship from single nucleotide polymorphism genotypes. Genetics 205:1063–1078
pubmed: 28100587
pmcid: 5340323
doi: 10.1534/genetics.116.197004
Wickham H, François R, Henry L et al (2020) dplyr: a grammar of data manipulation. R package version 1.0.2. https://CRAN.R-project.org/package=dplyr
Wright S (1921) Systems of mating. The biometric relations between parents and offspring. Genetics 6:111–123
pubmed: 17245958
pmcid: 1200501
doi: 10.1093/genetics/6.2.111
Yan W, Kang MS, Ma B et al (2007) GGE biplot vs. AMMI analysis of genotype-by-environment data. Crop Sci 47:643–653
doi: 10.2135/cropsci2006.06.0374
Zhang C, Dong SS, Xu JY et al (2019) PopLDdecay: a fast and effective tool for linkage disequilibrium decay analysis based on variant call format files. Bioinformatics 35:1786–1788
pubmed: 30321304
doi: 10.1093/bioinformatics/bty875