Genomic prediction of preliminary yield trials in chickpea: Effect of functional annotation of SNPs and environment.
Journal
The plant genome
ISSN: 1940-3372
Titre abrégé: Plant Genome
Pays: United States
ID NLM: 101273919
Informations de publication
Date de publication:
03 2022
03 2022
Historique:
received:
26
05
2021
accepted:
14
09
2021
pubmed:
18
11
2021
medline:
21
4
2022
entrez:
17
11
2021
Statut:
ppublish
Résumé
Achieving yield potential in chickpea (Cicer arietinum L.) is limited by many constraints that include biotic and abiotic stresses. Combining next-generation sequencing technology with advanced statistical modeling has the potential to increase genetic gain efficiently. Whole genome resequencing data was obtained from 315 advanced chickpea breeding lines from the Australian chickpea breeding program resulting in more than 298,000 single nucleotide polymorphisms (SNPs) discovered. Analysis of population structure revealed a distinct group of breeding lines with many alleles that are absent from recently released Australian cultivars. Genome-wide association studies (GWAS) using these Australian breeding lines identified 20 SNPs significantly associated with grain yield in multiple field environments. A reduced level of nucleotide diversity and extended linkage disequilibrium suggested that some regions in these chickpea genomes may have been through selective breeding for yield or other traits. A large introgression segment that introduced from C. echinospermum for phytophthora root rot resistance was identified on chromosome 6, yet it also has unintended consequences of reducing yield due to linkage drag. We further investigated the effect of genotype by environment interaction on genomic prediction of yield. We found that the training set had better prediction accuracy when phenotyped under conditions relevant to the targeted environments. We also investigated the effect of SNP functional annotation on prediction accuracy using different subsets of SNPs based on their genomic locations: regulatory regions, exome, and alternative splice sites. Compared with the whole SNP dataset, a subset of SNPs did not significantly decrease prediction accuracy for grain yield despite consisting of a smaller number of SNPs.
Types de publication
Journal Article
Research Support, Non-U.S. Gov't
Langues
eng
Sous-ensembles de citation
IM
Pagination
e20166Informations de copyright
© 2021 The Authors. The Plant Genome published by Wiley Periodicals LLC on behalf of Crop Science Society of America.
Références
Abraham, Z., & del Pozo, J. C. (2012). Ectopic expression of E2FB, a cell cycle transcription factor, accelerates flowering and increases fruit yield in tomato. Journal of Plant Growth Regulation, 31(1), 11-24. https://doi.org/10.1007/s00344-011-9215-y
Albrecht, T., Wimmer, V., Auinger, H. J., Erbe, M., Knaak, C., Ouzunova, M., Simianer, H., & Schon, C. C. (2011). Genome-based prediction of testcross values in maize. Theoretical and Applied Genetics, 123(2), 339-350. https://doi.org/10.1007/s00122-011-1587-7
Alexander, D. H., & Lange, K. (2011). Enhancements to the ADMIXTURE algorithm for individual ancestry estimation. BMC Bioinformatics, 12, 246. https://doi.org/10.1186/1471-2105-12-246
Alexander, D. H., Novembre, J., & Lange, K. (2009). Fast model-based estimation of ancestry in unrelated individuals. Genome Research, 19(9), 1655-1664. https://doi.org/10.1101/gr.094052.109
Amalraj, A., Taylor, J., Bithell, S., Li, Y., Moore, K., Hobson, K., & Sutton, T. (2019). Mapping resistance to Phytophthora root rot identifies independent loci from cultivated (Cicer arietinum L.) and wild (Cicer echinospermum P.H. Davis) chickpea. Theoretical and Applied Genetics, 132(4), 1017-1033. https://doi.org/10.1007/s00122-018-3256-6
Asoro, F. G., Newell, M. A., Beavis, W. D., Scott, M. P., Tinker, N. A., & Jannink, J. L. (2013). Genomic, marker-assisted, and pedigree-BLUP selection methods for beta-glucan concentration in elite oat. Crop Science, 53(5), 1894-1906. https://doi.org/10.2135/cropsci2012.09.0526
Atieno, J., Li, Y., Langridge, P., Dowling, K., Brien, C., Berger, B., Varshney, R. K., & Sutton, T. (2017). Exploring genetic variation for salinity tolerance in chickpea using image-based phenotyping. Scientific Reports, 7(1), 1300. https://doi.org/10.1038/s41598-017-01211-7
Barton, N. H., Etheridge, A. M., & Veber, A. (2017). The infinitesimal model: Definition, derivation, and implications. Theoretical Population Biology, 118, 50-73. https://doi.org/10.1016/j.tpb.2017.06.001
Berro, I., Lado, B., Nalin, R. S., Quincke, M., & Gutierrez, L. (2019). Training population optimization for genomic selection. Plant Genome, 12(3). https://doi.org/10.3835/plantgenome2019.04.0028
Calderhead, B. (2014). A general construction for parallelizing Metropolis-Hastings algorithms. Proceedings of the National Academy of the Sciences of the United States of America, 111(49), 17408-17413. https://doi.org/10.1073/pnas.1408184111
Chu, J. T., Zhao, Y. S., Beier, S., Schulthess, A. W., Stein, N., Philipp, N., Roder, M. S., & Reif, J. C. (2020). Suitability of single-nucleotide polymorphism arrays versus genotyping-by-sequencing for genebank genomics in wheat. Frontiers in Plant Science, 11. https://doi.org/10.3389/fpls.2020.00042
Cingolani, P., Platts, A., le Wang, L., Coon, M., Nguyen, T., Wang, L., Land, S. J., Lu, X., & Ruden, D. M. (2012). A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3. Fly, 6(2), 80-92. https://doi.org/10.4161/fly.19695
Clark, A. G., Hubisz, M. J., Bustamante, C. D., Williamson, S. H., & Nielsen, R. (2005). Ascertainment bias in studies of human genome-wide polymorphism. Genome Research, 15(11), 1496-1502. https://doi.org/10.1101/gr.4107905
Crossa, J., Perez, P., Hickey, J., Burgueno, J., Ornella, L., Ceron-Rojas, J., Zhang, X., Dreisigacker, S., Babu, R., Li, Y., Bonnett, D., & Mathews, K. (2014). Genomic prediction in CIMMYT maize and wheat breeding programs. Heredity, 112(1), 48-60. https://doi.org/10.1038/hdy.2013.16
Daetwyler, H. D., Hayden, M. J., Spangenberg, G. C., & Hayes, B. J. (2015). Selection on optimal haploid value increases genetic gain and preserves more genetic diversity relative to genomic selection. Genetics, 200(4), 1341. https://doi.org/10.1534/genetics.115.178038
de las Heras-Saldana, S., Lopez, B. I., Moghaddar, N., Park, W., Park, J. E., Chung, K. Y., Lim, D., Lee, S. H., Shin, D., & van der Werf, J. H. J. (2020). Use of gene expression and whole-genome sequence information to improve the accuracy of genomic prediction for carcass traits in Hanwoo cattle. Genetics Selection Evolution, 52(1). https://doi.org/10.1186/s12711-020-00574-2
de Los Campos, G., Hickey, J. M., Pong-Wong, R., Daetwyler, H. D., & Calus, M. P. (2013). Whole-genome regression and prediction methods applied to plant and animal breeding. Genetics, 193(2), 327-345. https://doi.org/10.1534/genetics.112.143313
Do, D. N., Janss, L. L. G., Jensen, J., & Kadarmideen, H. N. (2015). SNP annotation-based whole genomic prediction and selection: An application to feed efficiency and its component traits in pigs. Journal of Animal Science, 93(5), 2056-2063. https://doi.org/10.2527/jas.2014-8640
Doidy, J., Grace, E., Kuehn, C., Simon-Plas, F., Casieri, L., & Wipf, D. (2012). Sugar transporters in plants and in their interactions with fungi. Trends in Plant Science, 17(7), 413-422. https://doi.org/10.1016/j.tplants.2012.03.009
Fikere, M., Barbulescu, D. M., Malmberg, M. M., Shi, F., Koh, J. C. O., Slater, A. T., MacLeod, I. M., Bowman, P. J., Salisbury, P. A., Spangenberg, G. C., Cogan, N. O. I., & Daetwyler, H. D. (2018). Genomic prediction using prior quantitative trait loci information reveals a large reservoir of underutilised blackleg resistance in diverse canola (Brassica napus L.) lines. Plant Genome, 11(2). https://doi.org/10.3835/plantgenome2017.11.0100
Hayes, B. J., Bowman, P. J., Chamberlain, A. J., & Goddard, M. E. (2009). Invited review: Genomic selection in dairy cattle: Progress and challenges. Journal of Dairy Science, 92(2), 433-443. https://doi.org/10.3168/jds.2008-1646
Heslot, N., Akdemir, D., Sorrells, M. E., & Jannink, J. L. (2014). Integrating environmental covariates and crop modeling into the genomic selection framework to predict genotype by environment interactions. Theoretical and Applied Genetics, 127(2), 463-480. https://doi.org/10.1007/s00122-013-2231-5
Heslot, N., Jannink, J. L., & Sorrells, M. E. (2013). Using genomic prediction to characterize environments and optimize prediction accuracy in applied breeding data. Crop Science, 53(3), 921-933. https://doi.org/10.2135/cropsci2012.07.0420
Heslot, N., Yang, H.-P., Sorrells, M. E., & Jannink, J.-L. (2012). Genomic selection in plant breeding: A comparison of models. Crop Science, 52, 146. https://doi.org/10.2135/cropsci2011.06.0297
Hickey, J. M., Dreisigacker, S., Crossa, J., Hearne, S., Babu, R., Prasanna, B. M., Grondona, M., Zambelli, A., Windhausen, V. S., Mathews, K., & Gorjanc, G. (2014). Evaluation of genomic selection training population designs and genotyping strategies in plant breeding programs using simulation. Crop Science, 54(4), 1476-1488. https://doi.org/10.2135/cropsci2013.03.0195
Hill, W. G. (2014). Applications of population genetics to animal breeding, from wright, fisher and lush to genomic prediction. Genetics, 196(1), 1-16. https://doi.org/10.1534/genetics.112.147850
Hoffstetter, A., Cabrera, A., Huang, M., & Sneller, C. (2016). Optimizing training population data and validation of genomic selection for economic traits in soft winter wheat. G3 Genes|Genomes|Genetics, 6(9), 2919-2928. https://doi.org/10.1534/g3.116.032532
Jarquin, D., Crossa, J., Lacaze, X., Du Cheyron, P., Daucourt, J., Lorgeou, J., Piraux, F., Guerreiro, L., Perez, P., Calus, M., Burgueno, J., & de los Campos, G. (2014). A reaction norm model for genomic selection using high-dimensional genomic and environmental data. Theoretical and Applied Genetics, 127(3), 595-607. https://doi.org/10.1007/s00122-013-2243-1
Khan, H. A., Siddique, K. H. M., & Colmer, T. D. (2016). Salt sensitivity in chickpea is determined by sodium toxicity. Planta, 244(3), 623-637. https://doi.org/10.1007/s00425-016-2533-3
Kiran, A., Kumar, S., Nayyar, H., & Sharma, K. D. (2019). Low temperature-induced aberrations in male and female reproductive organ development cause flower abortion in chickpea. Plant, Cell and Environment, 42(7), 2075-2089. https://doi.org/10.1111/pce.13536
Koufariotis, L. T., Chen, Y. P. P., Stothard, P., & Hayes, B. (2018). Variance explained by whole genome sequence variants in coding and regulatory genome annotations for six dairy traits. BMC Genomics, 19. https://doi.org/10.1186/s12864-018-4617-x
Lachance, J., & Tishkoff, S. A. (2013). SNP ascertainment bias in population genetic analyses: Why it is important, and how to correct it. Bioessays, 35(9), 780-786. https://doi.org/10.1002/bies.201300014
Li, H., Handsaker, B., Wysoker, A., Fennell, T., Ruan, J., Homer, N., Marth, G., Abecasis, G., Durbin, R., & Proc, G. P. D. (2009). The sequence alignment/map format and SAMtools. Bioinformatics, 25(16), 2078-2079. https://doi.org/10.1093/bioinformatics/btp352
Li, X., Guo, T. T., Mu, Q., Li, X. R., & Yu, J. M. (2018). Genomic and environmental determinants and their interplay underlying phenotypic plasticity. Proceedings of the National Academy of Sciences of the United States of America, 115(26), 6679-6684. https://doi.org/10.1073/pnas.1718326115
Li, Y., Haseneyer, G., Schön, C. C., Ankerst, D., Korzun, V., Wilde, P., & Bauer, E. (2011). High levels of nucleotide diversity and fast decline of linkage disequilibrium in rye (Secale cereale L.) genes involved in frost response. BMC Plant Biology, 11, 6. https://doi.org/10.1186/1471-2229-11-6
Li, Y., Ruperao, P., Batley, J., Edwards, D., Davidson, J., Hobson, K., & Sutton, T. (2017). Genome analysis identified novel candidate genes for ascochyta blight resistance in chickpea using whole genome re-sequencing data. Frontiers in Plant Science, 8. https://doi.org/10.3389/fpls.2017.00359
Li, Y., Ruperao, P., Batley, J., Edwards, D., Khan, T., Colmer, T. D., Pang, J. Y., Siddique, K. H. M., & Sutton, T. (2018). Investigating drought tolerance in chickpea using genome-wide association mapping and genomic selection based on whole-genome resequencing data. Frontiers in Plant Science, 9. https://doi.org/10.3389/fpls.2018.00190
Lipka, A. E., Tian, F., Wang, Q., Peiffer, J., Li, M., Bradbury, P. J., Gore, M. A., Buckler, E. S., & Zhang, Z. (2012). GAPIT: Genome association and prediction integrated tool. Bioinformatics, 28(18), 2397-2399. https://doi.org/10.1093/bioinformatics/bts444
Lorenz, A. J. (2013). Resource allocation for maximizing prediction accuracy and genetic gain of genomic selection in plant breeding: A simulation experiment. G3 Genes|Genomes|Genetics, 3(3), 481-491. https://doi.org/10.1534/g3.112.004911
Magyar, Z., De Veylder, L., Atanassova, A., Bako, L., Inze, D., & Bogre, L. (2005). The role of the Arabidopsis E2FB transcription factor in regulating auxin-dependent cell division. Plant Cell, 17(9), 2527-2541. https://doi.org/10.1105/tpc.105.033761
Malomane, D. K., Reimer, C., Weigend, S., Weigend, A., Sharifi, A. R., & Simianer, H. (2018). Efficiency of different strategies to mitigate ascertainment bias when using SNP panels in diversity studies. BMC Genomics, 19. https://doi.org/10.1186/s12864-017-4416-9
Meuwissen, T., Hayes, B., & Goddard, M. (2016). Genomic selection: A paradigm shift in animal breeding. Animal Frontiers, 6(1), 6-14. https://doi.org/10.2527/af.2016-0002
Meuwissen, T., Hayes, B. J., & Goddard, M. E. (2001). Prediction of total genetic value using genome-wide dense marker maps. Genetics, 157(4), 1819-1829.
Muller, D., Schopp, P., & Melchinger, A. E. (2018). Selection on expected maximum haploid breeding values can increase genetic gain in recurrent genomic selection. G3-Genes Genomes Genetics, 8(4), 1173-1181. https://doi.org/10.1534/g3.118.200091
Norman, A., Taylor, J., Edwards, J., & Kuchel, H. (2018). Optimising Genomic Selection in Wheat: Effect of Marker Density, Population Size and Population Structure on Prediction Accuracy. G3 (Bethesda), 8(9), 2889-2899. https://doi.org/10.1534/g3.118.200311
Norman, A., Taylor, J., Tanaka, E., Telfer, P., Edwards, J., Martinant, J. P., & Kuchel, H. (2017). Increased genomic prediction accuracy in wheat breeding using a large Australian panel. Theoretical and Applied Genetics, 130(12), 2543-2555. https://doi.org/10.1007/s00122-017-2975-4
Pang, J., Turner, N. C., Khan, T., Du, Y. L., Xiong, J. L., Colmer, T. D., Devilla, R., Stefanova, K., & Siddique, K. H. (2016). Response of chickpea (Cicer arietinum L.) to terminal drought: Leaf stomatal conductance, pod abscisic acid concentration, and seed set. Journal of Experimental Botany. https://doi.org/10.1093/jxb/erw153
Quiroz, M., Kohn, R., Villani, M., & Tran, M. N. (2019). Speeding up MCMC by efficient data subsampling. Journal of the American Statistical Association, 114(526), 831-843. https://doi.org/10.1080/01621459.2018.1448827
Riedelsheimer, C., & Melchinger, A. E. (2013). Optimizing the allocation of resources for genomic selection in one breeding cycle. Theoretical and Applied Genetics, 126(11), 2835-2848. https://doi.org/10.1007/s00122-013-2175-9
Rincent, R., Malosetti, M., Ababaei, B., Touzy, G., Mini, A., Bogard, M., Martre, P., Le Gouis, J., & van Eeuwijk, F. (2019). Using crop growth model stress covariates and AMMI decomposition to better predict genotype-by-environment interactions. Theoretical and Applied Genetics, 132(12), 3399-3411. https://doi.org/10.1007/s00122-019-03432-y
Rodgers-Melnick, E., Vera, D. L., Bass, H. W., & Buckler, E. S. (2016). Open chromatin reveals the functional maize genome. Proceedings of the National Academy of Sciences of the United States of America, 113(22), E3177-3184. https://doi.org/10.1073/pnas.1525244113
Roorkiwal, M., Rathore, A., Das, R. R., Singh, M. K., Jain, A., Srinivasan, S., Gaur, P. M., Chellapilla, B., Tripathi, S., Li, Y., Hickey, J. M., Lorenz, A., Sutton, T., Crossa, J., Jannink, J. L., & Varshney, R. K. (2016). Genome-enabled prediction models for yield related traits in chickpea. Frontiers in Plant Science, 7, 1666. https://doi.org/10.3389/fpls.2016.01666
Sadras, V. O., Lake, L., Li, Y., Farquharson, E. A., & Sutton, T. (2016). Phenotypic plasticity and its genetic regulation for yield, nitrogen fixation and delta13C in chickpea crops under varying water regimes. Journal of Experimental Botany, 67(14), 4339-4351. https://doi.org/10.1093/jxb/erw221
Spindel, J. E., Begum, H., Akdemir, D., Collard, B., Redona, E., Jannink, J. L., & McCouch, S. (2016). Genome-wide prediction models that incorporate de novo GWAS are a powerful new tool for tropical rice improvement. Heredity, 116(4), 395-408. https://doi.org/10.1038/hdy.2015.113
Tang, Y., Liu, X., Wang, J., Li, M., Wang, Q., Tian, F., Su, Z., Pan, Y., Liu, D., Lipka, A. E., Buckler, E. S., & Zhang, Z. (2016). GAPIT Version 2: An enhanced integrated tool for genomic association and prediction. Plant Genome, 9(2). doi:https://doi.org/10.3835/plantgenome2015.11.0120
Tibshirani, R. (1996). Regression shrinkage and selection via the Lasso. Journal of the Royal Statistical Society Series B-Methodological, 58(1), 267-288.
Toosi, A., Fernando, R. L., & Dekkers, J. C. M. (2010). Genomic selection in admixed and crossbred populations. Journal of Animal Science, 88(1), 32-46. https://doi.org/10.2527/jas.2009-1975
Van Belle, G., Fisher, L., Heagerty, P., & Lumley, T. (2004). Biostatistics: A methodology for the health sciences (2nd ed.). John Wiley & Sons, Inc.
Wald, A. (1943). Tests of statistical hypotheses concerning several parameters when the number of observations is large. Transactions of the American Mathematical Society, 54(1-3), 426-482.
Waldmann, P., Hallander, J., Hoti, F., & Sillanpaa, M. J. (2008). Efficient Markov chain Monte Carlo implementation of Bayesian analysis of additive and dominance genetic variances in noninbred pedigrees. Genetics, 179(2), 1101-1112. https://doi.org/10.1534/genetics.107.084160
Wallace, T. C., Murray, R., & Zelman, K. M. (2016). The nutritional value and health benefits of chickpeas and hummus. Nutrients, 8(12). https://doi.org/10.3390/nu8120766
Wang, Q., Tian, F., Pan, Y., Buckler, E. S., & Zhang, Z. (2014). A SUPER powerful method for genome wide association study. Plos One, 9(9), e107684. https://doi.org/10.1371/journal.pone.0107684
Weigel, K. A., de los Campos, G., Vazquez, A. I., Rosa, G. J. M., Gianola, D., & Van Tassell, C. P. (2010). Accuracy of direct genomic values derived from imputed single nucleotide polymorphism genotypes in Jersey cattle. Journal of Dairy Science, 93(11), 5423-5435. https://doi.org/10.3168/jds.2010-3149
Wimmer, V., Lehermeier, C., Albrecht, T., Auinger, H. J., Wang, Y., & Schon, C. C. (2013). Genome-wide prediction of traits with different genetic architecture through efficient variable selection. Genetics, 195(2), 573-587. https://doi.org/10.1534/genetics.113.150078
Wobus, U., & Weber, H. (1999). Sugars as signal molecules in plant seed development. Biological Chemistry, 380(7-8), 937-944. https://doi.org/10.1515/bc.1999.116
Zhang, W., Zhang, T., Wu, Y. F., & Jiang, J. M. (2012). Genome-wide identification of regulatory DNA elements and protein-binding footprints using signatures of open chromatin in Arabidopsis. Plant Cell, 24(7), 2719-2731. https://doi.org/10.1105/tpc.112.098061
Zhang, Z., Ersoz, E., Lai, C. Q., Todhunter, R. J., Tiwari, H. K., Gore, M. A., Bradbury, P. J., Yu, J. M., Arnett, D. K., Ordovas, J. M., & Buckler, E. S. (2010). Mixed linear model approach adapted for genome-wide association studies. Nature Genetics, 42(4), 355-360. https://doi.org/10.1038/ng.546