Genome-based prediction of Bayesian linear and non-linear regression models for ordinal data.


Journal

The plant genome
ISSN: 1940-3372
Titre abrégé: Plant Genome
Pays: United States
ID NLM: 101273919

Informations de publication

Date de publication:
07 2020
Historique:
received: 10 01 2020
revised: 21 03 2020
accepted: 28 03 2020
entrez: 5 10 2020
pubmed: 6 10 2020
medline: 26 11 2020
Statut: ppublish

Résumé

Linear and non-linear models used in applications of genomic selection (GS) can fit different types of responses (e.g., continuous, ordinal, binary). In recent years, several genomic-enabled prediction models have been developed for predicting complex traits in genomic-assisted animal and plant breeding. These models include linear, non-linear and non-parametric models, mostly for continuous responses and less frequently for categorical responses. Several linear and non-linear models are special cases of a more general family of statistical models known as artificial neural networks, which provide better prediction ability than other models. In this paper, we propose a Bayesian Regularized Neural Network (BRNNO) for modelling ordinal data. The proposed model was fitted using a Bayesian framework; we used the data augmentation algorithm to facilitate computations. The proposed model was fitted using the Gibbs Maximum a Posteriori and Generalized EM algorithm implemented by combining code written in C and R programming languages. The new model was tested with two real maize datasets evaluated for Septoria and GLS diseases and was compared with the Bayesian Ordered Probit Model (BOPM). Results indicated that the BRNNO model performed better in terms of genomic-based prediction than the BOPM model.

Identifiants

pubmed: 33016610
doi: 10.1002/tpg2.20021
doi:

Types de publication

Journal Article Research Support, Non-U.S. Gov't Research Support, U.S. Gov't, Non-P.H.S.

Langues

eng

Sous-ensembles de citation

IM

Pagination

e20021

Informations de copyright

© 2020 The Authors. The Plant Genome published by Wiley Periodicals, LLC on behalf of Crop Science Society of America.

Références

Albert, J. H., & Chib, S. (1993). Bayesian analysis of binary and polychotomous response data. Journal of the American Statistical Association, 88(422), 669-679. https://doi.org/10.1080/01621459.1993.10476321
Crossa, J., Pérez, P., de los Campos, G., Mahuku, G., Dreisigacker, S., & Magorokosho, C. (2011). Genomic selection and prediction in plant breeding. Journal of Crop Improvement, 25(3), 239-261. https://doi.org/10.1080/15427528.2011.558767
Crossa, J., Pérez-Rodríguez, P., Cuevas, J., Montesinos-López, O., Jarquín, D., de los Campos, G., … Varshney, R. K. (2017). Genomic selection in plant breeding: Methods, models, and perspectives. Trends in Plant Science, 22(11), 961-975. https://doi.org/10.1016/j.tplants.2017.08.011
da Costa, J. P., & Cardoso, J. S. (2005). Classification of ordinal data using neural networks. In J. Gama, R. Camacho, P. B. Brazdil, A. M. Jorge, & L. Torgo (Eds.), Machine Learning: ECML 2005: 16th European Conference on Machine Learning, Porto, Portugal, October 3-7, 2005. Proceedings (pp. 690-697). Berlin, Heidelberg: Springer-Verlag.
Foresee, D. F., & Hagan, M. T. (1997). Gauss-Newton approximation to Bayesian learning. Proceedings of International Conference on Neural Networks (ICNN’97), 3, 1930-1935. https://doi.org/10.1109/ICNN.1997.614194
Geman, S., & Geman, D. (1984). Stochastic relaxation, gibbs distributions, and the Bayesian restoration of images. IEEE Transactions on Pattern Analysis and Machine Intelligence, PAMI, 6(6), 721-741. https://doi.org/10.1109/TPAMI.1984.4767596
Gianola, D. (1982). Theory and analysis of threshold characters. Journal of Animal Science, 54(5), 1079-1096. https://doi.org/10.2527/jas1982.5451079x
Gianola, D., Okut, H., Weigel, K. A., & Rosa, G. J. (2011). Predicting complex quantitative traits with Bayesian neural networks: A case study with Jersey cows and wheat. BMC Genetics, 12(1), 87. https://doi.org/10.1186/1471-2156-12-87
Brier, G. W. (1950). Verification of forecasts expressed in terms of probability. Monthly Weather Review, 78(1), 1-3.
González-Camacho, J. M., de los Campos, G., Pérez, P., Gianola, D., Cairns, J. E., Mahuku, G., … Crossa, J. (2012). Genome-enabled prediction of genetic values using radial basis function neural networks. Theoretical and Applied Genetics, 125(4), 759-771. https://doi.org/10.1007/s00122-012-1868-9
González-Camacho, J. M., Ornella, L., Pérez-Rodríguez, P., Gianola, D., Dreisigacker, S., & Crossa, J. (2018). Applications of machine learning methods to genomic selection in breeding wheat for rust resistance. The Plant Genome, 11(2), 0. https://doi.org/10.3835/plantgenome2017.11.0104
Kärkkäinen, H. P., & Sillanpää, M. J. (2013). Fast Genomic Predictions via Bayesian G-BLUP and Multilocus models of threshold traits including censored Gaussian data. G3: Genes|Genomes|Genetics, 3(9), 1511-1523. https://doi.org/10.1534/g3.113.007096
Kernighan, B. W., & Ritchie, D. M. (1988). The C programming language (2nd ed). Englewood Cliffs, NJ: Prentice Hall.
Kim, S., Hall, S. D., & Li, L. (2009). A Novel Gibbs Maximum a Posteriori (GMAP) Approach on Bayesian Nonlinear Mixed-Effects Population Pharmacokinetics (PK) Models. Journal of Biopharmaceutical Statistics, 19(4), 700-720. https://doi.org/10.1080/10543400902964159
Levenberg, K. (1944). A method for the solution of certain non-linear problems in least squares. Quarterly of Applied Mathematics, 2(2), 164-168. https://doi.org/10.1090/qam/10666
Lopez-Cruz, M., Crossa, J., Bonnett, D., Dreisigacker, S., Poland, J., Jannink, J.-L., … de los Campos, G. (2015). Increased prediction accuracy in wheat breeding trials using a marker × environment interaction genomic selection model. G3: Genes|Genomes|Genetics, 5(4), 569-582. https://doi.org/10.1534/g3.114.016097
Marquardt, D. W. (1963). An algorithm for least-squares estimation of nonlinear parameters. Journal of the Society for Industrial and Applied Mathematics, 11(2), 431-441. https://doi.org/10.1137/0111030
Mathieson, M. J. (1995). Ordinal models for neural networks. Proc. 3rd Int. Conf. Neural Netw. Capital Markets, 1995, 523-536.
Meuwissen, T. H. E., Hayes, B. J. B., & Goddard, M. E. M. (2001). Prediction of total genetic value using genome-wide dense marker maps. Genetics, 157, 1819-1829.
Montesinos-López, O. A., Martín-Vallejo, J., Crossa, J., Gianola, D., Hernández-Suárez, C. M., Montesinos-López, A., … Singh, R. (2019). New deep learning genomic-based prediction model for multiple traits with binary, ordinal, and continuous phenotypes.G3:Genes|Genomes|Genetics. 5(9), 1545-1556, https://doi.org/10.1534/g3.119.300585
Montesinos-López, O. A., Montesinos-López, A., Pérez-Rodríguez, P., de los Campos, G., Eskridge, K., & Crossa, J. (2015a). Threshold models for genome-enabled prediction of ordinal categorical traits in plant breeding. G3: Genes|Genomes|Genetics, 5(2), 291-300. https://doi.org/10.1534/g3.114.016188
Montesinos-López, O. A., Montesinos-López, A., Crossa, J., Burgueño, J., & Eskridge, K. (2015b). Genomic-enabled prediction of ordinal data with bayesian logistic ordinal regression. G3: Genes|Genomes|Genetics, 5(10), 2113-2126. https://doi.org/10.1534/g3.115.021154
Neal, R. M., & Hinton, G. E. (1998). A view of the EM algorithm that justifies incremental, sparse, and other variants. In M. I. Jordan (Ed.), Learning in graphical models (pp. 355-368). Springer. https://doi.org/10.1007/978-94-011-5014-9_12
Pérez, P., & de los Campos, G. (2014). Genome-wide regression and prediction with the BGLR statistical package. Genetics, 198(2), 483-495. https://doi.org/10.1534/genetics.114.164442
Pérez-Rodríguez, P., Gianola, D., Weigel, K. A., Rosa, G. J. M., & Crossa, J. (2013). Technical note: An R package for fitting Bayesian regularized neural networks with applications in animal breeding. Journal of Animal Science, 91(8), 3522-3531. https://doi.org/10.2527/jas.2012-6162
Pérez-Rodríguez, P., Acosta-Pech, R., Pérez-Elizalde, S., Cruz, C. V., Suárez-Espinosa, J., & Crossa, J. (2018). A Bayesian genomic regression model with skew normal random errors. G3: Genes|Genomes|Genetics, 8(5), 1771-1785. https://doi.org/10.1534/g3.117.300406
Pérez-Rodríguez, P., & Gianola, D. (2020). brnn: Bayesian Regularization for Feed-Forward Neural Networks. R package version 0.8. Retrieved from https://CRAN.R-project.org/package=brnn
Pérez-Rodríguez, P., Gianola, D., González-Camacho, J. M., Crossa, J., Manès, Y., & Dreisigacker, S. (2012). Comparison between linear and non-parametric regression models for genome-enabled prediction in wheat. G3: Genes|Genomes|Genetics, 2(12), 1595-1605. https://doi.org/10.1534/g3.112.003665
Okut, H., Gianola, D., Rosa, G. J. M., & Weigel, K. A. (2011). Prediction of body mass index in mice using dense molecular markers and a regularized neural network. Genetics Research, 93(3), 189-201. https://doi.org/10.1017/S0016672310000662
Poland, J., Endelman, J., Dawson, J., Rutkoski, J., Wu, S., Manes, Y., … Jannink, J.-L. (2012). Genomic selection in wheat breeding using genotyping-by-sequencing. The Plant Genome Journal, 5(3), 103. https://doi.org/10.3835/plantgenome2012.06.0006
R Core Team. (2019). R: A Language and Environment for Statistical Computing. R Core Team, Vienna. Retrieved from https://www.R-project.org/
Samuel, A. L. (1959). Some studies in machine learning using the game of checkers. IBM Journal of Research and Development, 3(3), 210-229. https://doi.org/10.1147/rd.33.0210
Stevens, S. S. (1946). On the theory of scales of measurement. Science, 103(2684), 677-680. https://doi.org/10.1126/science.103.2684.677
Stroup, W. W. (2012). Generalized linear mixed models: Modern concepts, methods and applications. Boca Raton, FL: CRC Press, Taylor & Francis Group.
Tanner, M. A. (1993). Tools for statistical inference: Methods for the exploration of posterior distributions and likelihood functions (2nd ed). New York: Springer-Verlag.
Tharwat, A. (2018). Classification assessment methods. Applied Computing and Informatics. https://doi.org/10.1016/j.aci.2018.08.003
Wang, C. L., Ding, X. D., Wang, J. Y., Liu, J. F., Fu, W. X., Zhang, Z., … Zhang, Q. (2013). Bayesian methods for estimating GEBVs of threshold traits. Heredity, 110(3), 213-219. https://doi.org/10.1038/hdy.2012.65

Auteurs

Paulino Pérez-Rodríguez (P)

Colegio de Postgraduados, CP 56230, Montecillos, Edo. de, México.

Samuel Flores-Galarza (S)

Colegio de Postgraduados, CP 56230, Montecillos, Edo. de, México.

Humberto Vaquera-Huerta (H)

Colegio de Postgraduados, CP 56230, Montecillos, Edo. de, México.

David Hebert Del Valle-Paniagua (DH)

Colegio de Postgraduados, CP 56230, Montecillos, Edo. de, México.

Osval A Montesinos-López (OA)

Facultad de Telemática, Universidad de Colima, Colima, 28040, México.

José Crossa (J)

Colegio de Postgraduados, CP 56230, Montecillos, Edo. de, México.
Biometrics and Statistics Unit, International Maize and Wheat Improvement Center (CIMMYT), Apdo. Postal 6-641, 06600, Cd. de, México.

Articles similaires

Robotic Surgical Procedures Animals Humans Telemedicine Models, Animal

Odour generalisation and detection dog training.

Lyn Caldicott, Thomas W Pike, Helen E Zulch et al.
1.00
Animals Odorants Dogs Generalization, Psychological Smell

Selecting optimal software code descriptors-The case of Java.

Yegor Bugayenko, Zamira Kholmatova, Artem Kruglov et al.
1.00
Software Algorithms Programming Languages
Animals TOR Serine-Threonine Kinases Colorectal Neoplasms Colitis Mice

Classifications MeSH