A Multivariate Poisson Deep Learning Model for Genomic Prediction of Count Data.

GenPred Genomic selection and genomic prediction Poisson regression models Shared data resources count data of wheat lines multivariate Poisson deep neural network univariate Poisson deep neural network

Journal

G3 (Bethesda, Md.)
ISSN: 2160-1836
Titre abrégé: G3 (Bethesda)
Pays: England
ID NLM: 101566598

Informations de publication

Date de publication:
05 11 2020
Historique:
pubmed: 17 9 2020
medline: 22 6 2021
entrez: 16 9 2020
Statut: epublish

Résumé

The paradigm called genomic selection (GS) is a revolutionary way of developing new plants and animals. This is a predictive methodology, since it uses learning methods to perform its task. Unfortunately, there is no universal model that can be used for all types of predictions; for this reason, specific methodologies are required for each type of output (response variables). Since there is a lack of efficient methodologies for multivariate count data outcomes, in this paper, a multivariate Poisson deep neural network (MPDN) model is proposed for the genomic prediction of various count outcomes simultaneously. The MPDN model uses the minus log-likelihood of a Poisson distribution as a loss function, in hidden layers for capturing nonlinear patterns using the rectified linear unit (RELU) activation function and, in the output layer, the exponential activation function was used for producing outputs on the same scale of counts. The proposed MPDN model was compared to conventional generalized Poisson regression models and univariate Poisson deep learning models in two experimental data sets of count data. We found that the proposed MPDL outperformed univariate Poisson deep neural network models, but did not outperform, in terms of prediction, the univariate generalized Poisson regression models. All deep learning models were implemented in Tensorflow as back-end and Keras as front-end, which allows implementing these models on moderate and large data sets, which is a significant advantage over previous GS models for multivariate count data.

Identifiants

pubmed: 32934019
pii: g3.120.401631
doi: 10.1534/g3.120.401631
pmc: PMC7642922
doi:

Types de publication

Journal Article Research Support, Non-U.S. Gov't Research Support, U.S. Gov't, Non-P.H.S.

Langues

eng

Sous-ensembles de citation

IM

Pagination

4177-4190

Informations de copyright

Copyright © 2020 Montesinos-Lopez et al.

Références

J Stat Softw. 2010;33(1):1-22
pubmed: 20808728
G3 (Bethesda). 2019 Feb 7;9(2):601-618
pubmed: 30593512
Theor Appl Genet. 2019 Jul;132(7):1943-1952
pubmed: 30888431
G3 (Bethesda). 2018 Dec 10;8(12):3813-3828
pubmed: 30291107
Front Plant Sci. 2016 Nov 22;7:1666
pubmed: 27920780
Trends Plant Sci. 2017 Nov;22(11):961-975
pubmed: 28965742
Genetics. 2006 Jul;173(3):1761-76
pubmed: 16648593
Genetics. 2001 Apr;157(4):1819-29
pubmed: 11290733
G3 (Bethesda). 2019 May 7;9(5):1545-1556
pubmed: 30858235
G3 (Bethesda). 2017 May 5;7(5):1595-1606
pubmed: 28364037
Plant Genome. 2017 Nov;10(3):
pubmed: 29293806
G3 (Bethesda). 2016 May 03;6(5):1165-77
pubmed: 26921298
Gigascience. 2017 Oct 1;6(10):1-10
pubmed: 29020747
Front Artif Intell. 2020 Feb 28;3:4
pubmed: 33733124
Mol Plant. 2019 Mar 4;12(3):390-401
pubmed: 30625380
Annu Rev Anim Biosci. 2013 Jan;1:221-37
pubmed: 25387018
G3 (Bethesda). 2018 Dec 10;8(12):3829-3840
pubmed: 30291108
Front Genet. 2018 Mar 06;9:78
pubmed: 29559995
Plant Genome. 2017 Mar;10(1):
pubmed: 28464061
Front Genet. 2020 Feb 06;11:25
pubmed: 32117441
Proc Natl Acad Sci U S A. 2013 May 14;110(20):8057-62
pubmed: 23630259
Heredity (Edinb). 2014 Jan;112(1):48-60
pubmed: 23572121
Anim Genet. 2016 Feb;47(1):3-11
pubmed: 26490440
Planta. 2018 Nov;248(5):1307-1318
pubmed: 30101399
J Am Med Inform Assoc. 2012 Jan-Feb;19(1):39-44
pubmed: 21715429

Auteurs

Osval Antonio Montesinos-López (OA)

Facultad de Telemática, Universidad de Colima, Colima, 28040, México.

José Cricelio Montesinos-López (JC)

Departamento de Estadística, Centro de Investigación en Matemáticas (CIMAT), Guanajuato, 36023, México.

Pawan Singh (P)

Biometrics and Statistics Unit, Genetic Resources Program, International Maize and Wheat Improvement Center (CIMMYT), Km 45 Carretera Mexico-Veracruz, CP 52640, Mexico.

Nerida Lozano-Ramirez (N)

Biometrics and Statistics Unit, Genetic Resources Program, International Maize and Wheat Improvement Center (CIMMYT), Km 45 Carretera Mexico-Veracruz, CP 52640, Mexico.

Alberto Barrón-López (A)

Department of Animal Production (DPA), Universidad Nacional Agraria La Molina, Av. La Molina s/n La Molina, 15024, Lima, Perú.

Abelardo Montesinos-López (A)

Departamento de Matemáticas, Centro Universitario de Ciencias Exactas e Ingenierías (CUCEI), Universidad de Guadalajara, 44430, Jalisco, México aml_uach2004@hotmail.com j.crossa@cgiar.org.

José Crossa (J)

Biometrics and Statistics Unit, Genetic Resources Program, International Maize and Wheat Improvement Center (CIMMYT), Km 45 Carretera Mexico-Veracruz, CP 52640, Mexico aml_uach2004@hotmail.com j.crossa@cgiar.org.
Colegio de Post-Graduados, Montecillos Texcoco. Edo. de Mexico.

Articles similaires

Databases, Protein Protein Domains Protein Folding Proteins Deep Learning
Coal Metagenome Phylogeny Bacteria Genome, Bacterial

Unsupervised learning for real-time and continuous gait phase detection.

Dollaporn Anopas, Yodchanan Wongsawat, Jetsada Arnin
1.00
Humans Gait Neural Networks, Computer Unsupervised Machine Learning Walking
Humans Shoulder Fractures Tomography, X-Ray Computed Neural Networks, Computer Female

Classifications MeSH