A Multivariate Poisson Deep Learning Model for Genomic Prediction of Count Data.
GenPred
Genomic selection and genomic prediction
Poisson regression models
Shared data resources
count data of wheat lines
multivariate Poisson deep neural network
univariate Poisson deep neural network
Journal
G3 (Bethesda, Md.)
ISSN: 2160-1836
Titre abrégé: G3 (Bethesda)
Pays: England
ID NLM: 101566598
Informations de publication
Date de publication:
05 11 2020
05 11 2020
Historique:
pubmed:
17
9
2020
medline:
22
6
2021
entrez:
16
9
2020
Statut:
epublish
Résumé
The paradigm called genomic selection (GS) is a revolutionary way of developing new plants and animals. This is a predictive methodology, since it uses learning methods to perform its task. Unfortunately, there is no universal model that can be used for all types of predictions; for this reason, specific methodologies are required for each type of output (response variables). Since there is a lack of efficient methodologies for multivariate count data outcomes, in this paper, a multivariate Poisson deep neural network (MPDN) model is proposed for the genomic prediction of various count outcomes simultaneously. The MPDN model uses the minus log-likelihood of a Poisson distribution as a loss function, in hidden layers for capturing nonlinear patterns using the rectified linear unit (RELU) activation function and, in the output layer, the exponential activation function was used for producing outputs on the same scale of counts. The proposed MPDN model was compared to conventional generalized Poisson regression models and univariate Poisson deep learning models in two experimental data sets of count data. We found that the proposed MPDL outperformed univariate Poisson deep neural network models, but did not outperform, in terms of prediction, the univariate generalized Poisson regression models. All deep learning models were implemented in Tensorflow as back-end and Keras as front-end, which allows implementing these models on moderate and large data sets, which is a significant advantage over previous GS models for multivariate count data.
Identifiants
pubmed: 32934019
pii: g3.120.401631
doi: 10.1534/g3.120.401631
pmc: PMC7642922
doi:
Types de publication
Journal Article
Research Support, Non-U.S. Gov't
Research Support, U.S. Gov't, Non-P.H.S.
Langues
eng
Sous-ensembles de citation
IM
Pagination
4177-4190Informations de copyright
Copyright © 2020 Montesinos-Lopez et al.
Références
J Stat Softw. 2010;33(1):1-22
pubmed: 20808728
G3 (Bethesda). 2019 Feb 7;9(2):601-618
pubmed: 30593512
Theor Appl Genet. 2019 Jul;132(7):1943-1952
pubmed: 30888431
G3 (Bethesda). 2018 Dec 10;8(12):3813-3828
pubmed: 30291107
Front Plant Sci. 2016 Nov 22;7:1666
pubmed: 27920780
Trends Plant Sci. 2017 Nov;22(11):961-975
pubmed: 28965742
Genetics. 2006 Jul;173(3):1761-76
pubmed: 16648593
Genetics. 2001 Apr;157(4):1819-29
pubmed: 11290733
G3 (Bethesda). 2019 May 7;9(5):1545-1556
pubmed: 30858235
G3 (Bethesda). 2017 May 5;7(5):1595-1606
pubmed: 28364037
Plant Genome. 2017 Nov;10(3):
pubmed: 29293806
G3 (Bethesda). 2016 May 03;6(5):1165-77
pubmed: 26921298
Gigascience. 2017 Oct 1;6(10):1-10
pubmed: 29020747
Front Artif Intell. 2020 Feb 28;3:4
pubmed: 33733124
Mol Plant. 2019 Mar 4;12(3):390-401
pubmed: 30625380
Annu Rev Anim Biosci. 2013 Jan;1:221-37
pubmed: 25387018
G3 (Bethesda). 2018 Dec 10;8(12):3829-3840
pubmed: 30291108
Front Genet. 2018 Mar 06;9:78
pubmed: 29559995
Plant Genome. 2017 Mar;10(1):
pubmed: 28464061
Front Genet. 2020 Feb 06;11:25
pubmed: 32117441
Proc Natl Acad Sci U S A. 2013 May 14;110(20):8057-62
pubmed: 23630259
Heredity (Edinb). 2014 Jan;112(1):48-60
pubmed: 23572121
Anim Genet. 2016 Feb;47(1):3-11
pubmed: 26490440
Planta. 2018 Nov;248(5):1307-1318
pubmed: 30101399
J Am Med Inform Assoc. 2012 Jan-Feb;19(1):39-44
pubmed: 21715429