New Deep Learning Genomic-Based Prediction Model for Multiple Traits with Binary, Ordinal, and Continuous Phenotypes.


Journal

G3 (Bethesda, Md.)
ISSN: 2160-1836
Titre abrégé: G3 (Bethesda)
Pays: England
ID NLM: 101566598

Informations de publication

Date de publication:
07 05 2019
Historique:
pubmed: 13 3 2019
medline: 30 11 2019
entrez: 13 3 2019
Statut: epublish

Résumé

Multiple-trait experiments with mixed phenotypes (binary, ordinal and continuous) are not rare in animal and plant breeding programs. However, there is a lack of statistical models that can exploit the correlation between traits with mixed phenotypes in order to improve prediction accuracy in the context of genomic selection (GS). For this reason, when breeders have mixed phenotypes, they usually analyze them using univariate models, and thus are not able to exploit the correlation between traits, which many times helps improve prediction accuracy. In this paper we propose applying deep learning for analyzing multiple traits with mixed phenotype data in terms of prediction accuracy. The prediction performance of multiple-trait deep learning with mixed phenotypes (MTDLMP) models was compared to the performance of univariate deep learning (UDL) models. Both models were evaluated using predictors with and without the genotype × environment (G×E) interaction term (I and WI, respectively). The metric used for evaluating prediction accuracy was Pearson's correlation for continuous traits and the percentage of cases correctly classified (PCCC) for binary and ordinal traits. We found that a modest gain in prediction accuracy was obtained only in the continuous trait under the MTDLMP model compared to the UDL model, whereas for the other traits (1 binary and 2 ordinal) we did not find any difference between the two models. In both models we observed that the prediction performance was better for WI than for I. The MTDLMP model is a good alternative for performing simultaneous predictions of mixed phenotypes (binary, ordinal and continuous) in the context of GS.

Identifiants

pubmed: 30858235
pii: g3.119.300585
doi: 10.1534/g3.119.300585
pmc: PMC6505163
doi:

Types de publication

Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

1545-1556

Informations de copyright

Copyright © 2019 Montesinos-López et al.

Références

G3 (Bethesda). 2015 Sep 15;5(11):2383-90
pubmed: 26377960
PLoS One. 2013 Apr 30;8(4):e61318
pubmed: 23646105
Comput Biol Med. 2021 Jun;133:104323
pubmed: 33934067
Nat Biotechnol. 2015 Aug;33(8):831-8
pubmed: 26213851
PLoS One. 2012;7(2):e32253
pubmed: 22389690
G3 (Bethesda). 2018 Dec 10;8(12):3813-3828
pubmed: 30291107
Nature. 2015 May 28;521(7553):436-44
pubmed: 26017442
Heredity (Edinb). 2015 Jul;115(1):29-36
pubmed: 25873147
PLoS One. 2011 May 04;6(5):e19379
pubmed: 21573248
Bioinformatics. 2007 Oct 1;23(19):2633-5
pubmed: 17586829
Neuroimage. 2017 Dec;163:115-124
pubmed: 28765056
Bull Math Biol. 1990;52(1-2):99-115; discussion 73-97
pubmed: 2185863
PLoS One. 2014 Feb 28;9(2):e90346
pubmed: 24587335
Genetics. 2012 Dec;192(4):1513-22
pubmed: 23086217
Plant Genome. 2017 Jul;10(2):
pubmed: 28724062
Theor Appl Genet. 2018 Mar;131(3):685-701
pubmed: 29198016
Genetics. 2001 Apr;157(4):1819-29
pubmed: 11290733
Genetics. 2013 Jul;194(3):573-96
pubmed: 23636739
Bioinformatics. 2016 Jun 15;32(12):i37-i43
pubmed: 27307640
G3 (Bethesda). 2016 Sep 08;6(9):2725-44
pubmed: 27342738
Mol Syst Biol. 2016 Jul 29;12(7):878
pubmed: 27474269
Plant Genome. 2018 Nov;11(3):
pubmed: 30512048
G3 (Bethesda). 2018 Dec 10;8(12):3829-3840
pubmed: 30291108

Auteurs

Osval A Montesinos-López (OA)

Facultad de Telemática.

Javier Martín-Vallejo (J)

Departamento de Estadística, Universidad de Salamanca, c/Espejo 2, Salamanca, 37007, España.

José Crossa (J)

International Maize and Wheat Improvement Center (CIMMYT), Apdo. Postal 6-641, 06600, Ciudad de México, México j.crossa@cgiar.org aml_uach2004@hotmail.com.

Daniel Gianola (D)

Departments of Animal Sciences, Dairy Science, and Biostatistics and Medical Informatics, University of Wisconsin-Madison, Wisconsin 53706.

Carlos M Hernández-Suárez (CM)

Facultad de Ciencias, Universidad de Colima, Colima, 28040, México.

Abelardo Montesinos-López (A)

Departamento de Matemáticas, Centro Universitario de Ciencias Exactas e Ingenierías (CUCEI), Universidad de Guadalajara, 44430, Jalisco, México j.crossa@cgiar.org aml_uach2004@hotmail.com.

Philomin Juliana (P)

International Maize and Wheat Improvement Center (CIMMYT), Apdo. Postal 6-641, 06600, Ciudad de México, México.

Ravi Singh (R)

International Maize and Wheat Improvement Center (CIMMYT), Apdo. Postal 6-641, 06600, Ciudad de México, México.

Articles similaires

Selecting optimal software code descriptors-The case of Java.

Yegor Bugayenko, Zamira Kholmatova, Artem Kruglov et al.
1.00
Software Algorithms Programming Languages
Databases, Protein Protein Domains Protein Folding Proteins Deep Learning
Populus Soil Microbiology Soil Microbiota Fungi

Classifications MeSH