New Deep Learning Genomic-Based Prediction Model for Multiple Traits with Binary, Ordinal, and Continuous Phenotypes
Author(s) -
Osval A. MontesinosLópez,
Javier MartínVallejo,
José Crossa,
Daniel Gianola,
Carlos Hernández-Suárez,
Abelardo MontesinosLópez,
Philomin Juliana,
Ravi P. Singh
Publication year - 2019
Publication title -
g3 genes genomes genetics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.468
H-Index - 66
ISSN - 2160-1836
DOI - 10.1534/g3.119.300585
Subject(s) - univariate , trait , correlation , context (archaeology) , ordinal data , binary number , predictive modelling , artificial intelligence , metric (unit) , computer science , statistics , machine learning , multivariate statistics , mathematics , biology , paleontology , operations management , geometry , arithmetic , economics , programming language
Multiple-trait experiments with mixed phenotypes (binary, ordinal and continuous) are not rare in animal and plant breeding programs. However, there is a lack of statistical models that can exploit the correlation between traits with mixed phenotypes in order to improve prediction accuracy in the context of genomic selection (GS). For this reason, when breeders have mixed phenotypes, they usually analyze them using univariate models, and thus are not able to exploit the correlation between traits, which many times helps improve prediction accuracy. In this paper we propose applying deep learning for analyzing multiple traits with mixed phenotype data in terms of prediction accuracy. The prediction performance of multiple-trait deep learning with mixed phenotypes (MTDLMP) models was compared to the performance of univariate deep learning (UDL) models. Both models were evaluated using predictors with and without the genotype × environment (G×E) interaction term (I and WI, respectively). The metric used for evaluating prediction accuracy was Pearson's correlation for continuous traits and the percentage of cases correctly classified (PCCC) for binary and ordinal traits. We found that a modest gain in prediction accuracy was obtained only in the continuous trait under the MTDLMP model compared to the UDL model, whereas for the other traits (1 binary and 2 ordinal) we did not find any difference between the two models. In both models we observed that the prediction performance was better for WI than for I. The MTDLMP model is a good alternative for performing simultaneous predictions of mixed phenotypes (binary, ordinal and continuous) in the context of GS.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom