| Literature DB >> 28391241 |
Osval A Montesinos-López1, Abelardo Montesinos-López2, José Crossa3, José Cricelio Montesinos-López4, Francisco Javier Luna-Vázquez1, Josafhat Salinas-Ruiz5, José R Herrera-Morales1, Raymundo Buenrostro-Mariscal1.
Abstract
There are Bayesian and non-Bayesian genomic models that take into account G×E interactions. However, the computational cost of implementing Bayesian models is high, and becomes almost impossible when the number of genotypes, environments, and traits is very large, while, in non-Bayesian models, there are often important and unsolved convergence problems. The variational Bayes method is popular in machine learning, and, by approximating the probability distributions through optimization, it tends to be faster than Markov Chain Monte Carlo methods. For this reason, in this paper, we propose a new genomic variational Bayes version of the Bayesian genomic model with G×E using half-t priors on each standard deviation (SD) term to guarantee highly noninformative and posterior inferences that are not sensitive to the choice of hyper-parameters. We show the complete theoretical derivation of the full conditional and the variational posterior distributions, and their implementations. We used eight experimental genomic maize and wheat data sets to illustrate the new proposed variational Bayes approximation, and compared its predictions and implementation time with a standard Bayesian genomic model with G×E. Results indicated that prediction accuracies are slightly higher in the standard Bayesian model with G×E than in its variational counterpart, but, in terms of computation time, the variational Bayes genomic model with G×E is, in general, 10 times faster than the conventional Bayesian genomic model with G×E. For this reason, the proposed model may be a useful tool for researchers who need to predict and select genotypes in several environments.Entities:
Keywords: GenPred; Genomic Selection; Shared Data Resources; genome-enabled prediction; multi-environment; variational Bayes
Mesh:
Year: 2017 PMID: 28391241 PMCID: PMC5473762 DOI: 10.1534/g3.117.041202
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
Parameter estimates, implementation time and Pearson correlation between the observed and predicted values for the complete data sets under the BME and VBME models
| Model | Data Set | Parameter | Correlation | Time | |||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| BME | Maize_GY | Mean | 6.589 | 4.910 | 6.167 | — | 2.127 | 1.970 | 0.428 | 0.879 | 48.9 |
| SD | 0.317 | 0.280 | 0.294 | — | 0.467 | 0.409 | 0.026 | — | — | ||
| Maize_ASI | Mean | 1.837 | 1.168 | 2.222 | — | 2.909 | 1.290 | 0.436 | 0.855 | 50.88 | |
| SD | 0.361 | 0.340 | 0.352 | — | 0.535 | 0.326 | 0.026 | — | — | ||
| Maize_PH | Mean | 2.330 | 2.021 | 2.343 | — | 0.007 | 0.002 | 0.020 | 0.745 | 49.68 | |
| SD | 0.044 | 0.043 | 0.043 | — | 0.009 | 0.003 | 0.002 | — | — | ||
| Wheat_DTHD | Mean | −3.155 | −4.018 | −0.298 | — | 26.381 | 8.015 | 0.403 | 0.999 | 64.62 | |
| SD | 0.270 | 0.268 | 0.276 | — | 2.656 | 0.545 | 0.029 | — | — | ||
| Wheat_PTHT | Mean | −4.652 | −7.402 | −0.776 | — | 8.864 | 25.702 | 0.403 | 0.999 | 31.14 | |
| SD | 0.269 | 0.262 | 0.267 | — | 1.725 | 1.687 | 0.029 | — | — | ||
| Wheat_Yield1 | Mean | 0.000 | 0.000 | 0.000 | 0.000 | 0.217 | 0.338 | 0.555 | 0.801 | 26.34 | |
| SD | 0.536 | 0.536 | 0.538 | 0.536 | 0.049 | 0.042 | 0.023 | — | — | ||
| Wheat_Yield2 | Mean | 0.170 | 0.202 | 0.073 | 0.082 | 0.295 | 0.230 | 0.486 | 0.816 | 33.36 | |
| SD | 0.476 | 0.475 | 0.476 | 0.477 | 0.039 | 0.027 | 0.018 | — | — | ||
| Wheat_Yield3 | Mean | 0.053 | −0.072 | −0.107 | −0.079 | 0.402 | 0.244 | 0.520 | 0.810 | 31.98 | |
| SD | 0.587 | 0.585 | 0.584 | 0.586 | 0.055 | 0.032 | 0.021 | — | — | ||
| VBME | Maize_GY | Mean | 6.391 | 5.017 | 6.113 | — | 1.485 | 0.301 | 0.576 | 0.772 | 5.64 |
| SD | 0.043 | 0.043 | 0.043 | — | 0.121 | 0.013 | 0.027 | — | — | ||
| Maize_ASI | Mean | 1.858 | 1.143 | 2.364 | — | 1.825 | 0.282 | 0.503 | 0.781 | 5.64 | |
| SD | 0.040 | 0.040 | 0.040 | — | 0.148 | 0.013 | 0.023 | — | — | ||
| Maize_PH | Mean | 2.337 | 2.051 | 2.333 | — | 0.011 | 0.011 | 0.014 | 0.859 | 3.18 | |
| SD | 0.007 | 0.007 | 0.007 | — | 0.0009 | 0.0005 | 0.0006 | — | |||
| Wheat_DTHD | Mean | −3.243 | −4.080 | −0.321 | — | 25.063 | 0.451 | 6.161 | 0.943 | 1.2 | |
| SD | 0.161 | 0.161 | 0.161 | — | 2.274 | 0.003 | 0.337 | — | — | ||
| Wheat_PTHT | Mean | −4.594 | −7.459 | −0.611 | — | 6.796 | 2.937 | 17.009 | 0.846 | 1.2 | |
| SD | 0.260 | 0.260 | 0.260 | — | 0.627 | 0.132 | 0.893 | — | — | ||
| Wheat_Yield1 | Mean | 0.000 | 0.000 | 0.000 | 0.000 | 0.268 | 0.263 | 0.604 | 0.784 | 15.12 | |
| SD | 0.044 | 0.043 | 0.044 | 0.067 | 0.013 | 0.002 | 0.029 | — | — | ||
| Wheat_Yield2 | Mean | 0.172 | 0.193 | 0.091 | 0.091 | 0.250 | 0.118 | 0.542 | 0.785 | 38.94 | |
| SD | 0.027 | 0.027 | 0.027 | 0.027 | 0.014 | 0.003 | 0.015 | — | — | ||
| Wheat_Yield3 | Mean | 0.045 | −0.062 | −0.092 | −0.075 | 0.417 | 0.190 | 0.561 | 0.797 | 25.86 | |
| SD | 0.028 | 0.028 | 0.028 | 0.028 | 0.023 | 0.005 | 0.015 | — | — |
The implementation time is given in minutes. Mean and SD under the BME were obtained as the Mean and SD posteriors, and, under the VBME, the Mean and SD were obtained as the mean and SD of the variational posterior.
Relative comparison using the full data sets between the BME and VBME calculated as parameter estimate under BME minus parameter estimate under VBME divided by parameter estimate under BME
| Data Set | Corr | Time | |||||||
|---|---|---|---|---|---|---|---|---|---|
| Maize_GY | 0.030 | −0.022 | 0.009 | — | 0.302 | 0.847 | −0.346 | 0.122 | 0.885 |
| Maize_ASI | −0.012 | 0.022 | −0.064 | — | 0.372 | 0.781 | −0.152 | 0.087 | 0.890 |
| Maize_PH | −0.003 | −0.015 | 0.004 | — | −5.289 | 0.330 | −0.153 | 0.936 | |
| Wheat_DTHD | −0.028 | −0.015 | −0.075 | — | 0.050 | 0.944 | −14.272 | 0.056 | |
| Wheat_PTHT | 0.012 | −0.008 | 0.213 | — | 0.233 | 0.886 | −41.243 | 0.961 | |
| Wheat_Yield1 | −0.009 | −0.068 | −0.029 | −0.235 | 0.222 | −0.087 | 0.022 | 0.426 | |
| Wheat_Yield2 | −0.010 | 0.047 | −0.110 | 0.153 | 0.488 | −0.115 | 0.038 | ||
| Wheat_Yield3 | 0.160 | 0.148 | 0.144 | 0.055 | 0.221 | −0.079 | 0.190 |
The smallest and largest differences in β coefficients, variance components, Pearson correlation and implementation time are in boldface.
Prediction accuracy of the BME and VBME models resulting from the 20 trn-tst random partitions implemented
| Env1 | Env2 | Env3 | Env4 | ||||||
|---|---|---|---|---|---|---|---|---|---|
| Model | Data set | APC | SE | APC | SE | APC | SE | APC | SE |
| BME | Maize_GY | 0.319 | 0.041 | 0.378 | 0.032 | 0.362 | 0.040 | — | — |
| Maize_ASI | 0.466 | 0.027 | 0.380 | 0.045 | 0.285 | 0.048 | — | — | |
| Maize_PH | 0.391 | 0.024 | 0.352 | 0.029 | 0.585 | 0.024 | — | — | |
| Wheat_DTHD | 0.905 | 0.011 | 0.835 | 0.012 | 0.897 | 0.008 | — | — | |
| Wheat_PTHT | 0.523 | 0.020 | 0.347 | 0.027 | 0.499 | 0.026 | — | — | |
| Wheat_Yield1 | 0.191 | 0.013 | 0.644 | 0.013 | 0.593 | 0.007 | 0.556 | 0.000 | |
| Wheat_Yield2 | 0.690 | 0.015 | 0.785 | 0.010 | 0.617 | 0.021 | 0.637 | 0.020 | |
| Wheat_Yield3 | 0.627 | 0.030 | 0.711 | 0.019 | 0.669 | 0.024 | 0.719 | 0.022 | |
| VBME | Maize_GY | 0.341 | 0.025 | 0.383 | 0.027 | 0.283 | 0.021 | — | — |
| Maize_ASI | 0.473 | 0.022 | 0.427 | 0.027 | 0.369 | 0.025 | — | — | |
| Maize_PH | 0.313 | 0.032 | 0.380 | 0.029 | 0.389 | 0.027 | — | — | |
| Wheat_DTHD | 0.920 | 0.010 | 0.860 | 0.012 | 0.908 | 0.008 | — | — | |
| Wheat_PTHT | 0.425 | 0.034 | 0.439 | 0.040 | 0.398 | 0.035 | — | — | |
| Wheat_Yield1 | 0.455 | 0.015 | 0.527 | 0.012 | 0.434 | 0.010 | 0.262 | 0.006 | |
| Wheat_Yield2 | 0.661 | 0.007 | 0.706 | 0.007 | 0.557 | 0.008 | 0.598 | 0.003 | |
| Wheat_Yield3 | 0.432 | 0.012 | 0.490 | 0.011 | 0.480 | 0.011 | 0.514 | 0.010 | |
APC, average of Pearson’s correlation; SE, standard error.
Relative comparison of the prediction accuracies of the BME and VBME models calculated as APC under BME minus APC VBME divided by the APC under BME
| Data set | Env1 | Env2 | Env3 | Env4 |
|---|---|---|---|---|
| Maize_GY | −0.069 | −0.015 | 0.217 | — |
| Maize_ASI | −0.015 | −0.121 | −0.298 | — |
| Maize_PH | 0.198 | −0.079 | 0.334 | — |
| Wheat_DTHD | −0.017 | −0.030 | — | |
| Wheat_PTHT | 0.187 | −0.266 | 0.203 | — |
| Wheat_Yield1 | 0.182 | 0.268 | ||
| Wheat_Yield2 | 0.101 | 0.097 | 0.061 | |
| Wheat_Yield3 | 0.311 | 0.311 | 0.282 | 0.285 |
| Maize_GY | 0 | 0 | 1 | — |
| Maize_ASI | 0 | 0 | 0 | — |
| Maize_PH | 1 | 0 | 1 | — |
| Wheat_DTHD | 0 | 0 | 0 | — |
| Wheat_PTHT | 1 | 0 | 1 | — |
| Wheat_Yield1 | 0 | 1 | 1 | 1 |
| Wheat_Yield2 | 1 | 1 | 1 | 1 |
| Wheat_Yield3 | 1 | 1 | 1 | 1 |
The smallest and largest differences in terms of prediction accuracy for both models are in boldface. In the bottom half of this table, 1 means that the best model was BME and 0 means that the best model was VBME in terms of prediction accuracy.