| Literature DB >> 27793970 |
Jaime Cuevas1, José Crossa2, Osval A Montesinos-López3, Juan Burgueño4, Paulino Pérez-Rodríguez5, Gustavo de Los Campos6.
Abstract
The phenomenon of genotype × environment (G × E) interaction in plant breeding decreases selection accuracy, thereby negatively affecting genetic gains. Several genomic prediction models incorporating G × E have been recently developed and used in genomic selection of plant breeding programs. Genomic prediction models for assessing multi-environment G × E interaction are extensions of a single-environment model, and have advantages and limitations. In this study, we propose two multi-environment Bayesian genomic models: the first model considers genetic effects [Formula: see text] that can be assessed by the Kronecker product of variance-covariance matrices of genetic correlations between environments and genomic kernels through markers under two linear kernel methods, linear (genomic best linear unbiased predictors, GBLUP) and Gaussian (Gaussian kernel, GK). The other model has the same genetic component as the first model [Formula: see text] plus an extra component, F: , that captures random effects between environments that were not captured by the random effects [Formula: see text] We used five CIMMYT data sets (one maize and four wheat) that were previously used in different studies. Results show that models with G × E always have superior prediction ability than single-environment models, and the higher prediction ability of multi-environment models with [Formula: see text] over the multi-environment model with only u occurred 85% of the time with GBLUP and 45% of the time with GK across the five data sets. The latter result indicated that including the random effect f is still beneficial for increasing prediction ability after adjusting by the random effect [Formula: see text].Entities:
Keywords: Gaussian kernel; GenPred; Shared data resource; genomic selection; kernel GBLUP; marker × environment interaction; multi-environment
Mesh:
Year: 2017 PMID: 27793970 PMCID: PMC5217122 DOI: 10.1534/g3.116.035584
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
Percent change in prediction accuracy of GK vs. GBLUP for each of the three models (1)–(3), prediction accuracy of model (2) vs. model (1) for GK and GBLUP, and prediction accuracy of model (3) vs. model (2) for GK and GBLUP for each environment in each data set
| Environment | GK | Model (2) | Model (3) | ||||
|---|---|---|---|---|---|---|---|
| Model (1) | Model (2) | Model (3) | GBLUP | GK | GBLUP | GK | |
| Wheat data set 1 | |||||||
| E1 | 15 | 12 | 12 | 2 | 0 | 6 | 5 |
| E2 | 1 | 8 | −1 | 34 | 44 | 13 | 4 |
| E3 | 14 | 16 | 1 | 60 | 62 | 17 | 2 |
| E4 | 14 | 11 | 9 | 12 | 9 | 5 | 3 |
| Maize data set 2 | |||||||
| E1 | 4 | 7 | 3 | 8 | 10 | 3 | 0 |
| E2 | 7 | 2 | 1 | 12 | 7 | 1 | 0 |
| E3 | 12 | 12 | 10 | 2 | 2 | 2 | 0 |
| Wheat data set 3 | |||||||
| E1 | 5 | 4 | 2 | 14 | 12 | 2 | 1 |
| E2 | 3 | −3 | −4 | 14 | 7 | 1 | 0 |
| E3 | 16 | 8 | 8 | 12 | 5 | 1 | 1 |
| E4 | 4 | 1 | 1 | 2 | −1 | 1 | 0 |
| Wheat data set 4 | |||||||
| E1 | 2 | 22 | 2 | 6 | 27 | 20 | 1 |
| E2 | −3 | 13 | 0 | 25 | 46 | 14 | 1 |
| E3 | 3 | 4 | 1 | 15 | 17 | 4 | 0 |
| E4 | −2 | 10 | −1 | 23 | 37 | 11 | 1 |
| Wheat data set 5 | |||||||
| E1 | 9 | 9 | 9 | 4 | 4 | 0 | 0 |
| E2 | 10 | 13 | 13 | 3 | 6 | 0 | 0 |
| E3 | 9 | 9 | 9 | 0 | 0 | 0 | 0 |
| E4 | 15 | 6 | 4 | 64 | 52 | 3 | 0 |
| E5 | 6 | 4 | 0 | 85 | 81 | 3 | 0 |
GBLUP, genomic best linear unbiased predictors: GK, Gaussian kernel.
Mean prediction accuracies for the different environments of wheat data set 1, maize data set 2, and wheat data set 3 for GBLUP and GK methods, and three models including a single-environment (model (1)) and two multi-environment models (models (2) and (3))
| Environment | GBLUP | GK | ||||
|---|---|---|---|---|---|---|
| Model (1) | Model (2) | Model (3) | Model (1) | Model (2) | Model (3) | |
| Wheat data set 1 | ||||||
| E1 | 0.500 (0.056) | 0.512 (0.043) | 0.543 (0.044) | 0.577 (0.043) | 0.575 (0.036) | |
| E2 | 0.474 (0.048) | 0.635 (0.042) | 0.477 (0.056) | 0.685 (0.030) | 0.713 (0.029) | |
| E3 | 0.370 (0.056) | 0.592 (0.045) | 0.694 (0.031) | 0.422 (0.053) | 0.685 (0.030) | |
| E4 | 0.447 (0.047) | 0.501 (0.040) | 0.525 (0.034) | 0.511 (0.044) | 0.555 (0.044) | |
| Maize data set 2 | ||||||
| E1 | 0.558 (0.038) | 0.603 (0.043) | 0.624 (0.045) | 0.583 (0.042) | 0.644 (0.037) | |
| E2 | 0.507 (0.049) | 0.567 (0.055) | 0.575 (0.054) | 0.542 (0.056) | 0.581 (0.057) | |
| E3 | 0.508 (0.051) | 0.517 (0.045) | 0.525 (0.046) | 0.568 (0.044) | 0.577 (0.044) | |
| Wheat data set 3 | ||||||
| E1 | 0.529 (0.044) | 0.603 (0.033) | 0.617 (0.031) | 0.557 (0.040) | 0.625 (0.033) | |
| E2 | 0.622 (0.045) | 0.706 (0.031) | 0.642 (0.030) | 0.688 (0.033) | 0.689 (0.034) | |
| E3 | 0.452 (0.051) | 0.506 (0.045) | 0.512 (0.043) | 0.523 (0.048) | 0.547 (0.041) | |
| E4 | 0.493 (0.046) | 0.504 (0.041) | 0.507 (0.039) | 0.508 (0.053) | 0.510 (0.052) | |
SDs are given in parentheses. The highest prediction accuracies for each environment in each data set are shown in boldface. GBLUP, genomic best linear unbiased predictors: GK, Gaussian kernel.
Empirical phenotypic correlation between environments: E1 vs. E2= −0.019; E1 vs. E3= −0.19; E1 vs. E4= −0.12; E2 vs. E3 = 0.661; E2 vs. E4 = 0.411; E3 vs. E4 = 0.388.
Empirical phenotypic correlation between environments: E1 vs. E2 = 0.388; E1 vs. E3 = 0.262; E 2 vs. E3 = 0.153.
Empirical phenotypic correlation between environments: E1 vs. E2 = 0.527; E1 vs. E3 = 0.253; E1 vs. E4 = 0.259; E2 vs. E3 = 0.340; E2 vs. E4 = 0.328; E3 vs. E4 = 0.22.
Wheat data set 1
| Env. | Covariance Matrix | Covariance Matrix | Variance–Covariance Matrix | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| E1 | E2 | E3 | E4 | E1 | E2 | E3 | E4 | E1 | E2 | E3 | E4 | |
| GBLUP | ||||||||||||
| E1 | 0.534 | −0.123 | −0.121 | −0.235 | 0.302 | 0.074 | −0.095 | 0.063 | 0.238 | – | – | – |
| E2 | −0.243 | 0.480 | 0.388 | 0.255 | 0.207 | 0.423 | 0.300 | 0.159 | – | 0.164 | – | – |
| E3 | −0.247 | 0.834 | 0.451 | 0.283 | −0.256 | 0.682 | 0.457 | 0.114 | – | – | 0.177 | – |
| E4 | −0.483 | 0.552 | 0.632 | 0.444 | 0.236 | 0.503 | 0.347 | 0.236 | – | – | – | 0.379 |
| GK | ||||||||||||
| E1 | 0.728 | −0.159 | −0.224 | −0.219 | 0.200 | 0.118 | −0.003 | 0.094 | 0.154 | – | – | – |
| E2 | −0.221 | 0.714 | 0.666 | 0.344 | 0.483 | 0.299 | 0.126 | 0.096 | – | 0.149 | – | – |
| E3 | −0.287 | 0.860 | 0.839 | 0.438 | −0.015 | 0.499 | 0.213 | 0.003 | – | – | 0.163 | – |
| E4 | −0.311 | 0.493 | 0.579 | 0.683 | 0.460 | 0.384 | 0.014 | 0.209 | – | – | – | 0.220 |
Empirical phenotypic correlation between environments: E1 vs. E2 = −0.019; E1 vs. E3 = −0.19; E1 vs. E4 = −0.12; E2 vs. E3 = 0.661; E2 vs. E4 = 0.411; E3 vs. E4 = 0.388. Variance–covariance matrix (upper triangular) and correlation matrix (lower triangular) for random effects u, f, and variance matrix for random errors of multi-environment model (3) including four environments (E1–E4) for linear kernel GBLUP and nonlinear Gaussian kernel (GK). Pair-wise sample phenotypic correlations between environments are given above. Env., environment; GBLUP, genomic best linear unbiased predictors: GK, Gaussian kernel.
Maize data set 2
| Env. | Covariance Matrix | Covariance Matrix | Variance–Covariance Matrix | ||||||
|---|---|---|---|---|---|---|---|---|---|
| 1 | 2 | 3 | 1 | 2 | 3 | 1 | 2 | 3 | |
| GBLUP | |||||||||
| E1 | 0.442 | 0.275 | 0.117 | 0.268 | 0.101 | 0.094 | 0.221 | – | – |
| E2 | 0.524 | 0.622 | −0.001 | 0.415 | 0.221 | 0.025 | – | 0.226 | 0.000 |
| E3 | 0.255 | −0.002 | 0.475 | 0.369 | 0.108 | 0.242 | – | – | 0.288 |
| GK | |||||||||
| E1 | 0.620 | 0.319 | 0.204 | 0.140 | 0.022 | 0.016 | 0.161 | – | – |
| E2 | 0.468 | 0.748 | 0.030 | 0.167 | 0.124 | 0.015 | – | 0.147 | 0.000 |
| E3 | 0.318 | 0.043 | 0.663 | 0.116 | 0.116 | 0.136 | — | — | 0.171 |
Empirical phenotypic correlation: Sample phenotypic correlations: E1vsE2 = 0.388; E1 vs. E3 = 0.262; E2 vs. E3 = 0.153. Variance–covariance matrix (upper triangular) and correlation matrix (lower triangular) for random effects u, f, and variance matrix for random errors of multi-environment model (3) including three environments (E1–E3) for linear kernel GBLUP and nonlinear Gaussian kernel (GK). Pair-wise sample phenotypic correlations between environments are given above. Env., environment; GBLUP, genomic best linear unbiased predictors: GK, Gaussian kernel.
Wheat data set 3
| Env. | Covariance Matrix | Covariance Matrix | Variance–Covariance Matrix | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 2 | 3 | 4 | 1 | 2 | 3 | 4 | 1 | 2 | 3 | 4 | |
| GBLUP | ||||||||||||
| E1 | 0.403 | 0.368 | 0.129 | 0.169 | 0.254 | 0.086 | 0.087 | 0.017 | 0.281 | – | – | – |
| E2 | 0.729 | 0.632 | 0.329 | 0.204 | 0.403 | 0.179 | 0.036 | 0.067 | – | 0.184 | – | – |
| E3 | 0.273 | 0.555 | 0.556 | 0.128 | 0.362 | 0.178 | 0.228 | 0.033 | – | – | 0.322 | – |
| E4 | 0.448 | 0.432 | 0.289 | 0.353 | 0.070 | 0.327 | 0.143 | 0.234 | – | – | – | 0.366 |
| GK | ||||||||||||
| E1 | 0.693 | 0.453 | 0.175 | 0.191 | 0.132 | −0.014 | 0.033 | −0.023 | 0.145 | – | – | – |
| E2 | 0.638 | 0.727 | 0.302 | 0.248 | −0.122 | 0.099 | 0.004 | 0.012 | – | 0.123 | – | – |
| E3 | 0.229 | 0.386 | 0.841 | 0.171 | 0.267 | 0.037 | 0.116 | −0.004 | – | – | 0.126 | – |
| E4 | 0.269 | 0.342 | 0.219 | 0.725 | −0.168 | 0.101 | −0.031 | 0.142 | – | – | – | 0.163 |
Empirical phenotypic correlation: E1 vs. E2 = 0.527; E1 vs. E3 = 0.253; E1 vs. E4 = 0.259; E2 vs. E3 = 0.340; E2 vs. E4 = 0.328; E3 vs. E4 = 0.220. Variance–covariance matrix (upper triangular) and correlation matrix (lower triangular) for random effects u, f, and variance matrix for random errors of multi-environment model (3) including four environments (E1–E4) for linear kernel GBLUP and nonlinear Gaussian kernel (GK). Pair-wise sample phenotypic correlations between environments are given above. Env., environment; GBLUP, genomic best linear unbiased predictors: GK, Gaussian kernel.
Mean prediction accuracies for the different environments of wheat data sets 4 and 5 for GBLUP and GK methods, and three models including a single-environment (model (1)) and two multi-environment models (models (2) and (3))
| GBLUP | GK | |||||
|---|---|---|---|---|---|---|
| Environment | Model (1) | Model (2) | Model (3) | Model (1) | Model (2) | Model (3) |
| Wheat data set 4 | ||||||
| E1 | 0.473 (0.052) | 0.501 (0.041) | 0.601 (0.033) | 0.482 (0.040) | 0.612 (0.041) | |
| E2 | 0.414 (0.063) | 0.517 (0.049) | 0.401 (0.051) | 0.584 (0.047) | 0.587 (0.044) | |
| E3 | 0.510 (0.052) | 0.588 (0.044) | 0.524 (0.039) | |||
| E4 | 0.448 (0.054) | 0.550 (0.037) | 0.440 (0.045) | 0.603 (0.045) | 0.607 (0.044) | |
| Wheat data set 5 | ||||||
| E1 | 0.561 (0.035) | 0.585 (0.036) | 0.583 (0.036) | 0.614 (0.038) | ||
| E2 | 0.445 (0.051) | 0.457 (0.040) | 0.458 (0.040) | 0.488 (0.046) | 0.517 (0.037) | |
| E3 | 0.628 (0.037) | 0.630 (0.027) | 0.632 (0.026) | 0.687 (0.026) | ||
| E4 | 0.360 (0.046) | 0.592 (0.042) | 0.608 (0.040) | 0.415 (0.043) | ||
| E5 | 0.312 (0.055) | 0.576 (0.036) | 0.596 (0.035) | 0.330 (0.047) | ||
SDs are given in parentheses. The highest prediction accuracies for each environment in each data set are shown in boldface. GBLUP, genomic best linear unbiased predictors: GK, Gaussian kernel.
Empirical phenotypic correlation between environments: E1 vs. E2 = 0.342; E1 vs. E3= –0.054; E1 vs. E4 = 0.311; E2 vs. E3 = 0.328; E2 vs. E4 = 0.414; E3 vs. E4 = 0.223.
Empirical phenotypic correlation between environments: E1 vs. E2 = 0.166; E1 vs. E3 = 0.30; E1 vs. E4= –0.10; E1 vs. E5= –0.010; E2 vs. E3= –0.033; E2 vs. E4 = 0.122; E2 vs. E5 = 0.035; E3 vs. E4= –0.091; E3 vs. E5 = 0.023; E4 vs. E5 = 0.546.
Comparison of prediction accuracy of multi-environment GBLUP and GK model (3) with various other models published in refereed journals for the five data sets utilized in this study
| GBLUP | GK | |||
|---|---|---|---|---|
| Wheat Data Set 1 | FA | Model (3) | EB-G × E | Model (3) |
| E1 | 0.553 | 0.543 | 0.458 | |
| E2 | 0.611 | 0.644 | 0.713 | |
| E3 | 0.585 | 0.586 | ||
| E4 | 0.51 | 0.525 | 0.543 | |
| GBLUP | GK | |||
| Maize Data Set 2 | EB-G × E | Model (2) | EB-G × E | Model (3) |
| E1 | 0.618 | 0.624 | 0.630 | |
| E2 | 0.547 | 0.575 | 0.566 | |
| E3 | 0.519 | 0.525 | 0.556 | |
| GBLUP | GK | |||
| Wheat Data Set 3 | GBLUP-ME | Model (3) | Model (3) | |
| E1 | 0.591 | 0.617 | ||
| E2 | 0.697 | 0.689 | ||
| E3 | 0.505 | 0.512 | ||
| E4 | 0.507 | 0.51 | ||
| GBLUP | GK | |||
| Wheat Data Set 4 | GBLUP-ME | Model (3) | Model (3) | |
| E1 | 0.513 | 0.601 | ||
| E2 | 0.536 | 0.587 | ||
| E3 | 0.531 | 0.609 | ||
| E4 | 0.561 | 0.607 | ||
| GBLUP | GK | |||
| Wheat Data Set 5 | GBLUP-ME | Model (3) | ||
| E1 | 0.575 | 0.583 | ||
| E2 | 0.466 | 0.458 | ||
| E3 | 0.629 | 0.632 | ||
| E4 | 0.402 | 0.608 | ||
| E5 | 0.376 | 0.596 | ||
FA (Factor Analytic) model, Burgueño ; EB (Empirical Bayes)-G × E, Cuevas ; GBLUP-ME, López-Cruz ; The highest correlations in each row are in boldface. GBLUP, genomic best linear unbiased predictors: GK, Gaussian kernel.
Wheat data set 4
| Env. | Covariance Matrix | Covariance Matrix | Variance–Covariance Matrix | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 2 | 3 | 4 | 1 | 2 | 3 | 4 | 1 | 2 | 3 | 4 | |
| GBLUP | ||||||||||||
| E1 | 0.483 | 0.111 | 0.004 | 0.133 | 0.409 | 0.230 | −0.055 | 0.215 | 0.175 | – | – | – |
| E2 | 0.234 | 0.467 | 0.291 | 0.246 | 0.585 | 0.378 | 0.091 | 0.199 | – | 0.242 | – | – |
| E3 | 0.008 | 0.560 | 0.578 | 0.292 | −0.163 | 0.280 | 0.279 | 0.105 | – | – | 0.230 | – |
| E4 | 0.304 | 0.572 | 0.610 | 0.396 | 0.541 | 0.521 | 0.320 | 0.386 | – | – | – | 0.243 |
| GK | ||||||||||||
| E1 | 0.968 | 0.353 | −0.042 | 0.378 | 0.142 | 0.084 | −0.021 | 0.059 | 0.111 | – | – | – |
| E2 | 0.395 | 0.826 | 0.417 | 0.476 | 0.502 | 0.197 | −0.007 | 0.045 | – | 0.176 | – | – |
| E3 | −0.044 | 0.470 | 0.952 | 0.439 | −0.170 | −0.048 | 0.107 | 0.009 | – | – | 0.129 | – |
| E4 | 0.429 | 0.584 | 0.502 | 0.803 | 0.383 | 0.248 | 0.067 | 0.167 | – | – | – | 0.192 |
Empirical phenotypic correlation: E1 vs. E2 = 0.342; E1 vs. E3 = −0.054; E1 vs. E4 = 0.311; E2 vs. E3 = 0.328; E2 vs. E4 = 0.414; E3 vs. E4 = 0.223. Variance–covariance matrix (upper triangular) and correlation matrix (lower triangular) for random effects u, f, and variance matrix for random errors of multi-environment model (3) including four environments (E1–E4) for linear kernel GBLUP and nonlinear Gaussian kernel (GK). Pair-wise sample phenotypic correlations between environments are above. Env., environment; GBLUP, genomic best linear unbiased predictors: GK, Gaussian kernel.
Wheat data set 5
| Env. | Covariance Matrix | Covariance Matrix | Variance–Covariance Matrix | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 2 | 3 | 4 | 5 | 1 | 2 | 3 | 4 | 5 | 1 | 2 | 3 | 4 | 5 | |
| GBLUP | |||||||||||||||
| E1 | 0.626 | 0.161 | 0.186 | −0.039 | 0.017 | 0.141 | 0.035 | 0.010 | −0.016 | 0.008 | 0.193 | – | – | – | – |
| E2 | 0.277 | 0.538 | −0.022 | 0.092 | 0.030 | 0.227 | 0.169 | 0.006 | −0.004 | 0.016 | – | 0.277 | – | – | – |
| E3 | 0.301 | −0.038 | 0.608 | −0.099 | −0.018 | 0.078 | 0.043 | 0.116 | 0.037 | 0.061 | – | – | 0.140 | – | – |
| E4 | −0.063 | 0.161 | −0.163 | 0.605 | 0.405 | −0.086 | −0.020 | 0.219 | 0.247 | 0.180 | – | – | – | 0.189 | – |
| E5 | 0.029 | 0.055 | −0.031 | 0.705 | 0.546 | 0.038 | 0.070 | 0.323 | 0.653 | 0.308 | – | – | – | – | 0.201 |
| GK | |||||||||||||||
| E1 | 0.849 | 0.201 | 0.261 | −0.094 | 0.010 | 0.083 | 0.014 | 0.003 | −0.009 | 0.005 | 0.100 | – | – | – | – |
| E2 | 0.232 | 0.887 | 0.014 | 0.095 | 0.032 | 0.150 | 0.105 | 0.006 | −0.006 | 0.013 | – | 0.136 | – | – | – |
| E3 | 0.325 | 0.017 | 0.759 | −0.031 | 0.060 | 0.040 | 0.071 | 0.068 | 0.012 | 0.014 | – | – | 0.082 | – | – |
| E4 | −0.105 | 0.104 | −0.037 | 0.949 | 0.677 | −0.095 | −0.056 | 0.139 | 0.109 | 0.031 | – | – | – | 0.135 | – |
| E5 | 0.011 | 0.035 | 0.072 | 0.722 | 0.927 | 0.047 | 0.108 | 0.145 | 0.254 | 0.137 | – | – | – | – | 0.174 |
Empirical phenotypic correlation: E1 vs. E2 = 0.166; E1 vs. E3 = 0.30; E1 vs. E4 = −0.10; E1 vs. E5 = −0.010; E2 vs. E3 = −0.033; E2 vs. E4 = 0.122; E2 vs. E5 = 0.035; E3 vs. E4 = −0.091; E3 vs. E5 = 0.023; E4 vs. E5 = 0.546. Variance–covariance matrix (upper triangular) and correlation matrix (lower triangular) for random effects u, f, and variance matrix for random errors of multi-environment model (3) including five environments (E1–E5) for linear kernel GBLUP and nonlinear Gaussian kernel (GK). Pair-wise sample phenotypic correlations between environments are given above. Env., environment; GBLUP, genomic best linear unbiased predictors: GK, Gaussian kernel.