| Literature DB >> 29097376 |
Osval A Montesinos-López1, Abelardo Montesinos-López2, José Crossa3, José C Montesinos-López4, David Mota-Sanchez5, Fermín Estrada-González6, Jussi Gillberg7, Ravi Singh8, Suchismita Mondal8, Philomin Juliana8.
Abstract
In genomic-enabled prediction, the task of improving the accuracy of the prediction of lines in environments is difficult because the available information is generally sparse and usually has low correlations between traits. In current genomic selection, although researchers have a large amount of information and appropriate statistical models to process it, there is still limited computing efficiency to do so. Although some statistical models are usually mathematically elegant, many of them are also computationally inefficient, and they are impractical for many traits, lines, environments, and years because they need to sample from huge normal multivariate distributions. For these reasons, this study explores two recommender systems: item-based collaborative filtering (IBCF) and the matrix factorization algorithm (MF) in the context of multiple traits and multiple environments. The IBCF and MF methods were compared with two conventional methods on simulated and real data. Results of the simulated and real data sets show that the IBCF technique was slightly better in terms of prediction accuracy than the two conventional methods and the MF method when the correlation was moderately high. The IBCF technique is very attractive because it produces good predictions when there is high correlation between items (environment-trait combinations) and its implementation is computationally feasible, which can be useful for plant breeders who deal with very large data sets.Entities:
Keywords: GenPred; Genomic Selection; Shared Data Resources; collaborative filtering; environment interaction; genomic information; genotype; item-based collaborative filtering; matrix factorization; multi-trait; prediction accuracy
Mesh:
Year: 2018 PMID: 29097376 PMCID: PMC5765342 DOI: 10.1534/g3.117.300309
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
Rating matrix data set with four users (rows) and three items (columns), with the rating of each user given in an ordinal scale of three points
| User/Items | I1 | I2 | I3 |
|---|---|---|---|
| U1 | |||
| U2 | |||
| U3 | |||
| U4 |
? denotes missing values that need to be predicted.
Item-to-item similarity matrix constructed with information in Table 1
| I1 | I2 | I3 | |
|---|---|---|---|
| I1 | |||
| I2 | |||
| I3 |
Value represents the similarity between item j and item j´, obtained using cosine similarity.
Phenotypic information for building the rating matrix for multiple-trait and multiple-environment data for genotypes, environments, and traits
| Environment–Trait Combinations | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Genotypes | E1T1 | E1TL | E2T1 | E2TL | EIT1 | EITL | ||||
| G1 | ||||||||||
| G2 | ||||||||||
| GJ | ||||||||||
G denotes genotypes, E denotes environment, T represents trait, and is the phenotype from the th line in the th environment for the th trait.
Figure 1Simulated data. Average Pearson correlation (APC) for each environment–trait combination using the four methods under study [IBCF, matrix factorization (MF), ME, and MTME] for data simulated with a correlation of traits (genetic and residual) and correlation of environments of 0.85. S1 is the scenario under normality, S2 is the scenario under the error negative skew multiplied by 1.25, and S3 represents the scenario under the error positive skew multiplied by 1.25. The notation E1_T1 means environment 1, trait 1.
Figure 2Simulated data. Average Pearson correlation (APC) for each environment-trait combination using the four methods being studied [IBCF, matrix factorization (MF), ME, and MTME] for data simulated with a correlation of traits (genetic and residual) and correlation of environments of 0.5. S1 is the scenario under normality, S2 is the scenario under the error negative skew multiplied by 1.25, and S3 represents the scenario under the error positive skew multiplied by 1.25. The notation E1_T1 means environment 1, trait 1.
Figure 3Simulated data. Average Pearson correlation (APC) for each environment-trait combination using the four methods under study [IBCF, matrix factorization (MF), ME, and MTME] for data simulated with a correlation of traits (genetic and residual) and correlation of environments of 0.25. S1 is the scenario under normality, S2 denotes the scenario under the error negative skew multiplied by 1.25, and S3 is the scenario under the error positive skew multiplied by 1.25. The notation E1_T1 means environment 1, trait 1.
Wheat data set 1
| Env–Trait | Mean | SE | Mean | SE | Mean | SE | Mean | SE | Mean | SE |
|---|---|---|---|---|---|---|---|---|---|---|
| Bed2IR_HD | 0.773 | 0.022 | 0.817 | 0.012 | 0.835 | 0.009 | 0.858 | 0.010 | 0.858 | 0.010 |
| Bed2IR_GNDVI | 0.627 | 0.014 | 0.659 | 0.013 | 0.660 | 0.011 | 0.662 | 0.011 | 0.661 | 0.011 |
| Bed2IR_GY | 0.496 | 0.027 | 0.523 | 0.016 | 0.513 | 0.014 | 0.502 | 0.014 | 0.499 | 0.014 |
| Bed2IR_PH | 0.479 | 0.033 | 0.530 | 0.019 | 0.616 | 0.012 | 0.626 | 0.010 | 0.621 | 0.010 |
| Bed5IR_HD | 0.765 | 0.029 | 0.811 | 0.012 | 0.854 | 0.007 | 0.873 | 0.007 | 0.876 | 0.007 |
| Bed5IR_GNDVI | 0.453 | 0.042 | 0.529 | 0.016 | 0.548 | 0.015 | 0.560 | 0.013 | 0.563 | 0.013 |
| Bed5IR_GY | −0.018 | 0.027 | 0.078 | 0.027 | 0.077 | 0.030 | 0.078 | 0.030 | 0.077 | 0.031 |
| Bed5IR_PH | 0.058 | 0.053 | 0.264 | 0.031 | 0.367 | 0.020 | 0.382 | 0.022 | 0.383 | 0.023 |
| Drip_HD | 0.872 | 0.011 | 0.872 | 0.009 | 0.900 | 0.005 | 0.906 | 0.004 | 0.908 | 0.004 |
| Drip_GNDVI | 0.546 | 0.025 | 0.537 | 0.021 | 0.554 | 0.018 | 0.560 | 0.018 | 0.562 | 0.018 |
| Drip_GY | 0.209 | 0.076 | 0.466 | 0.024 | 0.472 | 0.020 | 0.462 | 0.020 | 0.461 | 0.020 |
| Drip_PH | 0.301 | 0.036 | 0.492 | 0.025 | 0.589 | 0.022 | 0.636 | 0.018 | 0.652 | 0.017 |
| Average | 0.464 | 0.033 | 0.548 | 0.019 | 0.582 | 0.015 | 0.592 | 0.015 | 0.593 | 0.015 |
| Bed2IR_HD | 0.011 | 0.864 | 0.011 | 0.859 | 0.012 | 0.856 | 0.012 | 0.856 | 0.011 | |
| Bed2IR_GNDVI | 0.012 | 0.657 | 0.012 | 0.646 | 0.012 | 0.642 | 0.013 | 0.641 | 0.013 | |
| Bed2IR_GY | 0.014 | 0.491 | 0.015 | 0.480 | 0.015 | 0.465 | 0.016 | 0.459 | 0.016 | |
| Bed2IR_PH | 0.011 | 0.612 | 0.011 | 0.604 | 0.012 | 0.601 | 0.013 | 0.597 | 0.014 | |
| Bed5IR_HD | 0.007 | 0.877 | 0.007 | 0.874 | 0.007 | 0.871 | 0.007 | 0.869 | 0.007 | |
| Bed5IR_GNDVI | 0.012 | 0.567 | 0.012 | 0.571 | 0.012 | 0.572 | 0.012 | 0.574 | 0.012 | |
| Bed5IR_GY | 0.031 | 0.076 | 0.031 | 0.075 | 0.031 | 0.076 | 0.030 | 0.078 | 0.030 | |
| Bed5IR_PH | 0.023 | 0.384 | 0.023 | 0.385 | 0.023 | 0.382 | 0.022 | 0.381 | 0.022 | |
| Drip_HD | 0.004 | 0.909 | 0.003 | 0.908 | 0.003 | 0.906 | 0.003 | 0.906 | 0.003 | |
| Drip_GNDVI | 0.017 | 0.564 | 0.017 | 0.563 | 0.017 | 0.563 | 0.017 | 0.565 | 0.017 | |
| Drip_GY | 0.020 | 0.462 | 0.020 | 0.463 | 0.021 | 0.460 | 0.021 | 0.453 | 0.022 | |
| Drip_PH | 0.016 | 0.664 | 0.016 | 0.667 | 0.016 | 0.670 | 0.015 | 0.670 | 0.015 | |
| Average | 0.015 | 0.594 | 0.015 | 0.591 | 0.015 | 0.589 | 0.015 | 0.587 | 0.015 | |
Prediction accuracies with Pearson correlation for each environment–trait (Env–Trait) combination of the matrix factorization model (MF) with different values of lambda under cross-validation scheme CV1. The best predictions of the four methods are in boldface, and the comparisons are made by row. Wheat data set obtained from Rutkoski .
Wheat data set 1
| IBCF | MF | ME | MTME | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| No Markers | With Markers | No Markers | No Markers | With Markers | No Markers | With Markers | ||||||||
| Env–Trait | Mean | SE | Mean | SE | Mean | SE | Mean | SE | Mean | SE | Mean | SE | Mean | SE |
| Bed2IR_HD | 0.875 | 0.011 | 0.374 | 0.023 | 0.864 | 0.011 | −0.005 | 0.030 | −0.022 | 0.030 | 0.829 | 0.008 | 0.006 | |
| Bed2IR_GNDVI | 0.012 | 0.411 | 0.020 | 0.660 | 0.012 | −0.012 | 0.023 | −0.012 | 0.023 | 0.154 | 0.027 | 0.040 | 0.022 | |
| Bed2IR_GY | 0.012 | 0.353 | 0.018 | 0.495 | 0.014 | −0.014 | 0.024 | 0.043 | 0.022 | 0.181 | 0.024 | 0.098 | 0.025 | |
| Bed2IR_PH | 0.011 | 0.325 | 0.023 | 0.617 | 0.011 | −0.022 | 0.016 | 0.026 | 0.015 | 0.235 | 0.014 | 0.031 | 0.015 | |
| Bed5IR_HD | 0.873 | 0.007 | 0.390 | 0.015 | 0.877 | 0.007 | −0.006 | 0.028 | 0.024 | 0.028 | 0.866 | 0.01 | 0.008 | |
| Bed5IR_GNDVI | 0.011 | 0.249 | 0.022 | 0.565 | 0.012 | −0.008 | 0.019 | 0.005 | 0.019 | 0.001 | 0.033 | −0.003 | 0.033 | |
| Bed5IR_GY | 0.091 | 0.032 | 0.035 | 0.021 | 0.077 | 0.031 | 0.019 | 0.024 | 0.024 | 0.024 | 0.022 | 0.393 | 0.027 | |
| Bed5IR_PH | 0.410 | 0.022 | 0.345 | 0.018 | 0.383 | 0.023 | 0.063 | 0.021 | −0.013 | 0.025 | 0.012 | 0.505 | 0.015 | |
| Drip_HD | 0.920 | 0.003 | 0.464 | 0.023 | 0.909 | 0.004 | 0.018 | 0.023 | 0.006 | 0.023 | 0.917 | 0.004 | 0.003 | |
| Drip_GNDVI | 0.016 | 0.371 | 0.031 | 0.563 | 0.017 | 0.047 | 0.034 | −0.005 | 0.036 | −0.12 | 0.027 | −0.043 | 0.026 | |
| Drip_GY | 0.401 | 0.024 | 0.362 | 0.018 | 0.020 | −0.025 | 0.027 | −0.015 | 0.027 | 0.432 | 0.019 | 0.364 | 0.017 | |
| Drip_PH | 0.016 | 0.335 | 0.022 | 0.660 | 0.016 | 0.003 | 0.017 | −0.021 | 0.017 | 0.579 | 0.022 | 0.538 | 0.022 | |
| Average | 0.015 | 0.335 | 0.021 | 0.594 | 0.015 | 0.005 | 0.024 | 0.003 | 0.024 | 0.424 | 0.018 | 0.385 | 0.018 | |
Prediction accuracies with Pearson correlation for each environment–trait (Env–Trait) combination of the proposed methods for the wheat data set from Rutkoski , under cross-validation scheme CV1. The best predictions of the four methods are in boldface, and the comparisons are made by row. “No markers” means that genomic information was not used, while “with markers” means that genomic information was used. MF, matrix factorization.
Maize data set 2
| IBCF | MF | ME | MTME | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| No Markers | With Markers | No Markers | No Markers | With Markers | No Markers | With Markers | ||||||||
| Env–Trait | Mean | SE | Mean | SE | Mean | SE | Mean | SE | Mean | SE | Mean | SE | Mean | SE |
| EBU_GY | 0.232 | 0.017 | 0.214 | 0.023 | 0.209 | 0.013 | 0.236 | 0.019 | 0.345 | 0.018 | 0.233 | 0.020 | 0.019 | |
| EBU_ASI | 0.375 | 0.020 | 0.434 | 0.012 | 0.379 | 0.019 | 0.310 | 0.018 | 0.015 | 0.326 | 0.019 | 0.495 | 0.014 | |
| EBU_PH | 0.214 | 0.022 | 0.421 | 0.018 | 0.188 | 0.021 | 0.123 | 0.020 | 0.014 | 0.117 | 0.022 | 0.485 | 0.016 | |
| KAK_GY | 0.281 | 0.023 | 0.285 | 0.022 | 0.269 | 0.026 | 0.269 | 0.017 | 0.021 | 0.263 | 0.019 | 0.399 | 0.023 | |
| KAK_ASI | 0.332 | 0.016 | 0.272 | 0.018 | 0.336 | 0.018 | 0.298 | 0.020 | 0.018 | 0.316 | 0.019 | 0.412 | 0.017 | |
| KAK_PH | 0.317 | 0.023 | 0.272 | 0.023 | 0.234 | 0.024 | 0.260 | 0.019 | 0.409 | 0.020 | 0.278 | 0.019 | 0.022 | |
| KTI_GY | 0.206 | 0.016 | 0.197 | 0.018 | 0.190 | 0.018 | 0.236 | 0.015 | 0.299 | 0.018 | 0.234 | 0.018 | 0.020 | |
| KTI_ASI | 0.269 | 0.020 | 0.083 | 0.024 | 0.286 | 0.019 | 0.303 | 0.017 | 0.022 | 0.264 | 0.015 | 0.239 | 0.021 | |
| KTI_PH | 0.282 | 0.019 | 0.445 | 0.014 | 0.253 | 0.022 | 0.235 | 0.019 | 0.475 | 0.016 | 0.233 | 0.020 | 0.015 | |
| Average | 0.279 | 0.019 | 0.292 | 0.019 | 0.261 | 0.020 | 0.252 | 0.018 | 0.405 | 0.018 | 0.252 | 0.019 | 0.404 | 0.019 |
Prediction accuracies with Pearson correlation for each environment–trait (Env–Trait) combination of the proposed methods for the maize data set under cross-validation scheme CV1. The best predictions of the seven methods are in boldface, and the comparisons are made by row. “No markers” means that genomic information was not used, while “with markers” means that genomic information was used. MF, matrix factorization.
Large wheat data set 3
| IBCF | MF, | MF, | MF, | |||||
|---|---|---|---|---|---|---|---|---|
| Env–Trait | Mean | PCL | Mean | PCL | Mean | PCL | Mean | PCL |
| GY_Year_14_15 | 0.333 | 33.30 | 0.311 | 33.30 | 33.3 | 0.353 | 33.3 | |
| GY_Year_15_16_1yb | 33.80 | 0.254 | 33.80 | 0.241 | 33.8 | 0.240 | 33.8 | |
| GY_Year_15_16_2yb | 33.75 | 0.234 | 33.75 | 0.198 | 33.75 | 0.197 | 33.75 | |
| GY_Year_16_17_1yb | 28.30 | −0.178 | 28.30 | −0.178 | 28.30 | −0.179 | 28.30 | |
| GY_Year_16_17_2yb | 28.15 | 0.193 | 28.15 | 0.222 | 28.15 | 0.230 | 28.15 | |
| GY_Year_16_17_3yb | 28.25 | 0.148 | 28.25 | 0.193 | 28.25 | 0.196 | 28.25 | |
| Average | 30.93 | 0.160 | 30.93 | 0.175 | 30.93 | 0.17 | 30.93 | |
| HD_Year_14_15 | 53.65 | 0.483 | 53.65 | 0.556 | 53.65 | 0.565 | 53.65 | |
| HD_Year_15_16_1yb | 0.508 | 48.35 | 0.320 | 48.35 | 48.35 | 0.607 | 48.35 | |
| HD_Year_15_16_2yb | 0.537 | 49.45 | 0.440 | 49.45 | 49.45 | 0.563 | 49.45 | |
| HD_Year_16_17_1yb | 47.55 | 0.335 | 47.55 | 0.564 | 47.55 | 0.571 | 47.55 | |
| HD_Year_16_17_2yb | 47.70 | 0.554 | 47.70 | 0.602 | 47.7 | 0.611 | 47.7 | |
| HD_Year_16_17_3yb | 0.637 | 47.20 | 0.589 | 47.20 | 0.655 | 47.2 | 47.2 | |
| Average | 48.98 | 0.454 | 48.98 | 0.59 | 48.98 | 0.60 | 48.98 | |
| DMT_Year_14_15 | 49.50 | 0.427 | 49.50 | 0.546 | 49.50 | 0.566 | 49.50 | |
| DMT_Year_15_16_1yb | 0.491 | 39.55 | 0.510 | 39.55 | 39.55 | 0.694 | 39.55 | |
| DMT_Year_15_16_2yb | 48.85 | 0.460 | 48.85 | 0.596 | 48.85 | 0.625 | 48.85 | |
| DMT_Year_16_17_1yb | 0.527 | 37.30 | 0.270 | 37.30 | 37.30 | 0.580 | 37.30 | |
| DMT_Year_16_17_2yb | 0.566 | 40.35 | 0.494 | 40.35 | 40.35 | 40.35 | ||
| DMT_Year_16_17_3yb | 0.548 | 38.90 | 0.565 | 38.90 | 38.90 | 0.588 | 38.90 | |
| Average | 0.569 | 42.41 | 0.454 | 42.41 | 42.41 | 42.41 | ||
| PH_Year_14_15 | 0.015 | 20.30 | 20.00 | 0.044 | 20 | −0.014 | 20 | |
| PH_Year_15_16_1yb | 0.026 | 20.40 | 20.40 | −0.010 | 20.40 | 0.057 | 20.4 | |
| PH_Year_15_16_2yb | 0.044 | 21.35 | 21.35 | 0.051 | 21.35 | 0.058 | 21.35 | |
| PH_Year_16_17_1yb | 0.217 | 27.60 | 27.60 | 0.225 | 27.6 | 0.187 | 27.6 | |
| PH_Year_16_17_2yb | 29.25 | 0.229 | 29.25 | 0.228 | 29.25 | 0.197 | 29.25 | |
| PH_Year_16_17_3yb | 0.238 | 28.95 | 0.166 | 28.95 | 28.95 | 0.223 | 28.95 | |
| Average | 0.131 | 24.64 | 0.145 | 24.59 | 0.13 | 24.59 | 0.12 | 24.59 |
| Lodging_Year_14_15 | 40.70 | 0.104 | 40.70 | 0.120 | 40.70 | 0.124 | 40.70 | |
| Lodging_Year_15_16_1yb | −0.195 | 11.95 | 0.024 | 31.45 | 0.060 | 31.45 | 31.45 | |
| Lodging_Year_15_16_2yb | −0.274 | 9.65 | 0.099 | 37.25 | 0.115 | 37.25 | 37.25 | |
| Average | −0.041 | 20.77 | 0.076 | 36.467 | 0.098 | 36.47 | 36.47 | |
Prediction accuracies with Pearson correlation for all genotypes missing in the years: GY_Year_14_15, GY_Year_15_16, and GY_Year_16_17 for the wheat data set under cross-validation scheme CV2. Here, only the IBCF and MF methods were implemented. Method MF was implemented with three values of latent features (K = 2, 3, 4). Traits in 1 yr were predicted with data from 1 yr before (1yb), 2 yr before (2yb), and 3 yr before (3yb). PCL denotes the percentage of common lines in the top 2000 lines. MF, matrix factorization.
| Environments | Bed5IR_ HD | Bed5IR_ GNDVI | Bed5IR_ GY | Bed5IR_ PH | Bed2IR_ HD | Bed2IR_ GNDVI | Bed2IR_ GY | Bed2IR_ PH | Drip_ HD | Drip_G_ GNDVI | Drip_ GY | Drip_ PH |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Bed5IR_HD | 1.000 | 0.698 | −0.271 | −0.120 | 0.564 | 0.461 | 0.207 | 0.341 | 0.486 | 0.192 | −0.036 | −0.223 |
| Bed5IR_GNDVI | 0.698 | 1.000 | −0.204 | −0.035 | 0.470 | 0.527 | 0.131 | 0.259 | 0.398 | 0.286 | −0.032 | −0.148 |
| Bed5IR_GY | −0.271 | −0.204 | 1.000 | 0.577 | −0.149 | −0.058 | 0.207 | 0.138 | −0.324 | −0.385 | 0.481 | 0.187 |
| Bed5IR_PH | −0.120 | −0.035 | 0.577 | 1.000 | −0.052 | 0.083 | 0.247 | 0.318 | −0.157 | −0.262 | 0.397 | 0.236 |
| Bed2IR_HD | 0.564 | 0.470 | −0.149 | −0.052 | 1.000 | 0.787 | 0.386 | 0.628 | 0.505 | 0.201 | −0.020 | −0.268 |
| Bed2IR_GNDVI | 0.461 | 0.527 | −0.058 | 0.083 | 0.787 | 1.000 | 0.493 | 0.640 | 0.401 | 0.166 | 0.100 | −0.144 |
| Bed2IR_GY | 0.207 | 0.131 | 0.207 | 0.247 | 0.386 | 0.493 | 1.000 | 0.534 | 0.147 | −0.034 | 0.192 | −0.085 |
| Bed2IR_PH | 0.341 | 0.259 | 0.138 | 0.318 | 0.628 | 0.640 | 0.534 | 1.000 | 0.213 | −0.089 | 0.277 | 0.013 |
| Drip_HD | 0.486 | 0.398 | −0.324 | −0.157 | 0.505 | 0.401 | 0.147 | 0.213 | 1.000 | 0.551 | −0.401 | −0.535 |
| Drip_G_NDVI | 0.192 | 0.286 | −0.385 | −0.262 | 0.201 | 0.166 | −0.034 | −0.089 | 0.551 | 1.000 | −0.575 | −0.298 |
| Drip_GY | −0.036 | −0.032 | 0.481 | 0.397 | −0.020 | 0.100 | 0.192 | 0.277 | −0.401 | −0.575 | 1.000 | 0.460 |
| Drip_PH | −0.223 | −0.148 | 0.187 | 0.236 | −0.268 | −0.144 | −0.085 | 0.013 | −0.535 | −0.298 | 0.460 | 1.000 |
| EBU_GY | EBU_ASI | EBU_PH | KAK_GY | KAK_ASI | KAK_PH | KTI_GY | KTI_ASI | KTI_PH | |
|---|---|---|---|---|---|---|---|---|---|
| EBU_GY | 1.000 | −0.146 | 0.197 | 0.055 | −0.029 | 0.236 | 0.120 | −0.049 | 0.107 |
| EBU_ASI | −0.146 | 1.000 | −0.134 | 0.097 | 0.133 | 0.049 | −0.013 | 0.045 | −0.057 |
| EBU_PH | 0.197 | −0.134 | 1.000 | −0.094 | 0.100 | 0.068 | 0.043 | 0.034 | 0.010 |
| KAK_GY | 0.055 | 0.097 | −0.094 | 1.000 | −0.078 | 0.351 | 0.110 | 0.047 | −0.013 |
| KAK_ASI | −0.029 | 0.133 | 0.100 | −0.078 | 1.000 | −0.138 | 0.020 | 0.188 | −0.036 |
| KAK_PH | 0.236 | 0.049 | 0.068 | 0.351 | −0.138 | 1.000 | 0.016 | 0.057 | 0.080 |
| KTI_GY | 0.120 | −0.013 | 0.043 | 0.110 | 0.020 | 0.016 | 1.000 | −0.263 | 0.429 |
| KTI_ASI | −0.049 | 0.045 | 0.034 | 0.047 | 0.188 | 0.057 | −0.263 | 1.000 | −0.318 |
| KTI_PH | 0.107 | −0.057 | 0.010 | −0.013 | −0.036 | 0.080 | 0.429 | −0.318 | 1.000 |
| HD | DMT | PH | Lodging | GY | |
|---|---|---|---|---|---|
| HD | 1.000 | 0.760 | 0.228 | −0.069 | 0.401 |
| DMT | 0.760 | 1.000 | 0.251 | 0.012 | 0.454 |
| PH | 0.228 | 0.251 | 1.000 | −0.165 | 0.386 |
| Lodging | −0.069 | 0.012 | −0.165 | 1.000 | −0.443 |
| GY | 0.401 | 0.454 | 0.386 | −0.443 | 1.000 |
| Cor = 0.85 | Cor = 0.5 | Cor = 0.25 | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Scenario | Env–Trait | IBCF | MF | ME | MTME | IBCF | MF | ME | MTME | IBCF | MF | ME | MTME |
| E1_T1 | 0.004 | 0.005 | 0.034 | 0.007 | 0.008 | 0.008 | 0.039 | 0.007 | 0.010 | 0.010 | 0.042 | 0.008 | |
| E2_T1 | 0.004 | 0.006 | 0.024 | 0.010 | 0.008 | 0.009 | 0.041 | 0.011 | 0.009 | 0.011 | 0.045 | 0.010 | |
| E3_T1 | 0.005 | 0.007 | 0.021 | 0.008 | 0.008 | 0.010 | 0.041 | 0.009 | 0.010 | 0.012 | 0.049 | 0.009 | |
| S1 | E1_T2 | 0.005 | 0.006 | 0.014 | 0.008 | 0.010 | 0.010 | 0.026 | 0.008 | 0.012 | 0.012 | 0.031 | 0.008 |
| E2_T2 | 0.005 | 0.006 | 0.036 | 0.010 | 0.009 | 0.009 | 0.038 | 0.011 | 0.010 | 0.011 | 0.038 | 0.012 | |
| E3_T2 | 0.005 | 0.006 | 0.021 | 0.008 | 0.009 | 0.009 | 0.034 | 0.010 | 0.010 | 0.010 | 0.038 | 0.011 | |
| E1_T3 | 0.004 | 0.005 | 0.017 | 0.009 | 0.006 | 0.006 | 0.036 | 0.009 | 0.007 | 0.007 | 0.046 | 0.010 | |
| E2_T3 | 0.004 | 0.004 | 0.023 | 0.010 | 0.009 | 0.008 | 0.050 | 0.010 | 0.011 | 0.011 | 0.060 | 0.010 | |
| E3_T3 | 0.004 | 0.006 | 0.029 | 0.008 | 0.008 | 0.009 | 0.035 | 0.010 | 0.010 | 0.010 | 0.038 | 0.011 | |
| Average | 0.004 | 0.006 | 0.024 | 0.009 | 0.008 | 0.009 | 0.038 | 0.009 | 0.010 | 0.010 | 0.043 | 0.010 | |
| E1_T1 | 0.006 | 0.006 | 0.009 | 0.002 | 0.011 | 0.011 | 0.014 | 0.003 | 0.013 | 0.013 | 0.018 | 0.004 | |
| E2_T1 | 0.003 | 0.004 | 0.014 | 0.003 | 0.006 | 0.007 | 0.031 | 0.005 | 0.008 | 0.009 | 0.041 | 0.006 | |
| E3_T1 | 0.005 | 0.005 | 0.018 | 0.002 | 0.007 | 0.008 | 0.034 | 0.003 | 0.008 | 0.009 | 0.038 | 0.003 | |
| S2 | E1_T2 | 0.006 | 0.007 | 0.021 | 0.003 | 0.010 | 0.010 | 0.040 | 0.004 | 0.011 | 0.011 | 0.045 | 0.005 |
| E2_T2 | 0.004 | 0.004 | 0.014 | 0.003 | 0.007 | 0.007 | 0.025 | 0.005 | 0.008 | 0.009 | 0.031 | 0.007 | |
| E3_T2 | 0.006 | 0.006 | 0.023 | 0.002 | 0.010 | 0.010 | 0.047 | 0.005 | 0.011 | 0.011 | 0.052 | 0.006 | |
| E1_T3 | 0.004 | 0.005 | 0.019 | 0.003 | 0.008 | 0.008 | 0.040 | 0.004 | 0.008 | 0.009 | 0.045 | 0.005 | |
| E2_T3 | 0.005 | 0.005 | 0.014 | 0.005 | 0.009 | 0.009 | 0.031 | 0.007 | 0.010 | 0.010 | 0.034 | 0.008 | |
| E3_T3 | 0.005 | 0.005 | 0.013 | 0.003 | 0.010 | 0.010 | 0.021 | 0.005 | 0.011 | 0.011 | 0.025 | 0.006 | |
| Average | 0.005 | 0.005 | 0.016 | 0.003 | 0.009 | 0.009 | 0.031 | 0.005 | 0.010 | 0.010 | 0.037 | 0.006 | |
| E1_T1 | 0.003 | 0.004 | 0.040 | 0.009 | 0.008 | 0.008 | 0.046 | 0.009 | 0.010 | 0.010 | 0.049 | 0.010 | |
| E2_T1 | 0.003 | 0.003 | 0.051 | 0.008 | 0.006 | 0.006 | 0.068 | 0.008 | 0.007 | 0.007 | 0.067 | 0.009 | |
| E3_T1 | 0.003 | 0.004 | 0.040 | 0.012 | 0.007 | 0.007 | 0.050 | 0.014 | 0.009 | 0.009 | 0.051 | 0.014 | |
| E1_T2 | 0.004 | 0.004 | 0.043 | 0.011 | 0.007 | 0.007 | 0.060 | 0.010 | 0.009 | 0.009 | 0.056 | 0.010 | |
| S3 | E2_T2 | 0.004 | 0.004 | 0.059 | 0.014 | 0.007 | 0.007 | 0.057 | 0.014 | 0.008 | 0.008 | 0.062 | 0.012 |
| E3_T2 | 0.005 | 0.005 | 0.040 | 0.015 | 0.009 | 0.009 | 0.048 | 0.012 | 0.011 | 0.011 | 0.050 | 0.011 | |
| E1_T3 | 0.005 | 0.005 | 0.034 | 0.010 | 0.009 | 0.009 | 0.039 | 0.009 | 0.011 | 0.011 | 0.046 | 0.008 | |
| E2_T3 | 0.004 | 0.004 | 0.049 | 0.012 | 0.009 | 0.009 | 0.037 | 0.011 | 0.012 | 0.011 | 0.049 | 0.011 | |
| E3_T3 | 0.005 | 0.005 | 0.053 | 0.012 | 0.010 | 0.010 | 0.044 | 0.011 | 0.011 | 0.011 | 0.039 | 0.010 | |
| Average | 0.004 | 0.004 | 0.045 | 0.012 | 0.008 | 0.008 | 0.050 | 0.011 | 0.010 | 0.010 | 0.052 | 0.011 | |
SE of the prediction accuracies given in Figure 1, Figure 2, and Figure 3, with Pearson correlation for each environment–trait (Env–Trait) combination using the four methods under study [IBCF, matrix factorization (MF), ME, and MTME] for data simulated with a correlation (Cor) of traits (genetic and residual) and correlation of environments of 0.85, 0.5, and 0.25. S1 is the scenario under normality, S2 is the scenario under the error negative skew multiplied by 1.25, and S3 represents the scenario under the error positive skew multiplied by 1.25. The notation E1_T1 means environment 1, trait 1.