| Literature DB >> 32664601 |
Dennis N Lozada1,2, Arron H Carter1.
Abstract
Achieving optimal predictive ability is key to increasing the relevance of implementing genomic selection (GS) approaches in plant breeding programs. The potential of an item-based collaborative filtering (IBCF) recommender system in the context of multi-trait, multi-environment GS has been explored. Different GS scenarios for IBCF were evaluated for a diverse population of winter wheat lines adapted to the Pacific Northwest region of the US. Predictions across years through cross-validations resulted in improved predictive ability when there is a high correlation between environments. Using multiple spectral traits collected from high-throughput phenotyping resulted in better GS accuracies for grain yield (GY) compared to using only single traits for predictions. Trait adjustments through various Bayesian regression models using genomic information from SNP markers was the most effective in achieving improved accuracies for GY, heading date, and plant height among the GS scenarios evaluated. Bayesian LASSO had the highest predictive ability compared to other models for phenotypic trait adjustments. IBCF gave competitive accuracies compared to a genomic best linear unbiased predictor (GBLUP) model for predicting different traits. Overall, an IBCF approach could be used as an alternative to traditional prediction models for important target traits in wheat breeding programs.Entities:
Keywords: Bayesian models; genomic BLUP (GBLUP); grain yield; heading date; high-throughput phenotyping; item-based collaborative filtering (IBCF); plant height; recommender system; snow mold tolerance; spectral reflectance indices
Mesh:
Year: 2020 PMID: 32664601 PMCID: PMC7397162 DOI: 10.3390/genes11070779
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Sample phenotypic data for five lines and four trait–environment combinations (A, B, C, and D).
| Line | A | B | C | D |
|---|---|---|---|---|
| 1 | xA1 | 2.00 | 1.50 | 2.25 |
| 2 | 1.75 | xB2 | 2.25 | 3.00 |
| 3 | 1.50 | 2.25 | xC3 | 2.75 |
| 4 | 2.00 | 3.00 | 2.00 | xD4 |
| 5 | 1.25 | 2.75 | 2.00 | 3.00 |
| Mean | 1.63 | 2.50 | 1.94 | 2.75 |
| Std. dev | 0.28 | 0.40 | 0.27 | 0.31 |
Adjusted (standardized) phenotypic data from Table 1.
| Line | A | B | C | D |
|---|---|---|---|---|
| 1 | xA1 | −1.27 | −1.61 | −1.63 |
| 2 | 0.45 | xB2 | 1.15 | 0.82 |
| 3 | −0.45 | −0.63 | xC3 | 0.0 |
| 4 | 1.34 | 1.27 | 0.23 | xD4 |
| 5 | −1.34 | 0.63 | 0.23 | 0.82 |
Correlation between each trait–environment combination.
| A | B | C | D | |
|---|---|---|---|---|
|
| 1.00 | 0.14 | −0.31 | −0.55 |
|
| 0.14 | 1.00 | 0.44 | 0.52 |
|
| −0.31 | 0.435 | 1.00 | 0.94 |
|
| −0.55 | 0.516 | 0.94 | 1.00 |
Predictive ability for grain yield by training different years to predict performance in subsequent growing seasons across environments using an IBCF (item-based collaborative filtering) approach.
| Location | Training Year(s) | Test Year | Predictive Ability | MAAPE |
|---|---|---|---|---|
| LND | 2015 | 2017 | −0.03 | 0.22 |
| LND | 2015 | 2018 | −0.18 | 0.45 |
| LND | 2015 | 2019 | 0.43 | 0.28 |
| LND | 2015, 2017 | 2018 | −0.18 | 0.39 |
| LND | 2015, 2017 | 2019 | 0.43 | 0.21 |
| LND | 2015, 2017, 2018 | 2019 | 0.46 | 0.16 |
| PUL | 2015 | 2016 | −0.20 | 0.31 |
| PUL | 2015 | 2017 | −0.08 | 0.24 |
| PUL | 2015 | 2018 | −0.21 | 0.24 |
| PUL | 2015 | 2019 | 0.43 | 0.25 |
| PUL | 2015, 2016 | 2017 | −0.12 | 0.27 |
| PUL | 2015, 2016 | 2018 | −0.21 | 0.33 |
| PUL | 2015, 2016 | 2019 | 0.36 | 0.29 |
| PUL | 2015, 2016, 2017 | 2018 | −0.16 | 0.35 |
| PUL | 2015, 2016, 2017 | 2019 | 0.19 | 0.21 |
| PUL | 2015, 2016, 2017, 2018 | 2019 | 0.32 | 0.14 |
MAAPE—mean arctangent percentage error.
Figure 1Predictive ability under cross-validations for grain yield across nine environments using an IBCF recommender system. Bars indicate standard errors.
Figure 2Mean predictive ability for grain yield across different environments using spectral indices collected from high-throughput field phenotyping under an IBCF (item-based collaborative filtering) recommender system. NDVI—Normalized Difference Vegetative Index; NWI-1—Normalized Water Index-1; SR—Simple Ratio. Bars indicate standard errors.
Predictive ability from cross-validations for grain yield, heading date, and plant height for phenotypic values adjusted using genomic (marker) information from various Bayesian models.
| Model | Env | Grain Yield | Heading Date | Plant Height | |||
|---|---|---|---|---|---|---|---|
| Pred. Ability (PA) | SE_PA 2 | Pred. Ability (PA) | SE_PA | Pred. Ability (PA) | SE_PA | ||
| BRR 1 | LND15 | 0.20 | 0.03 | 0.62 | 6.10 × 10−3 | 0.61 | 0.02 |
| LND17 | 0.66 | 0.02 | 0.36 | 0.03 | 0.36 | 0.03 | |
| LND18 | 0.60 | 0.02 | −0.15 | 0.03 | 0.56 | 0.02 | |
| LND19 | 0.52 | 0.02 | 0.57 | 0.02 | 0.27 | 0.02 | |
| PUL15 | 0.49 | 0.03 | 0.18 | 0.04 | −0.18 | 0.03 | |
| PUL16 | 0.41 | 0.01 | 0.85 | 0.02 | 0.86 | 7.50 × 10−3 | |
| PUL17 | 0.56 | 0.02 | 0.93 | 5.90 × 10−3 | 0.92 | 7.10 × 10−3 | |
| PUL18 | 0.47 | 0.03 | 0.88 | 0.01 | 0.86 | 0.01 | |
| PUL19 | 0.58 | 0.02 | 0.90 | 6.00 × 10−3 | 0.89 | 6.20 × 10−3 | |
| Bayes A | LND15 | 0.19 | 0.04 | 0.58 | 0.01 | 0.61 | 0.03 |
| LND17 | 0.69 | 0.01 | 0.24 | 0.03 | 0.37 | 0.02 | |
| LND18 | 0.58 | 0.03 | −0.24 | 0.03 | 0.56 | 0.02 | |
| LND19 | 0.59 | 0.02 | 0.57 | 0.02 | 0.22 | 0.02 | |
| PUL15 | 0.55 | 0.02 | 0.17 | 0.02 | −0.12 | 0.03 | |
| PUL16 | 0.42 | 0.03 | 0.83 | 0.02 | 0.83 | 7.80 | |
| PUL17 | 0.60 | 0.03 | 0.94 | 3.80 × 10−3 | 0.91 | 4.30 | |
| PUL18 | 0.45 | 0.03 | 0.86 | 6.80 × 10−3 | 0.87 | 7.10 | |
| PUL19 | 0.61 | 0.01 | 0.90 | 5.10 × 10−3 | 0.88 | 5.90 | |
| Bayes B | LND15 | 0.25 | 0.03 | 0.65 | 0.01 | 0.63 | 0.02 |
| LND17 | 0.62 | 0.02 | 0.30 | 0.02 | 0.33 | 0.03 | |
| LND18 | 0.55 | 0.02 | −0.17 | 0.03 | 0.59 | 0.01 | |
| LND19 | 0.55 | 0.03 | 0.52 | 0.01 | 0.23 | 0.02 | |
| PUL15 | 0.48 | 0.03 | 0.15 | 0.04 | −0.14 | 0.02 | |
| PUL16 | 0.41 | 0.03 | 0.83 | 0.02 | 0.83 | 0.01 | |
| PUL17 | 0.57 | 0.02 | 0.93 | 5.00 × 10−3 | 0.92 | 4.50 × 10−3 | |
| PUL18 | 0.45 | 0.02 | 0.87 | 6.20 × 10−3 | 0.87 | 0.01 | |
| PUL19 | 0.58 | 0.02 | 0.90 | 5.00 × 10−3 | 0.87 | 0.01 | |
| Bayes C | LND15 | 0.28 | 0.03 | 0.64 | 0.02 | 0.56 | 0.02 |
| LND17 | 0.69 | 0.02 | 0.24 | 0.03 | 0.36 | 0.03 | |
| LND18 | 0.55 | 0.02 | −0.22 | 0.03 | 0.55 | 0.03 | |
| LND19 | 0.51 | 0.02 | 0.54 | 0.02 | 0.31 | 0.03 | |
| PUL15 | 0.52 | 0.013 | 0.09 | 0.03 | −0.19 | 0.02 | |
| PUL16 | 0.40 | 0.03 | 0.86 | 0.01 | 0.84 | 8.50 × 10−3 | |
| PUL17 | 0.60 | 0.02 | 0.93 | 5.00 × 10−3 | 0.91 | 4.70 × 10−3 | |
| PUL18 | 0.44 | 0.01 | 0.87 | 9.20 × 10−3 | 0.86 | 6.10 × 10−3 | |
| PUL19 | 0.56 | 0.01 | 0.90 | 4.90 × 10−3 | 0.88 | 6.30 × 10−3 | |
| BL 3 | LND15 | 0.34 | 0.03 | 0.64 | 0.02 | 0.66 | 0.02 |
| LND17 | 0.66 | 0.01 | 0.30 | 0.04 | 0.40 | 0.04 | |
| LND18 | 0.61 | 0.02 | −0.22 | 0.03 | 0.59 | 0.02 | |
| LND19 | 0.56 | 0.02 | 0.59 | 0.02 | 0.25 | 0.03 | |
| PUL15 | 0.57 | 0.02 | 0.24 | 0.03 | −0.20 | 0.03 | |
| PUL16 | 0.42 | 0.03 | 0.88 | 9.70 × 10−3 | 0.86 | 7.90 × 10−3 | |
| PUL17 | 0.61 | 0.02 | 0.93 | 5.20 × 10−3 | 0.92 | 4.40 × 10−3 | |
| PUL18 | 0.43 | 0.01 | 0.86 | 6.80 × 10−3 | 0.86 | 0.01 | |
| PUL19 | 0.65 | 0.03 | 0.91 | 6.60 × 10−3 | 0.90 | 6.70 × 10−3 | |
1 Bayesian Ridge Regression. 2 Standard error of predictive ability. 3 Bayesian LASSO.
Figure 3Mean predictive ability for snow mold tolerance across different environments and models. Trait values were adjusted using Bayesian models for IBCF. ALL—BLUP across all environments; BLUP17—BLUP across 2017 growing season; BLUP18—BLUP across 2018 growing season; MAN17—BLUE for Mansfield, WA, for 2017; MAN18—BLUE for Mansfield, WA, for 2018; WAT17—BLUE for Waterville, WA, for 2017; WAT18—BLUE for Waterville, WA, for 2018. BRR—Bayesian Ridge Regression; BL—Bayesian LASSO. GBLUP is a univariate, non-IBCF genomic BLUP model. Bars indicate standard errors.
Predictive ability for grain yield, heading date, and plant height using a genomic BLUP (GBLUP) model across different environments.
| Env | Grain Yield | Heading Date | Plant Height | |||
|---|---|---|---|---|---|---|
| Pred. Ability | RMSE | Pred. Ability | RMSE | Pred. Ability | RMSE | |
| LND15 | 0.47 | 0.47 | 0.59 | 2.13 | 0.63 | 5.91 |
| LND17 | 0.39 | 0.68 | 0.30 | 2.23 | 0.27 | 8.19 |
| LND18 | 0.44 | 0.86 | 0.25 | 2.30 | 0.29 | 7.08 |
| LND19 | 0.39 | 0.56 | 0.68 | 1.82 | 0.38 | 6.03 |
| PUL15 | 0.41 | 0.85 | 0.20 | 2.20 | 0.18 | 6.86 |
| PUL16 | 0.43 | 0.78 | 0.71 | 1.54 | 0.42 | 6.90 |
| PUL17 | 0.39 | 0.63 | 0.69 | 2.26 | 0.51 | 2.55 |
| PUL18 | 0.42 | 0.79 | 0.64 | 1.60 | 0.52 | 6.95 |
| PUL19 | 0.36 | 1.14 | 0.61 | 2.23 | 0.58 | 6.32 |
RMSE—root mean square error.