| Literature DB >> 26715095 |
Lian Lian1, Gustavo de Los Campos2.
Abstract
The Finlay-Wilkinson regression (FW) is a popular method among plant breeders to describe genotype by environment interaction. The standard implementation is a two-step procedure that uses environment (sample) means as covariates in a within-line ordinary least squares (OLS) regression. This procedure can be suboptimal for at least four reasons: (1) in the first step environmental means are typically estimated without considering genetic-by-environment interactions, (2) in the second step uncertainty about the environmental means is ignored, (3) estimation is performed regarding lines and environment as fixed effects, and (4) the procedure does not incorporate genetic (either pedigree-derived or marker-derived) relationships. Su et al. proposed to address these problems using a Bayesian method that allows simultaneous estimation of environmental and genotype parameters, and allows incorporation of pedigree information. In this article we: (1) extend the model presented by Su et al. to allow integration of genomic information [e.g., single nucleotide polymorphism (SNP)] and covariance between environments, (2) present an R package (FW) that implements these methods, and (3) illustrate the use of the package using examples based on real data. The FW R package implements both the two-step OLS method and a full Bayesian approach for Finlay-Wilkinson regression with a very simple interface. Using a real wheat data set we demonstrate that the prediction accuracy of the Bayesian approach is consistently higher than the one achieved by the two-step OLS method.Entities:
Keywords: Bayesian; Finlay–Wilkinson; GenPred; correlation; genomic selection; genomic/environment; genotype by environment interaction; reaction norm; shared data resource
Mesh:
Year: 2015 PMID: 26715095 PMCID: PMC4777122 DOI: 10.1534/g3.115.026328
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
Estimated variance components (posterior 95% credibility intervals in parentheses) from different models
| Parameters | FW Output | OLS | GibbsI (A = I) | GibbsA (A = G) |
|---|---|---|---|---|
| 0.32 | 0.30 (0.28, 0.32) | 0.30 (0.28, 0.32) | ||
| NA | 0.09 (0.07, 0.11) | 0.11 (0.08, 0.14) | ||
| NA | 0.10 (0.07, 0.12) | 0.13 (0.10, 0.17) | ||
| NA | 0.90 (0.24, 1.90) | 0.88 (0.24, 1.88) |
Pearson’s product-moment correlation between parameter estimates derived by each of the three methods implemented in Box 3
| OLS–GibbsI | OLS–GibbsA | GibbsI–GibbsA | |
|---|---|---|---|
| 1.00 | 1.00 | 1.00 | |
| 0.94 | 0.81 | 0.83 | |
| 0.98 | 0.79 | 0.81 | |
| 0.96 | 0.94 | 0.97 |
Figure 1Plot of variety performance vs. estimated environment values. Each line represents a different variety. Lines are fitted values and points are the cell means of genotype and environment combination. The horizontal axis displays the estimated environmental effects. The labels of these environments are also displayed; these labels can be removed by setting ENVlabel = F.
Figure 2Plot of the performance of five varieties on estimated environment values. Each color represents a different variety. Lines are fitted values and circles are the cell means of genotype by environment combination. The horizontal axis displays the estimated environmental effects. The labels of these environments are also displayed; these labels can be removed by setting ENVlabel = F.
Estimated environment effects from GibbsI and GibbsH
| ENV | GibbsI | GibbsH | GibbsH_NA |
|---|---|---|---|
| 1 | 0.52 | 0.51 | 0.78 |
| 2 | −0.18 | −0.19 | 0.74 |
| 4 | −0.78 | −0.78 | −0.52 |
| 5 | −1.38 | −1.39 | −1.11 |
Figure 3Trace and density plot of variance components from GibbsA.
Figure 4Trace plot of the intercept (mu) and the first two levels of environment effects (h[1]) and h[2]) from GibbsA.
Figure 5Prediction accuracy for training and validation sets for the three methods implemented in Box 8.
Pearson’s product-moment correlation between parameter estimates derived by each of the three methods implemented in Box 8 (results from the first replicate only)
| OLS–GibbsI | OLS–GibbsA | GibbsI–GibbsA | |
|---|---|---|---|
| 1.00 | 1.00 | 1.00 | |
| 0.85 | 0.64 | 0.79 | |
| 0.96 | 0.73 | 0.77 | |
| 0.91 | 0.87 | 0.97 |
| 1 | library(FW); data(wheat); attach(wheat.Y) |
| 2 | OLS = FW(y = y,VAR = VAR,ENV = ENV, method=”OLS”) |
| 3 | GibbsI = FW(y = y,VAR = VAR,ENV = ENV, |
| 4 | method=”Gibbs”,seed = 12345,saveAt=”GibbsI”,nIter = 50000 |
| 5 | ,burnIn = 5000) |
| 6 | GibbsA = FW(y = y,VAR = VAR,ENV = ENV, |
| 7 | method=”Gibbs”,A = wheat.G,seed = 12345, |
| 8 | saveAt=”GibbsA”,nIter = 50000,burnIn = 5000) |
| 9 | load(“GibbsIsamps.rda”) |
| 10 | HPDinterval(samps[,c(“var_e”,”var_g”,”var_b”,”var_h”)]) |
| 11 | load(“GibbsAsamps.rda”) |
| 12 | HPDinterval(samps[,c(“var_e”,”var_g”,”var_b”,”var_h”)]) |