| Literature DB >> 23390602 |
Zhigang Guo1, Dominic M Tucker, Daolong Wang, Christopher J Basten, Elhan Ersoz, William H Briggs, Jianwei Lu, Min Li, Gilles Gay.
Abstract
Most of previous empirical studies with genome-wide prediction were focused on within-environment prediction based on a single-environment (SE) model. In this study, we evaluated accuracy improvements of across-environment prediction by using genetic and residual covariance across correlated environments. Predictions with a multienvironment (ME) model were evaluated for two corn polygenic leaf structure traits, leaf length and leaf width, based on within-population (WP) and across-population (AP) experiments using a large maize nested association mapping data set consisting of 25 populations of recombinant inbred-lines. To make our study more applicable to plant breeding, two cross-validation schemes were used by evaluating accuracies of (CV1) predicting unobserved phenotypes of untested lines and (CV2) predicting unobserved phenotypes of lines that have been evaluated in some environments but not others. We concluded that (1) genome-wide prediction provided greater prediction accuracies than traditional quantitative trait loci-based prediction in both WP and AP and provided more advantages over quantitative trait loci -based prediction for WP than for AP. (2) Prediction accuracy with ME was significantly greater than that attained by SE in CV1 and CV2, and gains with ME over SE were greater in CV2 than in CV1. These gains were also greater in WP than in AP in both CV1 and CV2. (3) Gains with ME over SE attributed to genetic correlation between environments, with little effect from residual correlation. Impacts of marker density on predictions also were investigated in this study.Entities:
Keywords: GenPred; best linear unbiased prediction; genetic correlation; maize; ridge regression; shared data resources
Mesh:
Year: 2013 PMID: 23390602 PMCID: PMC3564986 DOI: 10.1534/g3.112.005066
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
Population information for the 25 NAM populations used for analysis in the current study
| PopId | Crosses | Sample Size | Marker Number | Genomic Coverage, cM |
|---|---|---|---|---|
| 1 | B73×B97 | 167 | 805 | 1386 |
| 2 | B73×CML103 | 173 | 813 | 1396 |
| 3 | B73×CML228 | 180 | 889 | 1396 |
| 4 | B73×CML247 | 165 | 828 | 1397 |
| 5 | B73×CML277 | 156 | 820 | 1385 |
| 6 | B73×CML322 | 173 | 838 | 1394 |
| 7 | B73×CML333 | 165 | 827 | 1388 |
| 8 | B73×CML52 | 163 | 834 | 1390 |
| 9 | B73×CML69 | 176 | 840 | 1389 |
| 10 | B73×Hp301 | 158 | 794 | 1387 |
| 11 | B73×IIL4H | 152 | 825 | 1389 |
| 12 | B73×Ki11 | 170 | 822 | 1386 |
| 13 | B73×Ki3 | 103 | 791 | 1397 |
| 14 | B73×Ky21 | 182 | 817 | 1395 |
| 15 | B73×M162W | 163 | 827 | 1378 |
| 16 | B73×M37W | 172 | 788 | 1389 |
| 17 | B73×Mo18W | 178 | 818 | 1386 |
| 18 | B73×MS71 | 171 | 771 | 1371 |
| 19 | B73×NC350 | 176 | 827 | 1388 |
| 20 | B73×NC358 | 156 | 809 | 1395 |
| 21 | B73×Oh43 | 164 | 811 | 1389 |
| 22 | B73×Oh7B | 156 | 789 | 1396 |
| 23 | B73×P39 | 161 | 828 | 1378 |
| 24 | B73×Tx303 | 168 | 807 | 1394 |
| 25 | B73×Tzi8 | 183 | 859 | 1381 |
| Mean | 165 | 819 | 1389 |
NAM, nested association mapping.
Figure 1 Example of one replicate of cross-validation in CV1 and CV2. White boxes represent observed phenotypic records, and black ones represent missing phenotypic records.
Prediction accuracy with QP and GWP using SE and ME models in CV1 and CV2 for traits LL and LW based on 25 NAM population
| LL | LW | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| SE | ME | SE | ME | |||||||
| Scheme | Approach | Envi | QP | GWP | QP | GWP | QP | GWP | QP | GWP |
| CV1 | WP | E1 | 0.20 (2.2) | 0.38 (0.93) | 0.17 (2.4, −0.10) | 0.42 (1.40, 0.10) | 0.19 (2.2) | 0.40 (1.11) | 0.20 (2.6, 0.06) | 0.46 (1.29, 0.15) |
| E2 | 0.19 (2.4) | 0.41 (1.15) | 0.17 (2.4, −0.12) | 0.44 (1.60, 0.06) | 0.24 (2.4) | 0.46 (0.92) | 0.23 (2.6, −0.05) | 0.49 (1.18, 0.08) | ||
| E3 | 0.17 (2.1) | 0.38 (1.24) | 0.16 (2.4, −0.04) | 0.42 (1.57, 0.10) | 0.15 (2.0) | 0.37 (1.40) | 0.18 (2.6, 0.18) | 0.43 (1.37, 0.16) | ||
| E4 | 0.16 (2.0) | 0.37 (1.27) | 0.16 (2.4, 0.00) | 0.41 (1.64, 0.10) | 0.27 (2.7) | 0.48 (0.78) | 0.24 (2.6, −0.10) | 0.52 (1.15, 0.08) | ||
| Mean | 0.18 (2.2) | 0.39 (1.00) | 0.17 (2.4, −0.05) | 0.42 (1.47, 0.08) | 0.21 (2.3) | 0.43 (1.05) | 0.21 (2.6, 0.00) | 0.48 (1.29, 0.12) | ||
| AP | E1 | 0.27 (10.4) | 0.31 (0.15) | 0.29 (14.0, 0.09) | 0.31 (0.07, 0.00) | 0.32 (13.0) | 0.36 (0.13) | 0.34 (12.1, 0.08) | 0.37 (0.09, 0.04) | |
| E2 | 0.28 (13.1) | 0.32 (0.14) | 0.29 (14.0, 0.01) | 0.33 (0.14, 0.01) | 0.37 (12.8) | 0.42 (0.14) | 0.39 (12.1, 0.06) | 0.43 (0.10, 0.02) | ||
| E3 | 0.24 (12.8) | 0.29 (0.21) | 0.25 (14.0, 0.04) | 0.30 (0.20, 0.01) | 0.30 (10.3) | 0.34 (0.13) | 0.32 (12.1, 0.09) | 0.35 (0.09, 0.03) | ||
| E4 | 0.25 (14.0) | 0.30 (0.20) | 0.23 (14.0, −0.07) | 0.30 (0.30, 0.00) | 0.38 (12.7) | 0.42 (0.11) | 0.39 (12.1, 0.03) | 0.43 (0.10, 0.01) | ||
| Mean | 0.26 (12.6) | 0.31 (0.19) | 0.27 (14.0, 0.04) | 0.32 (0.19, 0.03) | 0.34 (12.2) | 0.39 (0.15) | 0.36 (12.1. 0.06) | 0.40 (0.11, 0.03) | ||
| CV2 | WP | E1 | 0.20 (2.1) | 0.39 (0.98) | 0.24 (1.8, 0.25) | 0.53 (1.19, 0.38) | 0.19 (2.2) | 0.41 (1.14) | 0.27 (2.0, 0.42) | 0.55 (1.05, 0.36) |
| E2 | 0.20 (2.3) | 0.41 (1.10) | 0.24 (1.8, 0.23) | 0.56 (1.30, 0.35) | 0.24 (2.5) | 0.46 (0.92) | 0.31 (2.0, 0.28) | 0.59 (0.91, 0.27) | ||
| E3 | 0.18 (2.0) | 0.38 (1.17) | 0.23 (1.8, 0.33) | 0.52 (1.24, 0.37) | 0.16 (2.0) | 0.37 (1.36) | 0.25 (2.0, 0.63) | 0.52 (1.03, 0.40) | ||
| E4 | 0.16 (2.0) | 0.38 (1.38) | 0.24 (1.8, 0.49) | 0.53 (1.23, 0.39) | 0.27 (2.7) | 0.48 (0.78) | 0.32 (2.0, 0.18) | 0.61 (0.93, 0.28) | ||
| Mean | 0.19 (2.1) | 0.39 (1.05) | 0.24 (1.8, 0.26) | 0.54 (1.30, 0.40) | 0.22 (2.4) | 0.43 (0.95) | 0.29 (2.0, 0.32) | 0.57 (0.96, 0.33) | ||
| AP | E1 | 0.27 (8.0) | 0.32 (0.19) | 0.29 (9.5, 0.09) | 0.36 (0.24, 0.12) | 0.31 (8.5) | 0.36 (0.16) | 0.35 (11.1, 0.13) | 0.40 (0.14, 0.10) | |
| E2 | 0.28 (9.8) | 0.34 (0.21) | 0.30 (9.5, 0.05) | 0.37 (0.23, 0.10) | 0.38 (10.6) | 0.43 (0.13) | 0.40 (11.1, 0.06) | 0.46 (0.15, 0.07) | ||
| E3 | 0.23 (7.8) | 0.31 (0.35) | 0.26 (9.5, 0.13) | 0.35 (0.35, 0.12) | 0.30 (7.3) | 0.35 (0.17) | 0.33 (11.1, 0.12) | 0.38 (0.15, 0.09) | ||
| E4 | 0.23 (7.4) | 0.31 (0.35) | 0.26 (9.5, 0.15) | 0.35 (0.35, 0.12) | 0.39 (11.0) | 0.43 (0.10) | 0.41 (11.1, 0.04) | 0.46 (0.12, 0.07) | ||
| Mean | 0.25 (8.3) | 0.32 (0.14) | 0.28 (9.5, 0.12) | 0.36 (0.29, 0.13) | 0.35 (9.4) | 0.39 (0.11) | 0.37 (11.1, 0.06) | 0.43 (0.16, 0.10) | ||
QP, quantitative trait loci-based prediction; GWP, genome-wide prediction; SE, single environment; ME, multienvironment; LL, leaf length; LW, leaf width; NAM, nested association mapping; Envi, environment; WP, within population; AP, across population.
In parentheses is the number of QTL identified by QP based on the SE model.
In parentheses is the gain in prediction accuracy with GWP over QP based on the SE model.
The first value in parentheses is the number of QTL identified by QP based on the ME model; and the second one the gain with ME over SE for QP.
The first value in parentheses is the gain in accuracy with GWP over QP based on the ME model; and the second one is the gain in accuracy with ME over SE using GWP.
Prediction accuracies with ME GWP based on SG-SR, SG-UR, UG-SR, and UG-UR for traits LL and LW based on 25 NAM populations
| LL | LW | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Scheme | Approach | Envi | SG-SR | SG-UR | UG-SR | UG-UR | SG-SR | SG-UR | UG-SR | UG-UR |
| CV1 | WP | E1 | 0.38 | 0.33 (–0.14) | 0.42 (0.10) | 0.42 (0.00) | 0.40 | 0.37 (–0.09) | 0.46 (0.14) | 0.46 (0.00) |
| E2 | 0.41 | 0.36 (–0.12) | 0.45 (0.08) | 0.44 (–0.02) | 0.46 | 0.42 (–0.08) | 0.50 (0.09) | 0.49 (–0.01) | ||
| E3 | 0.38 | 0.32 (–0.16) | 0.43 (0.12) | 0.42 (–0.02) | 0.37 | 0.33 (–0.11) | 0.43 (0.16) | 0.43 (0.00) | ||
| E4 | 0.37 | 0.31 (–0.16) | 0.42 (0.13) | 0.41 (–0.02) | 0.48 | 0.45 (–0.07) | 0.52 (0.08) | 0.52 (0.00) | ||
| Mean | 0.39 | 0.33 (–0.18) | 0.43 (0.10) | 0.42 (–0.02) | 0.43 | 0.39 (–0.09) | 0.48 (0.12) | 0.48 (0.00) | ||
| AP | E1 | 0.31 | 0.30 (–0.03) | 0.31 (0.02) | 0.31 (0.00) | 0.36 | 0.35 (–0.02) | 0.37 (0.04) | 0.37 (0.00) | |
| E2 | 0.32 | 0.32 (–0.02) | 0.33 (0.01) | 0.33 (0.00) | 0.42 | 0.41 (–0.01) | 0.42 (0.01) | 0.43 (0.01) | ||
| E3 | 0.29 | 0.29 (0.00) | 0.30 (0.01) | 0.30 (0.00) | 0.34 | 0.33 (–0.02) | 0.35 (0.03) | 0.35 (0.00) | ||
| E4 | 0.30 | 0.29 (–0.03) | 0.30 (0.00) | 0.30 (0.00) | 0.42 | 0.41 (–0.02) | 0.43 (0.01) | 0.43 (0.00) | ||
| Mean | 0.31 | 0.30 (–0.03) | 0.31 (0.00) | 0.31 (0.00) | 0.39 | 0.38 (–0.03) | 0.39 (0.00) | 0.40 (0.01) | ||
| CV2 | WP | E1 | 0.39 | 0.37 (–0.04) | 0.54 (0.39) | 0.53 (–0.01) | 0.41 | 0.39 (–0.02) | 0.55 (0.36) | 0.55 (0.00) |
| E2 | 0.41 | 0.40 (–0.03) | 0.56 (0.35) | 0.56 (0.00) | 0.46 | 0.45 (–0.02) | 0.59 (0.27) | 0.59 (0.00) | ||
| E3 | 0.38 | 0.36 (–0.06) | 0.53 (0.38) | 0.52 (–0.01) | 0.37 | 0.36 (–0.03) | 0.52 (0.40) | 0.52 (0.00) | ||
| E4 | 0.38 | 0.36 (–0.05) | 0.54 (0.43) | 0.53 (–0.01) | 0.48 | 0.47 (–0.02) | 0.61 (0.28) | 0.61 (0.00) | ||
| Mean | 0.39 | 0.37 (–0.05) | 0.54 (0.38) | 0.54 (0.00) | 0.43 | 0.42 (–0.02) | 0.57 (0.33) | 0.57 (0.00) | ||
| AP | E1 | 0.32 | 0.32 (0.00) | 0.36 (0.12) | 0.36 (0.00) | 0.36 | 0.36 (0.00) | 0.40 (0.10) | 0.40 (0.00) | |
| E2 | 0.34 | 0.34 (0.00) | 0.37 (0.10) | 0.37 (0.00) | 0.43 | 0.42 (–0.01) | 0.46 (0.07) | 0.46 (0.00) | ||
| E3 | 0.31 | 0.31 (0.00) | 0.35 (0.12) | 0.35 (0.00) | 0.35 | 0.35 (0.00) | 0.38 (0.09) | 0.38 (0.00) | ||
| E4 | 0.31 | 0.31 (0.00) | 0.35 (0.12) | 0.35 (0.00) | 0.43 | 0.43 (0.00) | 0.46 (0.07) | 0.46 (0.00) | ||
| Mean | 0.32 | 0.32 (0.00) | 0.36 (0.13) | 0.36 (0.00) | 0.39 | 0.39 (0.00) | 0.43 (0.10) | 0.43 (0.00) | ||
ME, multi-environment; GWP, genome-wide prediction; SG-SR, structured genetic and residual covariance; SG-UR, Structured genetic and unstructured residual covariance; UG-SR, Unstructured genetic and structured residual covariance; UG-UR, Unstructured genetic and residual covariance; LL, leaf length; LW, leaf width; NAM, nested association mapping; Envi, environment; WP, within population; AP, across population.
In parentheses is the gain in prediction accuracy with SG-UR over SG-SR.
In parentheses is the gain in accuracy with UG-SR over SG-SR.
In parentheses is the gain in accuracy with UG-UR over UG-SR.
Figure 2 Prediction accuracy in environment E1 for the trait LL using SE and ME models in GWP with different levels of marker densities. The training sample proportion is 0.6 in NAM population B73×CML322. (A) CV1; (B) CV2.