| Literature DB >> 29109156 |
Ani A Elias1, Ismail Rabbi2, Peter Kulakow2, Jean-Luc Jannink1,3.
Abstract
Cassava (Manihot esculenta Crantz) is an important staple food in sub-Saharan Africa. Breeding experiments were conducted at the International Institute of Tropical Agriculture in cassava to select elite parents. Taking into account the heterogeneity in the field while evaluating these trials can increase the accuracy in estimation of breeding values. We used an exploratory approach using the parametric spatial kernels Power, Spherical, and Gaussian to determine the best kernel for a given scenario. The spatial kernel was fit simultaneously with a genomic kernel in a genomic selection model. Predictability of these models was tested through a 10-fold cross-validation method repeated five times. The best model was chosen as the one with the lowest prediction root mean squared error compared to that of the base model having no spatial kernel. Results from our real and simulated data studies indicated that predictability can be increased by accounting for spatial variation irrespective of the heritability of the trait. In real data scenarios we observed that the accuracy can be increased by a median value of 3.4%. Through simulations, we showed that a 21% increase in accuracy can be achieved. We also found that Range (row) directional spatial kernels, mostly Gaussian, explained the spatial variance in 71% of the scenarios when spatial correlation was significant.Entities:
Keywords: GenPred; Shared data resource; cassava; genomic selection; predictability; spatial kernel
Mesh:
Year: 2018 PMID: 29109156 PMCID: PMC5765366 DOI: 10.1534/g3.117.300323
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
Details of trials used in real data analysis
| Year | Cycle | Plot Dimension | Location | Field Dimension | #Plots |
|---|---|---|---|---|---|
| 2013 | C1 | 5 × 1 | Ibadan | 24 × 33 | 736 |
| Ikenne | 16 × 54 | 855 | |||
| Mokwa | 8 × 116 | 858 | |||
| 2014 | C1 | 5 × 4 | Ibadan | 19 × 18 | 293 |
| Ikenne | 19 × 18 | 330 | |||
| Mokwa | 19 × 18 | 329 | |||
| C2 | 5 × 1 | Ikenne | 10 × 46 | 444 | |
| Mokwa | 20 × 23 | 432 | |||
| PYT | 10 × 1 | Ibadan | 8 × 26 | 176 |
Plot dimension is expressed as length × width, where length is the number of plants in a row, and width is the number of rows in a plot. Field dimension is expressed as the number of Ranges × number of Columns in a field. Finally, #Plots gives the number of plots planted.
Comparison between linear models Base and Model 1 for DM, FYLD, SHTWT, and HI
| Data | Trait | Model | Variance | h2 | Spatial Structure | LLk | Increase in pCOR (%) | Decrease in pRMSE (%) | |||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Ibadan_2013_C1 | DM | Base | 16.155 | NA | 6.127 | 0.28 | NA | −949.14 | 3.82 | 2.1 | 0.8 |
| (511 & 485) | Model1 | 15.604 | 0.802 | 5.913 | 0.29 | Gaus - Range | −947.23 | (0.05) | |||
| ( | |||||||||||
| SHTWT | Base | 10.185 | NA | 11.888 | 0.58 | NA | −1284.39 | 17.474 | 5.4 | 1.4 | |
| (660 & 631) | Model1 | 9.022 | 1.729 | 11.875 | 0.59 | Gaus - Range | −1275.65 | (2.90E−05) | |||
| ( | |||||||||||
| Ibadan_2014_PYT | DM | Base | 15.866 | NA | 15.418 | 0.39 | NA | −310.06 | 39.724 | 27.8 | 9 |
| (148 & 80) | Model1 | 18.692 | 6.404 | 8.512 | 0.28 | Gaus - Range | −290.2 | (2.9e−10) | |||
| ( | |||||||||||
| SHTWT | Base | 23.556 | NA | 101.2 | 0.73 | NA | −433.64 | 10.782 | 23.1 | 4 | |
| (151 & 81) | Model1 | 27.546 | 16.702 | 84.024 | 0.68 | Sph - Range | −428.25 | (0.001) | |||
| ( | |||||||||||
| Ibadan_2014_C1 | FYLD | Base | 879.063 | NA | 49.264 | 0.46 | NA | −1026.56 | 4.084 | 1.1 | 0.1 |
| (286 & 266) | Model1 | 883.751 | 40.685 | 38.905 | 0.46 | Gaus - Isotropic | −1024.52 | (0.043) | |||
| ( | |||||||||||
| HI | Base | 0.018 | NA | 0 | 0.4 | NA | 518.78 | 3.894 | 2.4 | 1 | |
| (282 & 265) | Model1 | 0.017 | 0.031 | 0 | 0.37 | Gaus - Range | 520.73 | (0.048) | |||
| ( | |||||||||||
| Mokwa_2013_C1 | DM | Base | 15.983 | NA | 8.862 | 0.38 | NA | −1106.55 | 8.964 | 1.6 | 0.7 |
| (571 & 537) | Model1 | 15.536 | 1.455 | 8.602 | 0.37 | Power - Range | −1102.07 | (0.003) | |||
| ( | |||||||||||
| FYLD | Base | 2.469 | NA | 18.865 | 0.84 | NA | −1477.95 | 4.286 | 4.4 | 0.4 | |
| (734 & 694) | Model1 | 1.649 | 1.113 | 19.218 | 0.86 | Gaus - Range | −1475.81 | (0.038) | |||
| ( | |||||||||||
| Mokwa_2014_C2 | DM | Base | 13.946 | NA | 9.891 | 0.35 | NA | −513.17 | 7.294 | 5.2 | 1.3 |
| (260 & 239) | Model1 | 13.171 | 1.13 | 9.215 | 0.37 | Gaus - Range | −509.52 | (0.007) | |||
| ( | |||||||||||
| HI | Base | 0.008 | NA | 0.012 | 0.63 | NA | 473.95 | 6.736 | 6.2 | 1.5 | |
| (310 & 286) | Model1 | 0.008 | 0.004 | 0.011 | 0.61 | Gaus - Range | 477.32 | (0.009) | |||
| ( | |||||||||||
| SHTWT | Base | 1.743 | NA | 37.669 | 0.91 | NA | −754.11 | 4.418 | 8 | 0.6 | |
| (324 & 300) | Model1 | 1.943 | 159.034 | 36.727 | 0.9 | Gaus - Range | −751.9 | (0.035) | |||
| ( | |||||||||||
| Ikenne_2013_C1 | DM | Base | 25.097 | NA | 4.387 | 0.31 | NA | −1205.86 | 7.804 | 1.3 | 0.3 |
| (627 & 611) | Model1 | 26.914 | 0.903 | 2.919 | 0.32 | Gaus - Isotropic | −1201.96 | (0.005) | |||
| ( | |||||||||||
| HI | Base | 0.007 | NA | 0.012 | 0.51 | NA | 1167.01 | 3.04 | 1.2 | 0.7 | |
| (757 & 736) | Model1 | 0.007 | 0 | 0.012 | 0.51 | Gaus - Isotropic | 1168.53 | (0.081) | |||
| ( | |||||||||||
| SHTWT | Base | 13.31 | NA | 17.192 | 0.65 | NA | −1650.46 | 3.932 | 1.4 | 0.2 | |
| (781 & 753) | Model1 | 13.581 | 0.672 | 16.467 | 0.64 | Power - Column | −1648.49 | (0.047) | |||
| ( | |||||||||||
Under the Trait, the number of observations and unique genotypes analyzed is given in brackets. Variance of zero indicates that variance was <1e−03. Narrow sense heritability (h2) is calculated from the BLUP values of genotypes and genotypic variance. Spatial structure is given with the direction and the parameter value in brackets. Chi-square statistic is calculated from the log likelihood values (LLk) of the Base and selected Model 1 is given with p-value in brackets. The table shows results from trial-trait analysis with significant improvement in fit of Model 1 over Base at α = 0.1. Percentage change in pCOR and pRMSE for Model 1 compared to Base after CV are given.
Figure 1Spatial correlation with distance (meters) using different structures and standardizing parameters—an illustration. (A) Power; (B) Spherical; (C) Gaussian.
Figure 2Original observation (column 1), spatial BLUP (column 2), residual from Model 1 (column 3), and residual from Base (column 4) for dry root weight (DM) for Ibadan_2014_PYT. Plots are rectangular, and white dots inside each plot indicate the plots whose observation was used. Missing values were interpolated linearly for visualization. These plots do not represent the actual dimension of those in the field, but used as placeholders in order to visualize the trends in effects.
Output on adding extraneous error component to the selected Model 1 or Base for DM, FYLD, SHTWT, and HI
| Data | Trait | Model | Variance | h2 | LLk | |||||
|---|---|---|---|---|---|---|---|---|---|---|
| Ibadan_2013_C1 | FYLD | Base | 7.425 | NA | 19.791 | NA | NA | 0.7 | −1359.31 | 21.34 |
| Model2 | 6.876 | NA | 18.171 | NA | 2.049 | 0.68 | −1348.64 | (3.80E−06) | ||
| SHTWT | Model1 | 9.022 | 1.729 | 11.875 | NA | NA | 0.59 | −1275.65 | 6.48 | |
| Model2 | 8.686 | 1.763 | 11.419 | NA | 0.645 | 0.6 | −1272.41 | (0.01) | ||
| HI | Base | 0.005 | NA | 0.009 | NA | NA | 0.44 | 1092.16 | 7.89 | |
| Model2 | 0.005 | NA | 0.009 | NA | 0.001 | 0.44 | 1096.1 | (0.005) | ||
| Mokwa_2013_C1 | FYLD | Model1 | 1.649 | 1.113 | 19.218 | NA | NA | 0.86 | −1475.81 | 2.88 |
| Model2 | 1.362 | 1.101 | 18.609 | NA | 0.824 | 0.87 | −1474.37 | (0.089) | ||
| HI | Base | 0.005 | NA | 0.009 | NA | NA | 0.62 | 1270.87 | 10.15 | |
| Model2 | 0.005 | NA | 0.009 | 0.001 | NA | 0.66 | 1275.94 | (0.001) | ||
| Mokwa_14_C1 | HI | Base | 0.007 | NA | 0.006 | NA | NA | 0.59 | 561.33 | 3.66 |
| Model2 | 0.006 | NA | 0.005 | 0 | NA | 0.54 | 563.15 | (0.055) | ||
| Ikenne_2014_C1 | DM | Base | 31.949 | NA | 0.719 | NA | NA | 0.44 | −579.46 | 21.87 |
| Model2 | 36.116 | NA | 0.033 | NA | 1.379 | 0.46 | −568.52 | (2.9e−06) | ||
| SHTWT | Base | 721.089 | NA | 29.191 | NA | NA | 0.5 | −1109.42 | 3.27 | |
| Model2 | 752.485 | NA | 15.792 | 10.999 | NA | 0.49 | −1107.79 | (0.071) | ||
Base is the model having only the genetic variance component. Model 1 has a spatial variance component in addition, and Model 2 has extraneous error component added to the best model selected between Model 1 and Base for a particular trial and trait. Narrow sense heritability (h2) is calculated from BLUP values and genotypic variance. Chi-square statistic is calculated from the log likelihood values (LLk) of the Base/Model 1, and selected Model 2 is given with p-value in brackets. The table shows results from trial-trait analysis with significant improvement in model fit of Model 2 over the best of Base/Model 1 at α = 0.1.
Figure 3Results from simulation studies using data simulated with Power (column 1) and Gaussian (column 2) spatial kernels. A.1 and A.2 represent data simulated using low genotypic ratio (0.3), while those in B.1 and B.2 represent data simulated with high genotypic ratio (0.9). Accuracy, the correlation of true to estimated genotypic value, is given on the y-axis. Note differences in y-axis scales across all plots. The x-axis represents the fraction of spatial to total error variance (fraSp). The interaction of Model with fraSp is shown by nonparallel lines. ANOVA results are given in Tables S.1–S.6 in File S1.