| Literature DB >> 27317786 |
Alencar Xavier1, William M Muir2, Katy Martin Rainey3.
Abstract
Many economically important traits in plant breeding have low heritability or are difficult to measure. For these traits, genomic selection has attractive features and may boost genetic gains. Our goal was to evaluate alternative scenarios to implement genomic selection for yield components in soybean (Glycine max L. merr). We used a nested association panel with cross validation to evaluate the impacts of training population size, genotyping density, and prediction model on the accuracy of genomic prediction. Our results indicate that training population size was the factor most relevant to improvement in genome-wide prediction, with greatest improvement observed in training sets up to 2000 individuals. We discuss assumptions that influence the choice of the prediction model. Although alternative models had minor impacts on prediction accuracy, the most robust prediction model was the combination of reproducing kernel Hilbert space regression and BayesB. Higher genotyping density marginally improved accuracy. Our study finds that breeding programs seeking efficient genomic selection in soybeans would best allocate resources by investing in a representative training set.Entities:
Keywords: SoyNAM; bayesian methods; genomic selection; machine learning
Mesh:
Year: 2016 PMID: 27317786 PMCID: PMC4978914 DOI: 10.1534/g3.116.032268
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
Figure 1Effect of training population size on accuracy for six soybean traits. Plant height (HT), days to maturity (R8), number of reproductive nodes (No), pods per node (PN), number of pods (Po), and grain yield (GY).
Figure 2Boxplots of the accuracy of the genomic prediction models in soybeans tested in a variety of scenarios (i.e., combinations of trait, number of SNPs, environment, and training population size). Whiskers represent the upper and lower limit, and the box represents the quartiles Q1 (25%), Q2 (median), and Q3 (75%). Models include additive methods (BayesA, BayesB, BayesC, and BLASSO), kernel methods (GBLUP and RKHS), and each combination of both. BLASSO, Bayesian least absolute shrinkage and selection operator; GBLUP, genomic best linear unbiased predictor; RKHS, reproducing kernel Hilbert space; SNP, single nucleotide polymorphism.
Genomic heritability (h2), average predictive ability [Cor(y, ^y)], and accuracy in two environments (2013 and 2014) for six soybean traits
| Trait | h2 | Accuracy | ||||
|---|---|---|---|---|---|---|
| 2013 | 2014 | 2013 | 2014 | 2013 | 2014 | |
| HT | 0.522 | 0.478 | 0.459 | 0.418 | 0.635 | 0.604 |
| R8 | 0.374 | 0.317 | 0.398 | 0.355 | 0.650 | 0.630 |
| No | 0.307 | 0.259 | 0.334 | 0.309 | 0.603 | 0.607 |
| PN | 0.238 | 0.189 | 0.275 | 0.258 | 0.563 | 0.593 |
| Po | 0.264 | 0.253 | 0.283 | 0.296 | 0.552 | 0.589 |
| GY | 0.494 | 0.409 | 0.423 | 0.399 | 0.602 | 0.623 |
| Mean | 0.366 | 0.317 | 0.362 | 0.339 | 0.601 | 0.608 |
h2, genomic heritability; Cor(y,), average predictive ability; HT, plant height; R8, days to maturity; No, number of reproductive nodes; PN, pods per node; Po, number of pods; GY, grain yield.