| Literature DB >> 26206349 |
Saravanan Thavamanikumar1, Rudy Dolferus2, Bala R Thumma2.
Abstract
Genomic selection (GS) is becoming an important selection tool in crop breeding. In this study, we compared the ability of different GS models to predict time to young microspore (TYM), a flowering time-related trait, spike grain number under control conditions (SGNC) and spike grain number under osmotic stress conditions (SGNO) in two wheat biparental doubled haploid populations with unrelated parents. Prediction accuracies were compared using BayesB, Bayesian least absolute shrinkage and selection operator (Bayesian LASSO / BL), ridge regression best linear unbiased prediction (RR-BLUP), partial least square regression (PLS), and sparse partial least square regression (SPLS) models. Prediction accuracy was tested with 10-fold cross-validation within a population and with independent validation in which marker effects from one population were used to predict traits in the other population. High prediction accuracies were obtained for TYM (0.51-0.84), whereas moderate to low accuracies were observed for SGNC (0.10-0.42) and SGNO (0.27-0.46) using cross-validation. Prediction accuracies based on independent validation are generally lower than those based on cross-validation. BayesB and SPLS outperformed all other models in predicting TYM with both cross-validation and independent validation. Although the accuracies of all models are similar in predicting SGNC and SGNO with cross-validation, BayesB and SPLS had the highest accuracy in predicting SGNC with independent validation. In independent validation, accuracies of all the models increased by using only the QTL-linked markers. Results from this study indicate that BayesB and SPLS capture the linkage disequilibrium between markers and traits effectively leading to higher accuracies. Excluding markers from QTL studies reduces prediction accuracies.Entities:
Keywords: GenPred; SNP; biparental populations; genomic prediction; genomic selection; shared data resource; wheat
Mesh:
Year: 2015 PMID: 26206349 PMCID: PMC4592981 DOI: 10.1534/g3.115.019745
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
Figure 1Prediction accuracies obtained for TYM in C×H and S×A populations using 10-fold CV. Analyses were conducted with population-specific SNPs (1975 SNPs for C×H and 1483 SNPs for S×A) as well as 808 SNPs that are common to both the populations. Error bars are standard errors of mean from 10 repeats.
Prediction accuracies of TYM and SGN in the S×A population using the models developed in the C×H population
| Trait | BayesB | SPLS | BL | RR-BLUP | PLS |
|---|---|---|---|---|---|
| TYM (808 SNPs) | 0.70 | 0.66 | 0.59 | 0.36 | 0.32 |
| TYM (42 SNPs) | 0.71 | 0.70 | 0.70 | 0.68 | 0.69 |
| TYM (766 SNPs) | −0.03 | 0.06 | −0.07 | −0.06 | −0.03 |
| SGNC (808 SNPs) | 0.22 | 0.24 | 0.16 | 0.13 | 0.12 |
| SGNC (42 SNPs) | 0.21 | 0.21 | 0.21 | 0.20 | 0.19 |
| SGNC (766 SNPs) | 0.001 | 0.03 | 0.000 | 0.003 | 0.000 |
| SGNO (808 SNPs) | 0.06 | 0.10 | 0.05 | 0.07 | 0.11 |
| SGNO (42 SNPs) | 0.26 | 0.23 | 0.28 | 0.31 | 0.30 |
| SGNO (766 SNPs) | 0.00 | 0.06 | −0.04 | 0.01 | 0.00 |
TYM, time to young microspore; SGNC, spike grain number under control conditions; SGNO, spike grain number under osmotic conditions; SPLS, sparse partial least squares; BL, Bayesian LASSO; RR-BLUP, Ridge regression best linear unbiased prediction; PLS, partial least squares.
808 SNPs are common to both C×H and S×A biparental populations.
42 SNPs from chromosome 5A where QTL were identified for TYM and SGNO in a separate study.
42 QTL associated SNPs were excluded from 808 common SNPs.
Figure 2Prediction accuracies for SGNC in C×H and S×A populations using 10-fold CV. Analyses were conducted with both population-specific SNPs (1975 SNPs in C×H and 1483 SNPs in S×A) and 808 SNPs that are common to both the populations. Error bars are standard errors of mean from 10 repeats.
Figure 3Prediction accuracies for SGNO in C×H and S×A populations using 10-fold CV. Analyses were conducted with both population-specific SNPs (1975 SNPs in C×H and 1483 SNPs in S×A) and 808 SNPs that are common to both populations. Error bars are standard errors of mean from 10 repeats.