| Literature DB >> 27067826 |
Sebastian Michel1, Christian Ametz2, Huseyin Gungor3, Doru Epure4, Heinrich Grausgruber5, Franziska Löschenberger2, Hermann Buerstmayr6.
Abstract
KEY MESSAGE: We evaluated genomic selection across five breeding cycles of bread wheat breeding. Bias of within-cycle cross-validation and methods for improving the prediction accuracy were assessed. The prospect of genomic selection has been frequently shown by cross-validation studies using the same genetic material across multiple environments, but studies investigating genomic selection across multiple breeding cycles in applied bread wheat breeding are lacking. We estimated the prediction accuracy of grain yield, protein content and protein yield of 659 inbred lines across five independent breeding cycles and assessed the bias of within-cycle cross-validation. We investigated the influence of outliers on the prediction accuracy and predicted protein yield by its components traits. A high average heritability was estimated for protein content, followed by grain yield and protein yield. The bias of the prediction accuracy using populations from individual cycles using fivefold cross-validation was accordingly substantial for protein yield (17-712 %) and less pronounced for protein content (8-86 %). Cross-validation using the cycles as folds aimed to avoid this bias and reached a maximum prediction accuracy of [Formula: see text] = 0.51 for protein content, [Formula: see text] = 0.38 for grain yield and [Formula: see text] = 0.16 for protein yield. Dropping outlier cycles increased the prediction accuracy of grain yield to [Formula: see text] = 0.41 as estimated by cross-validation, while dropping outlier environments did not have a significant effect on the prediction accuracy. Independent validation suggests, on the other hand, that careful consideration is necessary before an outlier correction is undertaken, which removes lines from the training population. Predicting protein yield by multiplying genomic estimated breeding values of grain yield and protein content raised the prediction accuracy to [Formula: see text] = 0.19 for this derived trait.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27067826 PMCID: PMC4869760 DOI: 10.1007/s00122-016-2694-2
Source DB: PubMed Journal: Theor Appl Genet ISSN: 0040-5752 Impact factor: 5.699
Mean, variance components and heritability for grain yield (dt ha−1), protein content (%) and protein yield (dt ha−1) of genotyped lines across all trials in the respective breeding cycles 2010–2014
| Trait | Parameter | Breeding cycles | ||||
|---|---|---|---|---|---|---|
| 2010 | 2011 | 2012 | 2013 | 2014 | ||
| Grain yield | Trials | 5 | 6 | 4 | 5 | 8 |
|
| 2.28 ± 1.28 | 4.60 ± 1.60 | 5.03 ± 1.25 | 6.64 ± 1.76 | 37.00 ± 4.71 | |
|
| 23.70 ± 1.83 | 23.67 ± 1.99 | 17.80 ± 1.21 | 40.98 ± 2.36 | 54.48 ± 2.26 | |
|
| 0.32 | 0.54 | 0.53 | 0.45 | 0.84 | |
| Protein content | Trials | 4 | 2 | 3 | 4 | 2 |
|
| 0.23 ± 0.05 | 0.18 ± 0.05 | 0.35 ± 0.05 | 0.37 ± 0.06 | 0.33 ± 0.09 | |
|
| 0.36 ± 0.04 | 0.07 ± 0.03 | 0.27 ± 0.03 | 0.65 ± 0.05 | 0.65 ± 0.08 | |
|
| 0.72 | 0.84 | 0.80 | 0.69 | 0.50 | |
| Protein yield | Trials | 4 | 2 | 4 | 4 | 3 |
|
| 0.04 ± 0.03 | 0.03 ± 0.07 | 0.05 ± 0.02 | 0.26 ± 0.05 | 0.76 ± 0.14 | |
|
| 0.41 ± 0.04 | 0.38 ± 0.09 | 0.34 ± 0.03 | 0.69 ± 0.05 | 1.30 ± 0.11 | |
|
| 0.30 | 0.14 | 0.37 | 0.60 | 0.64 | |
| Lines | 94 | 64 | 165 | 160 | 176 | |
Genotypic variance (), genotype by trial interaction variance (), and heritability (h )
Fig. 1Bias of the within- cycle prediction accuracy in comparison with the between-cycle prediction accuracy for grain yield, protein content and protein yield and using lines from the years 2010–2014 as training populations
Fig. 2Heatmap of the pair-wise prediction accuracy between breeding cycles on the off-diagonal and the result of the fivefold within-cycle cross-validation on the diagonal
Fig. 3Influence of removing outlier years or environments from the training set on the prediction accuracy. Results were obtained using across-cycle cross-validation with years as folds. The horizontal red line indicates the maximum prediction accuracy in the complete dataset
Fig. 4Proportion of correctly selected lines when applying genomic selection for grain yield of either the best or worst lines in the independent validation population of the year 2015