| Literature DB >> 28983306 |
Brigitte Mangin1, Fanny Bonnafous1, Nicolas Blanchet1, Marie-Claude Boniface1, Emmanuelle Bret-Mestries2, Sébastien Carrère1, Ludovic Cottret1, Ludovic Legrand1, Gwenola Marage1, Prune Pegot-Espagnet1, Stéphane Munos1, Nicolas Pouilly1, Felicity Vear3, Patrick Vincourt1, Nicolas B Langlade1.
Abstract
Prediction of hybrid performance using incomplete factorial mating designs is widely used in breeding programs including different heterotic groups. Based on the general combining ability (GCA) of the parents, predictions are accurate only if the genetic variance resulting from the specific combining ability is small and both parents have phenotyped descendants. Genomic selection (GS) can predict performance using a model trained on both phenotyped and genotyped hybrids that do not necessarily include all hybrid parents. Therefore, GS could overcome the issue of unknown parent GCA. Here, we compared the accuracy of classical GCA-based and genomic predictions for oil content of sunflower seeds using several GS models. Our study involved 452 sunflower hybrids from an incomplete factorial design of 36 female and 36 male lines. Re-sequencing of parental lines allowed to identify 468,194 non-redundant SNPs and to infer the hybrid genotypes. Oil content was observed in a multi-environment trial (MET) over 3 years, leading to nine different environments. We compared GCA-based model to different GS models including female and male genomic kinships with the addition of the female-by-male interaction genomic kinship, the use of functional knowledge as SNPs in genes of oil metabolic pathways, and with epistasis modeling. When both parents have descendants in the training set, the predictive ability was high even for GCA-based prediction, with an average MET value of 0.782. GS performed slightly better (+0.2%). Neither the inclusion of the female-by-male interaction, nor functional knowledge of oil metabolism, nor epistasis modeling improved the GS accuracy. GS greatly improved predictive ability when one or both parents were untested in the training set, increasing GCA-based predictive ability by 10.4% from 0.575 to 0.635 in the MET. In this scenario, performing GS only considering SNPs in oil metabolic pathways did not improve whole genome GS prediction but increased GCA-based prediction ability by 6.4%. Our results show that GS is a major improvement to breeding efficiency compared to the classical GCA modeling when either one or both parents are not well-characterized. This finding could therefore accelerate breeding through reducing phenotyping efforts and more effectively targeting for the most promising crosses.Entities:
Keywords: GBS; factorial design; genomic selection; hybrid; oil content; sunflower
Year: 2017 PMID: 28983306 PMCID: PMC5613134 DOI: 10.3389/fpls.2017.01633
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 5.753
Figure 1Incomplete factorial design of SUNRISE from 36 maintainer lines (females, in column) and 36 restorer lines (males, in row). Black squares indicate the 486 hybrids with phenotypic observations on at least one environment and gray squares indicate hybrids that were planned but were never observed for oil content in the MET. Male and female dendrograms are based on their kinship matrices.
Figure 2Correlation of oil content adjusted phenotype of hybrids between environments of the MET.
Number of observed hybrids (n.obs), mean oil content (in %), variance components and parts of variance (in %) [female, male, interaction female × male (inter.) and residual (resi.)] estimated using REML in GCA, FM, and FMI models, per environment (Env.).
| Female | 2.74 | 1.68 | 1.44 | 2.00 | 4.19 | 3.70 | 2.60 | 1.58 | 6.12 | |
| Male | 1.44 | 0.67 | 0.75 | 0.61 | 1.66 | 1.74 | 0.88 | 0.73 | 2.91 | |
| Resi. | 2.09 | 1.03 | 1.03 | 1.40 | 1.79 | 1.70 | 1.40 | 1.40 | 4.00 | |
| Female | 2.13 | 1.21 | 1.07 | 1.55 | 3.22 | 2.90 | 1.84 | 1.22 | 4.88 | |
| Male | 1.50 | 0.77 | 0.78 | 0.69 | 1.79 | 1.83 | 1.09 | 0.83 | 3.23 | |
| Resi. | 2.08 | 1.03 | 1.03 | 1.39 | 1.79 | 1.70 | 1.40 | 1.40 | 4.00 | |
| Female | 2.13 | 1.21 | 1.07 | 1.51 | 3.22 | 2.91 | 1.84 | 1.22 | 4.88 | |
| Male | 1.50 | 0.78 | 0.78 | 0.72 | 1.80 | 1.84 | 1.09 | 0.83 | 3.23 | |
| Inter. | 0.00 | 0.10 | 0.00 | 0.72 | 0.14 | 0.47 | 0.00 | 0.00 | 0.00 | |
| Resi. | 2.08 | 0.93 | 1.03 | 0.65 | 1.64 | 1.21 | 1.40 | 1.40 | 4.00 | |
| Female | 0.44 | 0.50 | 0.45 | 0.50 | 0.55 | 0.52 | 0.53 | 0.43 | 0.47 | |
| Male | 0.23 | 0.20 | 0.23 | 0.15 | 0.22 | 0.24 | 0.18 | 0.20 | 0.22 | |
| Female | 0.37 | 0.40 | 0.37 | 0.43 | 0.47 | 0.45 | 0.42 | 0.35 | 0.40 | |
| Male | 0.26 | 0.26 | 0.27 | 0.19 | 0.26 | 0.28 | 0.25 | 0.24 | 0.27 | |
| Female | 0.37 | 0.40 | 0.37 | 0.42 | 0.47 | 0.45 | 0.42 | 0.35 | 0.40 | |
| Male | 0.26 | 0.26 | 0.27 | 0.20 | 0.26 | 0.29 | 0.25 | 0.24 | 0.27 | |
| Inter. | 0.00 | 0.03 | 0.00 | 0.20 | 0.02 | 0.07 | 0.00 | 0.00 | 0.00 | |
Predictive ability of hybrid performances per environment (Env.) and average on the MET with GCA, FM, and FMI model BLUPs as the mean over the same 100 test sets (TS) using two sampling processes.
| 13EX01 | 0.756 | 0.762 | 0.761 | 0.580 | 0.653 | 0.651 |
| 13EX03 | 0.780 | 0.780 | 0.776 | 0.588 | 0.652 | 0.648 |
| 13EX04 | 0.767 | 0.768 | 0.766 | 0.572 | 0.641 | 0.639 |
| 13EX05 | 0.739 | 0.739 | 0.744 | 0.537 | 0.599 | 0.604 |
| 14EX04 | 0.835 | 0.836 | 0.835 | 0.589 | 0.665 | 0.665 |
| 14RV01 | 0.824 | 0.825 | 0.827 | 0.587 | 0.658 | 0.659 |
| 15EX05 | 0.800 | 0.800 | 0.799 | 0.596 | 0.634 | 0.633 |
| 15EX06 | 0.738 | 0.738 | 0.736 | 0.533 | 0.580 | 0.578 |
| 15EX07 | 0.796 | 0.797 | 0.796 | 0.590 | 0.635 | 0.633 |
| Average | 0.782 | 0.783 | 0.782 | 0.575 | 0.635 | 0.634 |
T1 and T0 hybrids are hybrids for which one or both parents have no observed descendant in the training set.
Figure 3Boxplot of test set accuracy per environment for GCA, FM and FM_oil (FM modeling using knowledge of oil metabolic network) BLUPs. The model BLUPs were computed on the same 100 test sets. The test sets contained only T1 or T0 hybrids with untested parents.
Predictive ability of hybrid performances (mean over of 100 test sets and its variance) per environment (Env.) and in average on the MET with GCA, FM, FM_oil, mk_oil (multi-kernel FM model with two groups of SNPs) and mk_epi (multi-kernel model with female, male, female × female epistasis and male × male epistasis kernels) model BLUPs.
| 13EX01 | 0.580 | 0.653 | 0.646 | 0.641 | 0.650 | 8.58 10−3 | 6.46 10−3 | 6.53 10−3 | 6.69 10−3 | 6.34 10−3 |
| 13EX03 | 0.588 | 0.653 | 0.642 | 0.645 | 0.645 | 1.55 10−2 | 1.01 10−2 | 1.17 10−2 | 1.02 10−2 | 9.91 10−3 |
| 13EX04 | 0.572 | 0.641 | 0.601 | 0.640 | 0.628 | 1.42 10−2 | 6.87 10−3 | 9.75 10−3 | 6.97 10−3 | 8.07 10−3 |
| 13EX05 | 0.537 | 0.599 | 0.579 | 0.594 | 0.575 | 1.71 10−2 | 1.19 10−2 | 1.25 10−2 | 1.17 10−2 | 1.43 10−2 |
| 14EX04 | 0.589 | 0.666 | 0.619 | 0.666 | 0.662 | 1.01 10−2 | 6.53 10−3 | 9.81 10−3 | 6.52 10−3 | 6.41 10−3 |
| 14RV01 | 0.587 | 0.659 | 0.616 | 0.656 | 0.632 | 1.61 10−2 | 1.04 10−2 | 1.34 10−2 | 1.08 10−2 | 1.09 10−2 |
| 15EX05 | 0.596 | 0.634 | 0.622 | 0.623 | 0.637 | 1.77 10−2 | 7.13 10−3 | 7.09 10−3 | 7.51 10−3 | 7.22 10−3 |
| 15EX06 | 0.533 | 0.580 | 0.574 | 0.564 | 0.555 | 2.61 10−2 | 1.20 10−2 | 1.48 10−2 | 1.38 10−2 | 1.88 10−2 |
| 15EX07 | 0.590 | 0.635 | 0.610 | 0.625 | 0.625 | 1.35 10−2 | 8.34 10−3 | 9.70 10−3 | 9.05 10−3 | 9.11 10−3 |
| Average | 0.575 | 0.635 | 0.612 | 0.628 | 0.623 | 1.54 10−2 | 8.86 10−3 | 1.06 10−2 | 9.25 10−3 | 1.01 10−2 |
The model BLUPs were computed on the same 100 test sets. The test sets contained only T1 or T0 hybrids with parents never observed by their descendants.
Figure 4Histogram of hybrid mean predicted performance of oil content on the MET based on the mean of intra-environment ( + FM BLUP).
Figure 5Mean predicted performance of hybrid oil content on the MET based on the mean of intra-environment ( + FM BLUP). Parents are ranked according to the mean of their descendants. Stable hybrids (Wricke's ecovalence less than 5) are surrounded with a blank square. A single hybrid is predicted as highly productive and stable in the right top corner of the heat map.