| Literature DB >> 34893831 |
Xabi Cazenave1, Bernard Petit1, Marc Lateur2, Hilde Nybom3, Jiri Sedlak4, Stefano Tartarini5, François Laurens1, Charles-Eric Durel1, Hélène Muranty1.
Abstract
Genomic selection is an attractive strategy for apple breeding that could reduce the length of breeding cycles. A possible limitation to the practical implementation of this approach lies in the creation of a training set large and diverse enough to ensure accurate predictions. In this study, we investigated the potential of combining two available populations, i.e., genetic resources and elite material, in order to obtain a large training set with a high genetic diversity. We compared the predictive ability of genomic predictions within-population, across-population or when combining both populations, and tested a model accounting for population-specific marker effects in this last case. The obtained predictive abilities were moderate to high according to the studied trait and small increases in predictive ability could be obtained for some traits when the two populations were combined into a unique training set. We also investigated the potential of such a training set to predict hybrids resulting from crosses between the two populations, with a focus on the method to design the training set and the best proportion of each population to optimize predictions. The measured predictive abilities were very similar for all the proportions, except for the extreme cases where only one of the two populations was used in the training set, in which case predictive abilities could be lower than when using both populations. Using an optimization algorithm to choose the genotypes in the training set also led to higher predictive abilities than when the genotypes were chosen at random. Our results provide guidelines to initiate breeding programs that use genomic selection when the implementation of the training set is a limitation.Entities:
Keywords: zzm321990 Malus domesticazzm321990 ; GenPred; Genomic Prediction; Shared Data Resource; genomic selection; germplasm; population combination; training set design
Mesh:
Year: 2022 PMID: 34893831 PMCID: PMC9210277 DOI: 10.1093/g3journal/jkab420
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.542
Composition of the training and validation set in the different scenarios
| Method | Training set | Validation set |
|---|---|---|
| WP | Genetic resources | Genetic resources |
| Elite | Elite | |
| AP | Elite | Genetic resources |
| Genetic resources | Elite | |
| Comb | Genetic resources + Elite | Genetic resources |
| Genetic resources + Elite | Elite | |
| Prop_hybrids | Genetic resources + Elite | Hybrids |
Figure 1Schematic representation of the tested scenarios in this study. (A) The VS is composed of 20% of one of the two populations (elite material or genetic resources) and is predicted either by the remaining 80% of the population (within-population prediction), by the other population (across-population prediction) or by combining both populations (combination prediction). (B) The panel of hybrids is used as VS and is predicted by combining elite material and genetic resources into a unique population. Several training sets with increasing sizes and different proportions of the two populations are tested.
Figure 2PCA performed using pruned marker data of the FBo-Hi and hybrids panel.
Genomic and environmental variances and genomic correlations estimated from the Gibbs sampler for each trait in the two population of the FBo-Hi and REFPOP datasets
| Elite | Genetic Resources | ||||||
|---|---|---|---|---|---|---|---|
| Trait | σ² | σ² |
| σ² | σ² |
|
|
| FBo-Hi dataset | |||||||
| Harvest date | 194.2 (20.58) | 85.81 (5.92) | 0.69 (0.03) | 314.5 (25.47) | 108.91 (10.37) | 0.74 (0.03) | 0.99 (0.02) |
| Fruit over-color | 0.6 (0.08) | 0.53 (0.03) | 0.52 (0.04) | 0.8 (0.09) | 0.43 (0.04) | 0.65 (0.04) | 0.65 (0.07) |
| Acidity | 1.5 (0.24) | 1.63 (0.1) | 0.48 (0.05) | 1.1 (0.15) | 1.01 (0.08) | 0.53 (0.05) | 0.74 (0.07) |
| Juiciness | 0.8 (0.15) | 1.09 (0.07) | 0.42 (0.05) | 0.5 (0.07) | 0.7 (0.05) | 0.43 (0.04) | 0.52 (0.12) |
| Crispness | 0.4 (0.1) | 1.16 (0.07) | 0.27 (0.05) | 0.5 (0.08) | 0.85 (0.06) | 0.37 (0.05) | 0.45 (0.13) |
| REFPOP dataset | |||||||
| Harvest date | 245.8 (46.47) | 52.34 (14.25) | 0.82 (0.06) | 341.2 (38.37) | 36.33 (13.84) | 0.9 (0.04) | 1 (0) |
| Fruit over-color | 1 (0.16) | 0.34 (0.06) | 0.75 (0.05) | 1.3 (0.15) | 0.25 (0.06) | 0.83 (0.05) | 0.72 (0.08) |
| Fruit number | 69.6 (23.01) | 112.77 (15.64) | 0.38 (0.1) | 70.6 (16.03) | 47.21 (10.4) | 0.59 (0.1) | 0.98 (0.03) |
| Fruit weight | 0.4 (0.09) | 0.21 (0.04) | 0.66 (0.08) | 0.6 (0.1) | 0.23 (0.05) | 0.74 (0.06) | 0.42 (0.15) |
The standard deviation of the genomic and environmental variances, of the heritability values, and of the genomic correlations is shown between brackets.
Predictive abilities of the measured traits in the within-population (WP), across-population (AP), combined populations (Comb), and MG-GBLUP method
| Elite material | Genetic resources | ||||
|---|---|---|---|---|---|
| Trait | Method | Medium density | High density | Medium density | High density |
| FBo-Hi dataset | |||||
| Harvest date | WPP |
| 0.79 (0.03) |
|
|
| APP | 0.6 (0.05) | 0.64 (0.04) | 0.47 (0.05) | 0.51 (0.05) | |
| Comb | 0.78 (0.03) | 0.79 (0.03) |
|
| |
| MG-GBLUP |
|
|
|
| |
| Fruit over-color | WPP |
| 0.71 (0.03) | 0.59 (0.04) | 0.57 (0.05) |
| APP | 0.47 (0.05) | 0.45 (0.05) | 0.39 (0.06) | 0.35 (0.06) | |
| Comb |
|
|
| 0.58 (0.05) | |
| MG-GBLUP |
|
|
|
| |
| Crispness | WPP | 0.34 (0.06) | 0.33 (0.06) | 0.34 (0.05) | 0.34 (0.05) |
| APP | 0.15 (0.07) | 0.17 (0.06) | 0.21 (0.05) | 0.22 (0.05) | |
| Comb |
|
|
|
| |
| MG-GBLUP | 0.35 (0.06) | 0.35 (0.06) |
| 0.35 (0.05) | |
| Juiciness | WPP |
| 0.48 (0.05) |
| 0.41 (0.05) |
| APP | 0.18 (0.07) | 0.16 (0.07) | 0.13 (0.07) | 0.14 (0.07) | |
| Comb |
|
| 0.38 (0.05) | 0.4 (0.05) | |
| MG-GBLUP |
|
|
|
| |
| Acidity | WPP | 0.48 (0.05) | 0.47 (0.05) | 0.45 (0.05) | 0.43 (0.05) |
| APP | 0.39 (0.05) | 0.39 (0.05) | 0.28 (0.06) | 0.23 (0.06) | |
| Comb |
|
|
|
| |
| MG-GBLUP |
| 0.49 (0.04) |
| 0.45 (0.05) | |
| REFPOP dataset | |||||
| Harvest date | WPP |
| 0.6 (0.09) | 0.79 (0.04) | 0.78 (0.04) |
| APP | 0.44 (0.12) | 0.45 (0.12) | 0.41 (0.08) | 0.47 (0.08) | |
| Comb |
|
|
|
| |
| MG-GBLUP |
| 0.62 (0.10) | 0.79 (0.04) | 0.79 (0.04) | |
| Fruit over-color | WPP | 0.79 (0.05) | 0.78 (0.05) | 0.69 (0.06) | 0.64 (0.06) |
| APP | 0.73 (0.05) | 0.69 (0.06) | 0.56 (0.09) | 0.52 (0.09) | |
| Comb | 0.81 (0.05) | 0.80 (0.05) |
|
| |
| MG-GBLUP |
|
|
|
| |
| Fruit weight | WPP | 0.52 (0.09) | 0.52 (0.09) | 0.57 (0.08) | 0.58 (0.08) |
| APP | 0.29 (0.11) | 0.3 (0.11) | 0.19 (0.10) | 0.2 (0.10) | |
| Comb |
|
|
|
| |
| MG-GBLUP |
| 0.54 (0.09) |
| 0.6 (0.08) | |
| Fruit number | WPP | 0.48 (0.08) | 0.46 (0.08) | 0.5 (0.11) | 0.51 (0.11) |
| APP | 0.27 (0.12) | 0.24 (0.03) | 0.27 (0.11) | 0.29 (0.11) | |
| Comb |
|
|
|
| |
| MG-GBLUP | 0.47 (0.08) | 0.45 (0.09) | 0.51 (0.11) | 0.52 (0.11) | |
The standard deviation of the predictive abilities is shown between brackets. Bold values represent the highest predictive ability value for a given trait at a given marker density.
Figure 3Predictive abilities for (A) harvest date and (B) fruit over-color in the FBo-Hi or REFPOP dataset with medium and high marker density. WP: within-population prediction AP: across-population prediction Comb: combination prediction MG-Comb: combination prediction with the MG-GBLUP method.
Figure 4Predictive abilities for harvest date and fruit over-color in the dataset of hybrids with medium and high marker density when the training set is composed of varying proportions of elite material and genetic resources of the FbBo-Hi or REFPOP dataset. max(predAbi): maximum predictive ability obtained for a given marker density with one of the three tested methods regardless of the training set size Random: predictive ability obtained when randomly choosing the genotypes included in the training set.