| Literature DB >> 31331290 |
Paolo Annicchiarico1, Nelson Nazzicari2, Luciano Pecetti2, Massimo Romani2, Luigi Russi3.
Abstract
BACKGROUND: A thorough verification of the ability of genomic selection (GS) to predict estimated breeding values for pea (Pisum sativum L.) grain yield is pending. Prediction for different environments (inter-environment prediction) has key importance when breeding for target environments featuring high genotype × environment interaction (GEI). The interest of GS would increase if it could display acceptable prediction accuracies in different environments also for germplasm that was not used in model training (inter-population prediction).Entities:
Keywords: Breeding value; Cross-population prediction; Genotype × environment interaction; Genotyping-by-sequencing; Pisum sativum; Predictive ability; Yield
Year: 2019 PMID: 31331290 PMCID: PMC6647272 DOI: 10.1186/s12864-019-5920-x
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Mean trait value of three pea test environments
| Trait | Lodi 2013–14 | Lodi 2014–15 | Perugia 2013–14 |
|---|---|---|---|
| Grain yield (t/ha) | 6.307 a | 4.586 b | 2.905 c |
| Onset of flowering (dd from Apr. 1) | 12.42 b | 15.36 a | 12.44 b |
| Lodging susceptibility (score 1 = min, 5 = max) | 2.50 b | 3.67 a | 2.18 b |
| Individual seed weight (g) | 0.213 a | 0.187 b | 0.190 b |
| Winter plant survival (proportion) | 0.991 a | 0.726 b | 0.995 a |
Row means with different letter differ at P < 0.05
Mean value, and genetic coefficient of variation (CVg), for trait values in three test environments of three pea RIL populations derived from three connected crosses (A × I, 102 lines; K × A, 100 lines; K × I, 104 lines)
| Mean valuec | CVg (%)d | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Traita | Environmentb | A × I | K × A | K × I | A × I | K × A | K × I | ||||||
| GY (t/ha) | Lo14 | 5.994 | a | 6.326 | a | 6.541 | a | 10.1 | ** | 17.5 | ** | 18.2 | ** |
| GY (t/ha) | Lo15 | 5.805 | a | 2.522 | b | 5.777 | a | 28.0 | ** | 51.3 | ** | 33.0 | ** |
| GY (t/ha) | Pg14 | 2.615 | b | 2.773 | b | 3.310 | a | 24.8 | ** | 20.7 | ** | 14.8 | ** |
| OF (dd from Apr. 1) | Lo14 | 10.26 | c | 13.84 | a | 13.14 | b | 18.5 | ** | 37.5 | ** | 33.2 | ** |
| OF (dd from Apr. 1) | Lo15 | 12.45 | c | 18.72 | a | 14.81 | b | 14.2 | ** | 28.3 | ** | 27.7 | ** |
| OF (dd from Apr. 1) | Pg14 | 10.58 | b | 13.43 | a | 13.17 | a | 28.3 | ** | 39.5 | ** | 41.8 | ** |
| LS (score 1 = min, 5 = max) | Lo14 | 2.53 | a | 2.43 | a | 2.58 | a | 20.9 | ** | 9.7 | NS | 22.9 | ** |
| LS (score 1 = min, 5 = max) | Lo15 | 3.66 | b | 3.52 | ab | 3.80 | a | 7.4 | ** | 0.0 | NS | 8.7 | ** |
| LS (score 1 = min, 5 = max) | Pg14 | 2.37 | a | 2.14 | a | 2.02 | a | 16.2 | ** | 13.7 | NS | 23.0 | ** |
| SW (g) | Lo14 | 0.207 | b | 0.230 | a | 0.201 | b | 9.6 | ** | 8.0 | ** | 9.3 | ** |
| SW (g) | Lo15 | 0.190 | a | 0.192 | a | 0.180 | b | 8.2 | ** | 5.8 | ** | 7.9 | ** |
| SW (g) | Pg14 | 0.183 | b | 0.207 | a | 0.181 | b | 9.4 | ** | 8.0 | ** | 9.9 | ** |
| WS (proportion) | Lo15 | 0.877 | a | 0.436 | b | 0.865 | a | 10.8 | ** | 32.2 | ** | 12.8 | ** |
a GY, grain yield; OF, onset of flowering; LS, lodging susceptibility; SW, individual seed weight; WS, winter plant survival
b Lo14, Lodi 2013–14; Lo15, Lodi 2014–15; Pg14, Perugia 2013–14
c Row means followed by different letter differ at P < 0.05
d NS and **: genetic variance not different from zero at P < 0.05 and different from zero at P < 0.01, respectively
Estimated components of variance relative to genotype (S) and genotype × environment interaction (S) and to RIL population (S), genotype within RIL population (S), RIL population × environment interaction (S) and genotype within RIL population × environment interaction (S), for traits of 306 pea lines belonging to three RIL populations tested in three environments (Lodi 2013–14; Lodi 2014–15; Perugia 2013–14)
| Analysis without RIL population factorb | Analysis with RIL population factorb | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Traita |
|
|
|
|
|
| |||||||
| GY (t/ha)2 | 0.567 | ** | 1.443 | ** | 0.39 | 0.078 | ** | 0.517 | ** | 1.122 | ** | 0.689 | ** |
| OF (dd from Apr. 1)2 | 18.88 | ** | 2.73 | ** | 6.91 | 3.98 | ** | 16.30 | ** | 1.18 | ** | 1.96 | ** |
| LS [score (1 = min, 5 = max)]2 | 0.113 | ** | 0.042 | ** | 2.68 | 0.000 | NS | 0.123 | ** | 0.015 | ** | 0.012 | NS |
| SW [(g)2 × 1000] | 0.333 | ** | 0.056 | ** | 5.94 | 0.118 | ** | 0.254 | ** | 0.043 | ** | 0.030 | ** |
| WS (proportion)2 | – | – | – | – | – | 0.063 | ** | 0.014 | ** | – | – | – | – |
a GY, grain yield; OF, onset of flowering; LS, lodging susceptibility; SW, individual seed weight; WS, winter plant survival. All traits assessed in three environments except winter plant survival, assessed only in Lodi 2014–15
b NS and **: component of variance not different from zero at P < 0.05 and different from zero at P < 0.01, respectively
Significance and extent (as genetic correlation r for genotype response across environments) of genotype × environment interaction across pairs of test environments, for traits of 306 pea lines belonging to three RIL populations
| Years 2013–14 vs 2014–15 in Lodia | Lodi vs Perugia in 2013-14a | |||
|---|---|---|---|---|
| Trait |
|
| ||
| Grain yield | ** | 0.325 | ** | 0.822 |
| Onset of flowering | ** | 0.889 | ** | 0.955 |
| Lodging susceptibility | ** | 0.639 | NS | 0.980 |
| Individual seed weight | ** | 0.879 | ** | 0.953 |
a NS and **: not significant at P < 0.05 and significant at P < 0.01, respectively; r always different from zero (P < 0.01)
Fig. 1Nominal grain yield of six top-performing pea inbred lines out of 306 derived from three connected crosses, three parent cultivars (Attika, Isard, Kaspa) and one commercial cultivar (Spacial) as a function of the environment score on the first genotype × environment interaction principal component axis (PC 1) [environments are Lodi 2013–14 (Lo14), Lodi 2014–15 (Lo15), and Perugia 2013–14 (Pg14); the graph includes the two top-yielding lines in each environment or across environments]
Fig. 2Intra-population predictive ability for pea grain yield in three environments, for all combinations of three regression models (BL, Bayesian Lasso; rrBLUP, Ridge regression BLUP; G-BLUP, genomic BLUP) and five genotype missing data thresholds. Data averaged across three pea RIL populations and 50 repetitions of 10-fold stratified cross-validation per individual analysis
Fig. 3Intra-population predictive ability for pea mean grain yield, onset of flowering, lodging susceptibility and individual seed weight across three environments and winter plant survival in one environment, for all combinations of three regression models (BL, Bayesian Lasso; rrBLUP, Ridge regression BLUP; G-BLUP, genomic BLUP) and five genotype missing data thresholds. Data averaged across two (lodging susceptibility) or three (other traits) RIL populations and 50 repetitions of 10-fold stratified cross-validation per individual analysis
Intra-population predictive ability (PA) for pea grain yield, onset of flowering, lodging susceptibility and individual seed weight averaged across three environments and winter plant survival in one environment, for different Bayesian Lasso model training and account of population structure. Values averaged across two (lodging susceptibility) or three (other traits) RIL populations
| Trait | Traininga | Structureb | PA |
|---|---|---|---|
| Grain yield | Single | No | 0.452 |
| All | No | 0.476 | |
| All | Yes | 0.474 | |
| Onset of flowering | Single | No | 0.710 |
| All | No | 0.747 | |
| All | Yes | 0.749 | |
| Lodging susceptibility | Single | No | 0.385 |
| All | No | 0.404 | |
| All | Yes | 0.411 | |
| Individual seed weight | Single | No | 0.695 |
| All | No | 0.696 | |
| All | Yes | 0.700 | |
| Winter plant survival | Single | No | 0.561 |
| All | No | 0.557 | |
| All | Yes | 0.585 |
a Single = model trained on the specific population; all = model trained on all populations joined in a single data set. Fifty repetitions of 10-fold stratified cross-validation per individual analysis
b No = no structure information; yes = structure information as RIL population fixed factor
Inter-environment predictive ability for four pea traits, using Bayesian Lasso modelling trained in one environments for prediction of independent lines in each of two other environments. Values averaged across two (lodging susceptibility) or three (other traits) RIL populations
| Training environmentb | ||||
|---|---|---|---|---|
| Traita | Lodi 2013–14 | Lodi 2014–15 | Perugia 2013–14 | Average |
| GY | 0.311 | 0.240 | 0.336 | 0.296 |
| OF | 0.680 | 0.614 | 0.669 | 0.654 |
| LS | 0.386 | 0.180 | 0.269 | 0.278 |
| SW | 0.668 | 0.616 | 0.629 | 0.638 |
a GY, grain yield; OF, onset of flowering; LS, lodging susceptibility; SW, individual seed weight
b Fifty repetitions of 10-fold stratified cross-validation per individual analysis
Inter-population predictive ability for same (intra-environment) or other environments (inter-environment) for five pea traits, using Bayesian Lasso modelling trained in one RIL population for predictions in one (lodging susceptibility) or two (other traits) other RIL populations, averaging results across validation RIL populations in one (winter plant survival) or three (other traits) environments. Inter-environment predictions for models trained in one environment and tested in each of two other environments, averaging results across training environments
| Intra-environmentb | Inter-environmentb | |||||||
|---|---|---|---|---|---|---|---|---|
| Traita | A × I | K × A | K × I | Average | A × I | K × A | K × I | Average |
| GY | 0.092 | 0.282 | 0.367 | 0.247 | 0.042 | 0.226 | 0.292 | 0.187 |
| OF | 0.359 | 0.455 | 0.522 | 0.445 | 0.287 | 0.454 | 0.457 | 0.399 |
| LS | 0.296 | – | 0.333 | 0.315 | 0.195 | – | 0.250 | 0.223 |
| SW | 0.303 | 0.253 | 0.322 | 0.293 | 0.274 | 0.263 | 0.296 | 0.277 |
| WS | 0.230 | 0.130 | 0.410 | 0.257 | – | – | – | – |
a GY, grain yield; OF, onset of flowering; LS, lodging susceptibility; SW, individual seed weight; WS, winter plant survival
bColumn headings indicate the training population
Correlation (r) of phenotypic data or genomic selection (GS)-modelled data based on two (test) environments with phenotypic data in another (validation) environment, square root of broad-sense heritability (H) when adopting two test environments, accuracy (r) of GS modelling trained in two environments for prediction in one validation environment, and GS vs phenotypic selection (PS) efficiency ratio based on predicted genetic gains per unit time for similar evaluation costs assuming two environments for PS and for generation of phenotyping data for intra-population (GSA) and inter-population (GSB) GS scenarios, for four pea traits. Data averaged across three environment combinations and two (lodging susceptibility) or three (other traits) RIL populations
|
| GSA/PS efficiency ratioe | GSB/PS efficiency ratioe | |||||||
|---|---|---|---|---|---|---|---|---|---|
| Traita | Phenotypic data | GS-modelled datab |
| GSA
| GSB
| ||||
| GY | 0.377 | 0.402 | 0.632 | 0.390 | 1.801 | 3.602 | 0.262 | 1.209 | 2.418 |
| OF | 0.837 | 0.827 | 0.929 | 0.690 | 2.170 | 4.340 | 0.445 | 1.398 | 2.796 |
| LS | 0.398 | 0.436 | 0.609 | 0.485 | 2.323 | 4.647 | 0.420 | 2.014 | 4.029 |
| SW | 0.831 | 0.836 | 0.932 | 0.723 | 2.266 | 4.531 | 0.327 | 1.024 | 2.049 |
a GY, grain yield; OF, onset of flowering; LS, lodging susceptibility; SW, individual seed weight
b Using Bayesian Lasso modelling trained on all genotype data
c Assuming experiments with three replicates (as the current phenotyping experiments)
d Using Bayesian Lasso modelling for prediction of independent lines, using 50 repetitions of 10-fold stratified cross-validation per individual analysis
e As ratio (i r / t) / (i H / t), where i and i are standardized selection differentials for GS and PS, respectively, setting i = 1.46 i to approach same evaluation costs; and t and t are cycle duration for GS and PS, setting t = 0.5 and t = 1 (two test sites in the same test year) or t = 2 (two test years in the same site)
f Using Bayesian Lasso model training on data of one RIL population for prediction within each of two other populations
Fig. 4Top 100 genome-wide association scores, ranked in descending order, for single-nucleotide polymorphism (SNP) markers associated with five phenotypic traits of pea. GWAS was performed on stratified data, with each of two (lodging susceptibility) or three (other traits) RIL populations acting as a stratum