| Literature DB >> 35046967 |
Margherita Crosta1, Nelson Nazzicari1, Barbara Ferrari1, Luciano Pecetti1, Luigi Russi2, Massimo Romani1, Giovanni Cabassi1, Daniele Cavalli1, Adriano Marocco3, Paolo Annicchiarico1.
Abstract
Wider pea (Pisum sativum L.) cultivation has great interest for European agriculture, owing to its favorable environmental impact and provision of high-protein feedstuff. This work aimed to investigate the extent of genotype × environment interaction (GEI), genetically based trade-offs and polygenic control for crude protein content and grain yield of pea targeted to Italian environments, and to assess the efficiency of genomic selection (GS) as an alternative to phenotypic selection (PS) to increase protein yield per unit area. Some 306 genotypes belonging to three connected recombinant inbred line (RIL) populations derived from paired crosses between elite cultivars were genotyped through genotyping-by-sequencing and phenotyped for grain yield and protein content on a dry matter basis in three autumn-sown environments of northern or central Italy. Line variation for mean protein content ranged from 21.7 to 26.6%. Purely genetic effects, compared with GEI effects, were over two-fold larger for protein content, and over 2-fold smaller for grain and protein yield per unit area. Grain yield and protein content exhibited no inverse genetic correlation. A genome-wide association study revealed a definite polygenic control not only for grain yield but also for protein content, with small amounts of trait variation accounted for by individual loci. On average, the GS predictive ability for individual RIL populations based on the rrBLUP model (which was selected out of four tested models) using by turns two environments for selection and one for validation was moderately high for protein content (0.53) and moderate for grain yield (0.40) and protein yield (0.41). These values were about halved for inter-environment, inter-population predictions using one RIL population for model construction to predict data of the other populations. The comparison between GS and PS for protein yield based on predicted gains per unit time and similar evaluation costs indicated an advantage of GS for model construction including the target RIL population and, in case of multi-year PS, even for model training based on data of a non-target population. In conclusion, protein content is less challenging than grain yield for phenotypic or genome-enabled improvement, and GS is promising for the simultaneous improvement of both traits.Entities:
Keywords: Pisum sativum; crop quality; crude protein yield; genetic variation; genomic selection; genotype × environment interaction; grain yield; inter-population prediction
Year: 2022 PMID: 35046967 PMCID: PMC8761899 DOI: 10.3389/fpls.2021.718713
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 5.753
Trait mean value in three test environments of 306 pea inbred lines belonging to three connected RIL populations.
| Trait | Lodi 2013-14a | Lodi 2014-15b | Perugia 2013-14a | Standard error of meansb |
|---|---|---|---|---|
| Yield (t/ha)c | 6.31a | 4.59b | 2.90c | 0.35 |
| Protein content (%) | 25.32a | 23.22c | 24.26b | 0.15 |
| Protein yield (t/ha) | 1.60a | 1.07b | 0.70c | 0.09 |
Row means followed by different letter differ at p < 0.05. Error degrees of freedom for standard error: 6.
Mean value and genetic coefficient of variation of three traits measured in three test environments on pea lines of three RIL populations derived from connected crosses (A × I, 102 lines; K × A, 100 lines; K × I, 104 lines).
| Trait | Environment | Mean value | ||||||
|---|---|---|---|---|---|---|---|---|
| A × I | K × A | K × I | Standard error of means | A × I | K × A | K × I | ||
| Yield (t/ha) | Lodi 2013–14 | 5.99 | 6.33 | 6.54 | 0.14 | 10.1 | 17.5 | 18.2 |
| Lodi 2014–15 | 5.80 | 2.52 | 5.78 | 0.18 | 28.0 | 51.3 | 33.0 | |
| Perugia 2013–14 | 2.61 | 2.77 | 3.31 | 0.08 | 24.8 | 20.7 | 14.8 | |
| Protein content (%) | Lodi 2013–14 | 24.72 | 25.55 | 25.69 | 0.10 | 3.7 | 3.9 | 3.3 |
| Lodi 2014–15 | 23.23 | 23.03 | 23.37 | 0.10 | 3.9 | 3.6 | 3.9 | |
| Perugia 2013–14 | 23.29 | 24.82 | 24.68 | 0.11 | 3.9 | 4.5 | 3.4 | |
| Protein yield (t/ha) | Lodi 2013–14 | 1.48 | 1.62 | 1.68 | 0.03 | 11.1 | 18.0 | 18.5 |
| Lodi 2014–15 | 1.34 | 0.58 | 1.35 | 0.04 | 30.6 | 53.5 | 34.0 | |
| Perugia 2013–14 | 0.61 | 0.69 | 0.82 | 0.02 | 25.6 | 21.8 | 14.3 | |
Row means followed by different letter differ at p < 0.05.
CVg = /m, where m = trait mean value. Relevant variance different from zero at p < 0.01.
Error degrees of freedom: 303.
Components of variance relative to genotype (), genotype × environment interaction (), RIL population (), genotype within RIL population (), RIL population × environment interaction (), and genotype within RIL population × environment interaction () for three traits in three test environments of 306 pea lines belonging to three connected RIL populations.
| Trait | Analysis without RIL population factor | Analysis with RIL population factor | |||||
|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
| |
| Yield (t/ha) | 0.575 | 1.435 | 0.401 | 0.080 | 0.520 | 1.121 | 0.693 |
| Protein content (%) | 0.724 | 0.302 | 2.393 | 0.131 | 0.637 | 0.199 | 0.167 |
| Protein yield (t/ha) | 0.036 | 0.085 | 0.422 | 0.003 | 0.034 | 0.068 | 0.040 |
Relevant variance different from zero at p < 0.01.
Significance of genotype × environment interaction (GEI p value) and genetic correlation for line values across pairs of test environments (r) for traits of 306 pea lines belonging to three connected RIL populations.
| Genetic correlation | Lodi 2013–14 vs. Lodi 2014–15 | Lodi 2013–14 vs. Perugia 2013–14 | ||
|---|---|---|---|---|
| Trait | GEI |
| GEI |
|
| Yield (t/ha) |
| 0.35 |
| 0.79 |
| Protein content (%) |
| 0.73 |
| 0.92 |
| Protein yield (t/ha) |
| 0.34 |
| 0.80 |
p value of GEI significant at p < 0.01, or r.
Genetic correlation between grain yield (GY) and grain protein content (GPC), and phenotypic correlation between protein yield per unit area (PY) and its two component traits (GY and GPC), for 306 pea lines belonging to three connected RIL populations.
| Environment | Genetic correlation ± SE | Phenotypic correlation | |
|---|---|---|---|
| GY - GPC | PY - GY | PY - GPC | |
| Lodi 2013–14 | 0.12 ± 0.08 | 0.98 | 0.30 |
| Lodi 2014–15 | 0.18 ± 0.07 | 0.99 | 0.24 |
| Perugia 2013–14 | 0.14 ± 0.08 | 0.99 | 0.29 |
p < 0.01;
p < 0.05.
Not significant (p > 0.05).
Predictive ability for three traits of four genomic selection models in the intra-population, inter-environment scenario obtained by using two environments for model training and one for validation.
| Model | Grain yield | Protein content | Protein yield |
|---|---|---|---|
| Ridge regression BLUP | 0.403 | 0.529 | 0.406 |
| Bayesian C | 0.395 | 0.530 | 0.397 |
| Bayesian A | 0.394 | 0.531 | 0.396 |
| Bayesian Lasso | 0.398 | 0.524 | 0.397 |
Results averaged across three connected RIL populations and all possible validation environments.aValues of individual analyses averaged across results of a 10-fold stratified cross-validation scheme with 10 repetitions, relative to a total number of 306 lines.
Intra-population and inter-population inter-environment predictive ability for three pea traits obtained by Ridge regression BLUP modelling using two environments for model training and one for validation and, for inter-population predictions, one RIL population for model training aimed to predictions for the other populations.
| Trait | Intra-population inter-environment | Inter-population inter-environment | ||||||
|---|---|---|---|---|---|---|---|---|
| Validation environment | RIL population used for training | |||||||
| Lodi 2013–14 | Lodi 2014–15 | Perugia 2013–14 | Mean | A × I | K × A | K × I | Mean | |
| Yield (t/ha) | 0.39 | 0.45 | 0.36 | 0.40 | 0.08 | 0.28 | 0.27 | 0.21 |
| Protein content (%) | 0.60 | 0.45 | 0.53 | 0.53 | 0.27 | 0.21 | 0.32 | 0.27 |
| Protein yield (t/ha) | 0.40 | 0.46 | 0.36 | 0.41 | 0.08 | 0.25 | 0.27 | 0.20 |
Results relative to three RIL populations derived from connected crosses (A × I, 102 lines; K × A, 100 lines; K × I, 104 lines) averaged across all possible validation environments.aAveraged across results for each of three RIL populations based on a 10-fold stratified cross-validation scheme with 10 repetitions.
Correlation of phenotypic data or genomic selection (GS)-modelled data based on two test environments with data in a third (validation) environment, averaging results for all pairs test environments.
| Trait | Phenotypic data | Data predicted by GS |
|---|---|---|
| Yield (t/ha) | 0.46 | 0.48 |
| Protein content (%) | 0.75 | 0.70 |
| Protein yield (t/ha) | 0.49 | 0.51 |
Values averaged across results for each of three connected RIL populations and all possible validation environments.
Ratio of genomic selection (GS) to phenotypic selection (PS) efficiency for protein yield based on predicted genetic gains per unit time for similar evaluation costs assuming two environments for PS and for generation of phenotyping data for intra-population and inter-population GS scenarios.
| Trait |
| GSA | GSA/PS efficiency ratio | GSB | GSB/PS efficiency ratio | ||
|---|---|---|---|---|---|---|---|
| Protein yield (t/ha) | 0.676 | 0.511 | 2.192 | 4.383 | 0.252 | 1.084 | 2.167 |
H is the square root of the broad-sense heritability on a genotype mean basis; r is the GS predictive accuracy for intra-population (GSA) or inter-population (GSB) prediction scenarios; t is the duration of one cycle of PS (one or two years). Efficiency ratios averaged across separate analyses for all possible validation environments and three connected RIL populations.
Figure 1Manhattan plots showing the association score of SNP markers with grain yield (A) and grain protein content (B) along pea chromosomes in a GWAS based on 306 lines belonging to three connected RIL populations. The dashed line represents the Bonferroni correction threshold, while the solid line represents False Discovery Rate threshold in (A), and the threshold employed to select significantly associated markers in (B).