| Literature DB >> 36119585 |
Miguel Angel Raffo1, Pernille Sarup1,2, Jeppe Reitan Andersen2, Jihad Orabi2, Ahmed Jahoor2,3, Just Jensen1.
Abstract
Multi-trait and multi-environment analyses can improve genomic prediction by exploiting between-trait correlations and genotype-by-environment interactions. In the context of reaction norm models, genotype-by-environment interactions can be described as functions of high-dimensional sets of markers and environmental covariates. However, comprehensive multi-trait reaction norm models accounting for marker × environmental covariates interactions are lacking. In this article, we propose to extend a reaction norm model incorporating genotype-by-environment interactions through (co)variance structures of markers and environmental covariates to a multi-trait reaction norm case. To do that, we propose a novel methodology for characterizing the environment at different growth stages based on growth degree-days (GDD). The proposed models were evaluated by variance components estimation and predictive performance for winter wheat grain yield and protein content in a set of 2,015 F6-lines. Cross-validation analyses were performed using leave-one-year-location-out (CV1) and leave-one-breeding-cycle-out (CV2) strategies. The modeling of genomic [SNPs] × environmental covariates interactions significantly improved predictive ability and reduced the variance inflation of predicted genetic values for grain yield and protein content in both cross-validation schemes. Trait-assisted genomic prediction was carried out for multi-trait models, and it significantly enhanced predictive ability and reduced variance inflation in all scenarios. The genotype by environment interaction modeling via genomic [SNPs] × environmental covariates interactions, combined with trait-assisted genomic prediction, boosted the benefits in predictive performance. The proposed multi-trait reaction norm methodology is a comprehensive approach that allows capitalizing on the benefits of multi-trait models accounting for between-trait correlations and reaction norm models exploiting high-dimensional genomic and environmental information.Entities:
Keywords: genomic prediction; genotype by environment interaction; multi-environment; multi-trait; reaction norm
Year: 2022 PMID: 36119585 PMCID: PMC9481302 DOI: 10.3389/fpls.2022.939448
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 6.627
FIGURE 1Computation of growth stages and environmental covariates (ECs) for year 2014 in the locality of Skive. In total, the complete crop cycle was divided into 27 shorter periods of 100 GDD (27 growth stages). For each growth stage, a set of ECs summarizing environmental descriptors linked to water availability, radiation and temperature were computed as described in Supplementary material 1. The same approach was used for all year-location combinations. GDD: growth degree days, the GDD were estimated as the thermal sum of daily average temperature over 0°C.
Environmental covariates (ECs) description, modified from Lecomte (2005) and Heslot et al. (2014).
| Code | Variable description | Category |
| ave.glorad | Average global radiation (MJ/m2) | Radiation |
| ave.temp | Average temperature (°C) | Temperature |
| ave.vpd | Average vapour pressure deficit (VPD, kPa). VPD measures difference between air moisture and potential moisture and is related to water loss | Water/evapotranspiration |
| cumglorad | Accumulated global radiation (MJ/m2) | Radiation |
| ratrdtmp | Ratio between global radiation (MJ/m2) and temperature (°C) | Photothermal ratio |
| cumpospetp | Accumulated positive precipitation (mm) – evaporation (mm) | Water |
| cumnegpetp | Accumulated negative precipitation (mm) – evaporation (mm) | Water |
| cumpetp | Accumulated total precipitation (mm) – evaporation (mm) | Water |
| cumntdryd | Number of total dry days [precipitation (mm) ≤ evaporation (mm)] | Water |
| cumnsti4 | Accumulated temperature (°C) lower than −4°C | Frost/temperature |
| cumndt0 | Number of days with minimum temperature (°C) lower than 0°C | Frost/temperature |
| cumsti0 | Accumulated temperature (°C) lower than 0°C | Frost/temperature |
| cumprec | Accumulated precipitation (mm) | Water |
| cumvpd | Accumulated vapour pressure deficit (kPa) | Water/evapotranspiration |
| GDD | Growth degree days estimated as the thermal sum of daily average temperature over 0°C | Temperature |
| ndi10m | Number of days with radiation lower to 1,045 J/cm2 | Radiation |
| sri10m | Sum of daily radiation (MJ/m2) when radiation is lower to 1,045 J/cm2 | Radiation |
| pvt | Plant available water (%) | Soil/water |
| wscmm | Water storage capacity (mm) | Soil/water |
| claynor | Sand content (%) | Soil |
| fsandno | Fine sand content (%) | Soil |
| gsandno | Coarse sand content (%) | Soil |
| kulstof | Carbon content (%) | Soil |
| siltnor | Silt content (%) | Soil |
| Ks_250 | Saturated hydraulic conductivity (%) | Soil |
The ECs for the categories radiation, temperature, frost, photothermal ratio, water and evapotranspiration were computed each 100 GDD for the complete growth cycle starting from the sowing date. The ECs for the category soil were available at all localities for four depths: 0–30, 30–60, 60–100, 100–200 cm. The ECs from the different categories were used together to describe the environmental conditions in each year-location combination.
FIGURE 2Representation of spatial information in a field trial. The target and eight surrounding plots were used together to correct the spatial variability across the field. The trial borders’ effect was considered by adding virtual plots to complete the eight surrounding plots for all observations. Virtual plots were also added in empty X-Y coordinates (with no plot observation registered) to ensure all plots have the eight surrounding plots. Hence, the spatial effects on an individual plot is the sum of effects with the square centered on the plot itself plus the effects of the eight surrounding plots with a square centered on those plots.
FIGURE 3Heatmap of the spatial relationship matrix for locality Dyngby in the year 2017. A total of 311 plots were observed at Dyngby 2017, and the spatial relationship between plots are represented. Higher to lower relationships are represented from yellow to light-blue colors; the dark blue represents a lack of relationship between plots (not neighboring connections).
Summary of the effect included in the models and single-trait (ST) or multi-trait (MT) case.
| Models | Main effect | Interactions | ST/MT | |||
|
|
|
|
|
| ||
| M1 (baseline) |
|
|
| ST | ||
| M2 |
|
|
|
| ST | |
| M3 |
|
|
|
|
| ST |
| M4 |
|
|
|
| MT | |
| M5 |
|
|
|
|
| MT |
l line, g genomic [SNPs] additive effect, s spatial effect, f line × environment interaction, gw genomic [SNPs] additive × ECs interaction. ST, single trait model; MT, multi-trait model.
Descriptive statistics for the grain yield and protein content of F6 wheat breeding lines.
| Breeding cycle | No. of lines | No. of plots | Trait | Average (SD) | Min.–Max. values | Coef. of var. (%) |
| 1 | 321 | 1,274 | Yield | 8.82 (0.83) | 3.85–11.00 | 9.51 |
| Protein | 9.67 (0.93) | 7.50–15.10 | 9.65 | |||
| 2 | 230 | 2,258 | Yield | 8.61 (1.09) | 4.75–11.47 | 12.70 |
| Protein | 9.74 (0.87) | 7.50–14.30 | 8.95 | |||
| 3 | 336 | 3,289 | Yield | 8.31 (1.11) | 5.03–11.80 | 13.40 |
| Protein | 10.25 (0.69) | 8.40–13.00 | 6.73 | |||
| 4 | 159 | 918 | Yield | 9.09 (0.98) | 6.21–11.40 | 10.78 |
| Protein | 10.67 (0.57) | 9.00–12.60 | 5.34 | |||
| 5 | 358 | 1,674 | Yield | 8.59 (0.46) | 7.06–10.25 | 5.35 |
| Protein | 8.84 (0.41) | 7.60–10.20 | 4.46 | |||
| 6 | 257 | 1,977 | Yield | 9.36 (1.10) | 6.04–12.36 | 11.78 |
| Protein | 9.94 (0.51) | 8.50–12.00 | 5.13 | |||
| 7 | 354 | 3,040 | Yield | 9.37 (0.68) | 6.24–11.54 | 7.27 |
| Protein | 9.73 (0.59) | 8.40–12.50 | 6.04 | |||
| Total | 2,015 | 14,430 | Yield | 8.85 (1.03) | 3.85–12.35 | 11.63 |
| Protein | 9.84 (0.80) | 7.50–15.10 | 8.11 |
*The values presented correspond to the obtained F6 populations after successful phenotyping and genotyping.
**Units of measure: yield (grain yield, kg grain/8.25 m2), protein content (%); No., number; SD, standard deviation; Min, Minimum; Max, Maximum; Coef. of var., Coefficient of variation.
Posterior mean of variance components for grain yield (kg grain/8.25 m2).
| Models | Main effect | Interactions | Res. | |||
|
|
|
|
|
| ||
| M1 (baseline) | 0.093 (0.005) | 0.067 (0.003) | 0.122 (0.002) | 0.055 (0.002) | ||
| M2 | 0.049 (0.004) | 0.060 (0.008) | 0.067 (0.002) | 0.122 (0.002) | 0.055 (0.001) | |
| M3 | 0.050 (0.004) | 0.040 (0.008) | 0.065 (0.002) | 0.047 (0.002) | 0.112 (0.008) | 0.056 (0.002) |
| M4 | 0.051 (0.004) | 0.064 (0.008) | 0.067 (0.002) | 0.122 (0.002) | 0.056 (0.002) | |
| M5 | 0.051 (0.004) | 0.049 (0.007) | 0.065 (0.002) | 0.048 (0.003) | 0.112 (0.008) | 0.056 (0.002) |
*The values between parentheses are the posterior standard deviation (PSD) of the estimates.
l line, g genomic [SNPs] additive effect, s spatial effect, f line × environment interaction, gw genomic [SNPs] additive × ECs interaction, Res. residuals.
Posterior mean of variance components for protein content (%).
| Models | Main effect | Interactions | Res. | |||
|
|
|
|
|
| ||
| M1 (baseline) | 0.076 (0.004) | 0.050 (0.002) | 0.044 (0.002) | 0.049 (0.001) | ||
| M2 | 0.032 (0.004) | 0.069 (0.008) | 0.050 (0.002) | 0.044 (0.002) | 0.049 (0.001) | |
| M3 | 0.032 (0.004) | 0.058 (0.007) | 0.050 (0.002) | 0.018 (0.002) | 0.037 (0.003) | 0.048 (0.001) |
| M4 | 0.033 (0.004) | 0.067 (0.007) | 0.050 (0.002) | 0.045 (0.002) | 0.048 (0.001) | |
| M5 | 0.032 (0.004) | 0.058 (0.007) | 0.050 (0.002) | 0.020 (0.002) | 0.039 (0.003) | 0.047 (0.001) |
*The values between parentheses are the posterior standard deviation (PSD) of the estimates.
l line, g genomic [SNPs] additive effect, s spatial effect, f line × environment interaction, gw genomic [SNPs] additive × ECs interaction, Res. residuals.
Between-trait correlations for model effects of multi-trait models (M4 and M5).
| Models | Main effect | Interactions | Res. | |||
|
|
|
|
|
| ||
| M4 | −0.500 (0.066) | −0.404 (0.049) | −0.252 (0.027) | −0.687 (0.029) | −0.019 (0.019) | |
| M5 | −0.536 (0.063) | −0.419 (0.058) | −0.252 (0.027) | −0.597 (0.057) | −0.677 (0.032) | −0.016 (0.019) |
*The values between parentheses are the posterior standard deviation (PSD) of covariances estimates.
l line, g genomic [SNPs] additive effect, s spatial effect, f line × environment interaction, gw genomic [SNPs] additive × ECs interaction, Res. residuals.
FIGURE 4Barplot of predictive abilities (PAs) for grain yield in leave-one-year-location-out (CV1, upper panel) and leave-one-breeding-cycle-out (CV2, lower panel) cross-validations. M2: line + genomic [SNPs] additive effect + spatial effect + line × environment interaction. M3 expand M2 by adding a genomic [SNPs] additive × ECs interaction. M4 expand M2 to the multi-trait case. M4-TA is the M4 using trait-assisted (TA) genomic prediction. M5 expand M3 to the multi-trait case. M5-TA is the M4 using TA genomic prediction. Black bars are the 95% confidence interval. Differences in the letter above the bar represent significant differences between models (P-value < 0.01). Green lines are the theoretical maximum PAs.
FIGURE 5Barplot of predictive abilities (PAs) for protein content in leave-one-year-location-out (CV1, upper panel) and leave-one-breeding-cycle-out (CV2, lower panel) cross-validations. M2: line + genomic [SNPs] additive effect + spatial effect + line × environment interaction. M3 expand M2 by adding a genomic [SNPs] additive × ECs interaction. M4 expand M2 to the multi-trait case. M4-TA is the M4 using trait-assisted (TA) genomic prediction. M5 expand M3 to the multi-trait case. M5-TA is the M4 using TA genomic prediction. Black bars are the 95% confidence interval. Differences in the letter above the bar represent significant differences between models (P-value < 0.01). Green lines are the theoretical maximum PAs.
Slope of regression (b) of estimated genetic values with whole information on genetic values with partial information for grain yield in leave-one-year-location-out and leave-one-breeding-cycle-out cross-validations.
| Models | CV1: leave-one-year-location-out | CV2: leave-one-breeding-cycle-out | ||
| M2 | 1.02 | – | 0.80 | – |
| M4 | 1.02 | – | 0.80 | – |
| M4-TA | 1.01 | – | 0.88 | – |
| M3 | 1.02 | 0.87 | 0.89 | 0.79 |
| M5 | 1.03 | 0.85 | 0.87 | 0.79 |
| M5-TA | 1.02 | 0.93 | 0.92 | 0.98 |
M2: line + genomic [SNPs] additive effect + spatial effect + line × environment interaction. M3 expand M2 by adding a genomic [SNPs] additive × environmental covariates (ECs) interaction. M4 expand M2 to the multi-trait case. M4-TA is the M4 using trait-assisted (TA) genomic prediction. M5 expand M3 to the multi-trait case. M5-TA is the M4 using TA genomic prediction. g: genomic [SNPs] additive effect. gw: genomic [SNPs] additive × ECs interaction effect.
Slope of regression (b) of estimated genetic values with whole information on genetic values with partial information for protein content in leave-one-year-location-out and leave-one-breeding-cycle-out cross-validations.
| Models | CV1: leave-one-year-location-out | CV2: leave-one-breeding-cycle-out | ||
| M2 | 1.04 | – | 0.80 | – |
| M4 | 1.03 | – | 0.81 | – |
| M4-TA | 1.03 | – | 0.88 | – |
| M3 | 1.04 | 0.98 | 0.89 | 0.86 |
| M5 | 1.04 | 0.95 | 0.90 | 0.81 |
| M5-TA | 1.04 | 1.00 | 0.93 | 0.98 |
M2: line + genomic [SNPs] additive effect + spatial effect + line × environment interaction. M3 expand M2 by adding a genomic [SNPs] additive × environmental covariates (ECs) interaction. M4 expand M2 to the multi-trait case. M4-TA is the M4 using trait-assisted (TA) genomic prediction. M5 expand M3 to the multi-trait case. M5-TA is the M4 using TA genomic prediction. g: genomic [SNPs] additive effect. gw: genomic [SNPs] additive × ECs interaction.