| Literature DB >> 36171888 |
Hailiang Song1, Xue Wang2, Yi Guo2, Xiangdong Ding2.
Abstract
Genotype by environment (G × E) interaction is fundamental in the biology of complex traits and diseases. However, most of the existing methods for genomic prediction tend to ignore G × E interaction (GEI). In this study, we proposed the genomic prediction method G × EBLUP by considering GEI. Meanwhile, G × EBLUP can also detect the genome-wide single nucleotide polymorphisms (SNPs) subject to GEI. Using comprehensive simulations and analysis of real data from pigs and maize, we showed that G × EBLUP achieved higher efficiency in mapping GEI SNPs and higher prediction accuracy than the existing methods, and its superiority was more obvious when the GEI variance was large. For pig and maize real data, compared with GBLUP, G × EBLUP showed improvement by 3% in the prediction accuracy for backfat thickness, while our findings indicated that the trait of days to 100 kg of pig was not affected by GEI and G × EBLUP did not improve the accuracy of genomic prediction for the trait. A significant advantage was observed for G × EBLUP in maize; the prediction accuracy was improved by ∼5.0 and 7.7% for grain weight and water content, respectively. Furthermore, G × EBLUP was not influenced by the number of environment levels. It could determine a favourable environment using SNP Bayes factors for each environment, implying that it is a robust and useful method for market-specific animal and plant breeding. We proposed G × EBLUP, a novel method for the estimation of genomic breeding value by considering GEI. This method identified the genome-wide SNPs that were susceptible to GEI and yielded higher genomic prediction accuracies and lower mean squared error compared with the GBLUP method.Entities:
Keywords: G × E interaction; bayes factors; genomic prediction; snps; traits
Year: 2022 PMID: 36171888 PMCID: PMC9510768 DOI: 10.3389/fgene.2022.972557
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.772
Descriptive statistics for pig and maize population traits.
|
|
|
|
|
|
|
|
|
|
| Pig | AGE (day) | 28,827 | 1778 | 5 | 170.8 | 13.9 | 124.0 | 211.0 |
| BFT (mm) | 28,827 | 1778 | 5 | 11.8 | 2.4 | 5.0 | 30.7 | |
| Maize | GW (kg) | 2676 | 681 | 11 | 6.75 | 1.39 | 0.407 | 11.24 |
| WC (%) | 2676 | 681 | 11 | 26.89 | 4.58 | 14.80 | 47.80 |
AGE: days to 100 kg; BFT: backfat thickness adjusted to 100 kg; GW: grain weight; WC: water content.
N-obs: number of observations.
N-env: number of environments.
Significant G × E interaction single nucleotide polymorphisms (SNPs) detected on simulated data using the proposed G × EWAS method and the five approaches, StructLMM, Bartlett, F-Killeen, L-mean and L-median under different variance of G × E interactions. The SNP numbers overlapping with simulated genotype–environment interaction quantitative trait locus (306) are in parentheses.
| Variance of G × E interactions | 0.25 | 1 | 2 |
|---|---|---|---|
| G × EWAS | 2435 (43) | 5081 (64) | 3981 (77) |
| StructLMM | 2472 (41) | 5092(62) | 4084 (77) |
| Bartlett | 507 (9) | 3495 (33) | 3976 (54) |
| F-killeen | 101 (6) | 144 (2) | 109 (3) |
| L-mean | 224 (6) | 1037 (8) | 606 (14) |
| L-median | 188 (6) | 808 (8) | 439 (9) |
FIGURE 1G × E marker genome-wide association analysis of pig and maize population traits. AGE: days to 100 kg; BFT: backfat thickness adjusted to 100 kg; GW: grain weight; WC: water content.
Genomic prediction accuracies and mean squared error (MSE) for GBLUP and G × EBLUP method under different G × E interaction p-values (E01∼E05).
| Data set | Trait | Content | GBLUP | P-value | ||||
|---|---|---|---|---|---|---|---|---|
| E01 | E02 | E03 | E04 | E05 | ||||
| Simulation | One | SNP number | 45,323 | 23,517 | 14,210 | 8844 | 5543 | 2186 |
| Accuracy | 0.737 | 0.735 | 0.739 | 0.749 | 0.738 | 0.712 | ||
| MSE | 1.818 | 1.820 | 1.810 | 1.715 | 1.813 | 1.894 | ||
| Pig | AGE | SNP number | 56,445 | 27,762 | 14,117 | 7420 | 3964 | 2117 |
| Accuracy | 0.225 | 0.223 | 0.226 | 0.226 | 0.226 | 0.224 | ||
| MSE | 179.81 | 179.918 | 179.722 | 179.703 | 179.677 | 179.797 | ||
| BFT | SNP number | 56,445 | 37,448 | 25,242 | 17,110 | 11,801 | 8098 | |
| Accuracy | 0.268 | 0.275 | 0.276 | 0.272 | 0.269 | 0.268 | ||
| MSE | 2.707 | 2.693 | 2.69 | 2.699 | 2.704 | 2.706 | ||
| Maize | GW | SNP number | 59,401 | 17,285 | 4279 | 834 | 421 | 143 |
| Accuracy | 0.288 | 0.290 | 0.306 | 0.294 | 0.271 | 0.269 | ||
| MSE | 46.323 | 46.317 | 46.174 | 46.178 | 46.223 | 46.347 | ||
| WC | SNP number | 59,401 | 28,636 | 12,123 | 4168 | 2132 | 875 | |
| Accuracy | 0.295 | 0.301 | 0.315 | 0.318 | 0.293 | 0.273 | ||
| MSE | 721.588 | 721.229 | 721.129 | 720.854 | 721.590 | 722.009 | ||
Cut-off p-values for G × E interaction single nucleotide polymorphisms on G × E.
One randomly selected replicate.
The variance of G × E interactions was 0.25.
FIGURE 2GBLUP and G × EBLUP method for the (A) accuracy and (B) mean squared error (MSE) of genomic prediction under different G × E interaction variances. Genomic prediction (C) accuracy and (D) MSE under different numbers of environment variables.
FIGURE 3GBLUP and G × EBLUP method for the (A,C) accuracy and (B,D) mean squared error (MSE) of genomic prediction in pigs and maize. AGE: days to 100 kg; BFT: backfat thickness adjusted to 100 kg; GW: grain weight; WC: water content. MSE is a relative value by assuming that the MSE of GBLUP method is equal to 0 because of the large MSE value.
FIGURE 4Bayes factors of 10 single nucleotide polymorphisms for (A) AGE, (B) BFT, (C) GW and (D) WC in different environmental factors. AGE: days to 100 kg; BFT: backfat thickness adjusted to 100 kg; GW: grain weight; WC: water content.
Average computation time for G × EBLUP and GBLUP to complete each fold of cross-validation.
| Date set | Trait | G × EBLUP | GBLUP |
|---|---|---|---|
| Simulation | V0.25 | 45 min 48 s | 15 min 3 s |
| V1 | 46 min 27 s | 15 min 12 s | |
| V2 | 46 min 13 s | 15 min 9 s | |
| Pig | AGE | 30 min 9 s | 2 min 14 s |
| BFT | 30 min 14 s | 2 min 18 s | |
| Maize | GW | 26 min 13 s | 1 min 7 s |
| WC | 26 min 8 s | 1 min 10 s |
V0.25, V1 and V2: The traits with variance of G × E interactions of 0.25, 1 and 2, respectively, in simulated data.