| Literature DB >> 32709618 |
Matías F Schrauf1, Johannes W R Martini2, Henner Simianer3, Gustavo de Los Campos4, Rodolfo Cantet5,6, Jan Freudenthal7, Arthur Korte7, Sebastián Munilla5,6.
Abstract
Genomic selection uses whole-genome marker models to predict phenotypes or genetic values for complex traits. Some of these models fit interaction terms between markers, and are therefore called epistatic. The biological interpretation of the corresponding fitted effects is not straightforward and there is the threat of overinterpreting their functional meaning. Here we show that the predictive ability of epistatic models relative to additive models can change with the density of the marker panel. In more detail, we show that for publicly available Arabidopsis and rice datasets, an initial superiority of epistatic models over additive models, which can be observed at a lower marker density, vanishes when the number of markers increases. We relate these observations to earlier results reported in the context of association studies which showed that detecting statistical epistatic effects may not only be related to interactions in the underlying genetic architecture, but also to incomplete linkage disequilibrium at low marker density ("Phantom Epistasis"). Finally, we illustrate in a simulation study that due to phantom epistasis, epistatic models may also predict the genetic value of an underlying purely additive genetic architecture better than additive models, when the marker density is low. Our observations can encourage the use of genomic epistatic models with low density panels, and discourage their biological over-interpretation.Entities:
Keywords: Additive Effects; Breeding; Epistasis; GenPred; Genomic Prediction; Genomics; Shared data resources
Mesh:
Year: 2020 PMID: 32709618 PMCID: PMC7466977 DOI: 10.1534/g3.120.401300
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
Figure 1Predictive accuracy of genomic models for the real and the first simulated trait of Arabidopsis (left panels) and Rice (right panels). Upper panels: Mean predictive accuracy. Lower panel: Difference in predictive accuracy of epistatic models relative to the additive model. Error bars mark 99% bootstrap confidence intervals. GRMs:
Difference between the epistatic and additive models predictive accuracy (r) with 99% bootstrap confidence intervals
| Traits | Panel | ||||||
|---|---|---|---|---|---|---|---|
| real | 0.066 | 0.057 | 0.074 | 0.046 | 0.040 | 0.053 | |
| 0.027 | 0.023 | 0.031 | 0.013 | 0.011 | 0.015 | ||
| −0.004 | −0.008 | 0.000 | 0.005 | 0.004 | 0.007 | ||
| −0.019 | −0.022 | −0.015 | −0.003 | −0.004 | −0.001 | ||
| −0.019 | −0.023 | −0.016 | −0.003 | −0.004 | −0.002 | ||
| simulated | 0.061 | 0.057 | 0.065 | 0.030 | 0.027 | 0.032 | |
| 0.040 | 0.037 | 0.042 | 0.020 | 0.019 | 0.021 | ||
| 0.004 | 0.002 | 0.006 | 0.006 | 0.005 | 0.007 | ||
| −0.009 | −0.011 | −0.007 | −0.001 | −0.002 | 0.000 | ||
| −0.011 | −0.013 | −0.009 | −0.002 | −0.003 | −0.001 | ||
| real | 0.099 | 0.091 | 0.106 | 0.046 | 0.041 | 0.051 | |
| 0.050 | 0.045 | 0.054 | 0.007 | 0.004 | 0.010 | ||
| 0.015 | 0.011 | 0.018 | −0.019 | −0.023 | −0.016 | ||
| 0.010 | 0.007 | 0.013 | −0.016 | −0.020 | −0.013 | ||
| 0.008 | 0.005 | 0.011 | −0.022 | −0.025 | −0.018 | ||
| simulated | 0.034 | 0.031 | 0.036 | 0.011 | 0.009 | 0.012 | |
| 0.013 | 0.012 | 0.014 | 0.000 | −0.001 | 0.001 | ||
| −0.009 | −0.010 | −0.008 | −0.005 | −0.006 | −0.004 | ||
| −0.013 | −0.014 | −0.012 | −0.027 | −0.028 | −0.025 | ||
| −0.013 | −0.014 | −0.012 | −0.007 | −0.008 | −0.006 | ||
Percentage of cross-validation folds, cross-validation replicates and simulated traits where the epistatic model achieved higher predictive accuracy (r) than the additive model
| Traits | Panel | folds | replicates | traits | folds | replicates | traits |
|---|---|---|---|---|---|---|---|
| real | 94 | 100 | — | 94 | 100 | — | |
| 98 | 100 | — | 95 | 100 | — | ||
| 41 | 0 | — | 81 | 100 | — | ||
| 4 | 0 | — | 33 | 0 | — | ||
| 3 | 0 | — | 29 | 0 | — | ||
| simulated | 88 | 100 | 100 | 84 | 100 | 100 | |
| 89 | 100 | 100 | 92 | 100 | 100 | ||
| 56 | 62 | 70 | 69 | 92 | 90 | ||
| 33 | 14 | 10 | 48 | 45 | 40 | ||
| 30 | 9 | 10 | 43 | 32 | 40 | ||
| real | 100 | 100 | — | 98 | 100 | — | |
| 99 | 100 | — | 72 | 100 | — | ||
| 86 | 100 | — | 5 | 0 | — | ||
| 78 | 100 | — | 12 | 0 | — | ||
| 76 | 100 | — | 4 | 0 | — | ||
| simulated | 92 | 100 | 100 | 72 | 86 | 90 | |
| 80 | 91 | 90 | 47 | 47 | 50 | ||
| 25 | 2 | 0 | 26 | 1 | 0 | ||
| 15 | 1 | 0 | 4 | 0 | 0 | ||
| 15 | 1 | 0 | 17 | 0 | 0 | ||
Figure 2Linkage disequilibrium (r) between contiguous markers for panels with varying densities. Connected dots trace quantiles 0.9, 0.7 and 0.5 of the LD distributions.
Percentage of variance components’ contribution to total genetic variance, for different GRMs, at the level of the markers. GRMs marked with (*) shown to illustrate dependence of K to bandwidth parameter (h) and are not used in the fitted models
| Species | GRM | % | % | % | % |
|---|---|---|---|---|---|
| Arabidopsis | G | 100.0 | 0.0 | 0.0 | 0.0 |
| H | 63.5 | 36.5 | 0.0 | 0.0 | |
| K(0.5) | 79.6 | 18.8 | 1.5 | 0.0 | |
| K(0.25)* | 63.2 | 31.7 | 4.8 | 0.3 | |
| Rice | H | 100.0 | 0.0 | 0.0 | 0.0 |
| G | 78.5 | 21.5 | 0.0 | 0.0 | |
| K(0.5) | 83.5 | 15.5 | 1.0 | 0.0 | |
| K(0.25)* | 70.8 | 25.9 | 3.2 | 0.2 |
Difference between the epistatic and additive models predictive accuracy (r) for all sets of fixed effects
| Traits | Panel | PC1 | PC5 | CL3 | CL9 | PC1 | PC5 | CL3 | CL9 | ||
|---|---|---|---|---|---|---|---|---|---|---|---|
| real | 0.127 | 0.119 | 0.061 | 0.066 | 0.051 | 0.096 | 0.095 | 0.046 | 0.046 | 0.037 | |
| 0.035 | 0.035 | 0.028 | 0.027 | 0.026 | 0.015 | 0.015 | 0.014 | 0.013 | 0.014 | ||
| 0.003 | 0.003 | −0.003 | −0.004 | −0.004 | 0.007 | 0.007 | 0.006 | 0.005 | 0.005 | ||
| −0.012 | −0.013 | −0.019 | −0.019 | −0.019 | −0.001 | −0.001 | −0.002 | −0.003 | −0.003 | ||
| −0.013 | −0.014 | −0.019 | −0.019 | −0.020 | −0.001 | −0.001 | −0.003 | −0.003 | −0.003 | ||
| simulated | 0.068 | 0.067 | 0.048 | 0.061 | 0.048 | 0.034 | 0.034 | 0.021 | 0.030 | 0.022 | |
| 0.039 | 0.039 | 0.039 | 0.040 | 0.040 | 0.020 | 0.020 | 0.019 | 0.020 | 0.020 | ||
| 0.005 | 0.005 | 0.004 | 0.004 | 0.002 | 0.006 | 0.006 | 0.006 | 0.006 | 0.006 | ||
| −0.008 | −0.009 | −0.009 | −0.009 | −0.011 | −0.001 | −0.001 | −0.001 | −0.001 | −0.001 | ||
| −0.010 | −0.010 | −0.010 | −0.011 | −0.013 | −0.002 | −0.002 | −0.002 | −0.002 | −0.002 | ||
| real | 0.097 | 0.096 | 0.091 | 0.099 | 0.099 | 0.046 | 0.043 | 0.043 | 0.046 | 0.046 | |
| 0.049 | 0.050 | 0.050 | 0.050 | 0.050 | 0.006 | 0.005 | 0.006 | 0.007 | 0.006 | ||
| 0.014 | 0.014 | 0.015 | 0.015 | 0.015 | −0.020 | −0.020 | −0.020 | −0.019 | −0.019 | ||
| 0.009 | 0.009 | 0.009 | 0.010 | 0.010 | −0.017 | −0.017 | −0.017 | −0.016 | −0.016 | ||
| 0.008 | 0.008 | 0.008 | 0.008 | 0.009 | −0.022 | −0.022 | −0.022 | −0.022 | −0.022 | ||
| simulated | 0.034 | 0.034 | 0.027 | 0.034 | 0.033 | 0.011 | 0.012 | 0.009 | 0.011 | 0.010 | |
| 0.013 | 0.013 | 0.013 | 0.013 | 0.013 | 0.000 | 0.001 | 0.002 | 0.000 | 0.000 | ||
| −0.009 | −0.009 | −0.009 | −0.009 | −0.009 | −0.005 | −0.005 | −0.005 | −0.005 | −0.005 | ||
| −0.012 | −0.013 | −0.013 | −0.013 | −0.013 | −0.027 | −0.027 | −0.027 | −0.027 | −0.026 | ||
| −0.013 | −0.013 | −0.013 | −0.013 | −0.013 | −0.007 | −0.007 | −0.007 | −0.007 | −0.007 | ||