| Literature DB >> 34849749 |
Masao Ueki1, Gen Tamiya2,3,4.
Abstract
We propose a genetic prediction modeling approach for genome-wide association study (GWAS) data that can include not only marginal gene effects but also gene-environment (GxE) interaction effects-i.e., multiplicative effects of environmental factors with genes rather than merely additive effects of each. The proposed approach is a straightforward extension of our previous multiple regression-based method, STMGP (smooth-threshold multivariate genetic prediction), with the new feature being that genome-wide test statistics from a GxE interaction analysis are used to weight the corresponding variants. We develop a simple univariate regression approximation to the GxE interaction effect that allows a direct fit of the STMGP framework without modification. The sparse nature of our model automatically removes irrelevant predictors (including variants and GxE combinations), and the model is able to simultaneously incorporate multiple environmental variables. Simulation studies to evaluate the proposed method in comparison with other modeling approaches demonstrate its superior performance under the presence of GxE interaction effects. We illustrate the usefulness of our prediction model through application to real GWAS data from the Alzheimer's Disease Neuroimaging Initiative (ADNI).Entities:
Keywords: genetic prediction; gene–environment interaction; smooth thresholding
Mesh:
Year: 2021 PMID: 34849749 PMCID: PMC8664495 DOI: 10.1093/g3journal/jkab278
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
Figure 1Quantitative trait simulation with . Average predictive correlation coefficient (PCC) for eight models. For each scenario (shown in rows), high values are highlighted in red and low values in white. s: STMGP with E1 and E2 as covariates; sge1: STMGP with E1 and E2 as covariates and E1 as environmental variable for GxE interaction; sge2: STMGP with E1 and E2 as covariates and E2 as environmental variable for GxE interaction; sge12: STMGP with E1 and E2 as covariates, and E1 and E2 as environmental variables for GxE interaction; bg: BLUP with E1 and E2 as covariates; bge1: BLUP with E1 and E2 as covariates and E1 as environmental variable for GxE interaction; bge2: BLUP with E1 and E2 as covariates and E2 as environmental variable for GxE interaction; bge12: BLUP with E1 and E2 as covariates, and average of E1 and E2 as environmental variable for GxE interaction. Scenarios are denoted as _ dist, where dist means effect size distribution: Normal, NEG2, or Laplace.
Figure 2Quantitative trait simulation with . Average predictive correlation coefficient (PCC) for eight models. See Figure 1 for explanation of scenarios (shown in rows).
Figure 3Binary trait simulation with . Average area under the ROC curve (AUC) is shown for eight models. For each scenario (in rows), high values are highlighted in red and low values in white. s: STMGP with E1 and E2 as covariates; sge1: STMGP with E1 and E2 as covariates and E1 as environmental variable for GxE interaction; sge2: STMGP with E1 and E2 as covariates and E2 as environmental variable for GxE interaction; sge12: STMGP with E1 and E2 as covariates, and E1 and E2 as environmental variables for GxE interaction; bg: BLUP with E1 and E2 as covariates; bge1: BLUP with E1 and E2 as covariates and E1 as environmental variable for GxE interaction; bge2: BLUP with E1 and E2 as covariates and E2 as environmental variable for GxE interaction; bge12: BLUP with E1 and E2 as covariates, and average of E1 and E2 as environmental variable for GxE interaction. Scenarios are denoted as _ dist, where dist means effect size distribution: Normal, NEG2, or Laplace.
Figure 4Binary trait simulation with . Average area under the ROC curve (AUC) for eight models. See Figure 3 for explanation of scenarios (shown in rows).
Results of predicting four quantitative traits, FAQ, CDRSB, MMSE, and ADAS11
| Trait | Data | l0 | l | lge1 | lge2 | lge12 | s | sge1 | sge2 | sge12 | bg | bge1 | bge2 | bge12 |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| FAQ | CV 1 | 0.07 | 0.16 | 0.15 | 0.17 | 0.16 | 0.11 | –0.01 | 0.15 | 0.05 | 0.14 | 0.13 | 0.13 | 0.12 |
| CV 2 | 0.17 | 0.35 | 0.33 | 0.36 | 0.34 | 0.26 | 0.24 | 0.32 | 0.31 | 0.32 | 0.35 | 0.33 | 0.33 | |
| CV 3 | 0.19 | 0.15 | 0.15 | 0.16 | 0.16 | 0.19 | 0.13 | 0.21 | 0.15 | 0.17 | 0.15 | 0.18 | 0.17 | |
| CV 4 | 0.01 | 0.26 | 0.26 | 0.27 | 0.27 | 0.31 | 0.18 | 0.24 | 0.19 | 0.23 | 0.28 | 0.25 | 0.23 | |
| CV 5 | 0.08 | 0.16 | 0.16 | 0.10 | 0.09 | 0.15 | 0.14 | 0.17 | 0.15 | 0.17 | 0.14 | 0.17 | 0.15 | |
| Mean | 0.10 | 0.21 | 0.21 | 0.21 | 0.20 | 0.20 | 0.14 | 0.22 | 0.17 | 0.21 | 0.21 | 0.21 | 0.20 | |
| SD | 0.08 | 0.09 | 0.08 | 0.10 | 0.10 | 0.08 | 0.09 | 0.07 | 0.09 | 0.07 | 0.10 | 0.08 | 0.09 | |
| CDRSB | CV 1 | 0.07 | 0.13 | 0.13 | 0.12 | 0.12 | 0.21 | 0.18 | 0.22 | 0.17 | 0.12 | 0.13 | 0.10 | 0.11 |
| CV 2 | 0.16 | 0.38 | 0.37 | 0.36 | 0.35 | 0.33 | 0.28 | 0.33 | 0.30 | 0.34 | 0.36 | 0.34 | 0.33 | |
| CV 3 | 0.22 | 0.26 | 0.26 | 0.26 | 0.26 | 0.28 | 0.26 | 0.26 | 0.25 | 0.25 | 0.25 | 0.26 | 0.27 | |
| CV 4 | 0.10 | 0.37 | 0.37 | 0.37 | 0.37 | 0.44 | 0.36 | 0.41 | 0.31 | 0.36 | 0.39 | 0.37 | 0.36 | |
| CV 5 | 0.19 | 0.27 | 0.26 | 0.25 | 0.22 | 0.27 | 0.25 | 0.28 | 0.27 | 0.27 | 0.25 | 0.27 | 0.27 | |
| Mean | 0.15 | 0.28 | 0.27 | 0.27 | 0.26 | 0.31 | 0.27 | 0.30 | 0.26 | 0.27 | 0.27 | 0.27 | 0.27 | |
| SD | 0.06 | 0.10 | 0.10 | 0.10 | 0.10 | 0.08 | 0.06 | 0.07 | 0.06 | 0.09 | 0.10 | 0.10 | 0.10 | |
| MMSE | CV 1 | 0.10 | 0.27 | 0.25 | 0.26 | 0.25 | 0.13 | 0.21 | 0.18 | 0.16 | 0.22 | 0.23 | 0.23 | 0.22 |
| CV 2 | 0.19 | 0.34 | 0.33 | 0.33 | 0.32 | 0.30 | 0.33 | 0.33 | 0.33 | 0.29 | 0.30 | 0.31 | 0.30 | |
| CV 3 | 0.30 | 0.35 | 0.35 | 0.35 | 0.35 | 0.28 | 0.26 | 0.34 | 0.35 | 0.37 | 0.38 | 0.36 | 0.36 | |
| CV 4 | 0.27 | 0.35 | 0.35 | 0.35 | 0.36 | 0.35 | 0.34 | 0.39 | 0.37 | 0.36 | 0.37 | 0.36 | 0.37 | |
| CV 5 | 0.17 | 0.28 | 0.26 | 0.28 | 0.25 | 0.25 | 0.23 | 0.26 | 0.22 | 0.29 | 0.28 | 0.29 | 0.27 | |
| Mean | 0.21 | 0.32 | 0.31 | 0.31 | 0.31 | 0.26 | 0.27 | 0.30 | 0.29 | 0.31 | 0.31 | 0.31 | 0.30 | |
| SD | 0.08 | 0.04 | 0.05 | 0.04 | 0.05 | 0.08 | 0.06 | 0.08 | 0.09 | 0.06 | 0.06 | 0.05 | 0.06 | |
| ADAS11 | CV 1 | 0.12 | 0.31 | 0.32 | 0.30 | 0.31 | 0.30 | 0.28 | 0.29 | 0.26 | 0.29 | 0.29 | 0.28 | 0.27 |
| CV 2 | 0.17 | 0.30 | 0.30 | 0.30 | 0.30 | 0.22 | 0.23 | 0.24 | 0.22 | 0.28 | 0.27 | 0.28 | 0.29 | |
| CV 3 | 0.15 | 0.29 | 0.30 | 0.29 | 0.30 | 0.22 | 0.26 | 0.24 | 0.26 | 0.29 | 0.29 | 0.29 | 0.29 | |
| CV 4 | 0.11 | 0.36 | 0.36 | 0.35 | 0.35 | 0.29 | 0.29 | 0.37 | 0.29 | 0.37 | 0.38 | 0.35 | 0.36 | |
| CV 5 | 0.22 | 0.34 | 0.32 | 0.33 | 0.32 | 0.30 | 0.28 | 0.34 | 0.24 | 0.33 | 0.31 | 0.32 | 0.31 | |
| Mean | 0.15 | 0.32 | 0.32 | 0.31 | 0.32 | 0.27 | 0.27 | 0.30 | 0.25 | 0.31 | 0.31 | 0.30 | 0.30 | |
| SD | 0.04 | 0.03 | 0.03 | 0.03 | 0.02 | 0.04 | 0.02 | 0.06 | 0.03 | 0.04 | 0.04 | 0.03 | 0.03 |
Prediction of each target trait is evaluated by the prediction correlation coefficient (PCC) from 5-fold cross-validation.
Data used to calculate PCC (CV 1–CV 5 denote each cross-validated dataset from 5-fold cross-validation) for each model are shown in row together with mean and standard deviation (SD).
Linear regression with SEX and EDU as predictors.
Linear regression with SEX, EDU, AGE, and APOE4 as predictors.
Linear regression with SEX, EDU, AGE, APOE4, and APOE4xSEX as predictors.
Linear regression with SEX, EDU, AGE, APOE4, and APOE4xEDU as predictors.
Linear regression with SEX, EDU, AGE, APOE4, APOE4xSEX, and APOE4xEDU as predictors.
STMGP with SEX, EDU, AGE, and APOE4 as covariates.
STMGP with SEX, EDU, AGE, and APOE4 as covariates, and SEX as environmental variable for GxE interaction.
STMGP with SEX, EDU, AGE, and APOE4 as covariates, and EDU as environmental variable for GxE interaction.
STMGP with SEX, EDU, AGE, and APOE4 as covariates, and AGE and EDU as environmental variables for GxE interaction.
BLUP with SEX, EDU, AGE, and APOE4 as covariates.
BLUP with SEX, EDU, AGE, and APOE4 as covariates, and SEX as environmental variable for GxE interaction.
BLUP with SEX, EDU, AGE, and APOE4 as covariates, and EDU as environmental variable for GxE interaction.
BLUP with SEX, EDU, AGE, and APOE4 as covariates, and average of AGE and EDU as environmental variable for GxE interaction.