| Literature DB >> 27578615 |
H Robert Frost1, Li Shen2, Andrew J Saykin2, Scott M Williams3, Jason H Moore3,4.
Abstract
Although gene-environment (G× E) interactions play an important role in many biological systems, detecting these interactions within genome-wide data can be challenging due to the loss in statistical power incurred by multiple hypothesis correction. To address the challenge of poor power and the limitations of existing multistage methods, we recently developed a screening-testing approach for G× E interaction detection that combines elastic net penalized regression with joint estimation to support a single omnibus test for the presence of G× E interactions. In our original work on this technique, however, we did not assess type I error control or power and evaluated the method using just a single, small bladder cancer data set. In this paper, we extend the original method in two important directions and provide a more rigorous performance evaluation. First, we introduce a hierarchical false discovery rate approach to formally assess the significance of individual G× E interactions. Second, to support the analysis of truly genome-wide data sets, we incorporate a score statistic-based prescreening step to reduce the number of single nucleotide polymorphisms prior to fitting the first stage penalized regression model. To assess the statistical properties of our method, we compare the type I error rate and statistical power of our approach with competing techniques using both simple simulation designs as well as designs based on real disease architectures. Finally, we demonstrate the ability of our approach to identify biologically plausible SNP-education interactions relative to Alzheimer's disease status using genome-wide association study data from the Alzheimer's Disease Neuroimaging Initiative (ADNI).Entities:
Keywords: gene-environment interactions; hierarchical FDR; penalized regression; screening testing
Mesh:
Year: 2016 PMID: 27578615 PMCID: PMC5108431 DOI: 10.1002/gepi.21997
Source DB: PubMed Journal: Genet Epidemiol ISSN: 0741-0395 Impact factor: 2.135
Figure 1Workflow for the SPUR screening‐testing G× E detection method as presented in Frost et al. (2015)
Figure 2Workflow for SPUR extended to support score statistic prescreening and hierarchical FDR
Estimated type I error rates at and and power at for the simulation study detailed in Section 3.2.2
| Type I Error Rate | Power ( | |||
|---|---|---|---|---|
| Method |
|
| Model 1 | Model 2 |
| One step | 0.051 | 0.010 | 0.019 | 0.061 |
| Screening testing, marginal assoc. filter | 0.052 | 0.011 | 0.381 | 0.244 |
| Screening testing, gene‐env. correl. filter | 0.051 | 0.010 | 0.000 | 0.590 |
| SPUR, marginal assoc. filter | 0.062 | 0.013 | 0.697 | 0.312 |
| SPUR, gene‐env. correl. filter | 0.057 | 0.012 | 0.000 | 0.665 |
| SPUR, hierarchical FDR | NA | NA | 0.659 | 0.636 |
| SPUR, global | NA | NA | 0.903 | 0.828 |
These SPUR variants reflect the results from FDR or hierarchical FDR control so lack comparable type I error control values.
Estimated type I error rates at and power at for the disease‐based simulation studies detailed in Section 3.2.3
| Type I Error Rate | Power | |||
|---|---|---|---|---|
| Disease | Method |
|
|
|
| Breast cancer | One step | 0.051 | 0.010 | 0.033 |
| Screening testing, marginal assoc. filter | 0.0528 | 0.012 | 0.020 | |
| Screening testing, gene‐env. correl. filter | 0.0525 | 0.011 | 0.021 | |
| SPUR, marginal assoc. filter | 0.055 | 0.012 | 0.161 | |
| SPUR, gene‐env. correl. filter | 0.052 | 0.012 | 0.134 | |
| SPUR, hierarchical FDR | NA | NA | 0.187 | |
| SPUR, global | NA | NA | 0.280 | |
| Type 2 diabetes | One step | 0.052 | 0.011 | 0.019 |
| Screening testing, marginal assoc. filter | 0.052 | 0.010 | 0.100 | |
| Screening testing, gene‐env. correl. filter | 0.051 | 0.010 | 0.015 | |
| SPUR, marginal assoc. filter | 0.056 | 0.012 | 0.081 | |
| SPUR, gene‐env. correl. filter | 0.055 | 0.011 | 0.015 | |
| SPUR, hierarchical FDR | NA | NA | 0.074 | |
| SPUR, global | NA | NA | 0.084 | |
| Rheumatoid arthritis | One step | 0.051 | 0.010 | 0.012 |
| Screening testing, marginal assoc. filter | 0.51 | 0.010 | 0.157 | |
| Screening testing, gene‐env. correl. filter | 0.050 | 0.010 | 0.042 | |
| SPUR, marginal assoc. filter | 0.055 | 0.011 | 0.144 | |
| SPUR, gene‐env. correl. filter | 0.055 | 0.011 | 0.034 | |
| SPUR, hierarchical FDR | NA | NA | 0.126 | |
| SPUR, global | NA | NA | 0.207 | |
Ten most significant education‐SNP interactions computed for the ADNI data using the one step and standard screening‐testing methods
| Method | dbSNP ID |
|
| FDR |
|---|---|---|---|---|
| One step |
| 4.006 |
| 0.9999452 |
|
| −3.816 |
| 0.9999452 | |
|
| 3.812 |
| 0.9999452 | |
|
| 3.801 |
| 0.9999452 | |
|
| 3.757 |
| 0.9999452 | |
|
| −3.753 |
| 0.9999452 | |
|
| 3.749 |
| 0.9999452 | |
|
| −3.737 |
| 0.9999452 | |
|
| 3.727 |
| 0.9999452 | |
|
| 3.727 |
| 0.9999452 | |
| Screening testing (marginal assoc. filter) |
| −2.3760 | 0.01748 | 0.4372200 |
|
| −2.2520 | 0.02429 | 0.4372200 | |
|
| −1.7700 | 0.07667 | 0.7074000 | |
|
| −1.7260 | 0.08438 | 0.7074000 | |
|
| 1.5490 | 0.12150 | 0.7074000 | |
|
| 1.5040 | 0.13260 | 0.7074000 | |
|
| −1.4400 | 0.14980 | 0.7074000 | |
|
| 1.4140 | 0.15720 | 0.7074000 | |
|
| −1.3290 | 0.18390 | 0.7164000 | |
|
| −1.2840 | 0.19900 | 0.7164000 | |
| Screening testing (gene‐env. correl. filter) |
| 3.163 | 0.001563 | 0.0562680 |
|
| 2.611 | 0.009021 | 0.1141560 | |
|
| 2.593 | 0.009513 | 0.1141560 | |
|
| 2.391 | 0.016800 | 0.1399200 | |
|
| −2.268 | 0.023320 | 0.1399200 | |
|
| −2.268 | 0.023320 | 0.1399200 | |
|
| 2.125 | 0.033560 | 0.1725943 | |
|
| −2.027 | 0.042630 | 0.1836831 | |
|
| 1.946 | 0.051640 | 0.1836831 | |
|
| 1.921 | 0.054700 | 0.1836831 |
Twenty most significant education‐SNP interactions computed via the extended SPUR method
| Marginal | Correlation | |||||||
|---|---|---|---|---|---|---|---|---|
| LR | LR | |||||||
| dbSNP ID | Associated gene |
|
| FDR | dbSNP ID |
|
| FDR |
|
| LOC105373456 | 2.13 | 0.000731 | 0.0323 |
| 0.381 | 0.00705 | 0.239 |
|
| FAM188B | 2.35 | 0.00115 | 0.0323 |
| 0.315 | 0.0105 | 0.239 |
|
|
| −1.33 | 0.00403 | 0.0705 |
| −0.267 | 0.0147 | 0.239 |
|
|
| −1.26 | 0.00949 | 0.0771 |
| −0.259 | 0.0227 | 0.266 |
|
| RNF150 | 1.48 | 0.0102 | 0.0771 |
| −0.336 | 0.0345 | 0.29 |
|
|
| 1.4 | 0.0118 | 0.0771 |
| −0.33 | 0.039 | 0.29 |
|
|
| −0.791 | 0.0127 | 0.0771 |
| −0.274 | 0.0602 | 0.377 |
|
|
| −0.976 | 0.0132 | 0.0771 |
| 0.203 | 0.089 | 0.432 |
|
| Intergenic | −0.9 | 0.0187 | 0.0934 |
| 0.228 | 0.0907 | 0.432 |
| (LOC100288868, RPL21P46) | ||||||||
|
| Intergenic | 1.62 | 0.0204 | 0.0934 |
| −0.18 | 0.117 | 0.497 |
| (LOC100128712, | ||||||||
|
| ‐ | −1.14 | 0.0269 | 0.106 |
| 0.207 | 0.138 | 0.524 |
|
| PPIH | −0.851 | 0.0292 | 0.106 |
| 0.159 | 0.15 | 0.524 |
|
| LOC101927437 | 0.839 | 0.0305 | 0.106 |
| 0.154 | 0.165 | 0.529 |
|
| ‐ | 0.643 | 0.0363 | 0.116 |
| −0.17 | 0.198 | 0.532 |
|
| ‐ | 0.989 | 0.0418 | 0.124 |
| 0.154 | 0.205 | 0.532 |
|
| FAM3C | −0.535 | 0.0545 | 0.149 |
| −0.202 | 0.206 | 0.532 |
|
| ‐ | 0.758 | 0.0573 | 0.149 |
| −0.14 | 0.234 | 0.562 |
|
| AXDND1 | 0.497 | 0.093 | 0.219 |
| −0.148 | 0.256 | 0.562 |
|
| ‐ | 0.746 | 0.0966 | 0.219 |
| −0.137 | 0.261 | 0.562 |
|
| LRRC20 | 0.58 | 0.0996 | 0.219 |
| −0.123 | 0.321 | 0.63 |
SNPs with published associations in NHGRI‐EBI GWAS Catalog (Welter et al., 2014) are marked in bold. Genes associated with significant education‐SNP interactions, at , that have a plausible AD association are marked in bold.