| Literature DB >> 27386793 |
H Robert Frost1, Christopher I Amos1, Jason H Moore2.
Abstract
Statistical interactions between markers of genetic variation, or gene-gene interactions, are believed to play an important role in the etiology of many multifactorial diseases and other complex phenotypes. Unfortunately, detecting gene-gene interactions is extremely challenging due to the large number of potential interactions and ambiguity regarding marker coding and interaction scale. For many data sets, there is insufficient statistical power to evaluate all candidate gene-gene interactions. In these cases, a global test for gene-gene interactions may be the best option. Global tests have much greater power relative to multiple individual interaction tests and can be used on subsets of the markers as an initial filter prior to testing for specific interactions. In this paper, we describe a novel global test for gene-gene interactions, the global epistasis test (GET), that is based on results from random matrix theory. As we show via simulation studies based on previously proposed models for common diseases including rheumatoid arthritis, type 2 diabetes, and breast cancer, our proposed GET method has superior performance characteristics relative to existing global gene-gene interaction tests. A glaucoma GWAS data set is used to demonstrate the practical utility of the GET method.Entities:
Keywords: gene-gene interaction; global test; random matrix theory
Mesh:
Substances:
Year: 2016 PMID: 27386793 PMCID: PMC5132142 DOI: 10.1002/gepi.21990
Source DB: PubMed Journal: Genet Epidemiol ISSN: 0741-0395 Impact factor: 2.135
Partitioned sample correlation matrices for data simulated with five SNPs and an interaction between SNPs 1 and 2, that is, marker variables G 1 and G 2
| Average | Average | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
|
| ||
|
| 1 | − | − | − | − |
| 1 | − | − | − | − |
|
| − | 1 | − | − | − |
|
| 1 | − | − | − |
|
| 0.05 | 0.04 | 1 | − | − |
| 0.06 | 0.09 | 1 | − | − |
|
| 0.06 | 0.05 | 0.07 | 1 | − |
| 0.07 | 0.07 | 0.07 | 1 | − |
|
| 0.06 | 0.05 | 0.05 | 0.09 | 1 |
| 0.06 | 0.08 | 0.09 | 0.06 | 1 |
The mean case and control correlation coefficients for the first two SNPs are in bold.
Figure 1Comparison of the empirical density of the test statistic defined in (5) (solid line) and the Tracy‐Widom law of order 1 distribution (dotted line). In plot (a), the density was computed from 2,500 simulated data sets each containing 100 SNPs without marginal or interaction effects (see text in Section 2.2.2 for simulation details). In plot (b), the density was computed from 2,500 simulated data sets each containing 100 SNPs with no interaction effects and a marginal association for the first five SNPs.
Results for the type I error control simulation as detailed in Section 3.1
| Simulation settings | Type I error rate | |||
|---|---|---|---|---|
| Marginal assoc. | (No. of subjects)/(No. of SNPs) | SNP‐SNP | GET | Benchmark method |
| ✓ | 250/50=5 | 0.0 | 0.044 | 0.056 |
| ✓ | 500/50=10 | 0.0 | 0.024 | 0.044 |
| ✓ | 1,000/50=20 | 0.0 | 0.015 | 0.046 |
| ✓ | 2,000/50=40 | 0.0 | 0.017 | 0.049 |
| ✓ | 4,000/50=80 | 0.0 | 0.017 | 0.052 |
| ✓ | 250/50=5 | 0.1 | 0.033 | 1.000 |
| ✓ | 500/50=10 | 0.1 | 0.029 | 1.000 |
| ✓ | 1,000/50=20 | 0.1 | 0.025 | 1.000 |
| ✓ | 2,000/50=40 | 0.1 | 0.026 | 1.000 |
| ✓ | 4,000/50=80 | 0.1 | 0.018 | 1.000 |
| 250/50=5 | 0.0 | 0.035 | 0.047 | |
| 500/50=10 | 0.0 | 0.018 | 0.042 | |
| 1,000/50=20 | 0.0 | 0.027 | 0.050 | |
| 2,000/50=40 | 0.0 | 0.014 | 0.044 | |
| 4,000/50=80 | 0.0 | 0.019 | 0.060 | |
| 250/50=5 | 0.1 | 0.039 | 1.000 | |
| 500/50=10 | 0.1 | 0.025 | 1.000 | |
| 1,000/50=20 | 0.1 | 0.023 | 1.000 | |
| 2,000/50=40 | 0.1 | 0.024 | 1.000 | |
| 4,000/50=80 | 0.1 | 0.017 | 1.000 | |
Results for the power simulation as detailed in Section 2.3.2.
| Simulation settings | Power | ||
|---|---|---|---|
| Model no. | (No. of subjects)/(No. of SNPs) | GET | Benchmark method |
| 1 | 250/50=5 | 0.039 | 0.056 |
| 1 | 500/50=10 | 0.056 | 0.066 |
| 1 | 1,000/50=20 | 0.086 | 0.120 |
| 1 | 2,000/50=40 | 0.304 | 0.165 |
| 1 | 4,000/50=80 | 0.817 | 0.335 |
| 2 | 250/50=5 | 0.055 | 0.060 |
| 2 | 500/50=10 | 0.052 | 0.069 |
| 2 | 1,000/50=20 | 0.094 | 0.090 |
| 2 | 2,000/50=40 | 0.344 | 0.123 |
| 2 | 4,000/50=80 | 0.921 | 0.270 |
| 3 | 250/50=5 | 0.048 | 0.064 |
| 3 | 500/50=10 | 0.044 | 0.057 |
| 3 | 1,000/50=20 | 0.108 | 0.055 |
| 3 | 2,000/50=40 | 0.469 | 0.089 |
| 3 | 4,000/50=80 | 0.945 | 0.121 |
In the table, model no. refers to one of the numbered simulation models detailed in Section 2.3.2.
Estimated type I error rates at empirical power at the disease‐based simulation studies detailed in Section 2.3.3.
| Disease model | No. of subjects | Method | Type I error rate | Power |
|---|---|---|---|---|
| Breast cancer | 625 | Benchmark | 0.038 | 0.042 |
| 625 | GET | 0.058 | 0.466 | |
| 1,250 | Benchmark | 0.015 | 0.060 | |
| 1,250 | GET | 0.052 | 0.893 | |
| 2,500 | Benchmark | 0.031 | 0.225 | |
| 2,500 | GET | 0.062 | 1.000 | |
| Type 2 diabetes | 625 | Benchmark | 0.032 | 0.036 |
| 625 | GET | 0.040 | 0.202 | |
| 1,250 | Benchmark | 0.027 | 0.048 | |
| 1,250 | GET | 0.042 | 0.400 | |
| 2,500 | Benchmark | 0.018 | 0.143 | |
| 2,500 | GET | 0.063 | 0.806 | |
| Rheumatoid arthritis | 625 | Benchmark | 0.028 | 0.034 |
| 625 | GET | 0.048 | 0.279 | |
| 1,250 | Benchmark | 0.024 | 0.049 | |
| 1,250 | GET | 0.051 | 0.619 | |
| 2,500 | Benchmark | 0.025 | 0.124 | |
| 2,500 | GET | 0.058 | 0.965 |
Global gene‐gene interaction detection results for the GLAUGEN GWAS data using GET and the benchmark method using the procedure detailed in Section 2.3.4.
| Phenotype | No. of cases | No. of controls | GET FDR | Benchmark FDR |
|---|---|---|---|---|
| Primary open‐angle glaucoma (POAG) | 976 | 1,136 | 0.0094 | 0.175 |
| Paracentral vision loss (VFPA) | 127 | 510 | 0.414 | 0.853 |
| Peripheral vision loss (VFPE) | 357 | 175 | ∼0 | 0.0073 |
| Maximum untreated intraocular pressure (IOP) | 624 | 549 |
| 0.464 |
| Pattern standard deviation (VFPSD) | 432 | 433 | ∼0 | 0.0018 |
| Recent vertical cup/disk ratio (VCDR) | 678 | 606 | 0.00094 | 0.0128 |
SNP‐SNP interactions at a level‐specific FDR according to the interaction testing method detailed in Section 2.3.1.
| Phenotype | SNP 1 (gene) | SNP 2 (gene) | FDR |
|---|---|---|---|
| VFPE | rs13396549 | rs9863361 | 0.0012 |
| (PARD3B) | (∼3 kb from ncRNA LOC105734230) | ||
| IOP | rs10246477 | rs12324434 | 0.082 |
| (SEMA3E) | (DYX1C1) | ||
| VFPSD | rs2419666 | rs7914325 | 0.021 |
| (∼6 kb from CNV nsv995491) | (ABLIM1) | ||
| VCDR | rs481154 | rs11154524 | 0.029 |
| (DNM3) | (SAMD3) |
No significant SNP‐SNP interactions were found at for the primary open‐angle glaucoma (POAG) or paracentral vision loss (VFPA) phenotypes. For each SNP in the interactions, the rs number and associated gene, if one exists according to dbSNP, are listed. If no gene association exists for the SNP in dbSNP, the closest gene is indicated.