| Literature DB >> 27185397 |
William Murk1, Andrew T DeWan2.
Abstract
The identification of statistical SNP-SNP interactions may help explain the genetic etiology of many human diseases, but exhaustive genome-wide searches for these interactions have been difficult, due to a lack of power in most datasets. We aimed to use data from the Resource for Genetic Epidemiology Research on Adult Health and Aging (GERA) study to search for SNP-SNP interactions associated with 10 common diseases. FastEpistasis and BOOST were used to evaluate all pairwise interactions among approximately N = 300,000 single nucleotide polymorphisms (SNPs) with minor allele frequency (MAF) ≥ 0.15, for the dichotomous outcomes of allergic rhinitis, asthma, cardiac disease, depression, dermatophytosis, type 2 diabetes, dyslipidemia, hemorrhoids, hypertensive disease, and osteoarthritis. A total of N = 45,171 subjects were included after quality control steps were applied. These data were divided into discovery and replication subsets; the discovery subset had > 80% power, under selected models, to detect genome-wide significant interactions (P < 10(-12)). Interactions were also evaluated for enrichment in particular SNP features, including functionality, prior disease relevancy, and marginal effects. No interaction in any disease was significant in both the discovery and replication subsets. Enrichment analysis suggested that, for some outcomes, interactions involving SNPs with marginal effects were more likely to be nominally replicated, compared to interactions without marginal effects. If SNP-SNP interactions play a role in the etiology of the studied conditions, they likely have weak effect sizes, involve lower-frequency variants, and/or involve complex models of interaction that are not captured well by the methods that were utilized.Entities:
Keywords: GERA; epistasis; gene-gene interaction; statistical interaction
Mesh:
Year: 2016 PMID: 27185397 PMCID: PMC4938657 DOI: 10.1534/g3.116.028563
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
Subject characteristics
| Condition | Dataset | Min. Age (Years), Range | Min. Age (Years), Median | Male, % | |||||
|---|---|---|---|---|---|---|---|---|---|
| Cases | Controls | Cases | Controls | Cases | Controls | ||||
| Allergic rhinitis | Discovery | 10,258 | 30,933 | 18–84 | 18–84 | 64 | 59 | 31.1 | 37.2 |
| Replication | 976 | 3004 | 18–84 | 18–84 | 59 | 59 | 29.8 | 35.9 | |
| Asthma | Discovery | 6486 | 34,669 | 18–84 | 18–84 | 64 | 59 | 28.3 | 37.1 |
| Replication | 988 | 3028 | 18–84 | 18–84 | 64 | 59 | 26.3 | 37.4 | |
| Cardiac disease | Discovery | 11,069 | 28,979 | 34–84 | 34–84 | 69 | 59 | 50.1 | 30.5 |
| Replication | 1004 | 3013 | 34–84 | 34–84 | 69 | 59 | 46.9 | 31.2 | |
| Depression | Discovery | 4824 | 36,162 | 24–84 | 24–84 | 59 | 64 | 24.6 | 37.3 |
| Replication | 978 | 2992 | 24–84 | 24–84 | 59 | 59 | 24.3 | 36.9 | |
| Dermatophytosis | Discovery | 5163 | 36,083 | 18–84 | 18–84 | 64 | 59 | 43.6 | 34.2 |
| Replication | 989 | 2936 | 18–84 | 18–84 | 64 | 59 | 45.2 | 35.2 | |
| Diabetes, type 2 | Discovery | 4563 | 35,573 | 34–84 | 34–84 | 64 | 59 | 49.4 | 33.9 |
| Replication | 986 | 2943 | 34–84 | 34–84 | 64 | 59 | 48.8 | 33.7 | |
| Dyslipidemia | Discovery | 23,061 | 17,021 | 34–84 | 34–84 | 64 | 59 | 40.2 | 29.8 |
| Replication | 986 | 2997 | 34–84 | 34–84 | 64 | 59 | 38.3 | 30.5 | |
| Hemorrhoids | Discovery | 6199 | 34,356 | 29–84 | 29–84 | 64 | 59 | 40.7 | 34.8 |
| Replication | 1006 | 3117 | 29–84 | 29–84 | 64 | 59 | 41.6 | 34.0 | |
| Hypertensive disease | Discovery | 21,713 | 18,332 | 34–84 | 34–84 | 69 | 54 | 40.1 | 31.6 |
| Replication | 984 | 3036 | 34–84 | 34–84 | 64 | 59 | 38.1 | 30.8 | |
| Osteoarthritis | Discovery | 15,454 | 23,578 | 39–84 | 39–84 | 69 | 59 | 31.9 | 38.7 |
| Replication | 961 | 2985 | 39–84 | 39–84 | 69 | 59 | 32.5 | 38.8 | |
Min. age, the minimum possible age of a subject (see the Materials and Methods section).
Most significant marginal associations, by condition
| Condition | RSID | Chr | A1 | A0 | Discovery, Unadjusted | Discovery, Adjusted | Replication, Adjusted | Genome-wide Sig.? | Rep.? | Anno. | Gene | |||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| OR (95% C.I.) | OR (95% C.I.) | OR (95% C.I.) | ||||||||||||
| Allergic rhinitis | rs2160203 | 2 | G | A | 0.90 (0.87, 0.94) | 6.61E-08 | 0.90 (0.87, 0.94) | 5.75E-08 | 0.93 (0.82, 1.05) | 2.16E-01 | Yes | No | R, D, G | |
| Asthma | rs17612802 | 6 | C | T | 0.87 (0.83, 0.90) | 6.01E-13 | 0.86 (0.83, 0.90) | 2.46E-13 | 0.84 (0.76, 0.94) | 1.81E-03 | Yes | Yes | ( | |
| Cardiac disease | rs6843082 | 4 | G | A | 1.06 (1.02, 1.10) | 4.57E-03 | 1.11 (1.07, 1.16) | 7.41E-07 | 0.97 (0.86, 1.13) | 8.51E-01 | No | No | ||
| Depression | rs7797095 | 7 | T | C | 1.13 (1.07, 1.19) | 1.12E-05 | 1.14 (1.08, 1.20) | 4.11E-06 | 0.94 (0.82, 1.09) | 4.18E-01 | No | No | G | |
| Dermatophytosis | rs35626362 | 4 | T | C | 0.90 (0.86, 0.94) | 2.43E-05 | 0.89 (0.85, 0.94) | 1.12E-05 | 1.05 (0.93, 1.19) | 3.92E-01 | No | No | ||
| Diabetes, type 2 | rs4506565 | 10 | T | A | 1.35 (1.29, 1.41) | 1.55E-37 | 1.35 (1.28, 1.41) | 3.22E-36 | 1.31 (1.17, 1.46) | 1.92E-06 | Yes | Yes | D, G | |
| Dyslipidemia | rs1367117 | 2 | A | G | 1.19 (1.16, 1.23) | 4.18E-30 | 1.24 (1.20, 1.28) | 1.83E-37 | 1.06 (0.94, 1.19) | 3.41E-01 | Yes | No | EX, D, G | |
| Hemorrhoids | rs6106205 | 20 | C | T | 1.11 (1.07, 1.16) | 5.52E-07 | 1.11 (1.07, 1.16) | 6.57E-07 | 1.01 (1.00, 1.00) | 9.99E-01 | No | No | G | |
| Hypertensive dis. | rs1275985 | 2 | C | T | 1.09 (1.06, 1.12) | 1.02E-08 | 1.11 (1.07, 1.14) | 2.82E-10 | 1.17 (1.07, 1.34) | 1.84E-03 | Yes | Yes | G | |
| Osteoarthritis | rs6925021 | 6 | A | G | 0.94 (0.91, 0.97) | 2.96E-04 | 0.92 (0.89, 0.95) | 1.92E-06 | 1.05 (0.94, 1.19) | 3.62E-01 | No | No | ||
Most significant marginal effects, by condition, ranked by significance in each discovery adjusted analysis. RSID, reference SNP cluster ID; A1, nonreferent allele; A0, referent allele; OR, odds ratio; C.I., confidence interval; P, P-value; Genome-wide sig.?, whether or not the P-value from the discovery adjusted analysis was less than 10−7; Rep?, whether or not the marginal effect was nominally replicated; Anno., annotation assigned to the respective SNP, coded as follows; R, regulatory; D, disease-gene; G, any-gene; EX, exonic. M (marginal) not shown, since all listed SNPs have that annotation.
Although not formally assigned a gene annotation due to its distance, this SNP is located 6.9 kb from the HLA-DQB1 gene.
Most significant interactions (overall and among those that were nominally replicated), by condition; FastEpistasis with logistic regression
| Condition/Rank | SNP1 | SNP2 | Discovery | Replication | Rep.? | Anno1 | Anno2 | Gene1 | Gene2 | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| RSID | Chr | A1 | A0 | RSID | Chr | A1 | A0 | FE, | Unadj., | Adj., OR | Adj., | Unadj., | Adj., OR | Adj., | ||||||
| Allergic rhinitis | ||||||||||||||||||||
| 1 | rs493725 | 11 | C | T | rs12456095 | 18 | A | G | 1.49E-10 | 1.52E-10 | 1.21 (1.14, 1.28) | 8.61E-11 | 6.10E-01 | 0.94 (0.78, 1.13) | 5.29E-01 | No | G | |||
| 74 | rs10710098 | 12 | – | A | rs837473 | 12 | G | A | 1.69E-09 | 2.33E-09 | 1.22 (1.14, 1.30) | 2.08E-09 | 2.39E-02 | 1.27 (1.03, 1.57) | 2.48E-02 | Yes | G | |||
| Asthma | ||||||||||||||||||||
| 1 | rs6764801 | 3 | G | T | rs4574345 | 4 | C | T | 2.57E-11 | 2.84E-11 | 1.27 (1.19, 1.37) | 2.50E-11 | 1.11E-01 | 1.20 (0.98, 1.47) | 7.62E-02 | No | ||||
| 19 | rs7480608 | 11 | A | C | rs996197 | 20 | A | C | 8.41E-10 | 8.31E-10 | 0.79 (0.73, 0.85) | 3.53E-10 | 5.40E-02 | 0.81 (0.65, 1.00) | 4.68E-02 | Yes | G | |||
| Cardiac disease | ||||||||||||||||||||
| 1 | rs1407721 | 1 | T | G | rs455326 | 5 | A | G | 5.09E-10 | 6.86E-10 | 1.22 (1.15, 1.29) | 1.64E-11 | 4.28E-01 | 1.01 (0.84, 1.22) | 8.97E-01 | No | ||||
| 100 | rs9322768 | 6 | C | T | rs6463841 | 7 | G | A | 7.47E-09 | 6.54E-09 | 0.86 (0.82, 0.91) | 4.19E-09 | 6.08E-02 | 0.80 (0.69, 0.94) | 7.20E-03 | Yes | G | |||
| Depression | ||||||||||||||||||||
| 1 | rs16912862 | 9 | G | A | rs4769180 | 13 | C | T | 7.78E-11 | 5.40E-11 | 0.79 (0.74, 0.85) | 1.81E-11 | 9.92E-01 | 1.00 (0.85, 1.17) | 9.56E-01 | No | G | |||
| 3 | rs7587468 | 2 | G | A | rs13120959 | 4 | T | G | 1.16E-10 | 6.72E-11 | 0.80 (0.75, 0.86) | 1.26E-10 | 5.17E-02 | 0.84 (0.71, 0.99) | 3.51E-02 | Yes | G | |||
| Dermatophytosis | ||||||||||||||||||||
| 1 | rs4456135 | 1 | C | T | rs12162346 | 2 | T | C | 1.76E-12 | 2.02E-12 | 1.29 (1.20, 1.38) | 4.29E-13 | 1.43E-01 | 0.87 (0.74, 1.04) | 1.23E-01 | No | G | |||
| 17 | rs4318363 | 2 | C | T | rs7896441 | 10 | G | A | 3.02E-10 | 4.45E-10 | 1.23 (1.16, 1.32) | 1.98E-10 | 3.54E-02 | 1.19 (1.01, 1.39) | 3.27E-02 | Yes | G | |||
| Diabetes, type 2 | ||||||||||||||||||||
| 1 | rs6677074 | 1 | A | C | rs34332506 | 3 | C | T | 4.84E-10 | 5.33E-10 | 0.79 (0.73, 0.85) | 1.02E-10 | 8.66E-01 | 1.00 (0.84, 1.18) | 9.81E-01 | No | G | G | ||
| 8 | rs4986223 | 18 | C | T | rs59493447 | 22 | T | G | 3.29E-10 | 8.47E-10 | 1.33 (1.22, 1.45) | 3.11E-10 | 2.83E-02 | 1.26 (1.01, 1.57) | 3.65E-02 | Yes | G | D, G | ||
| Dyslipidemia | ||||||||||||||||||||
| 1 | rs3860935 | 9 | T | C | rs12243792 | 10 | A | G | 8.90E-11 | 8.99E-11 | 1.20 (1.14, 1.26) | 1.09E-11 | 7.29E-01 | 1.03 (0.86, 1.24) | 7.56E-01 | No | G | G, M | ||
| 57 | rs1655483 | 11 | G | A | rs2617815 | 19 | G | A | 4.51E-09 | 4.18E-09 | 1.18 (1.12, 1.25) | 1.69E-09 | 3.22E-03 | 1.28 (1.05, 1.56) | 1.33E-02 | Yes | R, G | |||
| Hemorrhoids | ||||||||||||||||||||
| 1 | rs12043442 | 1 | C | T | rs16858754 | 3 | T | C | 4.92E-11 | 2.00E-11 | 0.71 (0.65, 0.79) | 1.05E-11 | 3.08E-01 | 0.88 (0.69, 1.13) | 3.21E-01 | No | ||||
| 7 | rs2564056 | 2 | C | T | rs13421607 | 2 | T | C | 1.14E-10 | 1.99E-10 | 1.22 (1.15, 1.30) | 2.44E-10 | 1.31E-02 | 1.25 (1.06, 1.48) | 8.07E-03 | Yes | ||||
| Hypertensive disease | ||||||||||||||||||||
| 1 | rs10187912 | 2 | A | G | rs9929738 | 16 | G | A | 9.09E-10 | 9.27E-10 | 0.82 (0.77, 0.87) | 1.41E-11 | 1.91E-01 | 0.92 (0.74, 1.15) | 4.63E-01 | No | ||||
| 87 | rs7519626 | 1 | C | T | rs17342461 | 4 | C | T | 1.52E-08 | 1.39E-08 | 0.87 (0.83, 0.91) | 2.20E-09 | 9.65E-02 | 0.84 (0.71, 0.99) | 4.12E-02 | Yes | ||||
| Osteoarthritis | ||||||||||||||||||||
| 1 | rs7316595 | 12 | C | T | rs2066936 | 19 | G | A | 1.05E-09 | 1.05E-09 | 0.85 (0.81, 0.89) | 1.86E-11 | 9.77E-01 | 1.06 (0.90, 1.24) | 5.03E-01 | No | G | G | ||
| 6 | rs272051 | 2 | G | A | rs7630522 | 3 | T | C | 1.08E-08 | 1.01E-08 | 1.18 (1.13, 1.25) | 1.33E-10 | 5.87E-02 | 1.22 (1.01, 1.46) | 3.42E-02 | Yes | ||||
Interactions were first analyzed with FastEpistasis and then subjected to a follow-up analysis with logistic regression. For each condition, the most significant of all interactions is listed, followed by the most significant interaction that was nominally replicated. Interactions were ranked by significance in the adjusted logistic regression analysis (discovery). Numbers in the leftmost column indicate the overall ranks of the interactions for the respective condition. Blanks in the “Anno” or “Gene” columns indicate no annotation or gene assigned to the respective SNP. SNP, single nucleotide polymorphism; Rep.?, whether or not the interaction was nominally replicated; Anno1/Anno2, annotation assigned to SNP1 or SNP2, respectively; Gene1/Gene2, gene assigned to SNP1 and SNP2, respectively; RSID, reference SNP cluster ID; Chr, chromosome number; A1, nonreferent allele; A0, referent allele. FE, P, P-value from the FastEpistasis analysis; Unadj., P, P-value from the unadjusted logistic regression analysis; Adj., OR, interaction odds ratio and 95% confidence interval, from the adjusted logistic regression analysis; Adj., P, P-value from the adjusted logistic regression analysis; All “adjusted” analyses were adjusted for the first two principal components, birth year category, and sex; G, any-gene; D, disease-gene; M, marginal; R, regulatory.
Most significant interactions (overall and among those that were nominally replicated), by condition; BOOST
| Condition/Rank | SNP1 | SNP2 | Discovery | Replication | Rep.? | Anno1 | Anno2 | Gene1 | Gene2 | ||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| RSID | Chr | A1 | A0 | RSID | Chr | A1 | A0 | ||||||||
| Allergic rhinitis | |||||||||||||||
| 1 | rs13403689 | 2 | T | G | rs11086806 | 20 | G | T | 3.92E-11 | 1.52E-01 | No | G | G | ||
| 31 | rs7581504 | 2 | A | G | rs11086806 | 20 | G | T | 7.50E-10 | 4.97E-02 | Yes | G, M | G | ||
| Asthma | |||||||||||||||
| 1 | rs1461773 | 6 | A | G | rs1362930 | 7 | T | C | 2.55E-11 | 6.31E-01 | No | G | |||
| 30 | rs6722509 | 2 | C | T | rs11709714 | 3 | A | G | 8.78E-10 | 3.73E-02 | Yes | G | G | ||
| Cardiac disease | |||||||||||||||
| 1 | rs12943579 | 17 | G | A | rs886617 | 22 | T | C | 6.54E-11 | 7.48E-01 | No | G | |||
| 26 | rs2580405 | 2 | T | G | rs2355635 | 10 | C | T | 4.51E-10 | 3.72E-02 | Yes | ||||
| Depression | |||||||||||||||
| 1 | rs2651975 | 12 | C | A | rs9940287 | 16 | T | C | 4.85E-11 | 6.67E-02 | No | G | |||
| 18 | rs6414384 | 3 | G | A | rs10843021 | 12 | T | C | 6.16E-10 | 2.27E-02 | Yes | G | |||
| Dermatophytosis | |||||||||||||||
| 1 | rs74378451 | 10 | G | A | rs1536032 | 13 | A | G | 1.20E-12 | 9.44E-01 | No | G | M | ||
| 18 | rs400883 | 4 | G | T | rs12897227 | 14 | A | G | 3.22E-10 | 2.61E-03 | Yes | G | G | ||
| Diabetes, type 2 | |||||||||||||||
| 1 | rs1327614 | 1 | G | A | rs12895385 | 14 | C | T | 2.33E-11 | 6.92E-01 | No | ||||
| 2 | rs11900922 | 2 | C | T | rs770116 | 12 | T | C | 2.69E-11 | 7.99E-03 | Yes | D, G | |||
| Dyslipidemia | |||||||||||||||
| 1 | rs7646670 | 3 | T | C | rs72841214 | 17 | T | C | 2.80E-11 | 6.73E-01 | No | ||||
| 10 | rs251162 | 5 | G | A | rs3783322 | 14 | G | A | 2.84E-10 | 4.95E-02 | Yes | G | |||
| Hemorrhoids | |||||||||||||||
| 1 | rs11684491 | 2 | G | A | rs6792001 | 3 | A | G | 1.18E-11 | 2.97E-02 | Yes | ||||
| Hypertensive disease | |||||||||||||||
| 1 | rs3128854 | 6 | A | G | rs75377761 | 6 | C | T | 1.09E-11 | 5.66E-01 | No | G | G | ||
| 15 | rs2310357 | 4 | T | C | rs4449525 | 5 | G | A | 2.51E-10 | 1.56E-02 | Yes | R, G | |||
| Osteoarthritis | |||||||||||||||
| 1 | rs57799846 | 1 | A | G | rs8046139 | 16 | C | T | 1.85E-11 | 5.33E-01 | No | G | |||
| 14 | rs4408841 | 3 | A | G | rs4858960 | 3 | A | C | 2.78E-10 | 1.69E-02 | Yes | ||||
For each condition, the most significant of all interactions is listed, followed by the most significant interaction that was nominally replicated. Interactions were ranked by significance in the BOOST analysis (discovery). Numbers in the leftmost column indicate the overall ranks of the interactions for the respective condition. Blanks in the “Anno” or “Gene” columns indicate no annotation or gene assigned to the respective SNP. SNP, single nucleotide polymorphism; Discovery P, P-value from the discovery BOOST analysis; Replication P, P-value from the replication BOOST analysis; Rep.?, whether or not the interaction was nominally replicated; Anno1/Anno2, annotation assigned to SNP1 or SNP2, respectively; Gene1/Gene2, gene assigned to SNP1 and SNP2, respectively; RSID, reference SNP cluster ID; Chr, chromosome number; A1, nonreferent allele; A0, referent allele; G, any-gene; M, marginal; D, disease-gene; R, regulatory.