| Literature DB >> 25733798 |
Rajesh Talluri1, Sanjay Shete2.
Abstract
Epistasis helps to explain how multiple single-nucleotide polymorphisms (SNPs) interact to cause disease. A variety of tools have been developed to detect epistasis. In this article, we explore the strengths and weaknesses of an information theory approach for detecting epistasis and compare it to the logistic regression approach through simulations. We consider several scenarios to simulate the involvement of SNPs in an epistasis network with respect to linkage disequilibrium patterns among them and the presence or absence of main and interaction effects. We conclude that the information theory approach more efficiently detects interaction effects when main effects are absent, whereas, in general, the logistic regression approach is appropriate in all scenarios but results in higher false positives. We compute epistasis networks for SNPs in the FSD1L gene using a two-phase head and neck cancer genome-wide association study involving 2,185 cases and 4,507 controls to demonstrate the practical application of the methods.Entities:
Keywords: epistasis; head and neck cancer; information theory; networks; regression
Year: 2015 PMID: 25733798 PMCID: PMC4332043 DOI: 10.4137/CIN.S17289
Source DB: PubMed Journal: Cancer Inform ISSN: 1176-9351
Figure 1Epistasis networks for the four scenarios simulated on the basis of network 1. (A) The true simulated epistasis network. (B) Epistasis network for simulation scenario 1 – information theory approach. (C) Epistasis network for simulation scenario 1 – logistic regression approach. (D) Epistasis network for simulation scenario 2 – information theory approach. (E) Epistasis network for simulation scenario 2 – logistic regression approach. (F) Epistasis network for simulation scenario 3 – information theory approach. (G) Epistasis network for simulation scenario 3 – logistic regression approach. (H) Epistasis network for simulation scenario 4 – information theory approach. (I) Epistasis network for simulation scenario 4 – logistic regression approach.
Figure 2Epistasis networks for the four scenarios simulated on the basis of network 2. (A) The true simulated epistasis network. (B) Epistasis network for simulation scenario 1 – information theory approach. (C) Epistasis network for simulation scenario 1 – logistic regression approach. (D) Epistasis network for simulation scenario 2 – information theory approach. (E) Epistasis network for simulation scenario 2 – logistic regression approach. (F) Epistasis network for simulation scenario 3 – information theory approach. (G) Epistasis network for simulation scenario 3 – logistic regression approach. (H) Epistasis network for simulation scenario 4 – information theory approach. (I) Epistasis network for simulation scenario 4 – logistic regression approach.
Figure 3Epistasis networks for the four scenarios simulated on the basis of network 3. (A) The true simulated epistasis network. (B) Epistasis network for simulation scenario 1 – information theory approach. (C) Epistasis network for simulation scenario 1 – logistic regression approach. (D) Epistasis network for simulation scenario 2 – information theory approach. (E) Epistasis network for simulation scenario 2 – logistic regression approach. (F) Epistasis network for simulation scenario 3 – information theory approach. (G) Epistasis network for simulation scenario 3 – logistic regression approach. (H) Epistasis network for simulation scenario 4 – information theory approach. (I) Epistasis network for simulation scenario 4 – logistic regression approach.
Details of the four simulation scenarios.
| SIMULATION SCENARIO | MAIN EFFECTS | INTERACTION EFFECTS | LINKAGE DISEQUILIBRIUM |
|---|---|---|---|
| Scenario 1 | None | (1,2), (3,4), (5,6), (7,8), (9,10) | No |
| None | (1,2), (1,4), (5,6), (5,8), (9,10) | No | |
| None | (1,2) | No | |
| Scenario 2 | 1, 3, 9 | (1,2), (3,4), (5,6), (7,8), (9,10) | No |
| 1, 3, 9 | (1,2), (1,4), (5,6), (5,8), (9,10) | No | |
| 1, 3, 9 | (1,2) | No | |
| Scenario 3 | None | (1,2), (3,4), (5,6), (7,8), (9,10) | Yes |
| None | (1,2), (1,4), (5,6), (5,8), (9,10) | Yes | |
| None | (1,2) | Yes | |
| Scenario 4 | 1, 3, 9 | (1,2), (3,4), (5,6), (7,8), (9,10) | Yes |
| 1, 3, 9 | (1,2), (1,4), (5,6), (5,8), (9,10) | Yes | |
| 1, 3, 9 | (1,2) | Yes |
Notes: All the main effects and interaction effects that were present were simulated with an odds ratio of 2.
Figure 4Epistasis network for the phase 1 head and neck cancer GWAS. (A) Epistasis network – information theory approach. (B) Epistasis network – logistic regression approach.
Figure 5Epistasis network for the phase 2 head and neck cancer GWAS. (A) Epistasis network – information theory approach. (B) Epistasis network – logistic regression approach.