| Literature DB >> 20815886 |
Lara Sucheston1, Pritam Chanda, Aidong Zhang, David Tritchler, Murali Ramanathan.
Abstract
BACKGROUND: Multifactorial diseases such as cancer and cardiovascular diseases are caused by the complex interplay between genes and environment. The detection of these interactions remains challenging due to computational limitations. Information theoretic approaches use computationally efficient directed search strategies and thus provide a feasible solution to this problem. However, the power of information theoretic methods for interaction analysis has not been systematically evaluated. In this work, we compare power and Type I error of an information-theoretic approach to existing interaction analysis methods.Entities:
Mesh:
Year: 2010 PMID: 20815886 PMCID: PMC2996983 DOI: 10.1186/1471-2164-11-487
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Overview of simulation sets used to test power to detect GGI and type I error.
| Model | Sample size | Number of SNPs | Number of Interactions | MAF | ||
|---|---|---|---|---|---|---|
| 1-GH | 2 | 0.5 | 0.05 | 0.013 | ||
| 2-GH | 0.025 | |||||
| 3-GH | 400 | 10 | 2 | 0.25 | 0.06 | 0.007 |
| 3 | 1 | 0.03 | ||||
| 4-GH | 2 | 0.1 | 0.025 | 0.003 | ||
| 4 | 1 | 0.012 | ||||
| 5A | 0.3 | 0.62 | ||||
| 5B | 0.3 | |||||
| 5C | 0.15 | |||||
| 6A | 0.1 | 0.22 | ||||
| 6B | 200 | 7 | 1 | 0.5 | 0.11 | |
| 6C | 0.056 | |||||
| 7A | 0.01 | 0.02 | ||||
| 7B | 0.01 | |||||
| 7C | 0.005 | |||||
| 1-GH and 7C | 600, 1200 & 2400 | 10 | 3 | 0.5 | 0.037 | 0.013 |
| 1-GH | 2400 | 865 | 2 | 0.5 | 0.05 | 0.013 |
| 2-GH | 0.025 | |||||
| 3-GH | 865 | 2 | 0.25 | 0.06 | 0.007 | |
| 3 | 1 | 0.03 | ||||
| 4-GH | 865 | 2 | 0.1 | 0.025 | 0.003 | |
| 4 | 1 | 0.012 | ||||
GH: Genetic Heterogenity; K= population prevalence; h= Broad sense heritability;
a Penetrance is modeled as in Table 2 [16]
b Penetrance is modeled as in Table 3 [2]
c Kp values for Interactions 1 and 2 are each 0.05 (penetrance table from Model 1-GH from [16]) and for interaction 3 Kp is 0.01 (penetrance table from Model 7C from [2])
d [27]
Penetrance tables for comparison of KWII to MDR.
| 0.0 | 0.1 | 0.0 | 0.0 | 0.0 | 0.1 | ||
| 0.1 | 0.0 | 0.1 | 0.0 | 0.05 | 0.0 | ||
| 0.0 | 0.1 | 0.0 | 0.1 | 0.1 | 0.0 | ||
| 0.08 | 0.07 | 0.05 | 0.07 | 0.05 | 0.02 | ||
| 0.1 | 0.0 | 0.1 | 0.05 | 0.09 | 0.01 | ||
| 0.03 | 0.1 | 0.04 | 0.02 | 0.01 | 0.03 | ||
The penetrance values are based on the models in [16].
Penetrance tables comparison of KWII to RPM.
| 0.2 | 0.0 | 1.0 | 0.23 | 0.09 | 0.79 | 0.25 | 0.15 | 0.65 | |||
| 0.0 | 0.6 | 0.0 | 0.09 | 0.51 | 0.09 | 0.15 | 0.45 | 0.15 | |||
| 1.0 | 0.0 | 0.2 | 0.79 | 0.09 | 0.23 | 0.65 | 0.15 | 0.25 | |||
| 0.0 | 0.0 | 0.4 | 0.03 | 0.03 | 0.31 | 0.05 | 0.05 | 0.25 | |||
| 0.0 | 0.2 | 0.0 | 0.03 | 0.17 | 0.03 | 0.05 | 0.15 | 0.05 | |||
| 0.4 | 0.0 | 0.0 | 0.31 | 0.03 | 0.03 | 0.25 | 0.05 | 0.05 | |||
| 0.0 | 0.0 | 0.04 | 0.003 | 0.003 | 0.031 | 0.005 | 0.005 | 0.025 | |||
| 0.1 | 0.02 | 0.0 | 0.003 | 0.017 | 0.003 | 0.005 | 0.015 | 0.005 | |||
| 0.04 | 0.0 | 0.0 | 0.031 | 0.003 | 0.003 | 0.025 | 0.005 | 0.005 | |||
The penetrance values are based on the models in [2].
Penetrance tables for comparing KWII to the other four competing methods.
| 0.02 | 0.053 | 0.02 | 0.02 | 0.053 | 0.02 | 0.035 | 0.035 | 0.042 | |||
| 0.053 | 0.02 | 0.053 | 0.053 | 0.02 | 0.053 | 0.035 | 0.038 | 0.035 | |||
| 0.02 | 0.053 | 0.02 | 0.02 | 0.053 | 0.02 | 0.042 | 0.035 | 0.035 | |||
The overall disease prevalence is K= 0.037. Only pairwise penetrances for SNP directly involved in Interactions 1, 2, and 3 are shown. The penetrance values for Interaction 1 and Interaction 2 are from [16] whereas those for Interaction 3 are from [2].
Figure 1Figure 1A and B show the KWII spectra corresponding for Model 3 and Model 3-GH. Note the x-axis scales differ between Figures 1A and B. To improve clarity, a subset of uninformative combinations is not included in the plot; this is indicated with the break. The error bars are standard deviations.
Power and proportion of false positive comparison of the KWII to MDR.
| Model | α | Interaction 1 | Interactions 1 & 2* | MDR PFP | ||
|---|---|---|---|---|---|---|
| KWII | MDR | KWII | MDR | |||
| 0.01 | 98.7 | 19.9 | 98.1 | 0.7 | 0.0047 | |
| 0.001 | 94.3 | 1.3 | 89.4 | 0.9 | 0.0003 | |
| 0.0001 | 85.6 | 0.6 | 72.9 | 0.4 | 0.0002 | |
| 0.01 | 100 | 36.0 | 100 | 61.7 | 0.0116 | |
| 0.001 | 99.7 | 12.2 | 99.5 | 33.9 | 0.0029 | |
| 0.0001 | 98.1 | 5.0 | 96.3 | 23.3 | 0.0013 | |
| 0.01 | 100 | 91.3 | - | - | 0.0254 | |
| 0.001 | 100 | 56.1 | - | - | 0.0112 | |
| 0.0001 | 100 | 33.1 | - | - | 0.0067 | |
| 0.01 | 58.3 | 5.3 | 32.3 | 1.5 | 0.0028 | |
| 0.001 | 28.2 | 1.4 | 8.2 | 0.6 | 0.0010 | |
| 0.0001 | 15.3 | 0.1 | 2.2 | 0.3 | 0.0001 | |
| 0.01 | 99.6 | 54.0 | - | - | 0.0164 | |
| 0.001 | 97.5 | 8.0 | - | - | 0.0042 | |
| 0.0001 | 91.5 | 0.5 | - | - | 0.0010 | |
| 0.01 | 48.2 | 0.7 | 22.1 | 2.0 | 0.0019 | |
| 0.001 | 19.6 | 0 | 3.4 | 0.6 | 0.0005 | |
| 0.0001 | 9.1 | 0 | 0.9 | 0.3 | 0.0001 | |
*Models 3 and 4 had only a two SNP interaction (Interaction 1) present and thus power values are not applicable.
The simulations are based on models in [16].
Comparison of the power and proportion of false positives of KWII to RPM.
| Model | α | Power % | RPM PFP* | |||
|---|---|---|---|---|---|---|
| KWII | RPM | |||||
| 0.3 | 0.62 | 5A | 0.001 | 100 | 100 | 0.0010 |
| 0.0001 | 100 | 100 | 0.0002 | |||
| 0.3 | 5B | 0.001 | 100 | 100 | 0.0018 | |
| 0.0001 | 100 | 100 | 0.0004 | |||
| 0.15 | 5C | 0.001 | 97.7 | 93.0 | 0.0014 | |
| 0.0001 | 90.1 | 84.2 | 0.0003 | |||
| 0.1 | 0.22 | 6A | 0.001 | 100 | 100 | 0.0011 |
| 0.0001 | 100 | 100 | 0.0002 | |||
| 0.11 | 6B | 0.001 | 100 | 100 | 0.0014 | |
| 0.0001 | 100 | 100 | 0.0004 | |||
| 0.056 | 6C | 0.001 | 86.6 | 75.2 | 0.0013 | |
| 0.0001 | 73.6 | 56.5 | 0.0002 | |||
| 0.01 | 0.02 | 7A | 0.001 | 100 | 100 | 0.0010 |
| 0.0001 | 100 | 100 | 0.0002 | |||
| 0.01 | 7B | 0.001 | 99.3 | 98.4 | 0.0013 | |
| 0.0001 | 98.4 | 95.8 | 0.0003 | |||
| 0.005 | 7C | 0.001 | 72.6 | 58.4 | 0.0016 | |
| 0.0001 | 51.6 | 40.5 | 0.0005 | |||
* α is calibrated to the empirical false positive rate of KWII
The simulations are based on the models in [2].
Figure 2Figure 2A-F are receiver-operating characteristic plots showing the dependence of the power of the KWII on proportion of false positives for models 1-GH, 2-GH, 3, 3-GH, 4, 4-GH, 5C, 6C and 7A-7B in Table 1. Models 5A, 5B, 6A, 6B and 7C had power greater than 99% over the range of proportion of false positives examined and are not shown. The open circles in Figure 2A-3 D represent the power for detecting one of the two interacting pairs of loci and the open squares represent the power for detecting both loci. The filled circles in Figures 2C and 2 D correspond to the corresponding model without genetic heterogeneity whereas in Figures 2E and 2F the filled circles are used to distinguish between the different models. The power of the KWII at α-values of 0.001 and 0.0001 are summarized in Table 5 and 6.
Figure 3Figures 3A-C compare the power of KWII (green bars) to that of MDR (red bars), RPM (orange bars) and blue for logistic regression at α-values of 0.001 and 0.0001. The sample size was 1200. Figure 3A corresponds to Interaction 1, Figure 3B corresponds to Interaction 3 and Figure 3C represents the power to detect all three interactions. The penetrance matrices for the combinations of SNP pairs involved in Interactions 1, 2 and 3 are shown in Table 4. The bar corresponding to MDR in Figure 3C is apparently not visible because the power was low.
Comparison of the power and false positive proportions of KWII to MDR, RPM, and regression approaches.
| Interaction | Sample Size | α | Power % | |||
|---|---|---|---|---|---|---|
| KWII | MDR | RPM | Logistic | |||
| 600 | 0.001 | 68.9 | 7.8 | 48.8 | 68.3 | |
| 0.0001 | 42.2 | 3.7 | 29.5 | 43.3 | ||
| 1200 | 0.001 | 98.9 | 39.1 | 94.9 | 99.1 | |
| 0.0001 | 95.6 | 10.0 | 87.8 | 95.4 | ||
| 2400 | 0.001 | 100 | 65.3 | 100 | 100 | |
| 0.0001 | 100 | 13.8 | 100 | 100 | ||
| 600 | 0.001 | 15.1 | 0.04 | 7.2 | 14.8 | |
| 0.0001 | 4.8 | 0.03 | 3.2 | 5.2 | ||
| 1200 | 0.001 | 47.6 | 3.6 | 28.1 | 48.7 | |
| 0.0001 | 26.6 | 0.04 | 15.2 | 24.6 | ||
| 2400 | 0.001 | 95.1 | 6.6 | 84.4 | 94.0 | |
| 0.0001 | 83.5 | 0.05 | 70.8 | 84.2 | ||
| 600 | 0.001 | 7.6 | 0.01 | 2.2 | 7.5 | |
| 0.0001 | 15.2 | 0.01 | 0.5 | 14.1 | ||
| 1200 | 0.001 | 48.1 | 0.03 | 25.7 | 47.8 | |
| 0.0001 | 26.6 | 0.01 | 11.5 | 22.1 | ||
| 2400 | 0.001 | 95.1 | 2.9 | 84.4 | 94.0 | |
| 0.0001 | 83.5 | 0.01 | 70.8 | 84.2 | ||
| 600 | 0.001 | - | 0.0021 | 0.0010 | 0.0013 | |
| 0.0001 | - | 0.0010 | 0.0001 | 0.0001 | ||
| 1200 | 0.001 | - | 0.0075 | 0.0013 | 0.0016 | |
| 0.0001 | - | 0.0011 | 0.0003 | 0.0002 | ||
| 2400 | 0.001 | - | 0.0103 | 0.0009 | 0.0013 | |
| 0.0001 | - | 0.0014 | 0.0002 | 0.0002 | ||
*α is calibrated to the empirical false positive rate of KWII.
The model used for the simulations contained three interacting pairs of SNP as summarized in Table 1: Interaction 1 and Interaction 2 in were based on [16] whereas Interaction 3 was based on [2].