| Literature DB >> 20186329 |
Todd L Edwards1, Stephen D Turner, Eric S Torstenson, Scott M Dudek, Eden R Martin, Marylyn D Ritchie.
Abstract
The initial presentation of multifactor dimensionality reduction (MDR) featured cross-validation to mitigate over-fitting, computationally efficient searches of the epistatic model space, and variable construction with constructive induction to alleviate the curse of dimensionality. However, the method was unable to differentiate association signals arising from true interactions from those due to independent main effects at individual loci. This issue leads to problems in inference and interpretability for the results from MDR and the family-based compliment the MDR-pedigree disequilibrium test (PDT). A suggestion from previous work was to fit regression models post hoc to specifically evaluate the null hypothesis of no interaction for MDR or MDR-PDT models. We demonstrate with simulation that fitting a regression model on the same data as that analyzed by MDR or MDR-PDT is not a valid test of interaction. This is likely to be true for any other procedure that searches for models, and then performs an uncorrected test for interaction. We also show with simulation that when strong main effects are present and the null hypothesis of no interaction is true, that MDR and MDR-PDT reject at far greater than the nominal rate. We also provide a valid regression-based permutation test procedure that specifically tests the null hypothesis of no interaction, and does not reject the null when only main effects are present. The regression-based permutation test implemented here conducts a valid test of interaction after a search for multilocus models, and can be applied to any method that conducts a search to find a multilocus model representing an interaction.Entities:
Mesh:
Year: 2010 PMID: 20186329 PMCID: PMC2826406 DOI: 10.1371/journal.pone.0009363
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Rejection of the null hypothesis in MDR-type algorithms when only main effects are present is more evident as sample and effect sizes increase.
| 500 families | 2000 families | |||
| Relative Risk | MDR-PDT | MDR-PDT LR | MDR-PDT | MDR-PDT LR |
| 1.5 | 0 | 0 | 0 | 0 |
| 2 | 0 | 0 | 1 | 0 |
| 4 | 2 | 0 | 24 | 0 |
| 6 | 4 | 0 | 50 | 0 |
This behavior is not observed in the LR test, where one significant result was observed in 800 replicates, compared with 591 total significant results for MDR and MDR-PDT over all parameters.
Figure 1Power of different approaches for 2-locus models.
This figure shows the power of MDR LR, MDR-PDT LR, and exhaustive logistic regression (LR) with a Bonferroni correction for 1225 2-locus models under six 2-locus purely epistatic genetic scenarios (Table 1) and three sample sizes for LR, MDR LR, and MDR-PDT LR. LR and MDR LR simulations were with 50 SNPs and MDR-PDT LR simulations were with 20 SNPs.
Figure 2Power of different approaches for 3-locus models.
Power of MDR LR, MDR-PDT LR, and exhaustive logistic regression (LR) with a Bonferroni correction for 20,825 2 and 3-locus models under six 3-locus purely epistatic genetic scenarios (Table 1) and three sample sizes for LR, MDR LR, and MDR-PDT LR. LR and MDR LR simulations were with 50 SNPs and MDR-PDT LR simulations were with 20 SNPs.
Models examined in the simulation study.
| Loci | MAF | Heritability | Odds Ratio |
| 2 | 0.2 | 0.030 | 1.53 |
| 2 | 0.2 | 0.048 | 1.79 |
| 2 | 0.2 | 0.09 | 3.00 |
| 2 | 0.4 | 0.03 | 1.56 |
| 2 | 0.4 | 0.05 | 1.79 |
| 2 | 0.4 | 0.10 | 2.85 |
| 3 | 0.2 | 0.03 | 1.58 |
| 3 | 0.2 | 0.05 | 2.10 |
| 3 | 0.2 | 0.10 | 3.20 |
| 3 | 0.4 | 0.03 | 1.52 |
| 3 | 0.4 | 0.05 | 2.23 |
| 3 | 0.4 | 0.12 | 3.50 |
Example 2×2 table for the calculation of the purely epistatic disease model odds ratio.
| Case | Control | |
| High-Risk | A | B |
| Low-Risk | C | D |