| Literature DB >> 18466478 |
Amina Barhdadi1, Marie-Pierre Dubé.
Abstract
Large genetic association studies based on hundreds of thousands of single-nucleotide polymorphisms (SNPs) are a popular option for the study of complex diseases. The evaluation of gene x gene interactions in such studies is a sensible method of capturing important genetic effects. The number of tests required to consider all pairs of SNPs, however, can lead to a computational burden, and efficient strategies to reduce the number of tests performed are desirable. In this study, we compare two-stage strategies for pairwise SNP interactions testing. Those approaches rely on the selection of SNPs based on the single-locus test results obtained at the first stage. In the simultaneous approach, SNPs that fall below the marginal significance thresholds (p = 0.05 and p = 0.1) in stage 1 are selected and tested for within-group pairwise interaction in stage 2. With the conditional approach, SNPs that reach Bonferroni-adjusted significance at the first stage are tested in pairwise combinations with all SNPs in the data set. We compared the performance of those strategies by using Replicate 1 of the simulated data set of the Genetic Analysis Workshop 15 Problem 3. Most interactions detected resulted from SNP pairs within 1000 kb of each other. The remaining were false positives involving SNPs with excessively strong marginal signals. Our results highlight the need to account for locus proximity in the evaluation of interaction effects and emphasize the importance of marginal signal strength in logistic regression-based interaction modeling. We found that modeling additive genetic effects alone was sufficient to capture underlying dominance interaction effects in the data.Entities:
Year: 2007 PMID: 18466478 PMCID: PMC2367578 DOI: 10.1186/1753-6561-1-s1-s135
Source DB: PubMed Journal: BMC Proc ISSN: 1753-6561
Figure 1Marginal SNP association results for 1500 cases of rheumatoid arthritis and 2000 controls from the simulated Replicate 1 of GAW15, detected by using an additive and dominance genetic effects logistic regression model in [Eq. (1)].
Number of significant interactions detected in Stage 2 of the simultaneous and the conditional designs by logistic regression modeling [Eq. (3) and (4)].
| Full model without covariates [Eq. (3)] | Additive effects with covariates [Eq. (4)] | ||||
| Simultaneous design | 863,970 tests | 399,171 tests | |||
| <1,000 kb | 46 | 0 | 0 | 1 | 206 |
| >1,000 kb | 11 | 0 | 0 | 0 | 70 |
| Simultaneous design | 925,480 tests | 869,221 tests | |||
| <1,000 kb | 46 | 0 | 0 | 1 | 209 |
| >1,000 kb | 11 | 0 | 0 | 0 | 93 |
| Conditional design | 4,069,398 tests | 3,656,028 tests | |||
| <1,000 kb | 49 | 0 | 0 | 0 | 353 |
| >1,000 kb | 75 | 0 | 0 | 4 | 617 |