| Literature DB >> 20018019 |
Zheyang Wu1, Chatchawit Aporntewan, David H Ballard, Ji Young Lee, Joon Sang Lee, Hongyu Zhao.
Abstract
The interaction among multiple genes and environmental factors can affect an individual's susceptibility to disease. Some genes may not show strong marginal associations when they affect disease risk through interactions with other genes. As a result, these genes may not be identified by single-marker methods that are widely used in genome-wide association studies. To explore this possibility in real data, we carried out a two-stage model selection procedure of joint single-nucleotide polymorphism (SNP) analysis to detect genes associated with rheumatoid arthritis (RA) using Genetic Analysis Workshop 16 genome-wide association study data. In the first stage, the genetic markers were screened through an exhaustive two-dimensional search, through which promising SNP and SNP pairs were identified. Then, LASSO was used to choose putative SNPs from the candidates identified in the first stage. We then use the RA data collected by the Wellcome Trust Case Control Consortium to validate the putative genetic factors. Balancing computational load and statistical power, this method detects joint effects that may fail to emerge from single-marker analysis. Based on our proposed approach, we not only replicated the identification of important RA risk genes, but also found novel genes and their epistatic effects on RA. To our knowledge, this is the first two-dimensional scan based analysis for a real genome-wide association study.Entities:
Year: 2009 PMID: 20018019 PMCID: PMC2795926 DOI: 10.1186/1753-6561-3-s7-s29
Source DB: PubMed Journal: BMC Proc ISSN: 1753-6561
Figure 1Top log-likelihood test statistics for the marginal models. Blue solid curve shows the 395 top values of marginal LLR from original data; the green dot curve shows the top marginal LLR excluding the SNPs in chr6; the red dash curve shows the top marginal 395 LLRs from all SNPs after permuting RA disease status.
Figure 2Log marginal association ranks for SNP pairs with significant conditional additive effect. Each dot represents a SNP pair (totally 494 SNP-pairs). Displacement of a dot along the x- and y-axes indicate the log of the marginal association ranks for the corresponding two SNPs. A lower marginal rank indicates a larger marginal effect.
Figure 3Histograms of D' of SNP pairs with significant conditional additive effect. Bars show the distribution of the number of SNP-pairs (totally 494 SNP pairs) over the values of D'. A large D' indicates significant linkage disequilibrium between two SNPs.
Figure 4Top log-likelihood test statistics for the large interaction effects. The blue solid curve shows the top 208 decreasingly ordered LLR test statistics, among which 160 values corresponds to SNP-pairs that exist on different chromosomes or have D' < 0.2, as represented by the green dot curve. The red dash curve shows the top 208 LLR test statistics for the interaction terms after a permutation of the disease status.
Figure 5Marginal association ranks of SNP pairs with large interaction effects. Each dot represents a SNP pair (totally 208 SNP pairs). Displacement of a dot along the x- and y-axes indicate the log of the marginal association ranks for the corresponding two SNPs.
WTCCC-data-validated SNPs and SNP pairs (epistases) associated with RA
| Chr | SNP | Location | Gene Code | ± kbpa | ORb | Marg. Rankd | |
|---|---|---|---|---|---|---|---|
| 1 | rs3811019 | 114183625 | 0 | 1.46 | 6.93 × 10-7 | 1150 | |
| 6 | rs1265777 | 32381136 | 0 | 1.01 | 9.84 × 10-1 | 436 | |
| 6 | rs574710 | 32396168 | 0 | 0.89 | 7.44 × 10-1 | 488 | |
| 6 | rs539703 | 32396440 | 0 | 1.05 | 9.34 × 10-1 | 440 | |
| 6 | rs2894249 | 32433813 | 0 | 0.79 | 3.76 × 10-4 | 245 | |
| 6 | rs2076533 | 32471505 | 0 | 2.03 | 2.00 × 10-16 | 630 | |
| 6 | rs3763308 | 32482618 | 0 | 0.42 | 1.91 × 10-8 | 959 | |
| 6 | rs9268645 | 32516505 | 0 | 0.85 | 2.90 × 10-2 | 278 | |
| 6 | rs7194 | 32520458 | 0 | 1.02 | 7.96 × 10-1 | 110 | |
| 6 | rs9273363 | 32734250 | 5 | 0.73 | 4.14 × 10-9 | 709 | |
| 6 | rs6908943 | 32743274 | 5 | 0.69 | 2.92 × 10-7 | 1131 | |
| 8 | SNP_A-4193342 | 97922693 | 0 | 1.38 | 1.68 × 10-7 | 416671 | |
| 22 | rs16981203 | 24729414 | 0 | 1.3 | 1.08 × 10-5 | 501 |
a± kbp, location of the SNPs. "0" indicates the SNP is physically located within the corresponding gene; "5" indicates the SNP is located outside the gene but is less than 5 kbp away.
bOR, the joint odds ratios and p-values in the full model containing all of the selected variables.
cp-value, for the full model containing all of the selected variables.
dMarg. Rank, marginal ranks of the SNPs by single-marker analysis in the WTCCC data. The rank >1015 corresponds to the Bonferroni p-value > 0.1 in a single-marker study.
SNP-pairs with large epistatic effects validated with WTCCC data
| Chr1 | SNP1 | Location1 | Gene1 | Marg. Rank1a | Chr1 | SNP1 | Location1 | Gene1 | Marg. Rank2a | |
|---|---|---|---|---|---|---|---|---|---|---|
| 6 | rs2244579 | 31544618 | 2933 | 6 | rs206015 | 32290737 | NOTCH4 | 623 | 1.39 × 10-5 | |
| 6 | rs4394275 | 31426156 | 4705 | 6 | rs9276440 | 32822761 | HLA-DQA2 | 3042 | 5.33 × 10-5 | |
| 6 | rs4394275 | 31426156 | 4705 | 6 | rs9276432 | 32820362 | HLA-DQA2 | 2165 | 1.06 × 10-4 | |
| 6 | rs4394275 | 31426156 | 4705 | 6 | rs9276429 | 32820082 | HLA-DQA2 | 2386 | 1.08 × 10-4 | |
| 6 | rs2248880 | 31341489 | 130792 | 6 | rs9273363 | 32734250 | HLA-DQB1 | 709 | 1.14 × 10-4 | |
| 6 | rs4394275 | 31426156 | 4705 | 6 | rs9276431 | 32820225 | HLA-DQA2 | 2484 | 1.40 × 10-4 | |
| 6 | rs9263794 | 31237998 | 4325 | 6 | rs438475 | 32294223 | NOTCH4 | 1243 | 7.56 × 10-4 | |
| 6 | rs1265074 | 31221193 | 20388 | 6 | rs438475 | 32294223 | NOTCH4 | 1243 | 2.20 × 10-3 | |
| 6 | rs2244579 | 31544618 | 2933 | 6 | rs438475 | 32294223 | NOTCH4 | 1243 | 2.27 × 10-3 | |
| 6 | rs4394275 | 31426156 | 4705 | 6 | rs9273363 | 32734250 | HLA-DQB1 | 709 | 3.49 × 10-3 | |
| 6 | rs2844615 | 31350938 | 10590 | 6 | rs2596477 | 31435702 | HLA-B | 76640 | 2.36 × 10-2 | |
| 6 | rs4394275 | 31426156 | 4705 | 6 | rs2227127 | 32819760 | HLA-DQA2 | 585 | 3.16 × 10-2 | |
| 6 | rs2736172 | 31698877 | 406 | 6 | rs438475 | 32294223 | NOTCH4 | 1243 | 3.37 × 10-2 | |
| 6 | rs1063635 | 31487910 | 897 | 6 | rs206015 | 32290737 | NOTCH4 | 623 | 3.41 × 10-2 |
Each row gives the annotations of SNP-pairs with validated epistatic effects.
aMarg. Rank 1 and 2 give the ranks of single-marker association strengths of the SNPs 1 and 2 in the WTCCC data. The rank > 1015 corresponds to the Bonferroni p-value > 0.1 in a single-marker study.