| Literature DB >> 21226955 |
Jingyuan Zhao1, Simone Gupta, Mark Seielstad, Jianjun Liu, Anbupalam Thalamuthu.
Abstract
BACKGROUND: Single Nucleotide Polymorphism (SNP) analysis only captures a small proportion of associated genetic variants in Genome-Wide Association Studies (GWAS) partly due to small marginal effects. Pathway level analysis incorporating prior biological information offers another way to analyze GWAS's of complex diseases, and promises to reveal the mechanisms leading to complex diseases. Biologically defined pathways are typically comprised of numerous genes. If only a subset of genes in the pathways is associated with disease then a joint analysis including all individual genes would result in a loss of power. To address this issue, we propose a pathway-based method that allows us to test for joint effects by using a pre-selected gene subset. In the proposed approach, each gene is considered as the basic unit, which reduces the number of genetic variants considered and hence reduces the degrees of freedom in the joint analysis. The proposed approach also can be used to investigate the joint effect of several genes in a candidate gene study.Entities:
Mesh:
Year: 2011 PMID: 21226955 PMCID: PMC3033801 DOI: 10.1186/1471-2105-12-17
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1The flowchart of the various steps of our method.
The information of the three pathway patterns used in simulation
| TNFR2 Signaling Pathway (pattern 1) | 15 | 2 | 4 | 0.340, 0.416, 0.298, 0.317 | 1.070, 1.081, 1.117, 1.130 |
| Fructose and mannose metabolism (pattern 2) | 35 | 4 | 6 | 0.163, 0.353, 0.359, 0.359, 0.424, 0.332 | 1.113, 1.077, 1.063, 1.101, 1.105, 1.088 |
| Cytokine-cytokine receptor interaction (pattern 3) | 208 | 19 | 39 | 0.409, 0.491, 0.491, 0.275, 0.397, 0.315, 0.210, 0.073, 0.372, 0.415, 0.281, 0.219, 0.472, 0.305, 0.355, 0.435, 0.112, 0.323, 0.373, 0.044, 0.063, 0.227, 0.430, 0.070, 0.161, 0.355, 0.161, 0.034, 0.156, 0.417, 0.420, 0.169, 0.210, 0.175, 0.183, 0.269, 0.194, 0.177, 0.499 | 1.100, 1.065, 1.064, 1.090, 1.055, 1.086, 1.167, 1.172, 1.057, 1.084, 1.141, 1.081, 1.095, 1.149, 1.072, 1.092, 1.087, 1.062, 1.132, 1.134, 1.162, 1.094, 1.060, 1.247, 1.078, 1.062, 1.096, 1.249, 1.123, 1.070, 1.089, 1.073, 1.087, 1.093, 1.074, 1.064, 1.087, 1.123, 1.071 |
Figure 2Power versus relative risk in "TNFR2 Signaling Pathway" (pattern 1). The power of several methods is shown for different relative risks. ERR denotes the estimated relative risk from the real data set. The power are based on 500 replicates and defined as the proportion of replicates declared significant at the significant level 0.05.
Figure 3Power versus relative risk in "Fructose and mannose metabolism" (pattern 2).
Figure 4Power versus relative risk in "Cytokine-cytokine receptor interaction" (pattern 3).
Type I error of different methods
| TNFR2 Signaling Pathway (pattern 1) | Fructose and mannose metabolism (pattern 2) | Cytokine-cytokine receptor interaction (pattern 3) | |
|---|---|---|---|
| Type I error (95% CI) | Type I error (95% CI) | Type I error (95% CI) | |
| FPC_AIC | 0.060 (0.039,0.081) | 0.058 (0.038,0.078) | 0.054 (0.034,0.074) |
| FPC_BIC | 0.050 (0.031,0.069) NA | 0.048 (0.029,0.067) NA | 0.048 (0.029,0.067) NA |
| FPC_FULL | 0.058 (0.038,0.078) | 0.054 (0.034,0.074) | 0.048 (0.029,0.067) |
| AML | 0.062 (0.041,0.083) | 0.042 (0.024,0.060) | 0.034 (0.018,0.050) |
| MSS | 0.058 (0.038,0.078) | 0.050 (0.031,0.069) | 0.042 (0.024,0.060) |
| RTP_5 | 0.058 (0.038,0.078) | 0.046 (0.028,0.064) | 0.042 (0.024,0.060) |
| RTP_10 | 0.064 (0.043,0.085) | 0.048 (0.029,0.067) | 0.044 (0.026,0.062) |
For each of the methods, we presented the type I error, the 95% confidence interval of all estimated probabilities, the p-value of McNemar's test for the difference of type I error between FPC_BIC and other methods. The type I error was calculated on the basis of 500 replicates at significance level 0.05.
PSR and FDR for Gene subset identification of different methods
| PSR | FDR | PSR | FDR | PSR | FDR | |
|---|---|---|---|---|---|---|
| FPC_AIC | 0.880 | 0.524 | 0.473 | 0.665 | 0.535 | 0.739 |
| FPC_BIC | 0.318 | 0.031 | 0.243 | 0.093 | 0.192 | 0.090 |
| Gene-test | 0.310 | 0.024 | 0.185 | 0.075 | 0.085 | 0.025 |
The pathways detected by at least one method in GAIN data set
| 1. Cytokine Network | <0.001 | <0.001 | 0.020 | 0.888 |
| 2. Jak-STAT signaling pathway | <0.001 | <0.001 | 0.001 | 0.104 |
| 3. Dendritic cells in regulating TH1 and TH2 Development | <0.001 | <0.001 | 0.108 | 0.634 |
| 4. Cytokines and Inflammatory Response | <0.001 (0.005, 0.020) | 0.001 | 0.127 | 0.710 |
| 5. Cytokine-cytokine receptor interaction | <0.001 | <0.001 | 0.004 | 0.005 |
| 6. Role of BRCA1, BRCA2 and ATR in Cancer Susceptibility | <0.001 | <0.001 | 0.243 | 0.014 |
| 7. TNFR2 Signaling Pathway | 0.001 | <0.001 0.004, 0.021 | 0.249 | 0.303 |
| 8. Calcium signaling pathway | 0.022 | 0.001 | <0.001 (0.009, 0.009) | 0.002 |
| 9. Axon guidance | 0.017 | 0.040 | 0.001 | 0.238 |
| 10. ECM-receptor interaction | 0.792 | 0.392 | <0.001 (0.004, 0.012) | 0.197 |
| 11. Cell adhesion molecules (CAMs) | 0.332 | 0.011 | 0.001 | 0.023 |
All the p-values, false discovery rates (FDRs) and family-wise error rates (FWERs) were obtained via 1000 permutations. Gene p-value in pathway enrichment method was the p-value of the most significant SNP within gene. The AML method was based on the test statistic of Cochan-Armitage trend test.
The identified genes by AIC or BIC in their 5 common detected pathways
| Cytokine Network | 17 | 5/1 | |
| Jak-STAT signaling pathway | 126 | 25/5 | |
| Dendritic cells in regulating TH1 and TH2 Development | 19 | 4/1 | |
| Cytokine-cytokine receptor interaction | 208 | 49/6 | |
| Role of BRCA1, BRCA2 and ATR in Cancer Susceptibility | 21 | 3/1 |
Column "subset size" (# by AIC/# by BIC) presents the subset sizes selected by AIC and BIC, respectively. Column "selected genes" presents the identified genes by AIC and BIC, where genes with underline denote the ones selected by both AIC and BIC, and genes with no underline denote the ones selected only by AIC.