| Literature DB >> 23134571 |
Abstract
BACKGROUND: Genome-wide association studies (GWAS) have generated a wealth of valuable genotyping data for complex diseases/traits. A large proportion of these data are embedded with many weakly associated markers that have been missed in traditional single marker analyses, but they may provide valuable insights in dissecting the genetic components of diseases. Gene set analysis (GSA) augmented by protein-protein interaction network data provides a promising way to examine GWAS data by analyzing the combined effects of multiple genes/markers, each of which may have only individually weak to moderate association effects. A critical issue in GSA of GWAS data is the definition of gene-wise P values based on multiple SNPs mapped to a gene.Entities:
Mesh:
Substances:
Year: 2012 PMID: 23134571 PMCID: PMC3481439 DOI: 10.1186/1471-2164-13-S6-S15
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Q-Q plot of gene-wise . The three rows of panels represent the method of VEGAS-all, VEGAS-top, and minP, respectively. In each row (method), the left panel displays the Q-Q plot, while the right panel shows the distribution of gene-wise P values versus gene length. In the right panel, gene length (X-axis) was separated to bins, each of which has 100 genes; the Y-axis shows the proportion of significant genes (denoted as "SigGenes") in each length bin. SigGenes were defined as those with gene-wise P values <0.05.
Summary of the module search results by the strict searching strategy
| Gene-wise method | # modules | # significant modules | # module genes |
|---|---|---|---|
| VEGAS-all | 256 | 13 | 144 |
| VEGAS-top | 572 | 33 | 323 |
| minP | 703 | 36 | 319 |
Analysis results of module gene sets and KEGG pathways by ALIGATOR (P<0.2)
| Gene set | ||
|---|---|---|
| VEGAS-all module genes | 1.0 × 10-4 | 0.007 |
| VEGAS-top module genes | 1.0 × 10-4 | 0.007 |
| minP module genes | 1.0 × 10-4 | 0.007 |
| Asthma (hsa05310) | 8.0 × 10-4 | 0.040 |
| Tryptophan metabolism (hsa00380) | 1.7 × 10-3 | 0.068 |
| Gap junction (hsa04540) | 2.3 × 10-3 | 0.076 |
| Cytokine-cytokine receptor interaction (hsa04060) | 2.9 × 10-3 | 0.082 |
| Intestinal immune network for IgA production (hsa04672) | 5.5 × 10-3 | 0.137 |
$P values were adjusted by Benjamini & Hochberg (BH) method [20].
Figure 2Proportion of significant genes in each module of gene sets. Significant genes were defined as those with gene-wise P values <0.05. The proportion of significant genes in module genes and in the background genes in the network were shown for each scenario of using VEGAS-all, VEGAS-top, and minP as input genes, respectively.
Summary of module genes having positive association results in previous studies
| Gene | Module gene set$ | # SNPs | Start (bp) | Stop (bp) | ||||
|---|---|---|---|---|---|---|---|---|
| a, m | 125 | 1 | 160306204 | 160606437 | 0.005 | 0.007 | 1.07 × 10-4 | |
| t, m | 14 | 1 | 184907591 | 184916179 | 0.013 | 8.5 × 10-4 | 1.07 × 10-5 | |
| t | 11 | 1 | 229731021 | 229768892 | 0.476 | 0.306 | 0.113 | |
| m | 106 | 1 | 229829183 | 230243641 | 0.349 | 0.100 | 0.004 | |
| a | 47 | 3 | 57969166 | 58133017 | 0.030 | 0.034 | 0.007 | |
| t | 40 | 3 | 121028235 | 121295203 | 0.141 | 0.073 | 0.011 | |
| m | 193 | 5 | 58300622 | 59225378 | 0.277 | 0.018 | 6.16 × 10-5 | |
| a | 12 | 5 | 132037271 | 132046267 | 0.069 | 0.019 | 0.007 | |
| a | 52 | 6 | 15631017 | 15771250 | 0.020 | 0.022 | 0.008 | |
| t, m | 48 | 6 | 30420915 | 30422611 | 0.014 | 0.001 | 1.04 × 10-4 | |
| m | 43 | 6 | 31648071 | 31650077 | 0.450 | 0.075 | 0.003 | |
| m | 43 | 6 | 31651328 | 31654091 | 0.598 | 0.184 | 0.003 | |
| a, t, m | 86 | 6 | 152053323 | 152466101 | 0.091 | 0.027 | 0.004 | |
| t | 13 | 6 | 170705395 | 170723872 | 0.031 | 0.019 | 0.010 | |
| t, m | 72 | 7 | 55054218 | 55242525 | 0.556 | 0.294 | 0.026 | |
| a, t, m | 17 | 8 | 102000089 | 102034745 | 0.857 | 0.685 | 0.268 | |
| a, t, m | 3 | 9 | 139153429 | 139183029 | 0.016 | 0.005 | 0.004 | |
| m | 59 | 10 | 123492614 | 123677536 | 0.491 | 0.155 | 9.28 × 10-4 | |
| t, m | 26 | 12 | 66834816 | 66839788 | 0.068 | 0.007 | 0.002 | |
| m | 45 | 12 | 116135361 | 116283965 | 0.643 | 0.315 | 0.028 | |
| t | 68 | 13 | 41520888 | 41701888 | 0.106 | 0.031 | 0.002 | |
| a, t | 6 | 14 | 104306731 | 104333125 | 0.076 | 0.056 | 0.026 | |
| t, m | 11 | 17 | 7512444 | 7531588 | 0.540 | 0.219 | 0.075 | |
| t | 129 | 17 | 61729387 | 62237324 | 0.995 | 0.857 | 0.029 | |
| m | 13 | 19 | 40474877 | 40496547 | 0.097 | 0.037 | 0.003 | |
| t, m | 21 | 20 | 30813851 | 30860823 | 0.026 | 0.002 | 1.75 × 10-4 | |
| m | 37 | 22 | 35007271 | 35113927 | 0.050 | 0.017 | 5.75 × 10-4 |
a: VEGAS-all; t: VEGAS-top; m: minP.
Comparative results by DAPPLE for the top 30 genes in VEGAS-all, VEGAS-top, and minP data sets
| VEGAS-all | VEGAS-top | minP | |
|---|---|---|---|
| # of direct interactions | 4 | 1 | 3 |
| # genes to prioritize | 6 | 2 | 3 |
| Mean associated protein direct connectivity | 1.6 | 1.0 | 2.0 |
| Mean associated protein indirect connectivity | 58.1 | 6.1 | 7.5 |
| # module genes | 18 | 21 | 19 |
Enriched KEGG pathways for module genes using the hypergeometric test
| Pathway name (KEGG ID) | Pathway size | # module genes | ||
|---|---|---|---|---|
| Neurotrophin signaling pathway (hsa04722) | 116 | 10 | 6.96 × 10-5 | 0.007 |
| Adipocytokine signaling pathway (hsa04920) | 55 | 6 | 6.34 × 10-4 | 0.065 |
| Cell cycle (hsa04110) | 114 | 8 | 1.56 × 10-3 | 0.158 |
| Chronic myeloid leukemia (hsa05220) | 70 | 6 | 2.28 × 10-3 | 0.227 |
| Vasopressin-regulated water reabsorption (hsa04962) | 34 | 4 | 4.10 × 10-3 | 0.405 |
| Cell cycle (hsa04110) | 114 | 20 | 8.61 × 10-8 | 1.15 × 10-5 |
| Neurotrophin signaling pathway (hsa04722) | 116 | 17 | 1.14 × 10-5 | 1.51 × 10-3 |
| Endometrial cancer (hsa05213) | 49 | 8 | 1.31 × 10-3 | 0.173 |
| RIG-I-like receptor signaling pathway (hsa04622) | 50 | 8 | 1.50 × 10-3 | 0.196 |
| Chronic myeloid leukemia (hsa05220) | 70 | 9 | 3.69 × 10-3 | 0.479 |
| T cell receptor signaling pathway (hsa04660) | 104 | 17 | 2.68 × 10-6 | 3.43 × 10-4 |
| Neurotrophin signaling pathway (hsa04722) | 116 | 18 | 2.91 × 10-6 | 3.69 × 10-4 |
| Antigen processing and presentation (hsa04612) | 59 | 12 | 8.58 × 10-6 | 1.08 × 10-3 |
| Chronic myeloid leukemia (hsa05220) | 70 | 13 | 1.03 × 10-5 | 1.28 × 10-3 |
| Non-small cell lung cancer (hsa05223) | 51 | 11 | 1.14 × 10-5 | 1.42 × 10-3 |