| Literature DB >> 25429300 |
Tao He1, Ping-Shou Zhong1, Yuehua Cui2.
Abstract
Single variant analysis in genome-wide association studies (GWAS) has been proven to be successful in identifying thousands of genetic variants associated with hundreds of complex diseases. However, these identified variants only explain a small fraction of inheritable variability in many diseases, suggesting that other resources, such as multilevel genetic variations, may contribute to disease susceptibility. In this work, we proposed to combine genetic variants that belong to a gene set, such as at gene- and pathway-level to form an integrated signal aimed to identify major players that function in a coordinated manner conferring disease risk. The integrated analysis provides novel insight into disease etiology while individual signals could be easily missed by single variant analysis. We applied our approach to a genome-wide association study of type 2 diabetes (T2D) with male and female data analyzed separately. Novel sex-specific genes and pathways were identified to increase the risk of T2D. We also demonstrated the performance of signal integration through simulation studies.Entities:
Keywords: gene-based analysis; p-value combination; pathway-based analysis; sex-specific analysis; type 2 diabetes
Year: 2014 PMID: 25429300 PMCID: PMC4228910 DOI: 10.3389/fgene.2014.00395
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Figure 1Manhattan plot of single SNP . The horizontal dotted line represents the genome-wide significance threshold which is labeled in the figure.
Figure 2Manhattan plot of KEGG gene . The horizontal dotted line represents the genome-wide gene-level significance threshold which is labeled in the figure.
List of enriched KEGG Pathways in Female and Male population.
| KEGG maturity onset diabetes of the young | 25 | ✓ | – |
| KEGG pathways in cancer | 315 | ✓ | – |
| KEGG TGF beta signaling pathway | 86 | ✓ | – |
| KEGG hedgehog signaling pathway | 55 | ✓ | – |
| KEGG type II diabetes mellitus | 46 | ✓ | – |
| KEGG melanoma | 67 | ✓ | – |
| KEGG sphingolipid metabolism | 37 | ✓ | – |
| KEGG type I diabetes mellitus | 41 | ✓ | – |
| KEGG MAPK signaling pathway | 256 | ✓ | – |
| KEGG one carbon pool by folate | 17 | ✓ | – |
| KEGG alpha linolenic acid metabolism | 19 | ✓ | – |
| KEGG thyroid cancer | 29 | – | ✓ |
| KEGG arrhythmogenic right ventricular cardiomyopathy ARVC | 72 | – | ✓ |
| KEGG adherens junction | 73 | – | ✓ |
| KEGG basal cell carcinoma | 55 | ✓ | ✓ |
| KEGG colorectal cancer[ | 61 | ✓ | ✓ |
Pathway signal decreased greatly when TCF7L2 gene was deleted in female.
Pathway signal decreased greatly when TCF7L2 gene was deleted in male.
Figure 3Manhattan plot of KEGG pathway . The horizontal solid line represents the genome-wide pathway level significance threshold which is labeled in the figure.
Figure 4Three simulation scenarios. OR, odds ratio; IWG, interactions within a gene; IBG, interactions between genes. β = log OR for isolated nodes (SNPs), and Ψ = log OR for any two nodes connected with edge.
List of different simulation scenarios.
| NR5A2 | 1 | 2494 | 95 | M | 6S | 6S (+I) |
| GCK | 7 | 2645 | 21 | M | 5S | 5S (+I) |
| NEUROG3 | 10 | 50674 | 36 | M | 6S | 6S (+I) |
| HNF1A | 12 | 6927 | 29 | M | 6S | 6S (+I) |
| HNF1B | 17 | 6928 | 44 | M | 9S | 9S (+I) |
S, small effect (OR = 1.1); M, moderate effect (OR = 1.4); +I: with interactions.
Gene-level simulation results with ARTP and SCC under different scenarios.
| 500:500 | ScenarioA | 0.731 ( | ||||
| ScenarioB | 0.820 ( | 0.122 ( | 0.692 ( | 0.420 ( | ||
| ScenarioC | 0.505 ( | 0.116 ( | 0.172 ( | 0.308 ( | ||
| Under the null | 0.044 (0.053) | 0.045 (0.056) | 0.051 (0.053) | 0.048 (0.054) | 0.051 (0.062) | |
| 1000:1000 | ScenarioA | |||||
| ScenarioB | 0.990 ( | 0.189 ( | 0.954 ( | 0.757 ( | ||
| ScenarioC | 0.836 ( | 0.205 ( | 0.276 ( | 0.596 ( | ||
| Under the null | 0.042 (0.060) | 0.044 (0.049) | 0.061 (0.048) | 0.053 (0.065) | 0.046 (0.057) |
The results using the SCC method are given in the parenthesis while higher power between the two is highlighted with bold font.
Pathway-level simulation results under three different scenarios.
| ScenarioA | 0.754 | 0.757 | 0.996 | |||
| ScenarioB | 0.755 | 0.529 | 0.994 | 0.971 | ||
| ScenarioC | 0.383 | 0.294 | 0.863 | 0.747 | ||
| Under the null | 0.051 | 0.042 | 0.050 | 0.045 | 0.043 | 0.048 |
The highest power among the three methods (ARTP, MGSEA and SCC) is highlighted with bold font.