| Literature DB >> 22792057 |
Peilin Jia1, Lily Wang, Ayman H Fanous, Carlos N Pato, Todd L Edwards, Zhongming Zhao.
Abstract
With the recent success of genome-wide association studies (GWAS), a wealth of association data has been accomplished for more than 200 complex diseases/traits, proposing a strong demand for data integration and interpretation. A combinatory analysis of multiple GWAS datasets, or an integrative analysis of GWAS data and other high-throughput data, has been particularly promising. In this study, we proposed an integrative analysis framework of multiple GWAS datasets by overlaying association signals onto the protein-protein interaction network, and demonstrated it using schizophrenia datasets. Building on a dense module search algorithm, we first searched for significantly enriched subnetworks for schizophrenia in each single GWAS dataset and then implemented a discovery-evaluation strategy to identify module genes with consistent association signals. We validated the module genes in an independent dataset, and also examined them through meta-analysis of the related SNPs using multiple GWAS datasets. As a result, we identified 205 module genes with a joint effect significantly associated with schizophrenia; these module genes included a number of well-studied candidate genes such as DISC1, GNA12, GNA13, GNAI1, GPR17, and GRIN2B. Further functional analysis suggested these genes are involved in neuronal related processes. Additionally, meta-analysis found that 18 SNPs in 9 module genes had P(meta)<1 × 10⁻⁴, including the gene HLA-DQA1 located in the MHC region on chromosome 6, which was reported in previous studies using the largest cohort of schizophrenia patients to date. These results demonstrated our bi-directional network-based strategy is efficient for identifying disease-associated genes with modest signals in GWAS datasets. This approach can be applied to any other complex diseases/traits where multiple GWAS datasets are available.Entities:
Mesh:
Year: 2012 PMID: 22792057 PMCID: PMC3390381 DOI: 10.1371/journal.pcbi.1002587
Source DB: PubMed Journal: PLoS Comput Biol ISSN: 1553-734X Impact factor: 4.475
Figure 1Materials and MethodscMaterials and Methodsgenes for schizophrenia.
Figure 2Distribution of module scores (Z) from two GWAS datasets.
Each circle in the plot represents a module. The circles in red indicate those selected modules (see text). X-axis: module scores from the discovery GWAS dataset. Y-axis: module scores from the evaluation GWAS dataset.
Results of meta-analysis using GAIN, nonGAIN, and ISC GWAS datasets (P meta<1×10−4 and P heterogeneity≥0.05).
| SNP ID | Module Genes | Chr. | Position | Allele |
| Beta | s.e. |
|
|
|
|
| rs9272219 |
| 6 | 32710247 | T/G | 1.46×10−6 | −0.15 | 0.03 | 0.06 | 0.06 | 1.58×10−5 | 0.76 |
| rs10244946 |
| 7 | 1887594 | A/G | 4.30×10−6 | −0.16 | 0.03 | 1.81×10−4 | 0.18 | 2.36×10−3 | 0.27 |
| rs3778994 |
| 7 | 2142381 | A/C | 6.79×10−6 | −0.15 | 0.03 | 4.54×10−4 | 0.61 | 1.20×10−4 | 0.07 |
| rs10275045 |
| 7 | 1887352 | T/C | 9.79×10−6 | −0.13 | 0.03 | 1.44×10−4 | 0.20 | 3.61×10−3 | 0.16 |
| rs4721190 |
| 7 | 1921258 | A/G | 1.39×10−5 | −0.15 | 0.03 | 3.07×10−4 | 0.17 | 6.42×10−3 | 0.32 |
| rs2056480 |
| 7 | 1920827 | A/G | 1.44×10−5 | −0.12 | 0.03 | 4.15×10−5 | 0.31 | 5.69×10−3 | 0.07 |
| rs9272535 |
| 6 | 32714734 | A/G | 1.58×10−5 | −0.16 | 0.04 | 0.07 | 0.07 | 8.27×10−5 | 0.41 |
| rs3132649 |
| 6 | 30429036 | A/G | 1.64×10−5 | −0.20 | 0.05 | 0.01 | 0.46 | 6.46×10−7 | 0.00 |
| rs10224497 |
| 7 | 2116493 | G/A | 1.75×10−5 | −0.14 | 0.03 | 4.32×10−5 | 0.91 | 8.46×10−4 | 0.02 |
| rs741326 |
| 2 | 70912343 | G/A | 2.65×10−5 | −0.12 | 0.03 | 0.31 | 0.09 | 4.14×10−5 | 0.46 |
| rs12646184 |
| 4 | 95402239 | T/C | 3.21×10−5 | 0.12 | 0.03 | 2.85×10−5 | 0.11 | 0.03 | 0.05 |
| rs2071278 |
| 6 | 32273422 | G/A | 3.23×10−5 | −0.16 | 0.04 | 0.10 | 0.98 | 2.78×10−6 | 0.06 |
| rs2664871 |
| 4 | 95365304 | T/C | 4.69×10−5 | 0.12 | 0.03 | 3.89×10−5 | 0.12 | 0.04 | 0.05 |
| rs172531 |
| 1 | 8418177 | G/A | 5.62×10−5 | 0.12 | 0.03 | 0.01 | 0.75 | 4.03×10−5 | 0.06 |
| rs2087170 |
| 4 | 95381983 | G/T | 5.83×10−5 | 0.14 | 0.03 | 5.59×10−5 | 0.10 | 0.12 | 0.11 |
| rs3757440 |
| 7 | 2239462 | G/A | 6.01×10−5 | −0.14 | 0.04 | 6.41×10−4 | 0.56 | 2.35×10−3 | 0.14 |
| rs301791 |
| 1 | 8390959 | T/A | 6.45×10−5 | 0.12 | 0.03 | 4.88×10−3 | 0.78 | 9.90×10−5 | 0.06 |
| rs301801 |
| 1 | 8418532 | C/T | 6.66×10−5 | 0.12 | 0.03 | 0.01 | 0.73 | 4.51×10−5 | 0.06 |
| rs302719 |
| 1 | 8412907 | G/T | 7.01×10−5 | 0.12 | 0.03 | 0.01 | 0.73 | 4.84×10−5 | 0.06 |
| rs349171 |
| 3 | 62026751 | T/C | 9.39×10−5 | −0.16 | 0.04 | 0.50 | 0.08 | 9.58×10−5 | 0.35 |
| rs8336 |
| 4 | 95430633 | T/C | 9.81×10−5 | −0.13 | 0.03 | 1.43×10−3 | 0.18 | 0.02 | 0.47 |
Rows were ordered by P meta.
Figure 3Meta-analysis results of the two most significant genes.
Figures were generated using the LocusZoom online tool. X-axis is the genome coordinate. Y-axis is the -logP meta values. Each point represents a SNP. The color of points is according to their level of linkage disequilibrium (LD) with the index SNPs. In this case, the index SNP is the most significant one in each panel. The LD measure is r2 based on the HapMap CEU population (release 22).
Enriched pathways for module genes by Ingenuity Pathway Analysis.
| Ingenuity Canonical Pathways | -log( | Molecules |
| Huntington's Disease Signaling | 7.17 |
|
| Wnt/β-catenin Signaling | 6.52 |
|
| Androgen Signaling | 4.97 |
|
| CREB Signaling in Neurons | 4.86 |
|
| Prolactin Signaling | 4.82 |
|
| TGF-β Signaling | 4.40 |
|
| Calcium Signaling | 4.31 |
|
| Gα12/13 Signaling | 4.12 |
|
| Synaptic Long Term Depression | 3.97 |
|
| Dopamine-DARPP32 Feedback in cAMP Signaling | 3.80 |
|
P values adjusted by Benjamini & Hochberg (BH) method [33].