| Literature DB >> 25371288 |
Marina Evangelou1, Deborah J Smyth, Mary D Fortune, Oliver S Burren, Neil M Walker, Hui Guo, Suna Onengut-Gumuscu, Wei-Min Chen, Patrick Concannon, Stephen S Rich, John A Todd, Chris Wallace.
Abstract
Pathway analysis can complement point-wise single nucleotide polymorphism (SNP) analysis in exploring genomewide association study (GWAS) data to identify specific disease-associated genes that can be candidate causal genes. We propose a straightforward methodology that can be used for conducting a gene-based pathway analysis using summary GWAS statistics in combination with widely available reference genotype data. We used this method to perform a gene-based pathway analysis of a type 1 diabetes (T1D) meta-analysis GWAS (of 7,514 cases and 9,045 controls). An important feature of the conducted analysis is the removal of the major histocompatibility complex gene region, the major genetic risk factor for T1D. Thirty-one of the 1,583 (2%) tested pathways were identified to be enriched for association with T1D at a 5% false discovery rate. We analyzed these 31 pathways and their genes to identify SNPs in or near these pathway genes that showed potentially novel association with T1D and attempted to replicate the association of 22 SNPs in additional samples. Replication P-values were skewed (P=9.85×10-11) with 12 of the 22 SNPs showing P<0.05. Support, including replication evidence, was obtained for nine T1D associated variants in genes ITGB7 (rs11170466, P=7.86×10-9), NRP1 (rs722988, 4.88×10-8), BAD (rs694739, 2.37×10-7), CTSB (rs1296023, 2.79×10-7), FYN (rs11964650, P=5.60×10-7), UBE2G1 (rs9906760, 5.08×10-7), MAP3K14 (rs17759555, 9.67×10-7), ITGB1 (rs1557150, 1.93×10-6), and IL7R (rs1445898, 2.76×10-6). The proposed methodology can be applied to other GWAS datasets for which only summary level data are available.Entities:
Keywords: genomewide association data; meta-analysis; pathway analysis
Mesh:
Year: 2014 PMID: 25371288 PMCID: PMC4258092 DOI: 10.1002/gepi.21853
Source DB: PubMed Journal: Genet Epidemiol ISSN: 0741-0395 Impact factor: 2.135
Summary statistics of the database genes within the two GWAS and the meta-analysis data of Barrett et al. [2009]. The “Theoretical” represents the genes of each pathway database as these were downloaded. These numbers are reduced when SNP coverage within the studies is taken into account
| Database | Study | Minimum | Median | Mean | Maximum |
|---|---|---|---|---|---|
| BioCarta | Theoretical | 1 | 15 | 16.97 | 84 |
| Meta-analysis | 1 | 13 | 14.79 | 76 | |
| WTCCC | 1 | 13 | 14.75 | 76 | |
| T1DGC | 1 | 13 | 14.70 | 76 | |
| Reactome | Theoretical | 1 | 16.50 | 46.31 | 1,740 |
| Meta-analysis | 0 | 15 | 37.23 | 1,506 | |
| WTCCC | 0 | 14 | 36.75 | 1,497 | |
| T1DGC | 0 | 15 | 37.15 | 1,496 |
The names of the methods applied to the data. FM stands for Fisher's method and ARTP stands for adaptive rank truncated product method. The gene statistics computed are the minimum P-value statistic and the Fisher's method statistic, which were adjusted either using a phenotype permutation procedure or using the reference genotype data for generating the corresponding SNP P-values
| Name | Gene statistic | Procedure | Pathway analysis method |
|---|---|---|---|
| FM-(MIN) | Minimum | Phenotype permutation | FM |
| FM-(FM) | Fisher's statistic | Phenotype permutation | FM |
| FM-(MIN | Minimum | Reference genotype data | FM |
| FM-(FM | Fisher's statistic | Reference genotype data | FM |
| ARTP-(MIN) | Minimum | Phenotype permutation | ARTP |
| ARTP-(FM) | Fisher's statistic | Phenotype permutation | ARTP |
| ARTP-(MIN | Minimum | Reference genotype data | ARTP |
| ARTP(FM | Fisher's statistic | Reference genotype data | ARTP |
Spearman correlations of the P-values computed for each tested pathway, for each tested database for both T1DGC and WTCCC GWAS
| Database | Methods compared | Spearman correlation |
|---|---|---|
| T1DGC | ||
| BioCarta | FM-(MIN) vs FM-(MIN) | 0.9939 |
| FM-(FM) vs FM-(FM) | 0.9755 | |
| ARTP-(MIN) vs ARTP-(MIN) | 0.9854 | |
| ARTP-(FM) vs ARTP-(FM) | 0.9496 | |
| Reactome | FM-(MIN) vs FM-(MIN) | 0.9878 |
| FM-(FM) vs FM-(FM) | 0.8378 | |
| ARTP-(MIN) vs ARTP-(MIN) | 0.9389 | |
| ARTP-(FM) vs ARTP-(FM) | 0.8833 | |
| WTCCC | ||
| BioCarta | FM-(MIN) vs FM-(MIN) | 0.9761 |
| FM-(FM) vs FM-(FM) | 0.9739 | |
| ARTP-(MIN) vs ARTP-(MIN) | 0.9793 | |
| ARTP-(FM) vs ARTP-(FM) | 0.9303 | |
| Reactome | FM-(MIN) vs FM-(MIN) | 0.9784 |
| FM-(FM) vs FM-(FM) | 0.9729 | |
| ARTP-(MIN) vs ARTP-(MIN) | 0.9638 | |
| ARTP-(FM) vs ARTP-(FM) | 0.9210 | |
Type-I error of the gene statistics combined with Fisher's method for the different pathway analysis methods
| Method | ||||
|---|---|---|---|---|
| Pathway size | FM-(MIN) | FM-(FM) | FM-(MIN | FM-(FM |
| 20 | 0.044 | 0.053 | 0.045 | 00057 |
| 50 | 0.043 | 0.041 | 0.043 | 0.044 |
| 100 | 0.059 | 0.071 | 0.060 | 0.058 |
| 200 | 0.053 | 0.054 | 0.048 | 0.049 |
| 500 | 0.042 | 0.053 | 0.046 | 0.055 |
| 1000 | 0.044 | 0.051 | 0.048 | 0.050 |
Pathways with FDR P-values of FM-(MIN) method less than 0.05
| Number | Pathway | FDR |
|---|---|---|
| i | Activation of Csk by cAMP-dependent Protein Kinase Inhibits Signaling through the T Cell Receptor | 0.0436 |
| ii | IL-2 Receptor Beta Chain in T cell Activation | 0.0293 |
| iii | HIV Induced T Cell Apoptosis | 0.0106 |
| iv | CTL mediated immune response against target cells | 0.0323 |
| v | Antigen Dependent B Cell Activation | 0.0364 |
| vi | IL-10 Anti-inflammatory Signaling Pathway | 0.0115 |
| vii | Stathmin and breast cancer resistance to antimicrotubule agents | 0.0460 |
| viii | T Helper Cell Surface Molecules | 0.0021 |
| ix | NO2-dependent IL 12 Pathway in NK cells | 0.0372 |
| x | T Cytotoxic Cell Surface Molecules | 0.0014 |
| xi | IL 17 Signaling Pathway | 0.0387 |
| xii | The Co-Stimulatory Signal During T-cell Activation | 0.0003 |
| xiii | Lck and Fyn tyrosine kinases in initiation of TCR Activation | 0.0012 |
| xiv | Role of Tob in T-cell activation | 0.0414 |
| xv | T Cell Receptor and CD3 Complex | 0.0414 |
| xvi | Selective expression of chemokine receptors during T-cell polarization | 0.0395 |
| xvii | B Lymphocyte Cell Surface Molecules | 0.0375 |
| xviii | Monocyte and its Surface Molecules | 0.0460 |
| xix | Adhesion Molecules on Lymphocyte | 0.0429 |
| xx | Double Stranded RNA Induced Gene Expression | 0.0375 |
| xxi | IFN alpha signaling pathway | 0.0375 |
| xxii | Immune System | 0.0216 |
| xxiii | Adaptive Immune System | 0.0216 |
| xxiv | Integrin cell surface interactions | 0.0216 |
| xxv | Semaphorin interactions | 0.0299 |
| xxvi | Immunoregulatory interactions between a Lymphoid and a non-Lymphoid cell | 0.0012 |
| xxvii | Effects of PIP2 hydrolysis | 0.0216 |
| xxviii | Interleukin-6 signaling | 0.0445 |
| xxix | Signal regulatory protein (SIRP) family interactions | 0.0216 |
| xxx | Catecholamine biosynthesis | 0.0299 |
| xxxi | GRB7 events in ERBB2 signaling | 0.0264 |
Genes of the enriched pathways that have not been reported previously as associated with T1D, but have case-control meta-analysis minimum SNP P-values less than 10−4. The most significant SNP assigned to each gene with its meta-analysis P-value are given in columns 3 and 4. Columns 5 and 6 present the meta-analysis P-value of additional cohorts if available and the combined P-value with Barrett et al. [2009] P-value, respectively. The seventh column presents any other immune (either autoimmune or autoinflammatory) disease(s) that the genes are associated with (ImmunoBase, 26/08/2014). The disease symbols correspond to: UC, Ulcerative colitis; CEL, Celiac disease; PSO, psoriasis; IBD, inflammatory bowel disease; ALO, Alopecia; and CRO, Crohn's disease. The last column presents the pathway number that each gene belongs to (Table5)
| Gene | Chr | Most significant SNP | Additional cohorts meta-analysis SNP | Combined | Gene is located or is a candidate gene within an immune disease region | Pathway membership | |
|---|---|---|---|---|---|---|---|
| 1p34.3 | rs6703605 | xxii, xxiii | |||||
| 1q21.3 | rs6427658 | 0.3260 | AS, JIA, RA | xxii, xxviii | |||
| 1q24.3 | rs10912276 | 0.0290 | CEL, CRO, IBD | ii, iii, iv, v | |||
| 1q31.3 | rs2182419 | 0.9593 | RA | i, viii, x, xiii, xvii, xxii, xxiii, xxv | |||
| 2q31.1 | rs16860458 | xxiv | |||||
| 3p25.1 | rs2450855 | 0.7642 | ii, xxii, xxiii | ||||
| 4p14 | rs4321646 | xxii | |||||
| 4p14 | rs4321646 | xxii | |||||
| 4q27 | rs4502701 | 0.3697 | xxvii | ||||
| 5p13.2 | rs1445898 | 0.0146 | MS, PBC, UC, T1D | xxii | |||
| 5q33.1 | rs4246045 | 0.8456 | CRO, UC, IBD | xxii, xxiii | |||
| 6p21.31 | rs2237093 | xxii | |||||
| 6p25.3 | rs2048698 | 0.0296 | CEL, PSO, RA | xxii | |||
| 6q21 | rs11964650 | 0.0183 | UC, CRO, IBD | xiii, xxii, xxiii, xxv | |||
| 8p23.1 | rs1296023 | xxii, xxiii | |||||
| 10p11.22 | rs1557150 | 0.0119 | xviii, xix, xxii, xxiii, xxiv, xxv, xxvi | ||||
| 10p11.22 | rs722988 | 0.0013 | xxv | ||||
| 11p11.2 | rs2293576 | 0.7537 | MS | xxii, xxiii | |||
| 11q13.1 | rs694739 | 0.0031 | CRO, MS, UC, ALO, IBD | ii, xxii, xxiii | |||
| 11q23.3 | rs11216829 | 0.5807 | 0.0002 | xxii, xxiii, xxiv, xxvi | |||
| 12q13.13 | rs11170466 | xxii, xxiii, xxiv, xxvi | |||||
| 12q13.2 | rs11171710 | xxvii | |||||
| 13q32.1 | rs9302086 | xx | |||||
| 13q12.3 | rs1360485 | 0.3819 | xxii | ||||
| 14q23.1 | rs1111107 | RA | xxvii | ||||
| 16p13.13 | rs149310 | 0.0064 | CEL, CRO, JIA, MS, PBC, PSO, UC, IBD | ii, xxii, xxiii | |||
| 17p13.1 | rs16956936 | 0.0834 | xx | ||||
| 17p13.2 | rs9906760 | 0.0047 | xxii, xxiii | ||||
| 17q21.31 | rs17759555 | 0.0047 | MS | xx, xxii, xxiii | |||
| 19p13.3 | rs12982646 | 0.2323 | xxii, xxiii | ||||
| 19p13.3 | rs12982646 | 0.2323 | xxii, xxiii, xxiv, xxvi | ||||
| 19q13.32 | rs411560 | MS | xxii, xxiii |