| Literature DB >> 23071489 |
Irina Dinu1, Surakameth Mahasirimongkol, Qi Liu, Hideki Yanai, Noha Sharaf Eldin, Erin Kreiter, Xuan Wu, Shahab Jabbari, Katsushi Tokunaga, Yutaka Yasui.
Abstract
In genome-wide association studies (GWAS), the association between each single nucleotide polymorphism (SNP) and a phenotype is assessed statistically. To further explore genetic associations in GWAS, we considered two specific forms of biologically plausible SNP-SNP interactions, 'SNP intersection' and 'SNP union,' and analyzed the Crohn's Disease (CD) GWAS data of the Wellcome Trust Case Control Consortium for these interactions using a limited form of logic regression. We found strong evidence of CD-association for 195 genes, identifying novel susceptibility genes (e.g., ISX, SLCO6A1, TMEM183A) as well as confirming many previously identified susceptibility genes in CD GWAS (e.g., IL23R, NOD2, CYLD, NKX2-3, IL12RB2, ATG16L1). Notably, 37 of the 59 chromosomal locations indicated for CD-association by a meta-analysis of CD GWAS, involving over 22,000 cases and 29,000 controls, were represented in the 195 genes, as well as some chromosomal locations previously indicated only in linkage studies, but not in GWAS. We repeated the analysis with two smaller GWASs from the Database of Genotype and Phenotype (dbGaP): in spite of differences of populations and study power across the three datasets, we observed some consistencies across the three datasets. Notable examples included TMEM183A and SLCO6A1 which exhibited strong evidence consistently in our WTCCC and both of the dbGaP SNP-SNP interaction analyses. Examining these specific forms of SNP interactions could identify additional genetic associations from GWAS. R codes, data examples, and a ReadMe file are available for download from our website: http://www.ualberta.ca/~yyasui/homepage.html.Entities:
Mesh:
Substances:
Year: 2012 PMID: 23071489 PMCID: PMC3470545 DOI: 10.1371/journal.pone.0043035
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Forty genes with the strongest evidence for association with Crohn's Disease risk, with chromosomal locations, numbers of SNPs, approximate p-values, and Bayes factors.
| Gene Name | Chromosome | #SNPs | p-value | C.BF |
| ISX | 22q12 | 84 | <3.8×10−6 | 148.5 |
| SEMA6A | 5q23 | 152 | <3.8×10−6 | 96.2 |
| GTF3C4 | 9q34 | 4 | <3.8×10−6 | 91.8 |
| PTGFRN | 1p13 | 15 | <3.8×10−6 | 85.5 |
| ADRA1B | 5q33 | 45 | <3.8×10−6 | 82.3 |
| MYLK3 | 16q11 | 2 | <3.8×10−6 | 77.0 |
| HTR3B | 11q23 | 10 | <3.8×10−6 | 75.7 |
| RRP15 | 1q41 | 29 | <3.8×10−6 | 75.4 |
| RGL1 | 1q25 | 20 | <3.8×10−6 | 69.9 |
| SORBS1 | 10q23 | 46 | <3.8×10−6 | 65.5 |
| CALCOCO1 | 12q13 | 15 | <3.8×10−6 | 57.9 |
| TMEM156 | 4p14 | 13 | <3.8×10−6 | 52.7 |
| XRCC6BP1 | 12q14 | 38 | <3.8×10−6 | 45.9 |
| FXR1 | 3q28 | 7 | <3.8×10−6 | 37.7 |
| GARNL1 | 14q13 | 4 | <3.8×10−6 | 34.9 |
| GPR161 | 1q24 | 7 | <3.8×10−6 | 30.9 |
| SORCS1 | 10q23-q25 | 265 | <3.8×10−6 | 30.6 |
| SAC | 1q24 | 13 | <3.8×10−6 | 28.4 |
| LRP1B | 2q21 | 241 | <3.8×10−6 | 27.2 |
| C18orf62 | 18q23 | 79 | <3.8×10−6 | 25.9 |
| CSRP1 | 1q32 | 17 | <3.8×10−6 | 24.2 |
| POU6F2 | 7p14 | 58 | <3.8×10−6 | 22.6 |
| LEF1 | 4q23-q25 | 31 | <3.8×10−6 | 22.3 |
| SEL1L | 14q31 | 170 | <3.8×10−6 | 21.9 |
| SVIP | 11p14 | 88 | <3.8×10−6 | 21.7 |
| VRK1 | 14q32 | 128 | <3.8×10−6 | 19.3 |
| GLRX3 | 10q26 | 79 | <3.8×10−6 | 18.4 |
| ID4 | 6p22 | 79 | <3.8×10−6 | 15.3 |
| CDH10 | 5p14 | 107 | <3.8×10−6 | 14.9 |
| NOD2 | 16q21 | 5 | <3.8×10−6 | 14.6 |
| NHLRC1 | 6p22 | 7 | <3.8×10−6 | 14.0 |
| FMN2 | 1q43 | 60 | <3.8×10−6 | 14.0 |
| IL23R | 1p31 | 11 | <3.8×10−6 | 13.6 |
| PTGER4 | 5p13 | 46 | <3.8×10−6 | 13.5 |
| CTNNA3 | 10q22 | 257 | <3.8×10−6 | 13.3 |
| PNPLA6 | 19p13 | 5 | <3.8×10−6 | 13.0 |
| FBXO15 | 18q22 | 94 | <3.8×10−6 | 12.5 |
| ATG16L1 | 2q37 | 7 | <3.8×10−6 | 12.4 |
| RTP2 | 3q27 | 4 | <3.8×10−6 | 12.0 |
| KCNIP4 | 4p15 | 154 | <3.8×10-6 | 11.8 |
indicates genes in the chromosomal locations where the WTCCC single-SNP analysis showed strong evidence.
indicates genes in the chromosomal locations where the WTCCC single-SNP analysis showed moderate evidence.
indicates chromosomal locations are those with three or more genes in the 195 genes (see Table S1) showing strong evidence in our WTCCC logic-regression-based analysis, but without strong or moderate evidence in the single-SNP analysis of WTCCC.
Logic structures, frequencies, and associated Crohn's Disease odds ratios of the ISX gene (p-value<3.8×10−6).
| rs11089728CC | rs9610191 TT | rs17778240 TT | rs17778240 TT | rs5999715AC | Logic-based Risk Groups | ||
| Genotype Freq | Case N = 1748 | 797 (45.6%) | 10 (0.6%) | 466 (26.7%) | 466 (26.7%) | 214 (12.2%) | |
| Cont N = 2936 | 1326 (45.2%) | 18 (0.6%) | 776 (26.4%) | 776 (26.4%) | 17 (0.6%) | ||
Seventeen genes with the strong evidence for association with Crohn's Disease risk in WTCCC and one or both of Non-Jewish and Jewish dbGap GWASs, with chromosomal locations, numbers of SNPs, approximate p-values, and Bayes factors.
| GWAS Study Name | WTCCC | Non-Jewish dbGap | Jewish dbGap | |||||||
| Sample Size (Cases/Controls) | (1748/2936) | (498/498) | (291/429) | |||||||
| Gene Name | Chromosome | #SNPs | p-value | C.BF | #SNPs | p-value | C.BF | #SNPs | p-value | C.BF |
| IL23R | 1p31 | 11 | <3.8×10−6 | 13.6 | 14 | <3.8×10−6 | 9.3 | 18 | 1.4×10−3 | 3.8 |
| NOD2 | 16q21 | 5 | <3.8×10−6 | 14.6 | 4 | 2.9×10−5 | 5.8 | 4 | 8.7×10−3 | 2.8 |
| TMEM183A | 1q32 | 10 | 7.6×10−5 | 5.3 | 5 | 6.0×10−4 | 4.3 | 5 | 1.9×10−3 | 3.7 |
| SLCO6A1 | 5q21 | 15 | 5.7×10−5 | 5.4 | 11 | 4.7×10−4 | 4.4 | 11 | 1.6×10−2 | 2.4 |
| PTGER4 | 5p13 | 46 | <3.8×10−6 | 13.5 | 50 | 1.6×10−4 | 4.9 | 53 | 1.6×10−1 | 0.9 |
| CYLD | 16q12 | 30 | <3.8×10−6 | 11.2 | 22 | 2.2×10−3 | 3.6 | 21 | 1.1×10−1 | 1.2 |
| SOCS6 | 18q22 | 111 | 3.4×10−5 | 5.6 | 147 | 4.7×10−2 | 1.8 | 145 | 1.5×10−2 | 2.5 |
| ACAD11 | 3q22 | 4 | <3.8×10−6 | 6.2 | 5 | 3.7×10−1 | 0.3 | 5 | 1.2×10−3 | 3.9 |
| CLSTN2 | 3q23 | 120 | <3.8×10−6 | 6.3 | 100 | 4.0×10−4 | 4.4 | 104 | 7.6×10−1 | −0.5 |
| SOX11 | 2p25 | 194 | <3.8×10−6 | 9.6 | 194 | 2.8×10−3 | 3.4 | 188 | 3.0×10−1 | 0.4 |
| CEBPB | 20q13 | 15 | 5.9×10−4 | 4.2 | 27 | 2.2×10−1 | 0.7 | 28 | 5.5×10−3 | 3.1 |
| C1orf141 | 1p31 | 10 | <3.8×10−6 | 10.3 | 6 | 4.1×10−3 | 3.2 | 7 | 3.5×10−1 | 0.3 |
| NEK2 | 1q32-q41 | 11 | 8.8×10−5 | 5.1 | 13 | 4.0×10−1 | 0.2 | 12 | 4.9×10−3 | 3.1 |
| NKX2-3 | 10q24 | 14 | 7.6×10−6 | 6.0 | 7 | 8.4×10−3 | 2.8 | 7 | 2.9×10−1 | 0.5 |
| BSN | 3p21 | 4 | <3.8×10−6 | 7.0 | 3 | 1.2×10−2 | 2.6 | 3 | 7.0×10−1 | −0.4 |
| RBMS3 | 3p24-p23 | 157 | 2.1×10−4 | 4.7 | 177 | 8.8×10−1 | −0.8 | 175 | 6.9×10−3 | 2.9 |
| C10orf57 | 10q22 | 6 | 6.3×10−4 | 4.2 | 9 | 8.1×10−1 | −0.6 | 9 | 1.4×10−2 | 2.5 |
indicates genes in the chromosomal locations where the WTCCC single-SNP analysis showed strong evidence.
indicates chromosomal locations are those with three or more genes in the 195 genes (see Table S1) showing strong evidence in our WTCCC logic-regression-based analysis, but without strong or moderate evidence in the single-SNP analysis of WTCCC.