| Literature DB >> 21383967 |
Alexandra Zhernakova1, Eli A Stahl, Gosia Trynka, Soumya Raychaudhuri, Eleanora A Festen, Lude Franke, Harm-Jan Westra, Rudolf S N Fehrmann, Fina A S Kurreeman, Brian Thomson, Namrata Gupta, Jihane Romanos, Ross McManus, Anthony W Ryan, Graham Turner, Elisabeth Brouwer, Marcel D Posthumus, Elaine F Remmers, Francesca Tucci, Rene Toes, Elvira Grandone, Maria Cristina Mazzilli, Anna Rybak, Bozena Cukrowska, Marieke J H Coenen, Timothy R D J Radstake, Piet L C M van Riel, Yonghong Li, Paul I W de Bakker, Peter K Gregersen, Jane Worthington, Katherine A Siminovitch, Lars Klareskog, Tom W J Huizinga, Cisca Wijmenga, Robert M Plenge.
Abstract
Epidemiology and candidate gene studies indicate a shared genetic basis for celiac disease (CD) and rheumatoid arthritis (RA), but the extent of this sharing has not been systematically explored. Previous studies demonstrate that 6 of the established non-HLA CD and RA risk loci (out of 26 loci for each disease) are shared between both diseases. We hypothesized that there are additional shared risk alleles and that combining genome-wide association study (GWAS) data from each disease would increase power to identify these shared risk alleles. We performed a meta-analysis of two published GWAS on CD (4,533 cases and 10,750 controls) and RA (5,539 cases and 17,231 controls). After genotyping the top associated SNPs in 2,169 CD cases and 2,255 controls, and 2,845 RA cases and 4,944 controls, 8 additional SNPs demonstrated P<5 × 10(-8) in a combined analysis of all 50,266 samples, including four SNPs that have not been previously confirmed in either disease: rs10892279 near the DDX6 gene (P(combined) = 1.2 × 10(-12)), rs864537 near CD247 (P(combined) = 2.2 × 10(-11)), rs2298428 near UBE2L3 (P(combined) = 2.5 × 10(-10)), and rs11203203 near UBASH3A (P(combined) = 1.1 × 10(-8)). We also confirmed that 4 gene loci previously established in either CD or RA are associated with the other autoimmune disease at combined P<5 × 10(-8) (SH2B3, 8q24, STAT4, and TRAF1-C5). From the 14 shared gene loci, 7 SNPs showed a genome-wide significant effect on expression of one or more transcripts in the linkage disequilibrium (LD) block around the SNP. These associations implicate antigen presentation and T-cell activation as a shared mechanism of disease pathogenesis and underscore the utility of cross-disease meta-analysis for identification of genetic risk factors with pleiotropic effects between two clinically distinct diseases.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21383967 PMCID: PMC3044685 DOI: 10.1371/journal.pgen.1002004
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 5.917
Comparison of CD and RA risk alleles at seven shared risk loci.
| Locus | Locus name | Top CD SNP | Top RA SNP | LD between CD and RA SNPs (D' and r2) | Comment | |
| 1p36.3 | rs3890745 | rs3748816 | 1 | 0.93 | Same allele, same direction | |
| 2p16.1 | rs13003464 | rs13031237 | 0.11 | 0.01 | Different alleles | |
| 2q33.2 | rs4675374 | rs3087243 | 0.80 | 0.12 | Incomplete LD | |
| 4q27 | rs13151961 | rs6822844 | 1 | 0.90 | Same allele, same direction | |
| 6p21 | rs2187668 (DQA1*0501-DQB1*0201 tag) | rs6910071 (DRB1*0401 tag) | 1 | 0.04 | Different alleles | |
| 6q23.3 | rs2327832 | rs6920220 | 1 | 1 | Same allele, same direction | |
| 6q25.3 | rs1738074 | rs212389 (Chen et al, submitted) | 0.56 | 0.27 | Different alleles | |
Columns Top SNP CD and Top SNP RA – best reported SNP in the locus, as indicated in the reference paper. Association is indicated to the same allele if the r2 between CD and RA SNP is above 0.9.
Figure 1Established CD and RA SNPs and their association across diseases.
(A) Known CD SNPs in RA. The figure represents OR and CI for the established CD SNPs (p<5×10−8, one SNP per locus) in RA meta-analysis (5,539 auto-anbitody positive cases and 17,231 controls). (B) Known RA SNPs in CD. The figure represents OR and CI for the established RA SNPs (p<5×10−8, one SNP per locus) in CD meta-analysis (4,533 cases and 10,750 controls). For the six shared loci established in both diseases, figure 1A includes the top CD SNP and figure 1B the top RA SNP. From six shared loci, three (TNFRSF14, IL2/IL21 and TNFAIP3) are associated with same SNP or a good proxy (r2>0.9) in both diseases; in other three loci – CTLA4, REL and TAGAP – the most associated SNPs in CD and RA are not in strong LD with each other (r2<0.3), which is reflected in moderate association (CTLA4) or no association (REL) of these SNPs in the second disease. The TAGAP SNPs show association to opposite alleles in CD and RA.
Figure 2QQ plot of CD associated SNPs in RA and RA associated SNPs in CD.
QQ plot of CD associated SNPs (p<0.001) in RA (green) and RA associated SNPs (p<0.001) in CD (black). The most strongly associated SNPs (after removing known risk loci) in one disease were further filtered for P<0.001, and the resulting LD-pruned SNP sets were then tested for their distribution of association in the other disease. The QQ-plots indicate excess sharing of moderately associated SNPs across CD and RA.
Figure 3QQ plot of CD-RA meta-analysis by directional method and opposite allelic effect.
CD-RA inverse variance weighted meta-analysis assuming allelic effects in the same direction in the two diseases (panel A) and opposite allelic effects (panel B). Black – all loci except the MHC region (chr. 6: 20–40 Mb). Green – all loci except MHC and established CD and RA regions (1 MB around previously validated SNPs excluded).
CD-RA meta-analysis and replication, directional analysis.
| CD_GWAS | CD_Repl | CD_GWAS + Repl | RA_GWAS | RA_Repl | RA_GWAS + Repl | CD-RA GWAS+Repl | |||||||||||||
| ChrCHR | SNP | Min All | Locus | Status was | CD-RA P GWAS | CD P GWAS | CD OR GWAS | CD Repl* | CD OR Repl | CD P Meta | CD OR Meta | RA P GWAS | RA OR GWAS | RA Repl* | RA OR Repl | RA P Meta | RA OR Meta | CD-RA | Status now |
| 12q24.1 | rs653178 | C | CD | 1.3E-11 | 8.2E-13 | 1.21 | 1.18 | 1.19 | 8.8E-03 | 1.07 | 1.09 | 1.08 | CDRA | ||||||
| 8q24.2 | rs975730 | A | CD | 4.5E-06 | 1.6E-04 | 0.9 | 0.93 | 0.91 | 5.3E-03 | 0.93 | 0.91 | 0.92 | CDRA | ||||||
| 18p11.2 | rs1893217 | G | CD | 4.3E-10 | 1.8E-06 | 1.18 | 5.2E-05A | 1.17 | 7.9E-10 | 1.17 | 4.4E-05 | 1.14 | 0.075C | 1.05 | 1.2E-04 | 1.09 | 4.6E-12 | CD | |
| 16p13,1 | rs243323 | G | CD | 6.6E-06 | 4.1E-04 | 0.9 | 0.013 B | 0.90 | 3.0E-05 | 0.9 | 3.9E-03 | 0.93 | 0.36D | 1.01 | 0.032 | 0.95 | 1.4E-05 | CD | |
| 9q33.2 | rs1953126 | T | RA | 2.8E-06 | 9.5E-03 | 1.08 | 1.10 | 1.08 | 7.2E-05 | 1.1 | 1.17 | 1.13 | CDRA | ||||||
| 2q32.3 | rs7574865 | T | RA | 4.0E-08 | 7.7E-03 | 1.09 | 1.10 | 1.09 | 5.1E-07 | 1.16 | 1.10 | 1.14 | CDRA | ||||||
| 5q11.2 | rs1020388 | G | RA | 7.7E-09 | 1.5E-04 | 0.9 | 0.32 A | 0.99 | 2.1E-03 | 0.94 | 1.3E-05 | 0.9 | NA | NA | NA | NA | 3.1E-07 | RA | |
| 11q23.3 | rs10892279 | A | none | 2.0E-07 | 5.3E-03 | 0.91 | 0.83 | 0.89 | 7.4E-06 | 0.87 | 0.86 | 0.87 | CDRA | ||||||
| 1q24.2 | rs864537 | G | none | 2.6E-10 | 3.8E-07 | 0.87 | 0.95 | 0.91 | 8.9E-05 | 0.9 | 0.92 | 0.91 | CDRA | ||||||
| 22q11.2 | rs2298428 | T | none | 6.8E-09 | 8.7E-07 | 1.18 | 1.08 | 1.13 | 8.2E-04 | 1.11 | 1.07 | 1.09 | CDRA | ||||||
| 21q22.3 | rs11203203 | A | none | 3.1E-07 | 2.0E-03 | 1.1 | 1.08 | 1.10 | 4.2E-05 | 1.11 | 1.08 | 1.10 | CDRA | ||||||
| 7p14.1 | rs11984075 | G | none | 3.1E-07 | 2.0E-03 | 1.15 | 0.052B | 1.12 | 5.2E-04 | 1.14 | 3.9E-05 | 1.19 | 0.07F | 1.09 | 2.7E-05 | 1.15 | 5.2E-08 | none | |
| 12q14.1 | rs10876993 | C | none | 9.9E-06 | 1.3E-03 | 0.91 | 0.062B | 0.93 | 4.0E-04 | 0.92 | 2.3E-03 | 0.92 | 0.19F | 0.97 | 2.4E-03 | 0.94 | 3.7E-06 | none | |
| 10q26.1 | rs6585827 | A | none | 6.4E-07 | 9.9E-04 | 1.09 | 0.20B | 1.04 | 1.2E-03 | 1.08 | 1.90E-04 | 1.09 | 0.31D | 0.98 | 6.0E-03 | 1.06 | 2.7E-05 | none | |
Directional meta-analysis of CD and RA. CD-RA GWAS meta-analysis, GWAS results for CD and RA, replication and combined meta-analysis results for loci that achieve P<1×10−5 in CD-RA meta-analysis and P<0.01 in GWAS in CD and RA risk (one SNP per locus). Listed are the rs ID for each SNP, the chromosome, one candidate gene in the region (selected by GRAIL analysis, expression analysis or manually based on immunological function; see text and Tables S8, S9, S10 and Table 4), minor alleles (column MinAll) (positive strand in HapMap release 22, major/minor based on frequency in CD GWAS controls). The association P-value and odds ratio (OR) with respect to minor allele are listed for CD and RA GWAS, CD-RA GWAS meta-analysis, replication analyses in each disease and meta-analysis of CD-RA GWAS and replication cohorts. Loci established in both diseases are not included in this table. A – replication for this SNP in CD was done in Dubois et al paper [2]; B – replication in current study with all available CD samples as described in Materials and Methods; C - replication for this SNP in RA was done in Stahl et al paper [3]; D - replication in current study with all available RA samples, as described in methods; E – replication in current study with RA replication cohort R1, R2, R3 and R5 (failed in R4); F – replication in current study with RA replication cohorts R1, R2, R3 and R4 (failed in R5) (see Table S11 for cohort description); G – ICOSLG (established CD locus) and PRKCQ (established RA locus) SNPs were not included to the replication and not mentioned in this table. H - The CD-RA P META MAX column shows the P-overall for SNP tested in replication in both CD and RA. For one SNP (ANKRD55) the replication results were available only in CD replication cohort, in this case the CD-RA P META MAX column shows the p-value for the meta-analysis of CD-RA GWAS and CD-replication. P-values are 2-tailed except for replication p-values in CD and RA, indicated by *.
Analysis of candidate genes within associated blocks.
| Shared SNP | Chr | Block_start | Block_end | GRAIL annotated genes in block | GRAIL selected genes, p<0.1 | Cis eQTL transcripts in LD block | Likely candidate gene for CDRA based on GRAIL and eQTL analysis | Evidence |
| rs3748816 | 1p36.3 | 2396198 | 2742203 | GRAIL, eQTL | ||||
| rs864537 | 1q24.2 | 165663466 | 165707466 | GRAIL, single gene | ||||
| rs6706689 | 2p16.1 | 60920853 | 61033853 | none | GRAIL | |||
| rs7574865 | 2q32.3 | 191300239 | 191681739 | none | GRAIL | |||
| rs231804 | 2q33.2 | 204397239 | 204524239 | none | GRAIL | |||
| rs13151961 | 4q27 | 123202845 | 123784345 | none | GRAIL | |||
| rs6933404 | 6q23.3 | 137750000 | 138315000 | none | GRAIL | |||
| rs212388 | 6q25.3 | 159391579 | 159435579 | GRAIL, eQTL | ||||
| rs975730 | 8q24.2 | 129129000 | 129812500 | none | none | single gene | ||
| rs1953126 | 9q33.3 | 122674767 | 123013267 | GRAIL, eQTL | ||||
| rs10892279 | 11q23.3 | 118080000 | 118252500 | none | none | single gene | ||
| rs653178 | 12q24.1 | 110322163 | 111513663 | GRAIL, eQTL | ||||
| rs11203203 | 21q22.3 | 42681500 | 42750500 | eQTL | ||||
| rs2298428 | 22q11.2 | 19786946 | 20317446 | none | eQTL |
All chromosomal positions are based on NCBI build-36 coordinates. Grail selected genes column included genes with p-value P<0.1 as annotate by GRAIL, see Table S8. Cis eQTLs indicate genes that showed correlation in expression with associated SNP, see Table S9 and Figure S2.
CD-RA meta-analysis GWAS and replication—opposite allelic effect method.
| CD GWAS | CD Repl | CD GWAS + Repl | RA GWAS | RA Repl | RA GWAS + Repl | CD+RA GWAS+Repl | |||||||||||||
| CHR | SNP | Min All | locus | STATUS WAS | CD-RA GWAS | CD P GWAS | CD OR GWAS | CD Repl* | CD OR Repl | CD P Meta | CD OR Meta | RA P GWAS | RA OR GWAS | RA Repl* | RA OR Repl | RA P Meta | RA OR Meta | CD-RA P Meta MAX F | STATUS NOW |
| 2p14 | rs1876518 | T | RA | 2.2E-08 | 4.2E-03 | 0.93 | 0.18A | 1.04 | 5.4E-02 | 0.96 | 7.7E-07 | 1.13 | 1.4E-03B | 1.11 | 8.5E-09 | 1.12 | 1.9E-08 | RA | |
| 2p23.1 | rs7579944 | T | none | 3.2E-08 | 6.5E-05 | 1.12 | 0.025 A | 1.09 | 9.7E-06 | 1.11 | 1.3E-04 | 0.90 | 0.13 B | 0.96 | 2.2E-04 | 0.92 | 1.1E-08 | none | |
| 1q23.1 | rs1772408 | A | none | 5.0E-06 | 2.0E-04 | 1.15 | 0.016 A | 1.12 | 2.0E-05 | 1.14 | 4.5E-03 | 0.91 | 0.20D | 0.96 | 4.6E-03 | 0.93 | 8.4E-07 | none | |
| 7q34 | rs4626515 | C | none | 4.6E-07 | 3.2E-03 | 0.88 | 0.12 A | 1.08 | 0.070 | 0.93 | 4.1E-05 | 1.17 | 0.17 B | 1.06 | 7.9E-05 | 1.13 | 2.8E-05 | none | |
| 11p15 | rs11043097 | C | none | 2.0E-07 | 6.1E-05 | 1.15 | 0.45 A | 0.99 | 8.7E-04 | 1.10 | 7.5E-04 | 0.90 | 0.23C | 1.04 | 0.010 | 0.93 | 3.3E-05 | none | |
| 14q13.2 | rs17103033 | C | none | 6.8E-06 | 2.8E-03 | 1.12 | 0.27 A | 0.97 | 0.032 | 1.07 | 7.6E-04 | 0.89 | 0.42 D | 0.99 | 3.8E-03 | 0.92 | 3.4E-04 | none | |
Opposite allelic effect meta-analysis of CD and RA. CD-RA GWAS meta-analysis, GWAS results for CD and RA, replication and combined meta-analysis results for loci that achieve P<1×10-5 in CD-RA meta-analysis and P<0.01 in GWAS in CD and RA risk (one SNP per loci). Listed are the rs ID for each SNP, the chromosomal location, one candidate gene in the region (selected by GRAIL analysis, expression analysis or manually based on immunological function; see text and Tables S8, S9, S10 and Table 4), minor alleles (column MinAll) (positive strand in HapMap release 22, major/minor based on frequency in CD GWAS controls). The association P-value and odds ratio (OR) with respect to minor allele are listed for CD and RA GWAS, CD-RA-meta analysis on GWAS, replication analyses in each disease and meta-analysis of CD-RA GWAS and replication cohorts. A – replication in current study with all available CD samples, as described in Materials and Methods; B - replication in current study with all available RA samples; C – replication in current study with RA replication cohort R1, R2 and R3 (failed in R4 and R5); D – replication in current study with RA replication cohorts R1, R2, R3 and R4 (failed in R5), see Table S11 for cohort description; E - The TGFBR3 SNP fit the replication criteria but was not included to the replication step for technical reasons. F - The CD-RA P META MAX column shows the P-overall for SNP included to the replication in both CD and RA. Rs19126261 in TGFBR3 locus was not included to the replication step; in this case the CD-RA P META MAX column shows the p-value for GWAS meta-analysis. P-values are 2-tailed except for replication p-values in CD and RA, indicated by *.