| Literature DB >> 19898481 |
Soumya Raychaudhuri1, Brian P Thomson, Elaine F Remmers, Stephen Eyre, Anne Hinks, Candace Guiducci, Joseph J Catanese, Gang Xie, Eli A Stahl, Robert Chen, Lars Alfredsson, Christopher I Amos, Kristin G Ardlie, Anne Barton, John Bowes, Noel P Burtt, Monica Chang, Jonathan Coblyn, Karen H Costenbader, Lindsey A Criswell, J Bart A Crusius, Jing Cui, Phillip L De Jager, Bo Ding, Paul Emery, Edward Flynn, Pille Harrison, Lynne J Hocking, Tom W J Huizinga, Daniel L Kastner, Xiayi Ke, Fina A S Kurreeman, Annette T Lee, Xiangdong Liu, Yonghong Li, Paul Martin, Ann W Morgan, Leonid Padyukov, David M Reid, Mark Seielstad, Michael F Seldin, Nancy A Shadick, Sophia Steer, Paul P Tak, Wendy Thomson, Annette H M van der Helm-van Mil, Irene E van der Horst-Bruinsma, Michael E Weinblatt, Anthony G Wilson, Gert Jan Wolbink, Paul Wordsworth, David Altshuler, Elizabeth W Karlson, Rene E M Toes, Niek de Vries, Ann B Begovich, Katherine A Siminovitch, Jane Worthington, Lars Klareskog, Peter K Gregersen, Mark J Daly, Robert M Plenge.
Abstract
To discover new rheumatoid arthritis (RA) risk loci, we systematically examined 370 SNPs from 179 independent loci with P < 0.001 in a published meta-analysis of RA genome-wide association studies (GWAS) of 3,393 cases and 12,462 controls. We used Gene Relationships Across Implicated Loci (GRAIL), a computational method that applies statistical text mining to PubMed abstracts, to score these 179 loci for functional relationships to genes in 16 established RA disease loci. We identified 22 loci with a significant degree of functional connectivity. We genotyped 22 representative SNPs in an independent set of 7,957 cases and 11,958 matched controls. Three were convincingly validated: CD2-CD58 (rs11586238, P = 1 x 10(-6) replication, P = 1 x 10(-9) overall), CD28 (rs1980422, P = 5 x 10(-6) replication, P = 1 x 10(-9) overall) and PRDM1 (rs548234, P = 1 x 10(-5) replication, P = 2 x 10(-8) overall). An additional four were replicated (P < 0.0023): TAGAP (rs394581, P = 0.0002 replication, P = 4 x 10(-7) overall), PTPRC (rs10919563, P = 0.0003 replication, P = 7 x 10(-7) overall), TRAF6-RAG1 (rs540386, P = 0.0008 replication, P = 4 x 10(-6) overall) and FCGR2A (rs12746613, P = 0.0022 replication, P = 2 x 10(-5) overall). Many of these loci are also associated to other immunologic diseases.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19898481 PMCID: PMC3142887 DOI: 10.1038/ng.479
Source DB: PubMed Journal: Nat Genet ISSN: 1061-4036 Impact factor: 38.330
Validated RA loci used in functional analyses
We list each of the 16 established RA loci (column 1), and representative SNPs (column 2). Also we list all of the genes in LD with the SNP (column 3); for each SNP the gene in bolded font is the one that GRAIL selected as the most functionally connected gene when that locus was scored against the 15 other validated risk loci.
| Validated RA | Representative | Genes within Associated Regions |
|---|---|---|
| rs2476601 |
| |
| rs2240340 |
| |
| 1p36.32 | rs3890745 |
|
| 1q23.3 | rs6682654 |
|
| rs3087243 |
| |
| 2q32.3 | rs7574865 |
|
| 4q27 | rs6822844 |
|
| 6q23.3 | rs10499194, |
|
| rs6457620, | ||
| 7q21.2 | rs42041 |
|
| 9q33.2 | rs3761847 |
|
| 9p13.3 | rs2812378 |
|
| 10p15.1 | rs4750316 |
|
| 12q13.3 | rs1678542 |
|
| 20q13.12 | rs4810485 |
|
| 22q12.3 | rs3218253 |
|
Loci discovered prior to December 2006.
Figure 1Using Gene Relationships Across Implicated Loci (GRAIL) to prioritize candidate RA SNPs
We select a set of candidate SNPs to pursue in an independent genotyping experiment by starting with all SNPs that obtain p<0.001 in an independent GWAS meta-analysis. Then for each candidate SNP, GRAIL identifies the genomic region in LD, and identifies overlapping genes. It then checks to see how many other loci, already known to be associated with disease, contain functionally related genes. SNPs representing those candidate loci with significantly related genes are forwarded for genotyping in large numbers of independent case-control samples.
Figure 2GRAIL identifies inter-connectivity among genes in RA loci
We place the known RA associated SNP along the outer ring; the internal ring represents the genes near each SNP (as listed in Table 1) with a box. We illustrate the literature-based functional connectivity between these genes with lines drawn between them - the redder and thicker the lines are the stronger the connectivity between the genes is. RA SNPs implicate a small number of highly connected genes – those genes are indicated by labeled boxes.
Figure 3A. GRAIL identifies 22 SNPs among the 179 candidate SNPs with p<0.001 in a GWAS meta-analysis. We plot a histogram of the 179 SNPs as a function of their GWAS meta-analysis p-value. Gray bars represent the 157 SNPs that were not selected, while colored bars represent the 22 SNPs that were selected; purple indicating SNPs that replicated convincingly in follow-up genotyping (p<0.0023), orange indicating nominally associated SNPs in follow-up genotyping (p<0.05), and yellow indicating genotyped SNPs without any independent evidence of association. 3B. Enrichment of SNPs with z-scores >2 in replication samples. For each of the 22 SNPs tested, we calculated a one-sided CMH z-score statistic from our two-staged replication data. A z-score of 0 corresponds to a p=0.5; a z-score of 1.65 corresponds to a p=0.05; and a z-score of 2.83 corresponds to p=0.0023. For a random collection of unassociated SNPs, this histogram should approximate a normal distribution (dotted line).
SNPs tested for RA susceptibility
The first 6 columns list SNP characteristics. The next four columns list GWA meta-analysis results including allele frequencies, a two-tailed p-value for SNP association, and an odds ratio (OR) with respect to the minor allele. The next four columns list similar results for replication genotyping; significance is reported based on stratified one-tailed CMH statistic. The next three columns summarize joint (overall) analysis results. Significance is reported based on stratified two-tailed CMH statistic across all fourteen patient collections. The final column lists the Breslow-Day Test for heterogeneity of odds ratios across all fourteen collections (3 from the meta-analysis and 11 from the replication study).
| SNP | Meta-Analysis | Replication | Joint | |||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| ID | Chr | Pos (HG18) | Gene(s) | Allele | p | OR | Minor Allele | p | OR | Minor Allele | p | OR | Breslow- | |||
| Major | Minor | Control | Case | Control | Case | p | ||||||||||
|
| ||||||||||||||||
|
| 117,064,661 |
| C | G | 2.0E-04 | 1.14 | 0.237 | 0.260 | 1.4E-06 | 1.12 | 0.228 | 0.254 | 1.0E-09 | 1.13 | 0.29 | |
|
| 204,318,641 |
| T | C | 4.2E-05 | 1.16 | 0.230 | 0.255 | 4.7E-06 | 1.11 | 0.237 | 0.255 | 1.3E-09 | 1.13 | 0.81 | |
|
| 106,674,727 |
| T | C | 3.4E-04 | 1.12 | 0.328 | 0.351 | 1.2E-05 | 1.10 | 0.323 | 0.343 | 2.1E-08 | 1.11 | 0.66 | |
|
| 159,402,509 |
| T | C | 5.6E-04 | 0.89 | 0.302 | 0.269 | 1.5E-04 | 0.92 | 0.286 | 0.270 | 3.8E-07 | 0.91 | 0.63 | |
|
| 196,967,065 |
| G | A | 3.8E-04 | 0.84 | 0.128 | 0.108 | 2.6E-04 | 0.90 | 0.132 | 0.117 | 6.7E-07 | 0.88 | 0.64 | |
| rs540386 |
| 36,481,869 | C | T | 6.1E-04 | 0.86 | 0.142 | 0.130 | 8.3E-04 | 0.91 | 0.145 | 0.130 | 3.9E-06 | 0.89 | 0.08 | |
|
| 159,733,666 |
| C | T | 9.1E-04 | 1.16 | 0.120 | 0.133 | 0.0022 | 1.10 | 0.124 | 0.130 | 1.5E-05 | 1.12 | 0.25 | |
|
| ||||||||||||||||
|
| 12,867,060 |
| A | G | 1.9E-04 | 1.16 | 0.158 | 0.179 | 0.013 | 1.06 | 0.164 | 0.172 | 4.4E-05 | 1.10 | 0.61 | |
| rs4535211 |
| 17,048,001 |
| G | A | 4.4E-04 | 0.90 | 0.489 | 0.457 | 0.015 | 0.96 | 0.474 | 0.461 | 8.9E-05 | 0.94 | 0.524 |
| rs1773560 |
| 165,688,387 |
| A | G | 4.4E-04 | 0.90 | 0.421 | 0.385 | 0.021 | 0.96 | 0.414 | 0.401 | 1.5E-04 | 0.94 | 0.74 |
| rs892188 |
| 10,270,793 | C | T | 4.6E-05 | 1.13 | 0.378 | 0.409 | 0.041 | 1.05 | 0.393 | 0.401 | 4.3E-05 | 1.08 | 0.21 | |
| rs4272626 |
| 116,149,950 |
| C | T | 3.5E-04 | 1.12 | 0.359 | 0.388 | 0.042 | 1.04 | 0.354 | 0.362 | 1.9E-04 | 1.07 | 0.07 |
| rs231707 |
| 2,664,183 |
| G | A | 6.0E-04 | 1.14 | 0.178 | 0.195 | 0.048 | 1.05 | 0.172 | 0.184 | 5.3E-04 | 1.08 | 0.23 |
|
| ||||||||||||||||
| rs2276418 |
| 117,735,474 |
| A | T | 4.0E-04 | 1.16 | 0.142 | 0.161 | 0.077 | 1.04 | 0.155 | 0.155 | 9.5E-04 | 1.08 | 0.11 |
| rs3176767 |
| 10,310,751 |
| T | G | 1.0E-04 | 1.15 | 0.224 | 0.245 | 0.09 | 1.03 | 0.229 | 0.233 | 6.9E-04 | 1.07 | 0.60 |
| rs10282458 |
| 149,676,235 |
| G | A | 9.1E-04 | 1.12 | 0.259 | 0.282 | 0.23 | 1.02 | 0.260 | 0.266 | 4.4E-03 | 1.06 | 0.045 |
| rs7041422 |
| 21,034,021 |
| T | G | 4.7E-04 | 1.12 | 0.300 | 0.331 | 0.24 | 1.02 | 0.297 | 0.301 | 4.4E-03 | 1.06 | 0.86 |
| rs9564915 |
| 72,223,143 |
| A | G | 4.3E-04 | 1.12 | 0.319 | 0.341 | 0.27 | 1.01 | 0.317 | 0.315 | 0.008 | 1.05 | 0.14 |
| rs13393256 |
| 47,140,263 |
| C | A | 6.9E-04 | 1.13 | 0.210 | 0.227 | 0.44 | 1.00 | 0.221 | 0.221 | 0.014 | 1.06 | 0.14 |
| rs7579737 |
| 102,353,793 |
| A | G | 8.2E-04 | 0.89 | 0.307 | 0.274 | 0.93 | 1.04 | 0.295 | 0.308 | 0.483 | 0.99 | 0.023 |
| rs2614394 |
| 42,568,433 |
| G | A | 9.8E-05 | 0.81 | 0.098 | 0.082 | 0.94 | 1.08 | 0.099 | 0.105 | 0.06 | 0.94 | 0.002 |
| rs9359049 |
| 74,758,649 |
| T | A | 2.7E-05 | 1.27 | 0.068 | 0.081 | 0.94 | 0.94 | 0.079 | 0.071 | 0.14 | 1.05 | 0.0155 |
These SNPs are close to other loci already associated to autoimmune disease.
Tested SNPs near other alleles associated with autoimmune diseases
Seven of the 22 SNPs tested are near loci already associated with autoimmune diseases. In the first three columns we list the SNPs, cytogenetic location, and the likely candidate gene. In the next three columns we list the published SNP, the attributed gene, and the disease associations. In the final three columns we list the physical distance and measures of LD. For PTPRC the published SNP is rare, and LD cannot be accurately assessed.
| SNP | Published SNP | Proximity | ||||||
|---|---|---|---|---|---|---|---|---|
| ID | Chr | Gene | ID | Gene | Disease | Distance | r2 | D’ |
| rs12746613 |
|
| rs1801274 |
| Systemic Lupus | 12.7 | 0.19 | 1.00 |
| rs394581 |
|
| rs1738074 |
| Celiac Disease, | 16.5 | 0.32 | 0.73 |
| rs10919563 |
|
| rs17612648 |
| Multiple Sclerosis | 34.5 | - | - |
| rs7234029 |
|
| rs478582 |
| Type I Diabetes | 41.1 | 0.14 | 1.00 |
| rs1980422 |
|
| rs3087243 |
| Type I Diabetes, | 128.5 | 0.04 | 0.40 |
| rs548234 |
|
| rs7746082 |
| Crohn’s Disease | 132.8 | 0.01 | 0.08 |
| rs11586238 |
|
| rs2300747 |
| Multiple Sclerosis | 158.9 | 0.01 | 0.29 |