| Literature DB >> 22961000 |
Jimmy Z Liu1, Mohamed A Almarri, Daniel J Gaffney, George F Mells, Luke Jostins, Heather J Cordell, Samantha J Ducker, Darren B Day, Michael A Heneghan, James M Neuberger, Peter T Donaldson, Andrew J Bathgate, Andrew Burroughs, Mervyn H Davies, David E Jones, Graeme J Alexander, Jeffrey C Barrett, Richard N Sandford, Carl A Anderson.
Abstract
We genotyped 2,861 cases of primary biliary cirrhosis (PBC) from the UK PBC Consortium and 8,514 UK population controls across 196,524 variants within 186 known autoimmune risk loci. We identified 3 loci newly associated with PBC (at P<5×10(-8)), increasing the number of known susceptibility loci to 25. The most associated variant at 19p12 is a low-frequency nonsynonymous SNP in TYK2, further implicating JAK-STAT and cytokine signaling in disease pathogenesis. An additional five loci contained nonsynonymous variants in high linkage disequilibrium (LD; r2>0.8) with the most associated variant at the locus. We found multiple independent common, low-frequency and rare variant association signals at five loci. Of the 26 independent non-human leukocyte antigen (HLA) signals tagged on the Immunochip, 15 have SNPs in B-lymphoblastoid open chromatin regions in high LD (r2>0.8) with the most associated variant. This study shows how data from dense fine-mapping arrays coupled with functional genomic data can be used to identify candidate causal variants for functional follow-up.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22961000 PMCID: PMC3459817 DOI: 10.1038/ng.2395
Source DB: PubMed Journal: Nat Genet ISSN: 1061-4036 Impact factor: 38.330
6]. At 12 of the genome-wide significant loci, the most associated SNP was different to that previously reported (Supplementary Table 3). There was little difference in the effect-size estimates between the GWAS tagging SNP and the most strongly associated Immunochip SNP (Supplementary Figure 7), although this may be due to a large proportion of overlapping samples between the two studies (Online Methods).
Figure 1Manhattan plot and list of genome-wide significant PBC risk loci across Immunochip
Novel risk loci are highlighted in blue. Loci with more than one independent signal are highlighted in red. The vertical red line indicates the genome-wide significance threshold of P=5×10−8. The peak on chromosome 6 is the HLA region.
PBC risk loci at genome-wide significance
| Chr | SNP | RA | RAF |
| OR | LD region | Nearby gene(s) | Functional |
|---|---|---|---|---|---|---|---|---|
| (95% CI) | (size) | |||||||
| 1p31 | rs72678531 | G | 0.17 | 2.47×10−38 | 1.61 | 67,560,940-67,592,782 |
| OC |
| (1.49-1.73) | (31,842) | |||||||
| 1q31 | rs2488393 | A | 0.21 | 4.29×10−12 | 1.28 | 195,609,003-196,047,821 |
| |
| (1.19-1.37) | (438,818) | |||||||
| 2q32 | rs3024921 | A | 0.06 | 2.59×10−18 | 1.62 | 191,651,517-191,651,517 |
| |
| (1.45-1.80) | (0) | |||||||
| 2q32 | Second signal | A | 0.22 | 1.38×10−13 | 1.31 | 191,651,987-191,681,279 |
| |
| rs7574865 | (1.22-1.40) | (29,292) | ||||||
| 3q13 | rs2293370 | G | 0.8 | 6.84×10−16 | 1.39 | 120,598,840-120,734,898 |
| NS |
| (1.29-1.52) | (136,058) | |||||||
| 3q25 | rs2366643 | A | 0.57 | 3.92×10−22 | 1.35 | 161,202,965-161,219,770 |
| OC |
| (1.27-1.44) | (16,805) | |||||||
| 3q25 | Second signal | G | 0.15 | 5.74×10−17 | 1.41 | 161,122,353-161,174,976 |
| OC |
| rs62270414 | (1.30-1.53) | (52,623) | ||||||
| 3q25 | Third signal | G | 0.43 | 4.73×10−9 | 1.26 | 161,192,695-161,198,245 |
| OC |
| rs668998 | (1.17-1.36) | (5,550) | ||||||
| 3q25 | Fourth signal | A | 0.004 | 2.64×10−11 | 3.44 | 161,108,087-161,176,747 |
| |
| rs80014155 | (2.39-4.94) | (68,660) | ||||||
| 4q24 | rs7665090 | G | 0.52 | 8.48×10−14 | 1.26 | 103,770,651-103,770,651 |
| NS,eQTL |
| (1.19-1.34) | (0) | |||||||
| 5p13 | rs6871748 | A | 0.72 | 2.26×10−13 | 1.3 | 35,885,906-35,921,739 |
| NS |
| (1.21-1.4) | (35,833) | |||||||
| 7q32 | rs35188261 | A | 0.17 | 6.52×10−22 | 1.52 | 128,372,852-128,499,110 |
| OC |
| (1.39-1.63) | (126,258) | |||||||
| 7q32 | Second signal | G | 0.47 | 4.12×10−9 | 1.22 | 128,361,203-128,367,916 |
| OC,eQTL |
| rs3807307 | (1.14-1.30) | (6,713) | ||||||
| 11q23 | rs80065107 | A | 0.79 | 7.20×10−16 | 1.39 | 118,115,759-118,248,982 |
| OC |
| (1.28-1.5) | (133,223) | |||||||
| 12p13 | rs1800693 | G | 0.4 | 1.18×10−14 | 1.27 | 6,310,270-6,323,072 |
| OC |
| (1.19-1.34) | (12,802) | |||||||
| 12p13 | Second signal | A | 0.25 | 1.69×10−9 | 1.23 | 6,362,910-6,362,910 |
| OC |
| rs11064157 | (1.15-1.32) | (0) | ||||||
| 12q24 |
| A | 0.44 | 2.87×10−9 | 1.2 | 110,368,991-111,095,097 |
| NS |
| (1.13-1.27) | (726,106) | |||||||
| 14q24 | rs911263 | A | 0.71 | 9.95×10−11 | 1.26 | 67,823,346-67,823,346 |
| |
| (1.17-1.35) | (0) | |||||||
| 16p13 | rs1646019 | G | 0.71 | 6.72×10−15 | 1.31 | 11,254,549-11,273,001 |
| OC,eQTL |
| (1.23-1.41) | (18,452) | |||||||
| 16p13 | Second signal | C | 0.68 | 2.19×10−13 | 1.29 | 10,999,820-11,117,948 |
| OC,eQTL |
| rs12708715 | (1.21-1.38) | (118,128) | ||||||
| 16p13 | Third signal | A | 0.004 | 2.69×10−8 | 2.96 | 11,281,298-11,281,298 |
| OC |
| rs80073729 | (2.02-4.33) | (0) | ||||||
| 16q24 | rs11117433 | G | 0.77 | 1.41×10−9 | 1.26 | 84,577,017-84,577,017 |
| OC |
| (1.17-1.36) | (0) | |||||||
| 17q12 | rs8067378 | G | 0.52 | 6.05×10−14 | 1.26 | 35,158,633-35,336,333 |
| NS,OC,eQTL |
| (1.19-1.34) | (177,700) | |||||||
| 17q21 |
| G | 0.24 | 2.15×10−9 | 1.25 | 41,047,160-42,211,804 |
| NS,OC,eQTL |
| (1.16-1.35) | (1,164,644) | |||||||
| 19p12 |
| G | 0.95 | 1.23×10−12 | 1.91 | 10,324,118-10,324,118 |
| NS |
| (1.59-2.28) | (0) | |||||||
| 22q13 | rs2267407 | A | 0.23 | 1.29×10−13 | 1.29 | 38,076,996-38,086,596 |
| OC,eQTL |
| (1.21-1.38) | (9,600) |
Non HLA PBC risk loci meeting genome-wide significance (P < 5 × 10−8) are shown.
Most significant SNP in the locus. Novel associations are highlighted in bold.
Risk allele.
Risk allele frequency in controls.
For primary signals, p-values were obtained from Cochran-Armitage tests for trend. For second, third and fourth association signals, p-values were obtained using logistic regression conditioning on the previous independent SNPs.
Regions of high linkage disequilibrium defined by SNPs with r2 > 0.8. See Supplementary Figure 10 for regional locus plots.
RefSeq UCSC hg18 track.
Denotes if there are SNPs with r2 > 0.8 with the hit SNP that lie within OC (open chromatin peaks), are non-synonymous (NS) or are expression quantitative trait locus (eQTL). Full list of SNPs is given in Supplementary Table 6-8.
Figure 2Enrichment of DNase-seq peaks among PBC risk loci in Gm12878 compared to other ENCODE cell lines
The relative enrichment (E) of SNPs within DNase-seq peaks was calculated across the 35 most associated loci. There is suggestive, though non-significant, evidence that genome-wide significant loci (P<5×10−8 - vertical blue line) are more likely to lie within DNase-seq peaks in B-lymphoblastoid cell lines (solid red line) than they are to lie within the union of all other annotated cell lines (solid black line) (P=0.068). Dotted grey lines denote E for other annotated cell lines. The shaded grey area represents the 95% confidence interval of E for Gm12878 from 1000 permutations. Cell types: Gm12878: B-lymphoblastoid, H1hesc: embryonic stem cells, H9es: embryonic stem cells, Helas3: cervical carcinoma, Hepg2: liver carcinoma, Hsmm: skeletal muscle myoblasts, Huvec: umbilical vein endothelial cells, K562: leukemia, Lhsr: prostate epithelial cells, Mcf7: mammary gland adenocarcinoma, Medullo: medulloblastoma, Melano: epidermal melanocytes, Myometr: Myometrial cells, Nhbe: bronchial epithelial cells, Nhek: epidermal keratinocytes, Panislets: pancreatic islets, Progfib: fibroblasts.