| Literature DB >> 25743184 |
Hui Guo1, Mary D Fortune2, Oliver S Burren2, Ellen Schofield2, John A Todd2, Chris Wallace3.
Abstract
The genes and cells that mediate genetic associations identified through genome-wide association studies (GWAS) are only partially understood. Several studies that have investigated the genetic regulation of gene expression have shown that disease-associated variants are over-represented amongst expression quantitative trait loci (eQTL) variants. Evidence for colocalisation of eQTL and disease causal variants can suggest causal genes and cells for these genetic associations. Here, we used colocalisation analysis to investigate whether 595 genetic associations to ten immune-mediated diseases are consistent with a causal variant that regulates, in cis, gene expression in resting B cells, and in resting and stimulated monocytes. Previously published candidate causal genes were over-represented amongst genes exhibiting colocalisation (odds ratio > 1.5), and we identified evidence for colocalisation (posterior odds > 5) between cis eQTLs in at least one cell type and at least one disease for six genes: ADAM15, RGS1, CARD9, LTBR, CTSH and SYNGR1. We identified cell-specific effects, such as for CTSH, the expression of which in monocytes, but not in B cells, may mediate type 1 diabetes and narcolepsy associations in the chromosome 15q25.1 region. Our results demonstrate the utility of integrating genetic studies of disease and gene expression for highlighting causal genes and cell types.Entities:
Mesh:
Year: 2015 PMID: 25743184 PMCID: PMC4498151 DOI: 10.1093/hmg/ddv077
Source DB: PubMed Journal: Hum Mol Genet ISSN: 0964-6906 Impact factor: 6.150
Number of case–control subjects and disease-associated regions for pairwise colocalisation investigation
| Disease | Cases | Controls | Fairfax studies | Meta-analysis | Disease-eQTL. | ||||
|---|---|---|---|---|---|---|---|---|---|
| Regions | Pairs | Regions | Pairs | Overlaps | Suggestive | Convincing | |||
| Autoimmune thyroid disease | 2747 | 9364 | 9 | 125 | 8 | 65 | 6 | 2 | 0 |
| Celiac disease | 12 041 | 12 228 | 28 | 293 | 21 | 132 | 16 | 2 | 1 |
| Crohn's disease | 14 763 | 15 977 | 83 | 1110 | 77 | 619 | 61 | 5 | 2 |
| Multiple sclerosis | 9772 | 17 376 | 60 | 883 | 51 | 484 | 44 | 11 | 1 |
| Narcolepsy | 1886 | 10 421 | 2 | 19 | 2 | 6 | 1 | 1 | 1 |
| Primary biliary cirrhosis | 2861 | 8514 | 17 | 240 | 13 | 122 | 18 | 3 | 2 |
| Psoriasis | 10 588 | 22 806 | 24 | 423 | 22 | 253 | 15 | 1 | 0 |
| Rheumatoid arthritis | 13 838 | 33 742 | 60 | 905 | 58 | 536 | 11 | 0 | 0 |
| Type 1 diabetes | 6693 | 12 420 | 41 | 625 | 37 | 320 | 23 | 6 | 1 |
| Ulcerative colitis | 10 920 | 15 977 | 55 | 787 | 51 | 422 | 35 | 4 | 1 |
| At least one disease | 125 | 28 | 6 | ||||||
| Total | 379 | 5410 | 340 | 2959 | |||||
Probes within ±200 kb of the regions were included. ‘Regions’ gives the number of regions considered for each disease and ‘Pairs’ the number of probe/disease pairwise analyses performed. Given the different chips used for expression analyses, the number of regions differed between the Fairfax (17,19) and Cardiogenics (18) studies. The Disease-eQTL columns summarize the colocalisation analyses, giving the number of genes with a convincing eQTL overlapping a disease region (‘Overlaps’, PP3 + PP4 > 0.99), the number of those with suggestive evidence of colocalisation with the disease signal (‘Suggestive’; PP4/PP3 > 1) and the number of those with convincing evidence for colocalisation (‘Convincing’; PP4/PP3 > 5). The ‘Total’ row gives the total number of analyses performed (a simple sum of the numbers above). The ‘At least one disease’ row gives the number of genes showing eQTLs that overlap with or show suggestive or convincing colocalisation with at least one disease.
Figure 1.Overall results of colocalisation analyses. No evidence for association with either trait is found in most cases. Where association is observed, it is mostly with a single trait. Convincing evidence for colocalisation exists in a small proportion of genes. Shown are three posterior densities (PP0: causal variant for neither trait—black; PP1 + PP2: causal variant for one trait—blue; PP3 + PP4: causal variant(s) for both traits—red) of all pairwise comparisons performed between the diseases and gene expression (unstimulated monocytes and B cells, stimulated monocytes after 24-h IFN, 2-h LPS and 24-h LPS).
Colocalisation is over-represented amongst probe/disease pairs in published candidate causal genes across all five cell types (odds ratio > 1)
| Cell type | In candidate genes | Not in candidate genes | Odds ratio | |||
|---|---|---|---|---|---|---|
| Pairs | Odds | Pairs | Odds | |||
| Resting cells | ||||||
| B cells | 63 | 0.167 | 94 | 0.106 | 1.575 | 0.514 |
| Monocytes | 92 | 0.195 | 267 | 0.077 | 2.532 | 0.017 |
| Stimulated monocytes | ||||||
| IFN, 24 h | 99 | 0.165 | 146 | 0.074 | 2.230 | 0.096 |
| LPS, 2 h | 46 | 0.179 | 127 | 0.085 | 2.106 | 0.252 |
| LPS, 24 h | 50 | 0.163 | 133 | 0.099 | 1.646 | 0.477 |
‘Pairs’ gives the number of probe/disease pairs with evidence of association with both traits (PP3 + PP4 > 0.8) and ‘Odds’ the odds that a pairwise comparison gives greater posterior support to a shared causal variant (PP4 > PP3). Published candidate causal genes are as curated in ImmunoBase.
Six genes showing evidence for colocalisation between gene expression and disease with PP3 + PP4 ≥ 0.99 and PP4/PP3 ≥ 5 and three genes with weaker evidence (PP3 + PP4 ≥ 0.90 and PP4/PP3 ≥ 3)
| Region | Disease | Gene | Probe | B cells | Monocytes | Mono + 24-h IFN | Mono + 2-h LPS | Mono + 24-h LPS | Dir. | |||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Assoc | Coloc | Assoc | Coloc | Assoc | Coloc | Assoc | Coloc | Assoc | Coloc | |||||
| 1q22 | CRO | ILMN_1751500 | 0.054 | 0.218 | † | |||||||||
| * 1q31.2 | CEL | ILMN_1656011 | 0.312 | 3.168 | 0.013 | 0.839 | 0.476 | 4.672 | − | |||||
| * 1q31.2 | MS | LMN_1656011 | 0.330 | 2.806 | 0.015 | 0.693 | 0.482 | 4.021 | − | |||||
| 9q34.3 | CRO | ILMN_1712532 | 0.062 | 0.125 | + | |||||||||
| 9q34.3 | UC | ILMN_1712532 | 0.067 | 0.128 | + | |||||||||
| 12p13.31 | PBC | ILMN_1667476 | 0.010 | 1.426 | + | |||||||||
| 15q25.1 | NAR | ILMN_2390853 | − | |||||||||||
| 15q25.1 | T1D | ILMN_2390853 | − | |||||||||||
| 22q13.1 | PBC | ILMN_1727805 | 0.020 | 0.350 | 0.024 | 0.606 | 0.060 | 0.392 | ±$ | |||||
| 22q13.1 | PBC | ILMN_1810875 | 0.681 | 0.007 | 0.193 | 0.095 | 0.831 | 0.002 | + | |||||
| 8p23.1 | RA | ILMN_1668277 | 0.030 | 0.212 | 0.100 | 2.752 | 0.032 | 0.633 | − | |||||
| 8q21.12 | MS | ILMN_1789558 | − | |||||||||||
| 8q21.12 | MS | ILMN_2057981 | 0.896 | 2.343 | − | |||||||||
| 12q14.1 | MS | ILMN_1723846 | − | |||||||||||
Assoc is the posterior probability that both the disease is associated with the region and that an eQTL exists in the indicated cell type. Coloc is the posterior odds of colocalisation, given that both traits are associated. When data for a given probe were available from both studies of monocyte expression, we present the results using a meta-analysis of the two. When data were only available from one study (17), we used just that study, indicated by ‘*’. Evidence of colocalising effects is indicated by bold font, evidence of distinct effects by italic font. Allele includes risk (left) and protective alleles (right). Dir. shows whether disease risk across the associated SNPs in a region correlates with increased (+) or decreased (−) mRNA expression of the gene. † is a secondary disease signal, and direction of effect was not published for Crohn's disease. $For 22q13.1/SYNGR1, the PBC risk allele correlated with increase expression in B cells (B) but decreased expression in monocytes (M).
Figure 2.In the gene SYNGR1, expression of ILMN_1810875 is captured by almost all isoforms but that of ILMN_1727805 is captured by only one isoform. Shown is gene context plot of SYNGR1 depicting the positions of three probes.
Figure 3.Effect of prior probability p12 on the prior and posterior support for colocalisation in regions where both disease and eQTL show evidence for association (PP3 + PP4 > 0.8). We repeated the colocalisation analysis with all prior probabilities fixed as specified in Materials and Methods, except p12 which varied between 10−7 and 10−5. For each pairwise analysis in which PP3 + PP4 > 0.8, we plotted the number of SNPs in that region against the relative posterior support for H4, defined by PP4/(PP3 + PP4) (green), and similarly the relative prior support (orange).