| Literature DB >> 27861577 |
Paul Martin1, Amanda McGovern1, Jonathan Massey1, Stefan Schoenfelder2, Kate Duffus1, Annie Yarwood1, Anne Barton1,3, Jane Worthington1,3, Peter Fraser2, Stephen Eyre1, Gisela Orozco1.
Abstract
BACKGROUND: The chromosomal region 6q23 has been found to be associated with multiple sclerosis (MS) predisposition through genome wide association studies (GWAS). There are four independent single nucleotide polymorphisms (SNPs) associated with MS in this region, which spans around 2.5 Mb. Most GWAS variants associated with complex traits, including these four MS associated SNPs, are non-coding and their function is currently unknown. However, GWAS variants have been found to be enriched in enhancers and there is evidence that they may be involved in transcriptional regulation of their distant target genes through long range chromatin looping. AIM: The aim of this work is to identify causal disease genes in the 6q23 locus by studying long range chromatin interactions, using the recently developed Capture Hi-C method in human T and B-cell lines. Interactions involving four independent associations unique to MS, tagged by rs11154801, rs17066096, rs7769192 and rs67297943 were analysed using Capture Hi-C Analysis of Genomic Organisation (CHiCAGO).Entities:
Mesh:
Year: 2016 PMID: 27861577 PMCID: PMC5115837 DOI: 10.1371/journal.pone.0166923
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
MS 6q23 Immunochip associated regions.
| Region Co-ordinates (GRCh37) | Index SNP | Reported Gene | MAF | P | OR | ||
|---|---|---|---|---|---|---|---|
| Chr. | Start | End | |||||
| 6 | 134946991 | 135498875 | rs11154801 | 0.37 | 1.80 x 10−20 | 1.12 | |
| 6 | 136685539 | 136875216 | rs17066096 | 0.23 | 1.60 x 10−23 | 1.14 | |
| 6 | 137231416 | 137660037 | rs7769192 | 0.45 | 3.30 x 10−09 | 1.08 | |
| rs67297943 | 0.22 | 5.50 x 10−13 | 1.11 | ||||
All P values are from the joint analysis by Beecham et al.[6]. Chr., chromosome; MAF, minor allele frequency; OR, odds ratio.
a, P values and ORs shown after conditioning on rs67297943.
Associated SNP genotypes for GM12878 (B) and Jurkat (T) cell lines.
| SNP | Risk Allele | GM12878 | Jurkat | ||
|---|---|---|---|---|---|
| Genotype | Number of risk alleles | Genotype | Number of risk alleles | ||
| rs11154801 | A | CA | 1 | CC | 0 |
| rs17066096 | G | AA | 0 | AG | 1 |
| rs7769192 | A | AA | 2 | unknown | |
| rs67297943 | C | TC | 1 | unknown | |
*Neither directly genotyped nor suitable proxies available on array
Refined MS SNPs based on bioinformatics and Capture Hi-C data.
| LD SNP | Index SNP | r2 | RegulomeDB Score | Promoter histone marks | Enhancer histone marks | DNAse | Proteins bound | Motifs changed | GRASP QTL hits | Selected eQTL hits | GENCODE genes | Gene Interactions |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| rs11154801 | rs11154801 | 1 | 1d | BLD | BLD | BLD | Nkx2 | 20 hits | 46 hits | AHI1 | ||
| rs7759971 | rs11154801 | 0.99 | 1f | BLD | BLD, STRM, SKIN | 1 hit | 46 hits | AHI1 | ||||
| rs13197384 | rs11154801 | 0.92 | 1a | 24 tissues | 53 tissues | 42 bound proteins | 13 altered motifs | 1 hit | 44 hits | AHI1 | ||
| rs7750586 | rs11154801 | 0.98 | 1f | GI, BLD | 10 tissues | 22 tissues | CTCF, RAD21, SMC3 | 2 hits | 53 hits | LINC00271 | ||
| rs9399148 | rs11154801 | 0.95 | 1f | 8 tissues | LNG, GI, BLD | KAP1 | GR | 2 hits | 52 hits | LINC00271 | ||
| rs5880258 | rs11154801 | 0.95 | 4 tissues | MUS, MUS | Pbx-1 | 21 hits | LINC00271 | |||||
| rs1322553 | rs17066096 | 0.96 | 5 | BLD | IPSC, BLD | BLD | 4 altered motifs | 44kb 3' of | ||||
| rs17066063 | rs17066096 | 0.96 | 3a | BLD | BRST, BLD, SKIN | 7 tissues | 6 bound proteins | Pou2f2, TCF4 | 1 hit | 41kb 3' of | ||
| rs12214014 | rs17066096 | 0.97 | 5 | BLD | GR | 36kb 3' of | ||||||
| rs9321623 | rs7769192 | 0.97 | 4 | FAT, SKIN, MUS | 15 tissues | 7 tissues | Rad21 | 36kb 5' of RP11-95M15.1 | ||||
| rs7769192 | rs7769192 | 1 | 4 | 6 tissues | 18 tissues | CTCF | HNF1, Irf, Pax-6 | 32kb 5' of RP11-95M15.1 | ||||
| rs11758213 | rs7769192 | 1 | 2a | GI, BLD, LIV | 39 tissues | CTCF, RAD21, SMC3 | 4 altered motifs | 12kb 5' of RP11-95M15.1 | ||||
| rs10607561 | rs7769192 | 0.93 | 3a | BLD | NFKB, SETDB1 | Foxp1, RREB-1 | 4.2kb 3' of RP11-95M15.1 |
r2 has been rounded to 2 decimal places.
*—Only protein coding genes are shown for clarity.
†rs7750586 showed strong evidence of regulatory activity but was not involved in any interactions.
‡rs11758123 only interacts with lincRNAs. BLD, blood; BRST, breast; FAT, adipose tissue; GI, smooth muscle; IPSC, induced pluripotent stem cells; LIV, liver; LNG, lung; MUS, muscle; SKIN, skin; STRM, stromal connective.