| Literature DB >> 20106866 |
Natalie J Prescott1, Katherine M Dominy, Michiaki Kubo, Cathryn M Lewis, Sheila A Fisher, Richard Redon, Ni Huang, Barbara E Stranger, Katarzyna Blaszczyk, Barry Hudspith, Gareth Parkes, Naoya Hosono, Keiko Yamazaki, Clive M Onnie, Alastair Forbes, Emmanouil T Dermitzakis, Yusuke Nakamura, John C Mansfield, Jeremy Sanderson, Matthew E Hurles, Roland G Roberts, Christopher G Mathew.
Abstract
DNA polymorphisms in a region on chromosome 5q33.1 which contains two genes, immunity related GTPase related family, M (IRGM) and zinc finger protein 300 (ZNF300), are associated with Crohn's disease (CD). The deleted allele of a 20 kb copy number variation (CNV) upstream of IRGM was recently shown to be in strong linkage disequilibrium (LD) with the CD-associated single nucleotide polymorphisms and is itself associated with CD (P < 0.01). The deletion was correlated with increased or reduced expression of IRGM in transformed cells in a cell line-dependent manner, and has been proposed as a likely causal variant. We report here that small insertion/deletion polymorphisms in the promoter and 5' untranslated region of IRGM are, together with the CNV, strongly associated with CD (P = 1.37 x 10(-5) to 1.40 x 10(-9)), and that the CNV and the 5'-untranslated region variant -308(GTTT)(5) contribute independently to CD susceptibility (P = 2.6 x 10(-7) and P = 2 x 10(-5), respectively). We also show that the CD risk haplotype is associated with a significant decrease in IRGM expression (P < 10(-12)) in untransformed lymphocytes from CD patients. Further analysis of these variants in a Japanese CD case-control sample and of IRGM expression in HapMap populations revealed that neither the IRGM insertion/deletion polymorphisms nor the CNV was associated with CD or with altered IRGM expression in the Asian population. This suggests that the involvement of the IRGM risk haplotype in the pathogenesis of CD requires gene-gene or gene-environment interactions which are absent in Asian populations, or that none of the variants analysed are causal, and that the true causal variants arose after the European-Asian split.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20106866 PMCID: PMC2850616 DOI: 10.1093/hmg/ddq041
Source DB: PubMed Journal: Hum Mol Genet ISSN: 0964-6906 Impact factor: 6.150
Figure 1.Association of 35 SNPs at the IRGM-ZNF300 locus with CD (data from Ref. 16) indicated by –log of P-value. The physical location of the SNPs on chromosome 5 in Mbp is given on the x-axis with the relative position of the genes and CNV underneath.
Figure 2.The structure of IRGM showing the location of the upstream CNV on chromosome 5. Horizontal arrows represent the relative positions of the three primers for the CNV PCR assay which uses the common left primer (black), and either the insertion right primer (grey) or deletion right (black). The position of the two risk SNPs are indicated by vertical arrows. The IRGM region consisting of the single coding exon 2 and upstream un-translated exon 1 containing the promoter (P) and U3/U5 repeats of the ERV9LTR is expanded above to indicate the relative positions of the two insertion/deletion polymorphisms (boxed). Potential TF binding sites adjacent to the −1644ins are underlined. The coding sequence (CDS) of IRGM is shaded in dark grey.
Association of IRGM variants with Crohn's disease analysed in 1848 CD cases and 2025 population controls
| Variant | Risk allele | Risk allele frequency | OR (95% CI) | |||
|---|---|---|---|---|---|---|
| Cases | Controls | |||||
| CNV | Del | 0.115 | 0.073 | 1.40 × 10−9 | 1.9 × 10−9 | 1.66 (1.41–1.95) |
| rs13361189 | C | 0.115 | 0.075 | 3.73 × 10−9 | 5.10 × 10−9 | 1.65 (1.40–1.95) |
| −1644 | Ins | 0.108 | 0.073 | 4.20 × 10−7 | 4.20 × 10−7 | 1.53 (1.38–1.80) |
| −308 | (GTTT)5 | 0.149 | 0.111 | 1.37 × 10−5 | 2.77 × 10−6 | 1.37 (1.19–1.58) |
| (GTTT)4 | 0.001 | 0.000 | 0.0048a | n/a | n/ab | |
| rs10065172 | T | 0.107 | 0.071 | 1.04 × 10−7 | 1.01 × 10−7 | 1.56 (1.33–1.85) |
Results show P-values for allele specific association tests and the genotype trend test which can also account for multi-allelic genotypes.
aFishers exact test.
bNot possible due to absence in controls.
Figure 3.Linkage disequilibrium of IRGM variants in UK CD cases and controls.
Conditional logistic regression and haplotype analysis of IRGM variants in CD cases/controls, all haplotypes
| Conditional locus | ||||||
|---|---|---|---|---|---|---|
| CNV | rs13361189 | −1644 | −308 | rs10065172 | ||
| Test locus | CNV | 1.0 | 0.221 | 1.6 × 10−5 | 2.6 × 10−7 | 0.251 |
| rs13361189 | 0.361 | 1.0 | 0.005 | 3.88 × 10−6 | 0.388 | |
| −1644 | 3.1 × 10−4 | 0.142 | 1.0 | 9.3 × 10−6 | 0.047 | |
| −308 | 9.4 × 10−6 | 2.38 × 10−4 | 5.8 × 10−5 | 1.0 | 1.50 × 10−5 | |
| rs10065172 | 0.392 | 0.49 | 1.58 × 10−3 | 1.12 × 10−7 | 1.0 | |
This is a test of multiple haplotypes (see Materials and Methods).
Haplotypes analysis (all haplotypes)
| Haplotype | CNV | rs13361189 | −1644 | −308 | rs10065172 | Counta | Frequency | OR(95%CI) | |||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Case | Control | Case | Control | ||||||||
| 1 | INS | T | DEL | (GTTT)2 | C | 1668 | 2762 | 0.831 | 0.884 | ref | ref |
| 2 | 191.5 | 207.8 | 0.095 | 0.066 | 1.53 (1.24–1.88) | 6.18 × 10−5 | |||||
| 3 | INS | T | DEL | C | 104.5 | 130.3 | 0.052 | 0.042 | 1.33 (1.02–1.73) | 0.038 | |
| 4 | INS | DEL | (GTTT)2 | C | 6.003 | 6.006 | 0.003 | 0.002 | 1.66 (0.53–5.14) | 0.386 | |
| 5 | (GTTT)2 | 28.46 | 5.214 | 0.014 | 0.002 | 9.04 (3.47–23.53) | 4.76 × 10−8 | ||||
| 6 | C | 0 | 4.906 | 0 | 0.002 | n/a | 0.032 | ||||
| 7 | T | 1.003 | 3.006 | 0.0005 | 0.001 | 0.55 (0.06–5.32) | 0.591 | ||||
| 8 | INS | T | DEL | (GTTT)2 | 0 | 3.005 | 0 | 0.001 | n/a | 0.092 | |
| 9 | (GTTT)2 | C | 0 | 2.099 | 0 | 0.0007 | n/a | 0.161 | |||
| 10 | T | DEL | (GTTT)2 | C | 3 | 0 | 0.0015 | 0 | n/ab | 5.20 × 10−3 | |
| 11 | DEL | 4 | 0 | 0.002 | 0 | n/ab | 1.55 × 10−2 | ||||
Potential risk alleles are italicized.
aNon-integer haplotype counts are due to maximum likelihood estimation.
bOR over-inflated due to lack of controls with this haplotype (OR ≫ 4 × 106).
Conditional logistic regression and haplotype analysis of IRGM variants in CD cases/controls but excluding rare haplotypes (f < 0.005)
| Conditional locus | ||||||
|---|---|---|---|---|---|---|
| CNV | rs1331189 | −1644 | −308 | rs10065172 | ||
| Test locus | CNV | 1 | 1 | 1 | 2.6 × 10−7 | 1 |
| rs13361189 | 1 | 1 | 1 | 3.88 × 10−6 | 1 | |
| −1644 | 1 | 1 | 1 | 9.3 × 10−6 | 1 | |
| −308 | 2.0 × 10−5 | 9.76 × 10−4 | 3.9 × 10−4 | 1 | 4.24 × 10−5 | |
| rs10065172 | 1 | 1 | 1 | 1.12 × 10−7 | 1 | |
This is a test of multiple haplotypes (see Materials and Methods).
Figure 4.Frequency of IRGM variants in five HapMap populations from Asia and Africa (YRI 120, MKK 145, LWK 145, JPT 88, CHB 88) and 2025 white population controls from the 1958 MRC British birth cohort (BC1958).
Association analysis of IRGM variants in 484 Japanese CD cases and 933 Japanese controls
| Variant | Risk allele | Risk allele frequency | OR (95%CI) | ||
|---|---|---|---|---|---|
| Cases | Controls | ||||
| CNV | Del | 0.370 | 0.380 | 0.76 | 0.98 (0.83–1.14) |
| −1644 | Ins | 0.370 | 0.380 | 0.62 | 0.95 (0.81–1.12) |
| −308 | (GTTT)5a | 0.510 | 0.553 | 0.03 | 0.84 (0.72–0.98) |
| rs13361189 | C | 0.393 | 0.379 | 0.43 | 1.07 (0.91–1.26) |
| rs10065172 | T | 0.366 | 0.379 | 0.52 | 0.94 (0.81–1.11) |
aNo (GTTT)4 alleles were observed in this population.
Figure 5.Quantitative analysis of IRGM expression by real-time PCR in lymphocytes from CD patients of defined genotype for CNV (2, 1 and 0 copies of the CNV where 1 and 0 are the heterozygous and homozygous risk genotypes, respectively) −1644 (D = del allele, I = risk insertion allele) and −308 (where 2× GTTT repeats represent the non-risk allele and 5× GTTT repeats represent the risk allele. Relative expression is measured by, and inversely proportional to, mean ΔCt (see Materials and Methods).
Quantitative trait analysis of IRGM risk variants and expression in HapMap populations
| Locus | All African | YRI | MKK | LWK | CHB | JPT |
|---|---|---|---|---|---|---|
| CNV | 8.15 × 10−6 | 5.41 × 10−5 | 0.240 | 0.010 | 0.307 | 0.264 |
| rs13361189 | 3.74 × 10−7 | 2.29 × 10−6 | 0.033 | 0.016 | 0.797 | 0.349 |
| −1644 | 0.024 | 0.087 | 0.461 | 0.094 | 0.371 | 0.294 |
| −308 | 5.66 × 10−5 | 4.38 × 10−6 | 0.257 | 0.040 | 0.262 | 0.578 |
Figure 6.Relative expression of IRGM in HapMap populations from Asia (JPT & CHB), Africa (YRI, LWK & MKK) and Europe (CEU) by microarray analysis (26).