| Literature DB >> 25671699 |
Natalie J Prescott1, Benjamin Lehne2, Kristina Stone1, James C Lee3, Kirstin Taylor1, Jo Knight4, Efterpi Papouli5, Muddassar M Mirza5, Michael A Simpson1, Sarah L Spain1, Grace Lu6, Franca Fraternali6, Suzannah J Bumpstead7, Emma Gray7, Ariella Amar1, Hannah Bye1, Peter Green1, Guy Chung-Faye8, Bu'Hussain Hayee8, Richard Pollok9, Jack Satsangi10, Miles Parkes3, Jeffrey C Barrett7, John C Mansfield11, Jeremy Sanderson12, Cathryn M Lewis1, Michael E Weale1, Thomas Schlitt1, Christopher G Mathew13.
Abstract
The contribution of rare coding sequence variants to genetic susceptibility in complex disorders is an important but unresolved question. Most studies thus far have investigated a limited number of genes from regions which contain common disease associated variants. Here we investigate this in inflammatory bowel disease by sequencing the exons and proximal promoters of 531 genes selected from both genome-wide association studies and pathway analysis in pooled DNA panels from 474 cases of Crohn's disease and 480 controls. 80 variants with evidence of association in the sequencing experiment or with potential functional significance were selected for follow up genotyping in 6,507 IBD cases and 3,064 population controls. The top 5 disease associated variants were genotyped in an extension panel of 3,662 IBD cases and 3,639 controls, and tested for association in a combined analysis of 10,147 IBD cases and 7,008 controls. A rare coding variant p.G454C in the BTNL2 gene within the major histocompatibility complex was significantly associated with increased risk for IBD (p = 9.65x10-10, OR = 2.3[95% CI = 1.75-3.04]), but was independent of the known common associated CD and UC variants at this locus. Rare (<1%) and low frequency (1-5%) variants in 3 additional genes showed suggestive association (p<0.005) with either an increased risk (ARIH2 c.338-6C>T) or decreased risk (IL12B p.V298F, and NICN p.H191R) of IBD. These results provide additional insights into the involvement of the inhibition of T cell activation in the development of both sub-phenotypes of inflammatory bowel disease. We suggest that although rare coding variants may make a modest overall contribution to complex disease susceptibility, they can inform our understanding of the molecular pathways that contribute to pathogenesis.Entities:
Mesh:
Substances:
Year: 2015 PMID: 25671699 PMCID: PMC4335459 DOI: 10.1371/journal.pgen.1004955
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 6.020
Figure 1Summary of strategy for detecting rare variants associated with IBD.
Overview of our rare variant screening strategy in IBD using DNA pools. We detected 3442 high quality variants in phase I based on stringent filtering criteria. We were able to validate 1252 of these variants using a) previously generated genotyping data for 153 SNPs in 634 of the individuals who were sequenced; b) case-control association p values for 1099 SNPs from CD Immunochip study [4]. We then performed validation genotyping of 80 variants in phase II in 6335 IBD cases and 2974 controls and extended the analysis of the top 5 SNVs to a further 3662 IBD cases and 3639 controls (phase III) to allow a final combined analysis of 10,147 IBD cases and 7,008 controls.
Candidate gene selection strategies.
| Gene selection criteria | Count |
|---|---|
| Genes in Crohn’s disease GWAS hit regions | 75 |
| Genes in GWAS hits from other auto-immune diseases | 50 |
| Genes identified by pathway analysis | 74 |
| Genes identified by literature search | 214 |
| Genes identified by network analyses | 300 |
| Total Genes on array | 531 |
| Total exons | 6290 |
| Total base pairs | 1,569,003 |
Selection criteria are non-exclusive so that many genes were selected by more than one criterion.
Figure 2Minor allele frequencies WTCCC vs pooled NGS (24 pools combined).
MAFs for 153 SNPs were compared between allele frequency estimates based on pooled NGS and genotyping data from the WTCCC [47] for 634 individuals. MAFs are strongly correlated between both datasets (Spearman rank correlation coefficient R = 0.976), with only two SNPs showing substantial differences.
Figure 3Quantile-quantile plot of chi-squared statistic.
Data for case-control comparison of allele frequencies of 3442 variants, detected in pooled sequencing experiment for 42 case and 40 control pools. Overall genomic inflation (lambda) of 1.3 was observed.A genomic control correction was therefore applied for downstream association analysis (Materials and Methods).
Combined case-control association analysis of 5 sequence variants from the phase III extension study in 10,147 IBD cases and 7,008 controls from phases I–III.
| Gene, variant | dbSNP | Chr:bp | Control | Crohn’s disease | Ulcerative colitis | Inflammatory bowel disease | ||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| MAF | MAF | P | OR (95%CI) | MAF | P | OR (95%CI) | MAF | P | OR (95%CI) | |||
|
| rs200140527 | 3:49004552 | 0.09% | 0.22% | 0.01086 | 2.34 | 0.22% | 0.01504 | 2.32 | 0.22% | 0.004407 | 2.39 |
| c.338-6C>T | (1.19–4.61) | (1.15–4.67) | (1.29–4.42) | |||||||||
|
| rs61729946 | 3:49462458 | 1.30% | 0.90% | 0.003094 | 0.69 | 0.99% | 0.03548 | 0.76 | 0.94% | 0.001868 | 0.72 |
| p.H191R | (0.54–0.88) | (0.59–0.98) | (0.59–0.89) | |||||||||
|
| rs3213119 | 5:158743788 | 3.24% | 2.54% | 0.001242 | 0.78 | 2.80% | 0.06153 | 0.86 | 2.66% | 0.001825 | 0.82 |
| p.V298F | (0.67–0.91) | (0.74–1.01) | (0.72–0.93) | |||||||||
|
| rs28362675 | 6:32362521 | 0.47% | 0.90% | 3.63x10−05 | 1.91 | 1.31% | 3.48x10−12 | 2.79 | 1.08% | 9.65x10−10 | 2.31 |
| p.G454C | (1.40–2.61) | (2.06–3.76) | (1.75–3.04) | |||||||||
|
| rs41444548 | 17:45811354 | 7.70% | 7.19% | 0.1279 | 0.93 | 6.97% | 0.03644 | 0.9 | 7.09% | 0.03358 | 0.91 |
| c.491+43C>G | (0.84–1.02) | (0.81–0.99) | (0.84–0.99) | |||||||||