| Literature DB >> 30849075 |
Hugues Aschard1,2, Vincent Laville1, Eric Tchetgen Tchetgen3, Dan Knights4,5,6,7, Floris Imhann8, Philippe Seksik9, Noah Zaitlen10, Mark S Silverberg11, Jacques Cosnes9,12, Rinse K Weersma8, Ramnik Xavier5,6,13, Laurent Beaugerie9,12, David Skurnik14,15,16,17, Harry Sokol9,12,18,19.
Abstract
Several bacteria in the gut microbiota have been shown to be associated with inflammatory bowel disease (IBD), and dozens of IBD genetic variants have been identified in genome-wide association studies. However, the role of the microbiota in the etiology of IBD in terms of host genetic susceptibility remains unclear. Here, we studied the association between four major genetic variants associated with an increased risk of IBD and bacterial taxa in up to 633 IBD cases. We performed systematic screening for associations, identifying and replicating associations between NOD2 variants and two taxa: the Roseburia genus and the Faecalibacterium prausnitzii species. By exploring the overall association patterns between genes and bacteria, we found that IBD risk alleles were significantly enriched for associations concordant with bacteria-IBD associations. To understand the significance of this pattern in terms of the study design and known effects from the literature, we used counterfactual principles to assess the fitness of a few parsimonious gene-bacteria-IBD causal models. Our analyses showed evidence that the disease risk of these genetic variants were likely to be partially mediated by the microbiome. We confirmed these results in extensive simulation studies and sensitivity analyses using the association between NOD2 and F. prausnitzii as a case study.Entities:
Mesh:
Substances:
Year: 2019 PMID: 30849075 PMCID: PMC6426259 DOI: 10.1371/journal.pgen.1008018
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 5.917
Fig 3Hypothetical causal models.
Expected correlation between an IBD risk variant g and a bacteria B negatively correlated with IBD, across four potential underlying models. For illustrative purposes, g and B are assumed to be continuous and normally distributed and all effects are larger than those observed in real data. Top panels (a, b, c, and d) present the hypothetical causal diagrams and bottom panels (e, f, g, and h) present the corresponding scatterplots of B as a function of g in the population (dark grey points, and trend in black), and in cases only (light grey points and trend in red). In model a) the effect of g on IBD is mediated by B; in cases the effect of g on B is underestimated because of the oversampling of participants carrying risk alleles (e). In model b) the genetic variant influences both IBD and B, inducing a correlation between IBD and B which is observed in both the whole sample and cases only (f). In model c), g and B act independently on IBD and are therefore not associated in the population, however g and B are positively correlated in case-only samples because of biased selection (g). Finally, in model d) the effect of g on B is mediated by IBD; the indirect association between g and B observed in the general population is canceled when looking at cases only as the sample is stratified by the mediator (h).
Fig 4Correlation between SNP-taxa and IBD-taxa effect estimates.
The 168 bacterial taxa were tested for association with the variants from each of the four genes considered: (a) ATG16L1 (rs12994997), (b) CARD9 (rs10781499), (c) LRRK2 (rs11564258), and (d) NOD2 (rs2066844, rs2066845, and rs2066847). The histograms on the left panel show the distribution of IBD risk alleles-bacteria association (i.e. of , the regression coefficients) and the enrichment for negative effects (in blue, p-values equal 7.1x10-5, 3.3x10-8, 1.3x10-8, and 5.3x10-3, respectively). Panel (e) shows a similar histogram while merging the per-risk allele change in bacteria level of the four genes (i.e. summing for each bacteria the of the four genes). Panel (f) shows the distribution of bacteria-IBD association derived in a IBD cases-controls dataset () for each bin from panel (e). Together, panels (e) and (f) show the strong concordance of the gene-bacteria and bacteria-IBD effects, in agreement with a mediation effect of the risk allele on IBD through the microbiome. In particular, bacteria displaying lower level in carrier of IBD risk alleles are more likely to be negatively associated with the risk of IBD.
Sample characteristics.
| Female | Male | Total | |||
|---|---|---|---|---|---|
| CD | UC | CD | UC | ||
| 67 | 38 | 48 | 29 | 182 | |
| 40.3% | 55.3% | 50.0% | 41.4% | 46.2% | |
| 40.3 | 38.4 | 39.9 | 45.5 | 40.6 | |
| 25.4% | 7.9% | 31.3% | 13.8% | 21.4% | |
| 20.9% | 55.3% | 35.4% | 62.1% | 38.5% | |
| 11.9% | 23.7% | 16.7% | 31.0% | 18.7% | |
| 56.7% | 44.7% | 54.2% | 31.0% | 49.5% | |
| 43.3% | 31.6% | 29.2% | 51.7% | 38.5% | |
Abbreviation: MTX, methotrexate.
Bacteria-genetic variant associations.
| Outcome | Gene | Disease | Discovery | Replication | Meta-analysis | |||
|---|---|---|---|---|---|---|---|---|
| β | β | β | ||||||
| IBD | -0.38 | -0.08 | -0.25 | |||||
| CD | -0.43 | -0.05 | -0.30 | |||||
| CDil | -0.44 | -0.08 | -0.31 | |||||
| CDni | -0.46 | 0.85 | 0.12 | |||||
| UC | -0.25 | 0.07 | -0.10 | |||||
| IBD | -0.62 | -0.05 | -0.20 | |||||
| CD | -0.57 | -0.17 | -0.30 | |||||
| CDil | -0.65 | -0.16 | -0.31 | |||||
| CDni | - | -0.42 | -0.42 | |||||
| UC | -0.50 | 0.24 | 0.11 | |||||
| IBD | -0.58 | -0.22 | -0.36 | |||||
| CD | -0.46 | -0.26 | -0.35 | |||||
| CDil | -0.30 | -0.34 | -0.32 | |||||
| CDni | - | -0.27 | -0.27 | |||||
| UC | -0.76 | 0.14 | -0.25 | |||||
| IBD | -0.56 | -0.30 | -0.40 | |||||
| CD | -0.48 | -0.20 | -0.32 | |||||
| CDil | -0.51 | -0.31 | -0.39 | |||||
| CDni | - | -0.08 | -0.08 | |||||
| UC | -0.32 | -0.14 | -0.22 | |||||
Abbreviation: CD, Crohn’s disease; UC, ulcerative colitis; CDil, CD ileal; CDni, CD non-ileal
*Beta coefficients were derived when using the allele associated with an increased risk of IBD as the coded allele. Outcomes were normalized. All outcomes had a variance of 1, whereas the genetic variants were not transformed; thus, the beta coefficients corresponded to the estimated change in the outcome mean per additional risk allele.