| Literature DB >> 30166421 |
Jiun-Sheng Chen1,2, Fulan Hu1,3, Subra Kugathasan4, Lynn B Jorde5, David Nix6, Ann Rutherford7, Lee Denson8, W Scott Watkins5, Sampath Prahalad4, Chad Huff9,2, Stephen L Guthery10.
Abstract
Crohn's disease is a complex genetic trait characterized by chronic relapsing intestinal inflammation. Genome wide association studies (GWAS) have identified more than 170 loci associated with the disease, accounting for ∼14% of the disease variance. We hypothesized that rare genetic variation in GWAS positional candidates also contribute to disease pathogenesis. We performed targeted, massively-parallel sequencing of 101 genes in 205 children with Crohn's disease, including 179 parent-child trios and 200 controls, both of European ancestry. We used the gene burden test implemented in VAAST and estimated effect sizes using logistic regression and meta-analyses. We identified three genes with nominally significant p-values: NOD2, RTKN2, and MGAT3 Only NOD2 was significant after correcting for multiple comparisons. We identified eight novel rare variants in NOD2 that are likely disease-associated. Incorporation of rare variation and compound heterozygosity nominally increased the proportion of variance explained from 0.074 to 0.089. We estimated the population attributable risk and total heritability of variation in NOD2 to be 32.9% and 3.4%, respectively, with 3.7% and 0.25% accounted for by rare putatively functional variants. Sequencing probands (as opposed to genotyping) to identify rare variants and incorporating phase by sequencing parents can recover a portion of the missing heritability of Crohn's disease.Entities:
Keywords: Crohn’s disease; NOD2; case-control study; de novo mutation; rare variant
Mesh:
Substances:
Year: 2018 PMID: 30166421 PMCID: PMC6118318 DOI: 10.1534/g3.118.200404
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
VAAST results for genes with nominal p-value less than 0.05
| Gene | VAAST p-value | VAAST p-value after Bonferroni correction |
|---|---|---|
| 1.53x10−5 | 0.0009639 | |
| 0.00165 | 0.10395 | |
| 0.0147 | 0.9261 |
Rare coding variants identified in NOD2, RTKN2, and MGAT3
| PolyPhen2 annotation | PolyPhen2 value | SIFT annotation | SIFT value | CADD | Allele frequencyin cases | Allele frequency in ExAC and controls | p-value | |
|---|---|---|---|---|---|---|---|---|
| R702W | benign | 0.008 | deleterious | 0 | 24.1 | 10.05% | 3.502% | 2.99E-09*** |
| G908R | Probably damaging | 1 | deleterious | 0 | 31 | 4.90% | 1.407% | 2.40E-06*** |
| L1007fs | NA | NA | NA | NA | 35 | 9.07% | 2.020% | 9.27E-14*** |
| S47L | benign | 0.98 | deleterious | 0.1 | 21.3 | 0.24% | 0.12% | 0.0886 |
| E54K | probably damaging | 0.995 | deleterious | 0.01 | 33 | 0.24% | 0.003% | 0.01825 |
| D291G | probably damaging | 0.873 | deleterious | 0.01 | 23 | 0.25% | 0.004% | 0.02418 |
| R393C | probably damaging | 0.996 | deleterious | 0 | 23.7 | 0.25% | 0.004% | 0.02396 |
| R420H | benign | 0.082 | tolerated | 0.21 | 13.99 | 0.25% | 0.001% | 0.01209 |
| S431L | probably damaging | 0.166 | deleterious | 0.03 | 21.4 | 0.24% | 0.002% | 0.401 |
| P668fs | NA | NA | NA | NA | 22.8 | 0.25% | 0.002% | 0.01220 |
| R791Q | benign | 0.006 | tolerated | 0.22 | 0.074 | 0.49% | 0.004% | 0.2586 |
| R830L | benign | 0.006 | tolerated | 0.13 | 23.3 | 0.25% | 0.001% | 0.01212 |
| C842R | probably damaging | 0.646 | deleterious | 0 | 23.4 | 0.25% | 0.004% | 0.02406 |
| C973R | benign | 0.086 | tolerated | 0.36 | 16.96 | 0.25% | 0.000% | 0.00616**a |
| A328fs | NA | NA | NA | NA | 22.5 | 12.01% | 16.399% | 0.01561 |
| G573S | benign | 0.001 | tolerated | 1 | 0.002 | 0.49% | 0.106% | 0.07243 |
| D499H | benign | 0.409 | tolerated | 0.15 | 18.52 | 0.73% | 0.670% | 0.7559 |
| R525W | benign | 0.116 | deleterious | 0.01 | 22.4 | 0.25% | 0.051% | 0.194 |
| R162Q | benign | 0.004 | tolerated | 0.54 | 9.76 | 0.25% | 0.024% | 0.1319 |
P < 0.05, **P < 0.01, ***P < 0.001; aThe p-value was estimated using the total NFE sample number at the nearby variant (chr16:50759433).
Figure 1Location of risk variants in the NOD2 protein. Green dots indicate the well-established Crohn disease variants; Red dots indicate the potential risk rare variants found in the VAAST result.
VARPRISM de novo mutation results
| Transcript ID | Gene name | p-value | Number mutations | Genome Coordinate | Reference base | Alternative base |
|---|---|---|---|---|---|---|
| NM_022162 | 0.015682657 | 1 | chr16:50744999 | C | T | |
| NM_020193 | 0.017465753 | 1 | chr11:76248871 | C | G |
Logistic regression for three NOD2 established variants
| Variable | Case | Control | OR | Adjusted OR1 | Adjusted p |
|---|---|---|---|---|---|
| L1007fs (heterozygous) | 29 (14.2%) | 10 (5%) | 3.1 (1.5-6.6) | 3.1 (1.4-6.5) | 0.004** |
| R702W (heterozygous) | 29 (14.2%) | 19 (9.5%) | 1.6 (0.8-2.9) | 1.6 (0.9-3.0) | 0.14 |
| G908R (heterozygous) | 16 (7.8%) | 2 (1.0%) | 8.4 (1.9-36.9) | 7.2 (1.8-28.7) | 0.006** |
| Any homozygote | 12 (5.9%) | 2 (1.0%) | 6.2 (1.4-27.9) | 6.5 (1.6-27.1) | 0.01 |
| L1007fs (heterozygous) | 21 (10.2%) | 8 (4%) | 2.7 (1.2-6.3) | 3.4 (1.5-8.0) | 0.004** |
| G908R (heterozygous) | 11 (5.4%) | 4 (1%) | 5.6 (1.2-25.7) | 6.3 (1.5-26.4) | 0.01 |
| R702W (heterozygous) | 20 (9.8%) | 17 (8.5%) | 1.2 (0.6-2.3) | 1.6 (0.8-3.2) | 0.18 |
| Common homozygote | 12 (5.9%) | 2 (1.0%) | 6.2 (1.4-27.9) | 6.8 (1.6-28.3) | 0.008** |
| Compound heterozygote | 12 (5.9%) | 2 (1.0%) | 6.2 (1.4-27.9) | 6.8 (1.6-28.3) | 0.008** |
| Rare heterozygote | 5 (2.4%) | 0 (0%) | 11 (0.46-264)a | 14.9 (0.6-360.5) | 0.095 |
P < 0.05, **P < 0.01, ***P < 0.001; a unadjusted OR for rare heterozygote and adjusted OR for all covariates are using the method of Firth (Firth 1993)