| Literature DB >> 29593342 |
Subra Kugathasan1, Michael E Zwick2, Kelly A Shaw3, David J Cutler3, David Okou1, Anne Dodd1, Bruce J Aronow4, Yael Haberman5, Christine Stevens6, Thomas D Walters7, Anne Griffiths7, Robert N Baldassano8, Joshua D Noe9, Jeffrey S Hyams10, Wallace V Crandall11, Barbara S Kirschner12, Melvin B Heyman13, Scott Snapper14, Stephen Guthery15, Marla C Dubinsky16, Jason M Shapiro17, Anthony R Otley18, Mark Daly6, Lee A Denson5.
Abstract
In the United States, approximately 5% of individuals with inflammatory bowel disease (IBD) are younger than 20 years old. Studies of pediatric cohorts can provide unique insights into genetic architecture of IBD, which includes Crohn's disease (CD) and ulcerative colitis (UC). Large genome-wide association studies have found more than 200 IBD-associated loci but explain a minority of disease variance for CD and UC. We sought to characterize the contribution of rare variants to disease development, comparing exome sequencing of 368 pediatric IBD patients to publicly available exome sequencing (dbGaP) and aggregate frequency data (ExAC). Using dbGaP data, we performed logistic regression for common variants and optimal unified association tests (SKAT-O) for rare, likely-deleterious variants. We further compared rare variants to ExAC counts with Fisher's exact tests. We did pathway enrichment analysis on the most significant genes from each comparison. Many variants overlapped with known IBD-associated genes (e.g. NOD2). Rare variants were enriched in CD-associated loci (p = 0.009) and showed suggestive enrichment in neutrophil function genes (p = 0.05). Pathway enrichment implicated immune-related pathways, especially cell killing and apoptosis. Variants in extracellular matrix genes also emerged as an important theme in our analysis.Entities:
Mesh:
Year: 2018 PMID: 29593342 PMCID: PMC6162182 DOI: 10.1038/s41435-018-0015-2
Source DB: PubMed Journal: Genes Immun ISSN: 1466-4879 Impact factor: 2.676
Clinical and demographic characteristics of samples with exome sequencing data used in analysis
| IBD cases of European ancestry | ARRA controls of European ancestry | Epi4k controls of European ancestry | ||
|---|---|---|---|---|
| Age at participation | ||||
| Range | 0–17 | 18–84 | Ages not provided, but controls were parents of children with epilepsy | |
| Median | 8 | 51 | ||
| Mean | 7.3 | 52 | ||
| Gender | ||||
| Female | 152 (41%) | 118 (56%) | 223 (53%) | |
| Male | 216 (59%) | 91 (44%) | 199 (47%) | |
| Diagnosis | ||||
| CD | 281 (76%) | — | — | |
| UC | 61 (17%) | — | — | |
| IBD-other | 26 (7%) | — | — |
Dashes indicate not applicable
Fig. 1Manhattan plot of p-values from logistic regression (with significant principal components and sex as covariates) comparing frequency of exome sequencing common variants in pediatric IBD cases to controls from dbGaP
Top 20 most significant loci found in our common variant logistic regression
| Chrom | Position | ID | Alt | Type | OR | Gene | Assoc. Diagnosis, Study | Neut. gene list | |
|---|---|---|---|---|---|---|---|---|---|
| chr17 | 41227070 | chr17_41227070 | A | DEL | 0.1111 | KRTAP9-2 | 6.21E-06 | — | — |
| chr16 | 50711288 | rs2066843 | T | SNP | 1.57 | NOD2 | 9.60E-06 | CD, Jostins | Yes |
| chr16 | 50710713 | rs2066842 | T | SNP | 1.569 | NOD2 | 1.02E-05 | CD, Jostins | Yes |
| chr9 | 136371953 | rs10781499 | A | SNP | 1.524 | CARD9 | 2.32E-05 | IBD, Jostins | Yes |
| chr19 | 35488794 | rs10410228 | T | SNP | 1.653 | KRTDAP | 2.59E-05 |
| — |
| chr20 | 62346665 | rs6143036 | A | SNP | 1.607 | LAMA5 | 3.27E-05 | — | — |
| chr1 | 119895261 | rs2641348 | G | SNP | 0.4588 | ADAM30 | 3.37E-05 | CD, Jostins | — |
| chr1 | 119915381 | rs6685892 | T | SNP | 0.4627 | NOTCH2 | 4.14E-05 | CD, Jostins | — |
| chr9 | 136372044 | rs4077515 | T | SNP | 1.499 | CARD9 | 4.49E-05 | IBD, Jostins | Yes |
| chr16 | 50675812 | rs6596 | A | SNP | 1.613 | SNX20 | 4.97E-05 | CD, Jostins | — |
| chr9 | 136395373 | rs4266763 | G | SNP | 1.496 | SNAPC4 | 5.02E-05 | IBD, Jostins | — |
| chr9 | 136380752 | rs3812570 | C | SNP | 1.49 | SNAPC4 | 6.04E-05 | IBD, Jostins | — |
| chr9 | 136380842 | rs3812571 | C | SNP | 1.486 | SNAPC4 | 6.77E-05 | IBD, Jostins | — |
| chr9 | 136384721 | rs10781510 | A | SNP | 1.484 | SNAPC4 | 7.74E-05 | IBD, Jostins | — |
| chr9 | 136404141 | rs1051957 | G | SNP | 1.451 | SDCCAG3 | 0.00016 | IBD, Jostins | — |
| chr9 | 136477334 | rs6560632 | C | SNP | 1.427 | SEC16A | 0.00025 | IBD, Jostins | — |
| chr9 | 136432987 | rs10781542 | G | SNP | 1.423 | INPP5E | 0.00029 | IBD, Jostins | — |
| chr21 | 46246830 | rs17183220 | T | SNP | 0.4371 | MCM3AP-AS1 | 0.00029 | — | — |
| chr5 | 78885600 | rs1071598 | T | SNP | 1.558 | ARSB | 0.00046 | — | — |
| chr12 | 128899303 | chr12_128899303 | G | DEL | 0.4497 | GLT1D1 | 0.00047 | — | — |
Dashes indicate not applicable or no.
Significantly enriched pathways in the top 200 most significant genes in our common variant (dbGaP) analysis
| GO ID | GO term | % pathway covered | Corrected | Associated genes found |
|---|---|---|---|---|
| GO:2000427 | Positive regulation of apoptotic cell clearance | 33 | 9E-05 | [C2, C3, CCL2] |
| GO:0001910 | Regulation of leukocyte-mediated cytotoxicity | 9.4 | 2E-04 | [CCL2, HLA-A, LILRB1, RASGRP1, SERPINB4] |
| GO:0002699 | Positive regulation of immune effector process | 4.5 | 5E-04 | [C3, CCL2, GPI, HLA-A, IL2, LILRB1, NOD2, RASGRP1] |
| GO:0002703 | Regulation of leukocyte-mediated immunity | 4.4 | 0.001 | [C3, HLA-A, IL2, LILRB1, NOD2, RASGRP1, SERPINB4] |
| GO:0055001 | Muscle cell development | 4.3 | 0.002 | [ANK2, FHOD3, GPX1, IGSF22, MYPN, RYR1, XK] |
| GO:0001578 | Microtubule bundle formation | 5.8 | 0.002 | [CCDC40, DNAH5, MAP1B, RP1L1, SPAG16] |
| GO:0002705 | Positive regulation of leukocyte-mediated immunity | 5.5 | 0.003 | [C3, HLA-A, IL2, NOD2, RASGRP1] |
| GO:0048747 | Muscle fiber development | 7.1 | 0.003 | [GPX1, MYPN, RYR1, XK] |
| GO:0014032 | Neural crest cell development | 6.9 | 0.003 | [ERBB4, JAG1, LAMA5, RET] |
| GO:0010927 | Cellular component assembly involved in morphogenesis | 5.2 | 0.003 | [ANK2, DAG1, FHOD3, IGSF22, MYPN] |
| GO:0046487 | Glyoxylate metabolic process | 9.7 | 0.004 | [AMT, LDHD, LIAS] |
| GO:0006081 | Cellular aldehyde metabolic process | 4.8 | 0.005 | [AMT, GPI, H6PD, LDHD, LIAS] |
| GO:0032680 | Regulation of tumor necrosis factor production | 4.7 | 0.005 | [CARD9, CCL2, LILRB1, NOD2, RASGRP1] |
| GO:0010837 | Regulation of keratinocyte proliferation | 8.8 | 0.005 | [EPPK1, STXBP4, TGM1] |
| GO:0001912 | Positive regulation of leukocyte-mediated cytotoxicity | 8.6 | 0.006 | [CCL2, HLA-A, RASGRP1] |
| GO:0030449 | Regulation of complement activation | 8.1 | 0.007 | [C2, C3, CFB] |
| GO:0002700 | Regulation of production of molecular mediator of immune response | 4.1 | 0.009 | [GPI, HLA-A, IL2, LILRB1, NOD2] |
| GO:0045214 | Sarcomere organization | 6.8 | 0.01 | [FHOD3, IGSF22, MYPN] |
| GO:0072676 | Lymphocyte migration | 4.8 | 0.01 | [AIRE, CCL2, GCSAML, RET] |
| GO:2000106 | Regulation of leukocyte apoptotic process | 4.3 | 0.02 | [IL2, LILRB1, NOD2, TP53BP1] |
| GO:0032649 | Regulation of interferon-gamma production | 4.2 | 0.02 | [HLA-A, IL2, LILRB1, RASGRP1] |
| GO:0046579 | Positive regulation of Ras protein signal transduction | 5.6 | 0.02 | [ARRB1, NOTCH2, RASGRP1] |
| GO:0002718 | Regulation of cytokine production involved in immune response | 4.2 | 0.04 | [HLA-A, LILRB1, NOD2] |
Fig. 2Pathway enrichment of the genes annotated to the top 200 most significant common genes tested in our logistic regression
Top 15 results from SKAT-O analysis of enrichment of rare, likely-pathogenic (CADD > 10) variants in genes with five or more variants
| SetID | Number of variants included in gene | |
|---|---|---|
| NOD2 | 8.4E-12 | 15 |
| VWA2 | 0.0006 | 7 |
| HAPLN3 | 0.0008 | 5 |
| LMF1 | 0.002 | 5 |
| SOS1 | 0.002 | 5 |
| MAGI2 | 0.002 | 7 |
| SRRM2 | 0.002 | 13 |
| RGS12 | 0.003 | 10 |
| SCAF4 | 0.003 | 5 |
| STARD13 | 0.004 | 8 |
| RHPN2 | 0.005 | 6 |
| D2HGDH | 0.005 | 6 |
| G6PC2 | 0.005 | 6 |
| NR4A1 | 0.005 | 5 |
| EFEMP2 | 0.006 | 5 |
SKAT-O analysis for enrichment of rare variants with CADD scores >10 in loci associated with Crohn’s disease (CD), inflammatory bowel disease (IBD), or ulcerative colitis (UC)
| SetID | Number of variants included in SetID | |
|---|---|---|
| CD | 0.009 | 522 |
| IBD | 0.9 | 1849 |
| UC | 0.7 | 445 |
SKAT-O analysis for enrichment of rare, conserved variants in neutrophil function genes (NEUT)
| SetID | Number of variants included in SetID | |
|---|---|---|
| NEUT | 0.05 | 413 |
Fig. 3Manhattan plot of p-values from comparing frequency of exome sequencing rare variants in pediatric IBD cases to ExAC after filtering out sites most significantly different between ExAC and our control data set
Top 20 most significant sites in our rare variant Fisher’s exact tests
| Chrom | Position | ID | Alt | Type | OR | Gene | Assoc Diagnosis, Study | Neut. gene list | |
|---|---|---|---|---|---|---|---|---|---|
| chr16 | 50729867 | rs796661546 | GC | INS | 4.42151 | NOD2 | 9.19E-12 | CD, Jostins | Yes |
| chr8 | 100712766 | chr8_100712766 | CA | INS | 33.764 | PABPC1 | 9.19E-12 | — | — |
| chr11 | 294540 | chr11_294540 | GC | INS | 122.892 | ATHL1 | 9.19E-12 | — | — |
| chr13 | 24447181 | chr13_24447181 | T | DEL | 700.496 | PARP4 | 8.90E-09 | — | — |
| chr9 | 101390469 | chr9_101390469 | GTA | INS | 172.978 | MRPL50 | 9.39E-08 | — | — |
| chr1 | 248273809 | chr1_248273809 | C | DEL | 279.84 | OR2T33 | 5.58E-07 | — | — |
| chr21 | 44573789 | rs9977039 | G | SNP | 5.75281 | TSPEAR | 7.43E-07 | — | — |
| chr10 | 29462394 | chr10_29462394 | AT | INS | Inf | SVIL-AS1 | 1.59E-06 | — | — |
| chr16 | 50722629 | rs2066845 | C | MULTIALLELIC | 3.4408 | NOD2 | 4.52E-06 | CD, Jostins | Yes |
| chr4 | 56964497 | rs17087307 | C | SNP | 0.34222 | NOA1 | 5.34E-06 | — | — |
| chr7 | 72713798 | rs146095374 | A | SNP | 0.25988 | TYW1B | 7.50E-06 | — | — |
| chr5 | 140822334 | rs61730632 | A | SNP | 2.79249 | PCDHA1 | 1.03E-05 | — | — |
| chr14 | 73953419 | rs778985097 | AT | INS | 10.2459 | COQ6 | 1.50E-05 | — | — |
| chr5 | 140875534 | rs114654172 | G | SNP | 2.70029 | PCDHA1 | 2.28E-05 | — | — |
| chr11 | 5544676 | rs7934354 | G | SNP | 0.17691 | OR52H1 | 2.76E-05 | — | — |
| chr6 | 31960262 | rs11541400 | G | SNP | 5.1923 | SKIV2L | 4.06E-05 | — | — |
| chr6 | 31728544 | rs139006870 | A | SNP | 5.17682 | DDAH2 | 4.17E-05 | — | — |
| chr15 | 49588022 | chr15_49588022 | CT | INS | Inf | FAM227B | 4.47E-05 | — | — |
| chr3 | 51995472 | rs371570896 | A | SNP | 77.1658 | RPL29 | 6.15E-05 | — | — |
| chr2 | 241767780 | rs143940595 | A | SNP | 0 | D2HGDH | 8.10E-05 | CD, Liu | — |
Dashes indicate not applicable or no
Significantly enriched pathways using the list of the top 200 most significant genes in our ExAC rare variant analysis
| GO ID | GO Term | % Pathway covered | Corrected | Associated genes found |
|---|---|---|---|---|
| GO:0043921 | Modulation by host of viral transcription | 11 | 0.006 | [HMGA2, POU2F3, PSG1] |
| GO:0030517 | Negative regulation of axon extension | 12 | 0.008 | [BCL11A, RTN4R, SEMA5A] |
| GO:0051851 | Modification by host of symbiont morphology or physiology | 5.8 | 0.01 | [HMGA2, POU2F3, PSG1, SMC3] |
| GO:0000288 | Nuclear-transcribed mRNA catabolic process, deadenylation-dependent decay | 5.1 | 0.01 | [EIF4A1, EIF4B, PABPC1, SKIV2L] |
| GO:0015698 | Inorganic anion transport | 4.1 | 0.01 | [ABCB11, ANKH, CLCN6, CLCNKB, SLC12A6, SLC26A2, SLC5A5] |
| GO:0046426 | Negative regulation of JAK-STAT cascade | 5.7 | 0.02 | [HMGA2, RTN4R, RTN4RL2] |
| GO:1902476 | Chloride transmembrane transport | 4.3 | 0.02 | [CLCN6, CLCNKB, SLC12A6, SLC26A2] |
Fig. 4Pathway enrichment of the genes annotated to the top 200 most significant rare variants tested in our rare variant analysis