| Literature DB >> 35172000 |
Suresh Venkateswaran1, Hari K Somineni1,2, Varun Kilaru3, Seyma Katrinli3, Jarod Prince1, David T Okou1, Jeffrey S Hyams4, Lee A Denson5, Richard Kellermayer6, Greg Gibson7, David J Cutler8, Alicia K Smith3,9, Subra Kugathasan1,8, Karen N Conneely8.
Abstract
Recently, we identified 1,189 CpG sites whose DNA methylation level in blood associated with Crohn's disease. Here, we examined associations between DNA methylation and genetic variants to identify methylation quantitative trait loci across disease states in (1) 402 blood samples from 164 newly diagnosed pediatric Crohn's disease patients taken at 2 time points (diagnosis and follow-up), and 74 non-inflammatory bowel disease controls, (2) 780 blood samples from a non-Crohn's disease adult population, and (3) 40 ileal biopsies (17 Crohn's disease cases and 23 non-inflammatory bowel disease controls) from group (1). Genome-wide DNAm profiling and genotyping were performed using the Illumina MethylationEPIC and Illumina Multi-Ethnic arrays. SNP-CpG associations were identified via linear models adjusted for age, sex, disease status, disease subtype, estimated cell proportions, and genotype-based principal components. In total, we observed 535,448 SNP-CpG associations between 287,881 SNPs and 12,843 CpG sites (P < 8.21 × 10-14). Associations were highly consistent across different ages, races, disease states, and tissue types, suggesting that the majority of these methylation quantitative trait loci participate in common gene regulation. However, genes near CpGs associated with inflammatory bowel disease SNPs were enriched for 18 KEGG pathways relevant to inflammatory bowel disease-linked immune function and inflammatory responses. We observed suggestive evidence for a small number of tissue-specific associations and disease-specific associations in ileum, though larger studies will be needed to confirm these results. Our study concludes that the vast majority of blood-derived methylation quantitative trait loci are common across individuals, though a subset may be involved in processes related to Crohn's disease. Independent cohort studies will be required to validate these findings.Entities:
Keywords: Crohn’s disease (CD); DNA methylation (DNAm); inflammatory bowel disease (IBD); methylation quantitative trait loci (mQTL)
Mesh:
Year: 2022 PMID: 35172000 PMCID: PMC8982416 DOI: 10.1093/g3journal/jkac041
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
Fig. 1.Genetic associations with DNAm genome-wide in discovery cohort. a) In total 535,448 SNP-CpG pairs are plotted, including both cis- and trans-associations. The x-axis indicates genomic variant position, whereas the y-axis indicates CpG site position. The color of each point represents absolute differences in DNAm per allele relative to the reference allele. Point size represents the significance of the association. A clear positive diagonal line indicates that most of them are cis-associations. b) In total, 14,979 cis-associations are plotted by position of CpG relative to SNP. The strongest associations are seen for proximal pairs of SNPs and CpG sites.
Fig. 2.Replication of mQTL associations in replication cohort. Effect sizes for 425,288 SNP-CpG pairs are compared between discovery and replication cohorts. Associations significant in both discovery cohort (P < 8.21 × 10−14) and replication cohort (P < 5 × 10−8) are marked in maroon. Gray dots indicate associations significant only in discovery cohort.
Fig. 3.Disease-, disease course-, or tissue-specific associations in discovery cohort. For 498,261 cis-SNP-CpG pairs significant in the discovery cohort of 238 blood samples, Figure 3 shows effect sizes estimated in the following subgroups: a) 74 healthy controls vs 164 CD cases at diagnosis, (b) 164 CD cases at diagnosis v. their 3-year follow-up, (c) 74 healthy controls at baseline vs 164 CD cases at follow-up, and (d) all 238 blood samples vs 40 ileal biopsies. Maroon color indicates the associations that are significant (P < 5 × 10−8) in both subgroups. Gray indicates the associations that are significant in 1 subgroup but not the other.
Summary of blood mQTLs identified in discovery and replication cohorts.
| Cohorts | Number of mQTL associations in total |
|
| |
|---|---|---|---|---|
| Discovery cohort ( |
| |||
| SNP-CpG associations | 498,261 | 69 | 565 | |
| Unique SNPs | 287,881 | 37 | 565 | |
| Unique CpGs | 12,843 | 69 | 22 | |
|
| ||||
| SNP-CpG associations | 37,187 | 1 | 0 | |
| Unique SNPs | 27,664 | 1 | 0 | |
| Unique CpGs | 1,118 | 1 | 0 | |
| Replication cohort ( |
| |||
| SNP-CpG associations | 3,140,805 | 253 | 9,587 | |
| Unique SNPs | 1,429,468 | 87 | 8,848 | |
| Unique CpGs | 62,456 | 252 | 261 | |
|
| ||||
| SNP-CpG associations | 175,588 | 4 | 184 | |
| Unique SNPs | 106,900 | 4 | 85 | |
| Unique CpGs | 7901 | 4 | 11 | |
Fig. 4.Comparing CpG-specific associations with IBD SNPs in blood vs ileum. a) Frequency of CpG sites that are associated with previously identified IBD SNPs in discovery cohort at FDR < 0.05. In total, 454 associations are considered. b) Effect sizes of 454 SNP-CpG associations compared in CD cases vs controls in blood. x-axis shows the effect sizes in CD cases (n = 164) and y-axis shows the effect sizes for the same associations in controls (n = 74) (c) Effect sizes of 370 SNP-CpG associations found in blood are compared to effect sizes observed in ileum. The nominally significant associations in ileum (P < 0.05) are marked in maroon. (D) Effect sizes of 269 SNP-CpG associations compared across the disease groups in ileum. The x-xis shows mQTL effect sizes in CD cases (n = 17) and y-axis shows effect sizes for the same associations in controls (n = 23). Maroon points indicate nominally significant (P < 0.05) interactions between SNP and disease group, suggesting differences in strength and/or direction of SNP-CpG association for cases vs controls.
Fig. 5.Testing the effects of CD-associated CpG sites in blood and ileum. a) Frequency of SNPs that are associated with CD-associated CpG sites in discovery cohort at FDR < 0.05 are plotted. In total, 7,819 associations with 289 CD-associated CpG sites are considered. b) Effect sizes of 7,819 SNP-CpG associations compared in cases vs controls in blood. The x-axis shows the effect sizes in CD cases (n = 164) and y-axis shows the effect sizes for the same associations in controls (n = 74) (c) Effect sizes of 6,221 SNP-CpG associations from blood are compared to effect sizes observed in ileum. Associations nominally significant in ileum (P < 0.05) are marked in maroon. d) Effect sizes of 4,142 SNP-CpG associations compared in cases vs controls in ileum. x-axis shows effect sizes in CD cases (n = 17) and y-axis shows effect sizes for the same associations in controls (n = 23). Maroon points indicate nominally significant (P < 0.05) interactions between SNP and disease group, suggesting differences in strength and/or direction of SNP-CpG association for cases vs controls.