| Literature DB >> 29325019 |
Shaza B Zaghlool1,2, Dennis O Mook-Kanamori3, Sara Kader1, Nisha Stephan1, Anna Halama1, Rudolf Engelke4, Hina Sarwath4, Eman K Al-Dous5, Yasmin A Mohamoud5, Werner Roemisch-Margl6, Jerzy Adamski7,8, Gabi Kastenmüller6,8, Nele Friedrich9, Alessia Visconti10, Pei-Chien Tsai10, Tim Spector10, Jordana T Bell10, Mario Falchi10, Annika Wahl11,12, Melanie Waldenberger11,12, Annette Peters11,12,13, Christian Gieger11,12, Marija Pezer14, Gordan Lauc14, Johannes Graumann4,15, Joel A Malek5, Karsten Suhre1.
Abstract
Epigenetic regulation of cellular function provides a mechanism for rapid organismal adaptation to changes in health, lifestyle and environment. Associations of cytosine-guanine di-nucleotide (CpG) methylation with clinical endpoints that overlap with metabolic phenotypes suggest a regulatory role for these CpG sites in the body's response to disease or environmental stress. We previously identified 20 CpG sites in an epigenome-wide association study (EWAS) with metabolomics that were also associated in recent EWASs with diabetes-, obesity-, and smoking-related endpoints. To elucidate the molecular pathways that connect these potentially regulatory CpG sites to the associated disease or lifestyle factors, we conducted a multi-omics association study including 2474 mass-spectrometry-based metabolites in plasma, urine and saliva, 225 NMR-based lipid and metabolite measures in blood, 1124 blood-circulating proteins using aptamer technology, 113 plasma protein N-glycans and 60 IgG-glyans, using 359 samples from the multi-ethnic Qatar Metabolomics Study on Diabetes (QMDiab). We report 138 multi-omics associations at these CpG sites, including diabetes biomarkers at the diabetes-associated TXNIP locus, and smoking-specific metabolites and proteins at multiple smoking-associated loci, including AHRR. Mendelian randomization suggests a causal effect of metabolite levels on methylation of obesity-associated CpG sites, i.e. of glycerophospholipid PC(O-36: 5), glycine and a very low-density lipoprotein (VLDL-A) on the methylation of the obesity-associated CpG loci DHCR24, MYO5C and CPT1A, respectively. Taken together, our study suggests that multi-omics-associated CpG methylation can provide functional read-outs for the underlying regulatory response mechanisms to disease or environmental insults.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29325019 PMCID: PMC5886112 DOI: 10.1093/hmg/ddy006
Source DB: PubMed Journal: Hum Mol Genet ISSN: 0964-6906 Impact factor: 6.150
Summary of CpG–intermediate trait–complex trait associations for the CpG sites from the Petersen et al. (25) study
| Intermediate traits in QMDiab | Complex traits | |||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Locus name CpG Chr: Pos | Replication of Petersen study | Metabolite (blood) | Metabolite (urine) | Metabolite (saliva) | Lipid | Protein | Glycan | T2D | BMI | Blood pressure | Liver functions | Smoking | Reference | |
| chr4: 69514031 | ✓✓ | ✓ | ||||||||||||
| chr1: 145441552 | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ( | ||||||
| chr1: 55353706 | ✓✓ | ✓ | ✓ | ( | ||||||||||
| chr15: 52554171 | ✓✓ | ✓ | ✓ | ( | ||||||||||
| chr21: 43656587 | ✓✓ | ✓ | ✓ | ✓ | ✓ | ( | ||||||||
| chr11: 798350 | ||||||||||||||
| chr11: 68607622 | ✓✓ | ✓ | ✓ | ✓ | ✓ | ( | ||||||||
| chr4: 139162808 | ✓ | ✓ | ✓ | ✓ | ✓ | ( | ||||||||
| chr1: 120255992 | ✓ | ✓ | ( | |||||||||||
| LOC100132354—cg18120259 | ||||||||||||||
| chr6: 43894639 | ✓ | ✓ | ✓ | ( | ||||||||||
| chr19: 47287778 | ✓ | ✓ | ✓ | ( | ||||||||||
| cg13526915 | ✓✓ | |||||||||||||
| chr14: 24164078 | ||||||||||||||
| chr5: 373378 | ✓✓ | ✓ | ✓ | ✓ | ✓ | ( | ||||||||
| chr2: 233284661 | ✓✓ | ✓ | ✓ | ✓ | ✓ | ( | ||||||||
| chr19: 17000585 | ✓ | ✓ | ✓ | ✓ | ( | |||||||||
| cg06126421 | ✓✓ | ✓ | ✓ | ✓ | ( | |||||||||
| chr6: 30720080 | ||||||||||||||
| chr17: 38476024 | ✓✓ | ✓ | ✓ | ( | ||||||||||
| ✓ | ✓ | ( | ||||||||||||
| chr1: 92947588 | ✓ | ✓ | ||||||||||||
| chr15: 63349192 | ||||||||||||||
| cg23079012 | ✓ | ✓ | ✓ | ✓ | ( | |||||||||
| chr2: 8343710 | ||||||||||||||
Entries marked with asterisks indicate that these associations are genome-wide significant in QMDiab as well.
For the Replication of the Petersen study, 2 ticks indicate Bonferroni significance, and 1 tick indicates nominal significance.
These associations include a replication in KORA.
These associations include a replication in TwinsUK.
Figure 1.Hypothesis tested in this study. Exposure to physiological challenges, such as an increased BMI, smoking or dysregulated glycemic control leads physiological changes that translate into changes in intermediate molecular phenotypes, such as metabolite levels that are detectable in different body fluids, blood circulating lipids, proteins and protein glycosylation. These then further induce changes in DNA methylation at specific regulatory sites of genes that are required to counter this insult. Note that this view does not exclude that changes in the expression of certain genes may not also result in further changes in molecular phenotypes. Hence, despite the fact that we found here three cases of causality from metabolite to CpG, cases with reverse directionality are also likely to exist.
General characteristics of the QMDiab study participants
| Age (years) | 46.8±12.8 (mean ± s.d.) |
| Sex | 177 (49.3%) female |
| 182 (50.7%) male | |
| Body mass index (kg/m2) | 29.6±6.0 (mean ± s.d.) |
| Ethnicity | 189 (52.6%) Arab |
| 106 (29.5%) South Asian | |
| 34 (9.5%) Filipino | |
| 13 (3.6%) other/mixed | |
| 17 (4.7%) missing | |
| T2D status | 182 (50.7%) having diabetes |
| 176 (49.0%) no diabetes | |
| 1 (0.03%) missing | |
| Smoking status | 62 (17.3%) smokers |
| 280 (78.0%) non-smokers | |
| 17 (4.7%) missing |
The QMDiab study has been described previously and comprises 388 study participants from Arab and Asian ethnicities (29). The statistics here are reported for the 359 samples with methylation data overlapping with at least one type of proteomics, lipidomics, glycomics, or metabolomics measurement.
Arab ethnicity includes participants from Bahrain, Egypt, Iraq, Jordan, Kuwait, Lebanon, Morocco, Oman, Palestine, Qatar, Saudi Arabia, Somalia, Sudan, Syria, Tunisia, United Arab Emirates and Yemen. South Asian ethnicity includes participants from Bangladesh, India, Nepal, Pakistan, and Sri Lanka.
Smoking status was determined based on the detection of cotinine in blood at the time of blood collection.
Figure 2.Multi-omics dataset and study design. A total of 388 individuals participated in the initial QMDiab study. A total of 359 samples had DNA methylation data and at least one other deep-molecular trait.
Multi-omics associations with CpG methylation in QMDiab
| Locus | Group | Trait | Trend | |
|---|---|---|---|---|
UGT2B15 cg09189601 chr4: 69514031 Other | Metabolites | X-19141 [plasma] | ↓ | 6.21 ×10−23 |
TXNIP cg19693031 chr1: 145441552 Diabetes | Metabolites | 1, 5-anhydroglucitol (1, 5-AG) [plasma] | ↑ | 7.56 × 10−21 |
| Glucose [NMR] | ↓ | 1.17 × 10−14 | ||
| 2-hydroxybutyrate (AHB) [urine] | ↓ | 2.52 × 10−14 | ||
| 3-hydroxybutyrate (BHBA) [urine] | ↓ | 5.87 × 10−13 | ||
| glucose [plasma] | ↓ | 3.15 × 10−12 | ||
| … (list truncated) | ||||
| Lipids | L-VLDL-CE_% | ↑ | 1.01 × 10−8 | |
| XL-VLDL-CE_% | ↑ | 2.05 × 10−8 | ||
| M-VLDL-CE_% | ↑ | 4.42 × 10−6 | ||
| XL-VLDL-C_% | ↑ | 5.66 × 10−6 | ||
| … (list truncated) | ||||
| Proteins | Transmembrane glycoprotein NMB (GPNMB) | ↓ | 1.30 × 10−8 | |
| Aminoacylase-1 (ACY1) | ↓ | 2.59 × 10−7 | ||
| Sex hormone-binding globulin (SHBG) | ↑ | 4.65 × 10−7 | ||
| Melanoma-derived growth regulatory protein (MIA) | ↑ | 6.88 × 10−7 | ||
| Glycans | PGP23 | ↓ | 2.75 × 10−8 | |
| PGP31 | ↓ | 9.31 × 10−8 | ||
| PGP29 | ↓ | 7.61 × 10−6 | ||
| PGP28 | ↓ | 8.42 × 10−6 | ||
DHCR24 cg17901584 chr1: 55353706 Obesity | Lipids | M-VLDL-C_% | ↑ | 6.62 × 10−9 |
| M-VLDL-TG_% | ↓ | 7.14 × 10−9 | ||
| M-VLDL-CE_% | ↑ | 1.87 × 10−8 | ||
| S-VLDL-TG_% | ↓ | 1.36 × 10−6 | ||
| … (list truncated) | ||||
MYO5C cg06192883 chr15: 52554171 Obesity | Glycans | PGP58 | ↑ | 8.31 × 10−9 |
| PGP70 | ↑ | 6.91 × 10−8 | ||
| PGP1 | ↑ | 3.67 × 10−7 | ||
| PGP17 | ↓ | 8.34 × 10−7 | ||
| PGP99 | ↑ | 1.23 × 10−6 | ||
| … (list truncated) | ||||
| IgG1_G0F | ↑ | 9.46 × 10−7 | ||
| IgG4_G2FN | ↓ | 1.79 × 10−5 | ||
ABCG1 cg06500161 chr21: 43656587 Diabetes and obesity | Metabolites | Myo-inositol [urine] | ↑ | 7.21 × 10−7 |
| Lipids | L-VLDL-CE_% | ↓ | 1.03 × 10−8 | |
| M-VLDL-CE_% | ↓ | 2.19 × 10−7 | ||
| M-VLDL-C_% | ↓ | 2.26 × 10−7 | ||
| XXL-VLDL-CE_% | ↓ | 8.73 × 10−7 | ||
| … (list truncated) | ||||
CPT1A cg00574958 chr11: 68607622 Diabetes and obesity | Proteins | Tumor necrosis factor ligand superfamily member 4 (TNFSF4) | ↑ | 1.61 × 10−6 |
SLC7A11 cg06690548 chr4: 139162808 Obesity | Metabolites | Serine [plasma] | ↑ | 3.05 × 10−7 |
AHRR cg05575921 chr5: 373378 Smoking | Metabolites | o-cresol sulfate [urine] | ↓ | 2.66 × 10−27 |
| 3-ethylphenylsulfate* [urine] | ↓ | 1.08 × 10−17 | ||
| X-17185 [urine] | ↓ ↓ | 2.42 × 10−16, 1.52 × 10−7 | ||
| X-12161 [urine] | ↓ | 5.17 × 10−13 | ||
| X-17398 [urine] | ↓ | 1.36 × 10−12 | ||
| … (list truncated) | ||||
| Proteins | Polymeric immunoglobulin receptor (PIGR) | ↓ | 2.03 × 10−11 | |
ALPPL2 cg21566642 chr2: 233284661 Smoking | Metabolites | o-Cresol sulfate [urine] | ↓ | 7.43 × 10−16 |
| 3-ethylphenylsulfate* [urine] | ↓ | 4.45 × 10−9 | ||
| X-17185 [plasma] | ↓ ↓ | 6.32 × 10−8, 1.15 × 10−6 | ||
| X-17398 [urine] | ↓ | 6.35 × 10−8 | ||
| 2-ethylphenylsulfate [urine] | ↓ | 4.54 × 10−7 | ||
F2RL3 cg03636183 chr19: 17000585 Smoking | Metabolites | o-Cresol sulfate [urine] | ↓ | 4.93 × 10−13 |
| 3-ethylphenylsulfate* [urine] | ↓ | 1.09 × 10−9 | ||
| X-17398 [urine] | ↓ | 1.41 × 10−8 | ||
| X-17185 [urine] | ↓ | 9.18 × 10−7 | ||
| Proteins | Polymeric immunoglobulin receptor (PIGR) | ↓ | 9.02 × 10−7 | |
cg06126421 chr6: 30720080 Smoking | Metabolites | X-17185 [urine] | ↓ ↓ | 2.99 × 10−10, 6.23 × 10−7 |
| o-Cresol sulfate [urine] | ↓ | 2.23 × 10−9 | ||
| X-17398 [urine] | ↓ | 2.49 × 10−7 | ||
| 3-ethylphenylsulfate* [urine] | ↓ | 5.37 × 10−7 | ||
| X-17320 [urine] | ↓ | 5.47 × 10−7 | ||
| 3-methyl catechol sulfate 1 [urine] | ↓ | 5.53 × 10−7 | ||
| Proteins | Polymeric immunoglobulin receptor (PIGR) | ↓ | 3.36 × 10−7 | |
RARA cg19572487 chr17: 38476024 Smoking | Metabolites | o-Cresol sulfate [urine] | ↓ | 3.29 × 10−7 |
| Proteins | Gelsolin (GSN) | ↑ | 1.89 × 10−6 | |
GFI1 cg09935388 chr1: 92947588 Smoking | Metabolites | o-Cresol sulfate [urine] | ↓ | 2.90 × 10−7 |
cg23079012 chr2: 8343710 Smoking | Metabolites | o-Cresol sulfate [urine] | ↓ | 4.51 × 10−8 |
| Proteins | X-linked interleukin-1 receptor accessory protein-like 2 (IL1RAPL2) | ↓ | 4.35 × 10−9 | |
| Vascular endothelial growth factor A (VEGFA) | ↓ | 1.42 × 10−6 | ||
| NudC domain-containing protein 3 (NUDCD3) | ↓ | 1.55 × 10−6 |
Association data for 14 of the 20 CpG loci reported by Petersen et al. (25). P-values are for the reported phenotypes in linear regression models with the respective covariates (Fig. 2). Associations were required to reach a Bonferroni level of significance of pmetabolite < 1.01 × 10−6, plipid < 1.11 × 10−5, pprotein < 2.22 × 10−6, and pglycan < 2.21 × 10−5 for metabolites, lipids, proteins, and glycan traits, respectively. Genomic coordinates are based on human genome build 37. A positive association with methylation levels is indicated by (↑), while a negative is indicated by (↓). Full summary statistics are in Supplementary Material, Table S3.
This metabolite was already reported in Petersen et al.
This metabolite was measured on different platforms or in different fluids in QMDiab (indicated in square brackets).
Note: We only present the five most significant associations for each category in this table. For a more comprehensive list, see Supplementary Material, Table S3.
Replication of novel proteome-methylation associations in the KORA study
| QMDiab | KORA | ||||
|---|---|---|---|---|---|
| Locus | Protein | Beta | Beta | ||
| TXNIP | Transmembrane glycoprotein NMB | 1.30 × 10−8 | −0.006 | 0.841 | −0.002 |
| cg19693031 | − | − | |||
| Melanoma-derived growth regulatory protein | 6.88 × 10−7 | 0.006 | 0.106 | 0.021 | |
| CPT1A cg00574958 | Tumor necrosis factor ligand superfamily member 4 | 1.61 × 10−6 | 0.002 | 0.204 | 0.003 |
| AHRR cg05575921 | − | − | |||
| F2RL3 cg03636183 | − | − | |||
| cg06126421 | − | − | |||
| RARA cg19572487 | |||||
| cg23079012 | X-linked interleukin-1 receptor accessory protein-like 2 | 4.35 × 10−9 | −0.001 | 0.756 | −0.001 |
| Vascular endothelial growth factor A | 1.42 × 10−6 | −0.002 | 0.600 | −0.004 | |
| NudC domain-containing protein 3 | 1.55 × 10−6 | −0.001 | 0.338 | 0.003 | |
Six out of twelve protein-methylation associations were replicated in KORA (N = 997) at Bonferroni significance p < 0.0041 (0.05/12). All but one association showed concordant directions in the two studies.
Replication of novel N-glycan-methylation associations in the TwinsUK study
| QMDiab | TwinsUK | ||||
|---|---|---|---|---|---|
| Locus | Glycan | Beta | Beta | ||
| PGP23 | 2.75 × 10−8 | −0.023 | 0.115 | −0.007 | |
| cg19693031 | PGP31 | 9.31 × 10−8 | −0.025 | 0.035 | −0.009 |
| PGP29 | 7.61 × 10−6 | −0.019 | 0.135 | −0.007 | |
| − | − | ||||
| PGP58 | 8.31 × 10−9 | 0.013 | 0.064 | 0.009 | |
| cg06192883 | PGP70 | 6.91 × 10−8 | 0.013 | 0.060 | 0.010 |
| PGP1 | 3.67 × 10−7 | 0.011 | 0.016 | 0.011 | |
| PGP17 | 8.34 × 10−7 | −0.011 | 0.175 | −0.006 | |
| PGP99 | 1.23 × 10−6 | 0.010 | 0.012 | 0.012 | |
| PGP77 | 4.00 × 10−6 | 0.009 | 0.073 | 0.008 | |
| PGP81 | 4.00 × 10−6 | −0.009 | 0.073 | −0.008 | |
| PGP64 | 6.88 × 10−6 | −0.009 | 0.266 | −0.005 | |
| PGP73 | 1.61 × 10−5 | 0.009 | 0.132 | 0.006 | |
| PGP72 | 1.72 × 10−5 | −0.010 | 0.030 | −0.013 | |
Four of the glycan-methylation associations displayed nominal significance P < 0.05 in the TwinsUK study (N = 165) and one was replicated at Bonferroni significance P < 0.0035 (0.05/14). All associations had the same direction of effect as in QMDiab. Glycan annotations are provided in Supplementary Material, Table S1.
Figure 3.Evidence supporting the hypothesis that genetically induced changes in metabolite levels are causal to the associated changes in methylation levels. The instrumental variables here were identified using the BIOS server (75) and SNiPA (76). The three-way associations were evaluated using the KORA dataset (N∼1800). The P-values (PIVW) shown are associated with the estimate (Wald test). In all three cases presented here (see Table 6 for details), the associations between SNP and CpG methylation can be fully explained via the metabolite. This suggests that the metabolic trait is causal to the association between metabolite and CpG.
Causality analysis using Mendelian randomization
| Triangle associations | MR (IVW method) | |||
|---|---|---|---|---|
| Metabolite∼SNP (instrument) | CpG∼SNP | CpG∼Metabolite (observed) | CpG∼Metabolite (predicted) | |
| (cg00574958) | ||||
| APO-cluster | ||||
| (rs964184) | SE = 0.0427 | SE = 0.0450 | SE = 0.0246 | SE = 0.177 |
| VLDL-A | CI95 = [0.170, 0.338] | CI95 = [−0.212, −0.0358] PIV = 0.080 | CI95 = [−0.234, −0.138] | CI95 = [−0.837,-0.141] |
| (cg17901584) | ||||
| (rs174547) | SE = 0.0339 | SE = 0.0345 | SE = 0.023 | SE = 0.100 |
| PC.ae.C36.5 | CI95 = [−0.411, −0.277] | CI95 = [−0.156, −0.0209] | CI95 = [0.157, 0.248] | CI95 = [0.061, 0.454] |
| (cg06192883) | ||||
| (rs715) | SE = 0.0325 | SE = 0.0361 | SE = 0.0254 | SE = 0.079 |
| glycine | CI95 = [0.391, 0.519] | CI95 = [−0.178, 0.00359] | CI95 = [−0.249,-0.149] | CI95 = [−0.390,-0.079] |
KORA data (N∼1800) was used for MR analysis using the inverse-variance weighted method. All three MR analyses suggest that changes in metabolites are causal for the observed changes in CpG methylation with Bonferroni significance PMR < 0.017 (0.05/3).
Abbreviations: = effect size (units: s.d./s.d. or s.d./minor allele copy), SE = standard error, P = P-value, CI95 = 95% confidence intervals, PIV = the P-value for the association of the CpG to the metabolite conditioned on the SNP; this association must not be significant for a valid instrument.