| Literature DB >> 32484362 |
Arce Domingo-Relloso1,2,3, Angela L Riffo-Campos4, Karin Haack5, Pilar Rentero-Garrido6,7, Christine Ladd-Acosta8, Daniele M Fallin7,8, Wan Yee Tang9, Miguel Herreros-Martinez10, Juan R Gonzalez11,12,13, Anne K Bozack1, Shelley A Cole5, Ana Navas-Acien1, Maria Tellez-Plaza9,14,2.
Abstract
BACKGROUND: The epigenetic effects of individual environmental toxicants in tobacco remain largely unexplored. Cadmium (Cd) has been associated with smoking-related health effects, and its concentration in tobacco smoke is higher in comparison with other metals.Entities:
Mesh:
Substances:
Year: 2020 PMID: 32484362 PMCID: PMC7265996 DOI: 10.1289/EHP6345
Source DB: PubMed Journal: Environ Health Perspect ISSN: 0091-6765 Impact factor: 9.031
Figure 1.Summary of results from the epigenome-wide association study of cadmium (Cd) and smoking with DNA methylation (DNAm) and subsequent bioinformatics analysis. After considering a false discovery rate (FDR)-corrected p-value < 0.05, we obtained 6 Cd-differentially methylated positions (DMPs) and 312 smoking DMPs in a genomic exploration of CpG sites with the Infinium MethylationEPIC BeadChip. We excluded nonprotein-coding genomic regions and duplicate genes and conducted gene set enrichment analysis. We subsequently evaluated protein interaction networks among protein-encoding genes from the STRING database, resulting in a protein interaction network of 271 nodes and 1,802 interactions. We observed highly connected nodes, which were directly or indirectly related to DNAm metabolism key proteins.
Median (IQR) of urine cadmium () and accumulated smoking dose (pack-years) levels and percentage of former and current smoking status by participant’s characteristics.
| Cadmium ( | Former smoking (%) | Current smoking (%) | Cumulative smoking | ||
|---|---|---|---|---|---|
| Overall | 2,325 | 0.97 (0.62, 1.50) | 32.2 | 38.4 | 4 (0, 18) |
| Age (y) | |||||
| | 669 | 0.89 (0.56, 1.31) | 47.3 | 56.9 | 4 (0, 17) |
| 50–64 | 1,246 | 1.00 (0.64, 1.55) | 32.6 | 30.7 | 3 (0, 18) |
| | 410 | 1.03 (0.64, 1.59) | 20.1 | 12.4 | 2 (0, 20) |
| Sex | |||||
| Men | 964 | 0.71 (0.47, 1.10) | 51.1 | 53.4 | 9 (0, 27) |
| Women | 1,361 | 1.16 (0.77, 1.78) | 48.9 | 46.6 | 1 (0, 12) |
| Smoking status | |||||
| Never | 684 | 0.86 (0.54, 1.36) | 0.0 | 0.0 | 0 (0, 0) |
| Former | 748 | 0.81 (0.55, 1.25) | 100.0 | 0.0 | 7 (2, 20) |
| Current | 893 | 1.17 (0.77, 1.81) | 0.0 | 100.0 | 14 (5, 30) |
| Center | |||||
| Arizona | 312 | 0.76 (0.53, 1.19) | 15.5 | 6.3 | 0 (0, 5) |
| Oklahoma | 981 | 0.86 (0.55, 1.33) | 45.5 | 37.3 | 2 (0, 16) |
| Dakota | 1,032 | 1.11 (0.75, 1.78) | 39.0 | 56.4 | 7 (0, 23) |
| Obesity | |||||
| No | 1,238 | 1.05 (0.66, 1.63) | 44.8 | 62 | 6 (0, 23) |
| Yes | 1,087 | 0.87 (0.57, 1.35) | 55.2 | 38 | 2 (0, 13) |
| eGFR ( | |||||
| | 76 | 0.91 (0.53, 1.51) | 32.89 | 32.89 | 7 (2, 25) |
| | 2,249 | 0.97 (0.62, 1.50) | 32.15 | 38.59 | 11 (3, 26) |
Note: IQR, interquartile range.
Top differentially methylated positions (DMPs) associated to log-transformed urine cadmium (Cd) levels (i.e., Cd DMPs).
| CpG | Chr | Position | Gene | Function | Overall | Never smokers | ||||
|---|---|---|---|---|---|---|---|---|---|---|
| Odds ratio (95% CI) | Nominal | FDR | Odds ratio (95% CI) | Nominal | FDR | |||||
| cg14391737 | chr11 | 86513429 | PRSS23 | Encodes Serine protease 23. May be an important ovarian protease | 0.92 (0.9, 0.94) | 0.94 (0.89, 0.98) | 0.003 | 0.006 | ||
| cg21566642 | chr2 | 233284661 | 2q37.1 | Uncharacterized | 0.92 (0.9, 0.94) | 0.97 (0.92, 1.01) | 0.1 | 0.2 | ||
| cg05575921 | chr5 | 373378 | AHRR | Mediates dioxin toxicity. Involved in cell growth and differentiation | 0.84 (0.8, 0.88) | 0.85 (0.77, 0.94) | 0.002 | 0.006 | ||
| cg01940273 | chr2 | 233284934 | 2q37.1 | Uncharacterized | 0.96 (0.94, 0.97) | 0.004 | 0.98 (0.95, 1.02) | 0.3 | 0.3 | |
| cg03636183 | chr19 | 17000585 | F2RL3 | Blood coagulation, inflammation, and response to pain | 0.96 (0.94, 0.97) | 0.02 | 0.96 (0.93, 1.00) | 0.03 | 0.04 | |
| cg17739917 | chr17 | 38477572 | RARA | Development, differentiation, apoptosis, granulopoeisis, clock genes transcription | 0.95 (0.93, 0.97) | 0.03 | 0.93 (0.88, 0.97) | 0.002 | 0.006 | |
Note: BMI, body mass index; CI, confidence interval; FDR, false discovery rate.
Linear regression models were fitted using logit-transformed DNAm proportions as dependent variables separately for each CpG and were adjusted for smoking status (never, former, current), age (years), sex (male/female), BMI (), Houseman cell proportions (CD8T, CD4T, NK, B cells, monocytes and granulocytes), study center (Arizona, Oklahoma or North and South Dakota), five genetic principal components and estimated glomerular filtration rate ().
Not available CpGs in the Illumina HumanMethylation450 BeadChip array.
Top 25 differentially methylated positions (DMPs) comparing current smoking status with never smoking status (i.e., current smoking-DMPs).
| CpG | Chr | Position | Gene | Function | Odds ratio (95% CI) | Nominal | FDR | FDR | Direction of association in meta-analysis ( |
|---|---|---|---|---|---|---|---|---|---|
| cg05575921 | chr5 | 373378 | AHRR | Mediates dioxin toxicity. Involved in cell growth and differentiation | 0.38 (0.35, 0.42) | Inverse | |||
| cg21566642 | chr2 | 233284661 | 2q37.1 | Uncharacterized | 0.71 (0.69, 0.74) | Inverse | |||
| cg01940273 | chr2 | 233284934 | 2q37.1 | Uncharacterized | 0.8 (0.78, 0.83) | Inverse | |||
| cg17739917 | chr17 | 38477572 | RARA | Development, differentiation, apoptosis, granulopoeisis, clock genes transcription | 0.75 (0.73, 0.78) | Not available | — | ||
| cg03636183 | chr19 | 17000585 | F2RL3 | Blood coagulation, inflammation, and response to pain | 0.8 (0.78, 0.83) | Inverse | |||
| cg26703534 | chr5 | 377358 | AHRR | Mediates dioxin toxicity. Involved in cell growth and differentiation | 0.83 (0.81, 0.86) | Inverse | |||
| cg21161138 | chr5 | 399360 | AHRR | Mediates dioxin toxicity. Involved in cell growth and differentiation | 0.86 (0.84, 0.88) | Inverse | |||
| cg25648203 | chr5 | 395444 | AHRR | Mediates dioxin toxicity. Involved in cell growth and differentiation | 0.84 (0.81, 0.87) | Inverse | |||
| cg09935388 | chr1 | 92947588 | GFI1 | Transcriptional repressor. Plays a role in hematopoiesis and oncogenesis | 0.75 (0.71, 0.79) | Inverse | |||
| cg14391737 | chr11 | 86513429 | PRSS23 | Encodes Serine protease 23. May be an important ovarian protease | 0.81 (0.78, 0.84) | Not available | — | ||
| cg19859270 | chr3 | 98251294 | GPR15 | Chemokine receptor for human immunodeficiency virus type 1 and 2 | 0.84 (0.81, 0.87) | Inverse | |||
| cg17287155 | chr5 | 393347 | AHRR | Mediates dioxin toxicity. Involved in cell growth and differentiation | 0.78 (0.74, 0.82) | Inverse | |||
| cg14753356 | chr6 | 30720108 | Intergenic | Uncharacterized | 0.86 (0.83, 0.89) | Inverse | |||
| cg16841366 | chr2 | 233286192 | 2q37.1 | Uncharacterized | 0.83 (0.8, 0.87) | Not available | — | ||
| cg18110140 | chr15 | 75350380 | Intergenic | Uncharacterized | 0.87 (0.84, 0.9) | Not available | — | ||
| cg21911711 | chr19 | 16998668 | F2RL3 | Blood coagulation, inflammation, and response to pain | 0.85 (0.82, 0.88) | Not available | — | ||
| cg25189904 | chr1 | 68299493 | GNG12 | Modulator in transmembrane signaling systems | 0.81 (0.78, 0.85) | Inverse | |||
| cg22675726 | chr18 | 3179889 | MYOM1 | Structural constituent of muscle. Associated with hypertrophic cardiomyopathy | 0.85 (0.82, 0.88) | Not available | — | ||
| cg00045592 | chr1 | 160714299 | SLAMF7 | Modulator on the activation and differentiation of immune cells | 0.89 (0.86, 0.91) | Not available | — | ||
| cg00475490 | chr11 | 86517110 | PRSS23 | Encodes Serine protease 23. May be an important ovarian protease | 0.78 (0.74, 0.83) | Not available | — | ||
| cg16522223 | chr22 | 43400443 | PACSIN2 | Intracellular vesicle-mediated transport | 0.93 (0.91, 0.94) | Not available | — | ||
| cg09338374 | chr22 | 39888390 | Intergenic | Uncharacterized | 1.1 (1.07, 1.12) | Not available | — | ||
| cg15159987 | chr19 | 17003890 | CPAMD8 | Innate and acquired immunity | 0.93 (0.91, 0.95) | Inverse | |||
| cg21322436 | chr7 | 145812842 | CNTNAP2 | Cell adhesion. Related to autism | 0.87 (0.84, 0.9) | Inverse | |||
| cg26768182 | chr9 | 134272679 | Intergenic | Uncharacterized | 0.88 (0.85, 0.9) | Not available | — |
Note: —, no data; BMI, body mass index; CI, confidence interval; DNAm, DNA methylation; FDR, false discovery rate.
Linear regression models were fitted using logit-transformed DNAm proportions as dependent variables separately for each CpG and were adjusted, age (y), sex (male/female), BMI (), Houseman cell proportions (CD8T, CD4T, NK, B cells, monocytes and granulocytes), five genetic principal components and study center (Arizona, Oklahoma, or North and South Dakota).
Not available CpGs in the metanalysis by (Joehanes et al. 2016) reflect that the metanalysis was based in the Illumina Infinium HumanMethylation450 BeadChip array.
Changes in DNA methylation percentages comparing current with never smokers attributable to changes in urinary Cd concentrations (“mediated effect”).
| CpG | Chr | Position | Gene | Function | Outcome model | Mediated (i.e., indirect) effects | Total effects of current vs. never smoking in DNA methylation % | ||
|---|---|---|---|---|---|---|---|---|---|
| Absolute change in DNAm % per log-Cd unit increase (95% CI) | Absolute change in DNAm % comparing current vs. never smokers (“Direct effect”) (95% CI) | Difference in change attributable to Cd (95% CI) | Percentage of change attributable to Cd (95% CI) | Sum of direct and indirect effects (95% CI) | |||||
| cg14391737 | chr11 | 86513429 | PRSS23 | Encodes Serine protease 23. May be an important ovarian protease | 13.05 (9.4, 17.18) | ||||
| cg05575921 | chr5 | 373378 | AHRR | Mediates dioxin toxicity. Involved in cell growth and differentiation | 6.43 (5.23, 7.72) | ||||
| cg21566642 | chr2 | 233284661 | 2q37.1 | Uncharacterized | 7.82 (5.4, 10.6) | ||||
| cg03636183 | chr19 | 17000585 | F2RL3 | Blood coagulation, inflammation and response to pain | 11.45 (8.12, 15.33) | ||||
| cg17739917 | chr17 | 38477572 | RARA | Development, differentiation, apoptosis, granulopoeisis, clock genes transcription | 8.06 (4.97, 11.62) | ||||
| cg01940273 | chr2 | 233284934 | 2q37.1 | Uncharacterized | 7.61 (4.86, 10.73) | ||||
Note: BMI, body mass index; Cd, cadmium; CI, confidence interval; DNAm, DNA methylation.
Absolute changes (mean differences) in methylation percent comparing current with never smokers (i.e., “Direct effect”) and absolute changes in DNAm % associated with one-unit increase in log-transformed urine Cd were both estimated as the regression coefficients from beta models with an identity link (i.e., “Outcome model”) which included the following independent variables: current smoking status (“Exposure”), log-transformed Cd (“Mediator”), age (years), sex (men/women), BMI, Houseman cell proportions (CD8T, CD4T, NK, B cells, monocytes and granulocytes), five genetic principal components, and study center (Arizona, Oklahoma, or North and South Dakota).
Effects mediated through Cd (i.e., “Indirect effects”) were calculated using the “product of coefficients” method that multiplies the coefficient for the mean difference in log-transformed cadmium concentrations comparing current with never smokers (i.e., 0.38 [95% CI: 0.32, 0.44] corresponding to coefficient [95% CI] for current smoking status from the linear regression “mediator model” where the outcome is log-transformed Cd adjusted for the same variables as the outcome model, data not shown) by the absolute change in DNAm % associated with one-unit increase in log-Cd concentrations indirect effects were expressed in absolute terms (difference in smoking-related DNAm % change) and relative to the “Total effect” [DNAm % changes (95% CIs) comparing current with never smokers] calculated as the sum of the direct and indirect effects. Note that if the mediation analysis assumptions hold, the estimated “Total effects” in the setting of the “product of coefficients” method is numerically identical to the regression coefficient associated with current smoker status in a beta regression as in the outcome model without Cd.
To illustrate the estimation of mediated effects in the setting of the product of coefficients methods, we provide an example: In absolute terms, the mediated effect of Cd in cg14391737 (column 8) was obtained as . The corresponding mediated effect in relative terms (column 9) was . The 95% CIs for the total and mediated effects were derived by simulation from the estimated model coefficients and covariance matrices.
Figure 2.Enrichment analysis of top significant cadmium (Cd) and smoking-related differentially methylated positions (DMPs) () for 15-chromatine states from the ROADMAP project. The area of the solid dots is directly proportional to the strength of the statistical evidence in favor of being annotated to a given chromatin state category comparing statistically significant vs. nonstatistically significant DMPs in our data.
Figure 3.Protein-interaction network between proteins attributed to cadmium (Cd)- and smoking-differentially methylated positions (DMPs) and key proteins in DNA methylation (DNAm) metabolism pathways from KEGG. The nodes correspond to proteins involved in DNAm-related pathways, and proteins encoded by genes associated to Cd-DMPs and smoking-DMPs. The size of the nodes is proportional to the number of connections. The STRING database provides a confidence score (from 0 to 1) to indicate the estimated likelihood that the annotated interaction between a given pair of proteins is biologically meaningful, specific, and reproducible, according to the evidence derived from in-house predictions, homology transfers, and the externally maintained databases (Szklarczyk et al. 2019). Increasingly darker solid edge lines indicate a protein interaction with increasingly higher confidence scores. The nodes included among the top 25 statistically significant Cd- and/or smoking-DMPs in our EWAS show thicker node lines.