Literature DB >> 27066266

A cross-study analysis of prenatal exposures to environmental contaminants and the epigenome: support for stress-responsive transcription factor occupancy as a mediator of gene-specific CpG methylation patterning.

Elizabeth M Martin1, Rebecca C Fry1.   

Abstract

A biological mechanism by which exposure to environmental contaminants results in gene-specific CpG methylation patterning is currently unknown. We hypothesize that gene-specific CpG methylation is related to environmentally perturbed transcription factor occupancy. To test this hypothesis, a database of 396 genes with altered CpG methylation either in cord blood leukocytes or placental tissue was compiled from 14 studies representing assessments of six environmental contaminants. Subsequently, an in silico approach was used to identify transcription factor binding sites enriched among the genes with altered CpG methylation in relationship to the suite of environmental contaminants. For each study, the sequences of the promoter regions (representing -1000 to +500 bp from the transcription start site) of all genes with altered CpG methylation were analyzed for enrichment of transcription factor binding sites. Binding sites for a total of 56 unique transcription factors were identified to be enriched within the promoter regions of the genes. Binding sites for the Kidney-Enriched Krupple-like Factor 15, a known responder to endogenous stress, were enriched (P < 0.001-0.041) among the genes with altered CpG methylation associated for five of the six environmental contaminants. These data support the transcription factor occupancy theory as a potential mechanism underlying environmentally-induced gene-specific CpG methylation.

Entities:  

Keywords:  CpG DNA methylation; KLF15; gene-specific methylation patterning; prenatal environmental exposure; transcription factor occupancy theory

Year:  2016        PMID: 27066266      PMCID: PMC4824001          DOI: 10.1093/eep/dvv011

Source DB:  PubMed          Journal:  Environ Epigenet        ISSN: 2058-5888


Introduction

Prenatal exposure to environmental contaminants has been associated with detrimental health outcomes early in life, as well as later in adulthood. Depending upon the contaminant, detrimental health effects in infants and children may include low birth weight, preterm birth, and increased incidence of infections in early childhood [ 1 , 2 ]. In addition to health effects apparent at birth and in early childhood, there is the potential for prenatal environmental exposures to influence adult health. For example, later life diseases associated with early life exposures include cancers in different tissues, diabetes, and suppressed immune function [ 3 , 4 ]. Still, specific genes and biological pathways that are causally related to these later life health effects of early life exposure remain largely unknown. One potential mechanism by which prenatal exposures may drive later life health outcomes is via epigenetic modification, specifically altered CpG methylation patterning. CpG methylation has the potential to alter transcription and subsequent translational regulation, ultimately impacting disease susceptibility [ 5 ]. During mammalian fetal development, there are two major events, one during preimplantation during which DNA is passively demethylated and one during midgestation during which DNA is actively demethylated, leaving the genome completely unmethylated and vulnerable to insult [ 6–8 ]. Additionally, passive demethylation takes place throughout pregnancy resulting in multiple timeframes during which the fetal epigenome is susceptible to epigenetic modification [ 3 , 4 , 8 ]. These CpG modifications can be influenced via maternal exposures during the prenatal period with the potential to remain stable over the course of a lifetime [ 9 ]. This stability is important as it could provide an explanation for the relationship between prenatal exposures with later life health outcomes. There have been several biological mechanisms proposed to underlie environmental-contaminant-driven changes in CpG methylation. For example, some environmental contaminants disrupt the process of DNA methylation by affecting DNA methyl transferase (DNMT) activity and/or modify the availability of the substrate used for methylation, namely S-adenosine methionine [ 10 ]. Environmental contaminants can adversely impact the ability of DNMT to methylate DNA either by affecting the efficiency of the enzyme and/or decreased expression levels of DNMT [ 11 ]. In other cases, contaminants may be detoxified by processes that utilize S-adenosine methionine, thus depleting the methyl donor molecule. Depletion of the methyl donor and decreased functionality of DNMT have been associated with decreased levels of global methylation [ 12 , 13 ]. Importantly, while these data provide general information on contaminant-DNA methylation relationships, they do not explain how exposure to environmental contaminants results in gene-specific modifications to CpG methylation. In prior work, we proposed that transcription factors, induced or repressed by environmental contaminants, may mediate the gene-specific patterns of DNA methylation related to in utero exposures. Specifically, we hypothesized that transcription factors respond to environmental contaminants and bind to specific target regions (binding elements) of DNA influencing access of the DNA methylation machinery to DNA [ 14–16 ]. More specifically, the “transcription factor occupancy theory” suggests that the binding of transcription factors to target regions of the genome may influence gene-specific patterns of CpG methylation by influencing the access of DNA methyltransferase (DNMT) to specific sites. Our theory is supported by in vitro studies demonstrating that genes in lowly methylated regions of the genome are often occupied by DNA binding proteins [ 17–20 ]. Specifically, these studies have established that transcription factor binding correlates with lowly methylated regions of the genome, suggesting that DNA-binding factors can influence patterns of DNA methylation. In addition to data derived from in vitro studies, in vivo studies have demonstrated that transcription factor binding could alter methylation patterns of key genes. Specifically, decreased maternal care in rats leads to DNA binding of the transcription factor NGFI-A, resulting in decreased DNA methylation observed at the glucocorticoid receptor (GR) gene promoter region [ 21 ]. Taken together, these data support that transcription factor binding to target genes can influence CpG methylation patterns. In this work, an in silico approach was used to determine whether evidence exists for the transcription factor occupancy theory related to prenatal environmental exposures. Gene sets derived from studies that examined the effects of prenatal environmental exposures on genome-wide gene-specific DNA methylation were examined including those that assessed arsenic [ 15 , 22–24 ], cadmium [ 16 , 25 ], lead [ 26 ], manganese [ 27 ], mercury [ 28 ], and tobacco smoke [ 29–33 ]. The data from the 14 studies were integrated, resulting in a cross-contaminant database used to identify common transcription factor binding sequences in differentially methylated genes associated with environmental contaminants.

Results

Generation of a Cross-Study Database of Gene-Specific CpG Methylation

Of the 14 studies identified for analysis, the majority ( n  = 11, 78.6%) examined CpG methylation in cord blood leukocytes, while the remaining three studies analyzed DNA methylation in placental tissue ( Table 1 ). A number of the published studies examined the relationship between CpG methylation and prenatal exposure to tobacco smoke ( n  = 5) or arsenic ( n  = 4). The other studies focused on cadmium ( n  = 2), lead ( n  = 1), manganese ( n  = 1), or mercury ( n  = 1). The majority ( n  = 12, 85.7%) of studies utilized the Illumina 450 k array for CpG methylation assessment, while one study used the Illumina 27 k array and one study used the Affymetrix MIRA methylation assay.
Table 1

Summary of studies included for analysis ( n  = 14)

Study (PMID)Environmental contaminant Subjects ( n ) Number of associated genesMeasure of exposureRange of exposuresTissue/biological matrixTechnology
Broberg et al. [ 22 ] (24965135) Arsenic (inorganic)12729Maternal urinary total arsenic20–457 μg/lCord bloodIllumina 450 k
Kile et al. [ 23 ] (24525453) Arsenic (inorganic)446Drinking water arsenic ≤LOD * -510 μg/l Cord bloodIllumina 450 k
Koestler et al. [ 24 ] (23757598) Arsenic (inorganic)13425Maternal urinary arsenic (quartiles)1.8–6.6 μg/lCord bloodIllumina 450 k
Rojas et al. [ 15 ] (25304211) Arsenic (inorganic)3816Maternal urinary total arsenic ≤LOD * –236 μg/l Cord bloodIllumina 450 k
Kipper et al. [ 26 ] (23644563) Cadmium12725Maternal urinary cadmium0.21–2.4 μg/lCord bloodIllumina 450 k
Sanders et al. [ 16 ] (24169490) Cadmium1791Maternal urinary cadmium0–1.05 μg/lCord bloodMethylated Island Recovery Assay
Sen et al. [ 26 ] (26046694) Lead4816Maternal blood lead levels0.59–721 μg/dlCord bloodIllumina 450 k
Maccani et al. [ 27 ] (25982381) Manganese615Maternal toenail manganese levels0.131 to 5.666 μg/gPlacental tissueIllumina 450 k
Maccani et al. [ 28 ] (25748564) Mercury41104Maternal toenail mercury levels (tertiles)≤LOD-0.38 μg/gPlacental tissueIllumina 450 k
Ivorra et al. [ 29 ] (25623364) Tobacco smoke2025Self-reported smoking status confirmed by cotinine levelsNACord bloodIllumina 450 k
Joubert et al. [ 30 ] (22851337) Tobacco smoke106210Cotinine-based categorization of smoking status≤0 ->388 nmol/lCord bloodIllumina 450 k
Lee et al. [ 31 ] (25325234) Tobacco smoke13212Self-reported smoking statusSmoking one cigarette per day during the second trimesterCord bloodIllumina 450 k
Richmond et al. [ 32 ] (25552657) Tobacco smoke80015Self-reported smoking statusSmoking in at least two trimesters during pregnancyCord bloodIllumina 450 k
Suter et al. [ 33 ] (21937876) Tobacco Smoke3633Self-reported smoking statusNAPlacental tissueIllumina 27 k

* Limit of detection (LOD)

Summary of studies included for analysis ( n  = 14) * Limit of detection (LOD)

CpG Methylation Trends across Environmental Contaminants

We first set out to determine whether there was a common set of genes with altered CpG methylation associated with prenatal exposure to environmental contaminants. Across the 14 studies, there were 396 unique genes identified to have altered CpG methylation in relationship to exposure to six different environmental contaminants ( Fig. 1 ). There was no overlap between the 396 genes within the compiled database and the 500 probes represented within the Houseman et al. [ 45 ]. Further detail regarding whether these genes were classified as hyper- or hypomethylated in relationship to environmental exposure is presented in Supplementary Table S1 . At the gene level, there was little overlap with only 14 (3.53%) genes being shared between two or more studies, regardless of environmental contaminant.
Figure 1

A heatmap of CpG methylation patterns of environmental contaminant-associated genes ( n  = 396) across 14 studies. The directionality of CpG methylation changes associated with environmental contaminants are as follows: red shading indicates hypermethylated genes, blue shading indicates hypomethylated genes, and gray indicates genes with no change in CpG methylation in relationship to the environmental contaminant.

A heatmap of CpG methylation patterns of environmental contaminant-associated genes ( n  = 396) across 14 studies. The directionality of CpG methylation changes associated with environmental contaminants are as follows: red shading indicates hypermethylated genes, blue shading indicates hypomethylated genes, and gray indicates genes with no change in CpG methylation in relationship to the environmental contaminant. Specifically, there was little evidence of gene-specific overlap between studies examining the same contaminant. For example, there were no common genes identified between the two studies that analyzed CpG methylation in relationship to prenatal cadmium exposure ( Fig. 1 , Supplementary Table S2 ) [ 16 , 25 ]. Similarly, there were no common genes identified among the four studies analyzing the relationship between prenatal arsenic exposure and fetal cord blood CpG methylation [ 15 , 22–24 ]. In contrast, there were four genes identified in three of the tobacco smoke studies, namely the Aryl-Hydrocarbon Receptor Repressor ( AHRR ), Contactin-associated protein-like 2 ( CNTNAP2 ), Cytochrome P450, family 1, member A1 ( CYP1A1 ), and Myosin IG ( MYO1G ) [ 30–32 ]. An additional two genes were common between two of the tobacco-smoke studies, namely ENSG00000225718 and Growth Factor Independent 1 Transcription Repressor ( GFI1 ) [ 30 , 32 ] ( Supplementary Table S1 ). In addition to the genes that were common to studies of a single contaminant, there were eight genes with altered CpG methylation common to multiple contaminants ( Fig. 1 , Supplementary Table S1 ). Increased levels of mercury and tobacco smoke were associated with hypermethylation of G Protein-Coupled Receptor 135 ( GPR135 ) and Krueppel-Related Zinc Finger Protein 1 ( HKR1 ). Increased lead and tobacco smoke exposure were both associated with hypermethylation of Dimethylargininase-2 ( DDAH2 ). Lastly, increased levels of arsenic and tobacco smoke were associated with hypermethylation of Evolutionarily Conserved G Patch Domain Containing ( GPATCH1 ). Of note, for these genes, the observed relationship between environmental contaminants and CpG methylation status were the same across studies with increasing levels of the environmental contaminants associated with increased CpG methylation. The final four genes were shared between studies of metals where cadmium and mercury exposure were associated with altered CpG methylation of Acid Trehalase-Like 1 ( ATHL1 ). Arsenic and mercury exposure were associated with altered CpG methylation of Cyclic AMP-Responsive Element-Binding Protein 5 ( CREB5 ) and Ribosomal Protein S6 Kinase, 90 kDa, Polypeptide 2 ( RPS6K42 ). Arsenic and cadmium exposure were associated with altered CpG methylation of Histone Deacetylase 4 ( HDAC4 ). In contrast to the prior findings, the observed associations between CpG methylation and the environmental contaminants were in opposite directions. ATHL1 showed cadmium-associated hypermethylation and mercury-associated hypomethylation. CREB5 displayed mercury-associated hypermethylation and arsenic-associated hypomethylation. HDAC3 showed arsenic-associated hypermethylation and cadmium-associated hypomethylation. Finally, RPS6KAS2 displayed mercury-associated hypermethylation and arsenic-associated hypomethylation ( Supplementary Table S1 ). Most of the summarized studies have previously characterized the associated biological functions of the genes with altered CpG methylation. In this study, we aimed to establish major functional categories of genes with altered CpG methylation to determine commonalities within or across contaminant-specific gene sets. Nearly all the contaminants ( n  = 4) including arsenic, cadmium, lead, and tobacco smoke were associated with CpG methylation in target genes with known roles in transcription or transcription-related processes ( n  = 54, P value range: 0.000749–0.043) ( Supplementary Table S3 ). Among these genes were HDAC4 , a histone deactylase with altered CpG methylation associated with arsenic and cadmium, as well as polymerase genes DNA-Directed RNA Polymerase II Subunit E, POLR2E , DNA-Directed RNA Polymerase II Subunit J2, POLR2J2 with altered CpG methylation in relationship to cadmium, and known transcription factor genes AHRR , Translocation-Associated Notch Protein TAN-1, NOTCH1 with altered CpG methylation in relationship to tobacco smoke exposure ( Supplementary Table S2 ). Additionally, genes that are involved with ATP binding had altered CpG methylation in relationship to exposure to arsenic ( n  = 11, P value = 0.019). Genes involved in DNA binding had altered CpG methylation in relationship to arsenic ( n  = 7, P value = 0.029), cadmium ( n  = 17, P value = 0.040), and tobacco smoke ( n  = 16, P value = 0.006). Metal binding-related genes had altered CpG methylation in relationship to prenatal exposure to arsenic ( n  = 22, P value = 0.001), mercury ( n  = 28, P value = 0.002), and tobacco smoke ( n  = 6, P value = 0.038) ( Supplementary Table S2 ).

Common Transcriptional Regulators with Enriched Binding Sites among Altered CpG Gene Sets Associated with Environmental Contaminants

The gene sets representing altered CpG methylation associated with prenatal environmental exposures were analyzed for transcription factor binding site enrichment within the promoter regions of the genes using an in silico analysis. The gene sets were separated into hypermethylated and hypomethylated subsets. By analyzing hypermethylated genes separately from hypomethylated genes, it was possible to make in silico predictions of the relationship between transcription factors and gene-specific DNA methylation patterns. Transcription factor binding within promoter regions, upstream of the transcription start site (TSS), may inhibit DNMT from accessing a particular location resulting in hypomethylation of the gene ( Fig. 2 A). In contrast, a lack of transcription factor binding may allow DNMT access to a particular genomic location resulting in hypermethylation of a gene ( Fig. 2 B).
Figure 2

( a ) Environmental contaminants are known to trigger the activation of transcription factors as a mechanism of cellular defense. The binding of the transcription factor within promoter regions, upstream of the TSS, of genes may inhibit DNMT from access for methylation of a particular gene resulting in gene-specific hypomethylation. ( b ) In contrast, environmental contaminants may also inhibit the activity of transcription factors. The lack of the transcription factor binding within promoter regions of genes may allow DNMT access to a particular genomic location resulting in gene-specific hypermethylation.

( a ) Environmental contaminants are known to trigger the activation of transcription factors as a mechanism of cellular defense. The binding of the transcription factor within promoter regions, upstream of the TSS, of genes may inhibit DNMT from access for methylation of a particular gene resulting in gene-specific hypomethylation. ( b ) In contrast, environmental contaminants may also inhibit the activity of transcription factors. The lack of the transcription factor binding within promoter regions of genes may allow DNMT access to a particular genomic location resulting in gene-specific hypermethylation. Based on the nature of the enrichment analysis, gene sets containing fewer than seven sequences were not suitable for transcription factor binding site enrichment analysis. This resulted in the exclusion of two studies [ 23 , 27 ]. Additionally, there were insufficient sequences to conduct enrichment analysis associated with hypomethylated genes in three studies [ 16 , 30 , 31 ] and hypermethylated genes in two studies [ 29 , 32 ]. As a result of these exclusions, 11 of 14 studies, representing 341 of the 396 genes and five of the six contaminants remained for transcription factor binding-site enrichment analysis including the following contaminants: arsenic, cadmium, lead, mercury, and tobacco smoke. Through the transcription factor binding site enrichment analysis, we identified a total of 56 transcription factors represented within the 341 genes ( Fig. 3 , Supplementary Tables S1 and S3 ). In contrast to the gene-level analysis, the transcription factor enrichment results displayed much more overlap within and across contaminants. A total of 32 (57.1%) transcription factors were shared by at least two studies. The remaining 24 (42.9%) transcription factors were unique and found to be enriched in only one study. In addition to a larger proportion of overlapping transcription factors as compared to differentially methylated genes (57.1% vs. 3.03%), there were more commonalities within studies examining the same environmental contaminant. For example, of the four studies that focused on CpG methylation in relationship to arsenic, no common genes were identified. However, when all arsenic-associated genes were analyzed together, there were four major transcription factors identified, namely Kidney-enriched Krupple-like factor 15 (KLF15), Kruppel-like zinc finger protein 219 (ZNF219), Myeloid zinc finger protein (MZF1), and Zinc Finger and BTB domain containing 7 A, pokemon (ZBTB7) ( Supplementary Table S3 ). Additionally, when arsenic-associated CpG methylation was analyzed by study, there were eight common transcription factors identified. These were KLF15, ZNF219, Myc-associated zinc finger protein (MAZ), MYC-associated zinc finger protein-related transcription factor (MAZR), MZF1, Pleomorphic adenoma gene 1 (PLAG1), ZBTB7, and Zinc Finger Protein 263 (ZNF263). Similarly, between the two cadmium-associated CpG methylation studies, there were no genes that were common, but they shared binding sites for a common transcription factor, MZF1. The five studies that focused on tobacco smoke-associated CpG methylation identified four genes that were common to at least two studies, while the genes with altered CpG methylation were enriched for 17 transcription factors. These included Krueppel-type Zinc Fingers (KLF6), E2F transcription Factor 1 (E2F1), Ets-family member ELF2 (NERF1a), MAZ, MAZR, MZF1, PLAG1, Ras-response element binding protein 1 (RREB1), Stimulating protein 1 (SP1), Wilms Tumor Suppressor (WT1), ZBTB7, ZNF263, Zinc finger protein insulinoma-associated 1 (IA-1) functions as a transcriptional repressor (INZM1), Zinc finger transcription factor ZBP-89 (ZBP89), and Zinc finger, BED-type containing 4 (ZBED4) ( Supplementary Table S3 ).
Figure 3

A heat map of environmental contaminant-associated transcription factors ( n  = 56) associated with 341 genes across 11 studies. Data were analyzed in a study-dependent manner for the 11 studies with sufficient genes for enrichment analysis or in a contaminant-grouped manner. The colors indicate whether the transcription factor was enriched among genes that were hypermethylated (red), hypomethyated (blue), or both hyper- and hypomethylated in relationship to a contaminant (purple).

A heat map of environmental contaminant-associated transcription factors ( n  = 56) associated with 341 genes across 11 studies. Data were analyzed in a study-dependent manner for the 11 studies with sufficient genes for enrichment analysis or in a contaminant-grouped manner. The colors indicate whether the transcription factor was enriched among genes that were hypermethylated (red), hypomethyated (blue), or both hyper- and hypomethylated in relationship to a contaminant (purple). Interestingly, there were some transcription factors that were represented among the majority of studies. Binding sites for Kidney-enriched Kruppel-like factor, KLF15, were enriched within the gene sets associated with all five contaminants analyzed (arsenic, cadmium, lead, mercury, and tobacco smoke). KLF15 was enriched ( P value range:<0.0001–0.0411) in gene sets from 8 of 11 studies analyzed ( Supplementary Table S3 ). In addition, transcription factor binding sites for Pleomorphic adenoma gene 1, PLAG1, were also enriched ( P value range: <0.0001–0.425) in gene sets associated with all five contaminants and eight studies. Only one other transcription factor, myeloid zinc finger (MZF1), had binding sites that were enriched ( P value range: <0.0001–0.0283) within gene sets of eight studies and was associated with four contaminants (arsenic, cadmium, lead and tobacco smoke) ( Supplementary Table S3 ). Interestingly, 38 (65.5%) of the genes encoding for transcription factors identified in this analysis, including MZF1 and PLAG1, have glucocorticoid-responsive-related sequence elements in their promoter regions ( Supplementary Table S3 ). Taken together, these results provide in silico epigenome-level evidence that these environmental contaminants may trigger glucocorticoid-receptor (GR)-related signaling, resulting in the alteration of specific transcription factors such as KLF15. In addition to the transcription factors potentially controlled by the GR, others are likely GR-independent. The subsequent binding of the transcription factors to response elements in target genes is supported by an enrichment of these sites among the tested gene sets ( Supplementary Table S3 , Fig. 4 ). The differential binding/occupancy of the transcription factors between environmentally exposed individuals versus those with no/lower exposure may result in specific patterns of CpG methylation at target gene sites. Thus, the differential patterns of CpG methylation associated with environmental contaminants may in fact be an indicator of an “epigenetic environmental stress response” or “epigenetic ESR.”
Figure 4

Exposures to environmental contaminants, including arsenic, cadmium, mercury, lead, and tobacco smoke, are known to alter the expression of the GR pathway. The GR, a known responder to endogenous stress and transcription factor, can activate and/or inhibit the expression of multiple downstream transcription factors. These transcription factors may alter access to CpG sites resulting in differential gene-specific methylation status. The transcription factor occupancy theory describes a mechanism whereby CpG sites may be blocked through occupancy or made accessible by the absence of transcription factors. This occupancy may influence the gene-specific CpG methylation patterns observed in fetal cord leukocytes or placental tissue.

Exposures to environmental contaminants, including arsenic, cadmium, mercury, lead, and tobacco smoke, are known to alter the expression of the GR pathway. The GR, a known responder to endogenous stress and transcription factor, can activate and/or inhibit the expression of multiple downstream transcription factors. These transcription factors may alter access to CpG sites resulting in differential gene-specific methylation status. The transcription factor occupancy theory describes a mechanism whereby CpG sites may be blocked through occupancy or made accessible by the absence of transcription factors. This occupancy may influence the gene-specific CpG methylation patterns observed in fetal cord leukocytes or placental tissue.

Discussion

Across studies examining gene-specific CpG methylation related to prenatal exposure to environmental contaminants, it is currently unknown whether there are commonalities at the gene level. Furthermore, a biological mechanism underlying the gene-specific CpG patterning is unknown. To address this, in this study, first we set out to generate a database of gene-specific CpG methylation compiled from studies examining the epigenetic effects of prenatal environmental exposures assessed in cord blood leukocytes or placental tissue. The contaminants included in this study were arsenic, cadmium, lead, manganese, mercury, and tobacco smoke. This database enabled the in silico examination of promoter sequences to identify enriched transcription factor binding sites among genes with altered CpG methylation to support or refute the hypothesis of “transcription factor occupancy” as a mediator of gene-specific methylation [ 14–16 ]. At the gene level, we observed minimal overlap for studies examining the epigenetic effects even for the same contaminant, as well as those examining diverse environmental contaminants. In support of the “transcription factor occupancy” theory, there was much more commonality at the level of transcription factors where gene sets showed enrichment of binding sites for common transcription factors. Interestingly, the majority of genes with altered CpG methylation contained binding sites for KLF15, a known responder to GR signaling. Taken together, these data provide in silico support for gene-specific methylation patterns potentially reflective of an epigenetic ESR to different environmental contaminants. There was minimal overlap of genes with altered CpG methylation across studies examining the same contaminant or across contaminants. Specifically, of the 396 genes in the database, only 14 (or 3.53%) were shared between two or more studies. The greatest overlap was observed for studies of tobacco smoke where four genes were independently identified to have altered CpG methylation across three studies analyzing tobacco smoke-associated methylation. While there was little overlap at the gene level, there are known associations between the genes with altered CpG methylation and negative health outcomes associated with the environmental contaminants. For example, the genes with altered CpG methylation in relationship to tobacco smoke exposure, CYP1A1 and GFI-1, are associated with differences in susceptibility to lung cancer and acute myeloid leukemia, respectively [ 31 , 34 ]. DDAH2 has altered CpG methylation in relationship to prenatal smoke and lead and is an epigenetic marker for neural stem cell differentiation [ 35 ]. Both prenatal tobacco smoke exposure and lead have been previously associated with neurodevelopmental outcomes [ 36 , 37 ]. These data highlight prenatal exposure to environmental toxicants is associated with altered CpG methylation of target genes known to play a role in contaminant-related diseases. In support of a biological mechanism for gene-specific patterns of CpG methylation, the data presented here suggest that transcription factor occupancy may play a role. In prior studies, we examined transcription factor binding site enrichment among genes with altered CpG methylation in relationship to prenatal cadmium [ 16 ], arsenic [ 15 ], and preeclampsia [ 14 ]. In the present research, we expanded upon this to examine five contaminants across 13 studies for sequence-based enrichment of transcription factor binding sites and found that 56 transcription factors have enriched binding sites within the promoter regions of 341 compiled genes. The majority ( n  = 33, 58.9%) of these transcription factors were represented in gene sets identified from at least two studies, with three transcription factors (5.36%) represented in the majority of studies, namely KLF15, MZF1, and PLAG1. MZF1 is a tumor suppressor protein and controls cell proliferation and tumorigenesis [ 38 ]. This is interesting as arsenic, cadmium, and tobacco smoke are known carcinogens and were associated with CpG methylation of target genes of MZF1. The other major transcription factor with binding sites identified to be enriched in this analysis was PLAG1, representing an imprinted and developmentally-regulated gene [ 39 ]. Decreased expression of PLAG1 has been associated with fetal growth restriction, which is a common birth outcome for fetuses exposed to environmental contaminants in utero [ 40 ]. It is also an oncogene and associated with numerous cancers [ 39 ]. Thus, transcription factor binding sites identified using sequence-based enrichment analysis may provide clues regarding proteins that regulate environmental contaminant-induced DNA methylation patterns that could, in part, underlie later life disease. The most significant of the identified transcription factors was KLF15, with binding sites enriched within genes with altered CpG methylation upon exposure to five contaminants. KLF15 expression is upregulated in response to stress-responsive GR activation [ 41 , 42 ]. GR is a transcription factor responsive to a suite of environmental contaminants including those analyzed here namely, arsenic, cadmium, lead, mercury, and tobacco smoke [ 43 ] and has previously been shown to be a potential regulator of genes and proteins altered by arsenic and cadmium [ 44 ]. While not observed in the target genes themselves, the majority of identified transcription factors include binding sites for the GR supporting that differential patterns of CpG methylation associated with environmental contaminants may be an indicator of an epigenetic ESR. This ESR may be mediated through GR signaling, with subsequent CpG methylation marks on downstream target genes. In summary, the data from this study provide support for the transcription factor occupancy theory as a driver of environmentally altered CpG methylation patterning. While there was little overlap at the gene level, genes with altered CpG methylation in relationship to prenatal environmental contaminants share common binding sites for a set of transcription factors. These identified transcription factors have known relationships both to the contaminants of interest and associated diseases. These findings are important as they provide support for a hypothesis for a specific biological mechanism by which prenatal exposure to environmental contaminants may impart gene-specific footprints on the epigenome in both fetal blood and placental tissue. As this research utilizes in silico strategies to examine patterns of CpG methylation at the level of DNA sequence, future investigations can leverage these results to use targeted in vitro or in vivo strategies to examine the precise roles of the identified transcription factors. Taken together, these data suggest that transcription factors perturbed by environmental contaminants may be key regulators of the fetal epigenome.

Methods

Compilation of Studies Assessing Gene-Specific Changes in CpG Methylation Related to Prenatal Exposure to Environmental Contaminants

To identify studies for inclusion in the analysis, a literature search was conducted using PubMed (US National Library of Medicine and National Institutes of Health). Studies were included if they were carried out using biological samples collected from human subjects and if maternal biomarker measures of environmental contaminants were provided. Gene-specific CpG methylation was required to be assessed using the Illumina 450 k array or similar technology (Methylated CpG Island Recovery Assay (MIRA), Illumina 27 K, Illumina Golden Gate, etc.), in a relevant tissue (cord blood or placenta) for comparability between studies. To obtain relevant studies, combinations of applicable terms were queried. The first set of terms was “prenatal,” “ in utero ,” and “transplacental.” The second set of terms included both the general term “environmental contaminant” and a second set of specific contaminants of interest: “air pollution,” “arsenic,” “bisphenol A,” “cadmium,” “dioxin,” “endocrine disrupters,” “flame retardants,” “lead,” “manganese,” “mercury,” “metals,” “pesticides,” “phthalates,” “polyaromatic hydrocarbons”, and “tobacco smoke.” The final set of terms pertained to CpG methylation: “methylation,” “5-methylcytosine,” and “CpG.” These yielded a total of 408 studies of interest with a total of 14 studies suitable for inclusion. Studies were excluded from analysis if they focused on non-environmental toxicant exposures (medication, alcohol), were case–control designs, or were reviews. They were additionally excluded if they did not assess gene-specific CpG methylation in cord blood or placental samples.

Identifying Trends in Gene-Specific CpG Methylation across Studies of Diverse Environmental Contaminants

To examine trends in gene-specific CpG methylation associated with prenatal exposure to environmental contaminants, a database of 396 genes was compiled. These genes were identified from a total of 14 total studies analyzing six contaminants: arsenic, cadmium, lead, manganese, mercury, and tobacco smoke. The genes compiled represent those determined to be statistically significant by the study authors. Most studies employed regression analysis requiring a P value < 0.05 and a false discovery rate correction [ 16 , 23 , 24 , 29 , 30 , 32 ]. Others used cluster-based analysis coupled with regression models with P value selection and false discovery rate correction [ 26–28 , 33 ]. Rojas et al. [ 15 ] applied a multiple regression model ( P value < 0.05, q  < 0.1) followed by integration with functional changes in gene expression assessed using correlation analysis [ 15 ]. Broberg et al. [ 22 ] used regression analysis presenting the most significant regression P values. In addition to collecting all statistically significant sites for the analysis, the compiled genes were compared to the 500 probes identified by Houseman et al. [ 45 ]. These probes contained within the Infinium HumanMethylation450 BeadChip are representative of CpG methylation changes that can be used as surrogate measurements of changes in the underlying white blood cell population mixture [ 45 ]. The probes compiled in the present study were compared against the cell population-related probes to test whether the contaminant-associated changes were related to potential shifts in white blood cell population. Finally, any probes found in intergenic regions were excluded from the analysis. After compilation of the genes from each study, each of the genes was classified as either hypermethylated or hypomethylated. Different studies used different metrics of change, with some utilizing a β-difference (Δβ) [ 15 , 24 , 28 , 29 , 33 , 46 ]. Others were expressed as correlation coefficients [ 22 , 25 ] or partial correlation (β) coefficients [ 16 , 23 , 26 , 30 , 31 ]. A gene’s methylation status (i.e. hyper- or hypomethylation) in relationship to the environmental exposure was assigned based upon the majority of statistically significant probes. Thus, for each of the 14 studies, genes were classified as hyper- or hypomethylated in relationship to the environmental exposure resulting in the generation of 28 gene sets.

Identifying Biological Pathways Enriched among Genes with Altered CpG Methylation Associated with Prenatal Environmental Contaminants

Once the database of 396 genes was compiled, Database for Annotation, Visualization and Integrated Discovery (DAVID) functional annotation software (david.ncifcrf.gov) was used to conduct biological category-based enrichment analyses. Most of the summarized studies have published characterization of biological functions of the genes with altered CpG methylation. For this reason, the functional annotation tool using keyword gene ontology was used to determine enrichment of major biological functions for each study ( n  = 14). The P value determined by DAVID represents an absolute enrichment determined by a modified Fisher’s exact test [ 47 ].

Identifying Transcription Factor Binding Site Enrichment among Genes with Altered CpG Methylation Associated with Prenatal Environmental Contaminants

The enrichment of transcription factor binding sites within the gene sets was carried out using Genomatix software ( http://www.genomatix.de ) [ 48 ]. In addition to the individual study-based analysis, genes were also grouped and analyzed by contaminant, resulting in six additional gene sets for analysis. Using the Gene2Promoter module, the promoter region sequences, designated as −1000 bp/+500 bp of TSS, all genes within each of the gene sets were collected. Gene sets containing seven or fewer sequences were excluded as meaningful enrichment analysis could not be conducted on these gene sets. This resulted in 24 gene sets that were analyzed using the Common Transcription Factor Analysis Module. Core similarities of 95% and the presence of the consensus sequence in more than 50% of the analyzed genes were required. Conserved “core” sequences were queried across the promoter regions of all genes. These data were compared to a series of weight matrices, which are a summary of DNA sequence patterns in transcription factor binding sites. P values, representing the likelihood of obtaining an equal or greater number of sequences with a match in a randomly drawn sample of the same size as the input sequence set, were determined [ 49 ]. Significance was set at <0.05.

Supplementary data

Supplementary data are available at EnvEpig online. Supplementary Data Click here for additional data file.
  49 in total

1.  DNA-binding factors shape the mouse methylome at distal regulatory regions.

Authors:  Michael B Stadler; Rabih Murr; Lukas Burger; Robert Ivanek; Florian Lienert; Anne Schöler; Erik van Nimwegen; Christiane Wirbelauer; Edward J Oakeley; Dimos Gaidatzis; Vijay K Tiwari; Dirk Schübeler
Journal:  Nature       Date:  2011-12-14       Impact factor: 49.962

2.  Identification of genetic elements that autonomously determine DNA methylation states.

Authors:  Florian Lienert; Christiane Wirbelauer; Indrani Som; Ann Dean; Fabio Mohn; Dirk Schübeler
Journal:  Nat Genet       Date:  2011-10-02       Impact factor: 38.330

Review 3.  Epigenetic reprogramming in mammals.

Authors:  Hugh D Morgan; Fátima Santos; Kelly Green; Wendy Dean; Wolf Reik
Journal:  Hum Mol Genet       Date:  2005-04-15       Impact factor: 6.150

4.  The stereochemistry of benzo[a]pyrene-2'-deoxyguanosine adducts affects DNA methylation by SssI and HhaI DNA methyltransferases.

Authors:  Oksana M Subach; Diana V Maltseva; Anant Shastry; Alexander Kolbanovskiy; Saulius Klimasauskas; Nicholas E Geacintov; Elizaveta S Gromova
Journal:  FEBS J       Date:  2007-03-27       Impact factor: 5.542

Review 5.  PLAG1, the prototype of the PLAG gene family: versatility in tumour development (review).

Authors:  Frederik Van Dyck; Jeroen Declercq; Caroline V Braem; Wim J M Van de Ven
Journal:  Int J Oncol       Date:  2007-04       Impact factor: 5.650

6.  Gene body methylation of the dimethylarginine dimethylamino-hydrolase 2 (Ddah2) gene is an epigenetic biomarker for neural stem cell differentiation.

Authors:  Liselotte Bäckdahl; Marlis Herberth; Gareth Wilson; Peri Tate; Lia S Campos; Rene Cortese; Florian Eckhardt; Stephan Beck
Journal:  Epigenetics       Date:  2009-05-22       Impact factor: 4.528

7.  Cadmium exposure and the epigenome: Exposure-associated patterns of DNA methylation in leukocytes from mother-baby pairs.

Authors:  Alison P Sanders; Lisa Smeester; Daniel Rojas; Tristan DeBussycher; Michael C Wu; Fred A Wright; Yi-Hui Zhou; Jessica E Laine; Julia E Rager; Geeta K Swamy; Allison Ashley-Koch; Marie Lynn Miranda; Rebecca C Fry
Journal:  Epigenetics       Date:  2013-10-28       Impact factor: 4.528

Review 8.  Environmental chemicals and DNA methylation in adults: a systematic review of the epidemiologic evidence.

Authors:  Adrian Ruiz-Hernandez; Chin-Chi Kuo; Pilar Rentero-Garrido; Wan-Yee Tang; Josep Redon; Jose M Ordovas; Ana Navas-Acien; Maria Tellez-Plaza
Journal:  Clin Epigenetics       Date:  2015-04-29       Impact factor: 6.551

9.  Prenatal exposure to arsenic and cadmium impacts infectious disease-related genes within the glucocorticoid receptor signal transduction pathway.

Authors:  Julia E Rager; Andrew Yosim; Rebecca C Fry
Journal:  Int J Mol Sci       Date:  2014-12-03       Impact factor: 5.923

10.  Differential DNA methylation in umbilical cord blood of infants exposed to low levels of arsenic in utero.

Authors:  Devin C Koestler; Michele Avissar-Whiting; E Andres Houseman; Margaret R Karagas; Carmen J Marsit
Journal:  Environ Health Perspect       Date:  2013-06-11       Impact factor: 9.031

View more
  20 in total

1.  DNA methylation of imprinted genes in Mexican-American newborn children with prenatal phthalate exposure.

Authors:  Gwen Tindula; Susan K Murphy; Carole Grenier; Zhiqing Huang; Karen Huen; Maria Escudero-Fung; Asa Bradman; Brenda Eskenazi; Cathrine Hoyo; Nina Holland
Journal:  Epigenomics       Date:  2018-06-29       Impact factor: 4.778

Review 2.  Microorganisms in the Placenta: Links to Early-Life Inflammation and Neurodevelopment in Children.

Authors:  Martha Scott Tomlinson; Kun Lu; Jill R Stewart; Carmen J Marsit; T Michael O'Shea; Rebecca C Fry
Journal:  Clin Microbiol Rev       Date:  2019-05-01       Impact factor: 26.132

Review 3.  Effects of prenatal exposure to endocrine disruptors and toxic metals on the fetal epigenome.

Authors:  Paige A Bommarito; Elizabeth Martin; Rebecca C Fry
Journal:  Epigenomics       Date:  2017-02-17       Impact factor: 4.778

Review 4.  Epigenetic influence of environmentally neurotoxic metals.

Authors:  Omamuyovwi M Ijomone; Olayemi K Ijomone; Joy D Iroegbu; Chibuzor W Ifenatuoha; Nzube F Olung; Michael Aschner
Journal:  Neurotoxicology       Date:  2020-09-01       Impact factor: 4.294

Review 5.  Architecture in 3D cell culture: An essential feature for in vitro toxicology.

Authors:  Sophie A Lelièvre; Tim Kwok; Shirisha Chittiboyina
Journal:  Toxicol In Vitro       Date:  2017-03-30       Impact factor: 3.500

6.  Sexual epigenetic dimorphism in the human placenta: implications for susceptibility during the prenatal period.

Authors:  Elizabeth Martin; Lisa Smeester; Paige A Bommarito; Matthew R Grace; Kim Boggess; Karl Kuban; Margaret R Karagas; Carmen J Marsit; T Michael O'Shea; Rebecca C Fry
Journal:  Epigenomics       Date:  2017-02-17       Impact factor: 4.778

7.  Lasting Impact of an Ephemeral Organ: The Role of the Placenta in Fetal Programming.

Authors:  Lindsey Konkel
Journal:  Environ Health Perspect       Date:  2016-07-01       Impact factor: 9.031

8.  Placental CpG methylation of infants born extremely preterm predicts cognitive impairment later in life.

Authors:  Sloane K Tilley; Elizabeth M Martin; Lisa Smeester; Robert M Joseph; Karl C K Kuban; Tim C Heeren; Olaf U Dammann; T Michael O'Shea; Rebecca C Fry
Journal:  PLoS One       Date:  2018-03-07       Impact factor: 3.240

9.  Placental CpG methylation of HPA-axis genes is associated with cognitive impairment at age 10 among children born extremely preterm.

Authors:  C J Meakin; E M Martin; H P Santos; I Mokrova; K Kuban; T M O'Shea; R M Joseph; L Smeester; R C Fry
Journal:  Horm Behav       Date:  2018-03-05       Impact factor: 3.587

10.  Microorganisms in the human placenta are associated with altered CpG methylation of immune and inflammation-related genes.

Authors:  Martha Scott Tomlinson; Paige A Bommarito; Elizabeth M Martin; Lisa Smeester; Raina N Fichorova; Andrew B Onderdonk; Karl C K Kuban; T Michael O'Shea; Rebecca C Fry
Journal:  PLoS One       Date:  2017-12-14       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.