Literature DB >> 26913202

PON1 as a model for integration of genetic, epigenetic, and expression data on candidate susceptibility genes.

Karen Huen1, Paul Yousefi1, Kelly Street1, Brenda Eskenazi1, Nina Holland1.   

Abstract

Recent genome- and epigenome-wide studies demonstrate that the DNA methylation is controlled in part by genetics, highlighting the importance of integrating genetic and epigenetic data. To better understand molecular mechanisms affecting gene expression, we used the candidate susceptibility gene paraoxonase 1 (PON1) as a model to assess associations of PON1 genetic polymorphisms with DNA methylation and arylesterase activity, a marker of PON1 expression. PON1 has been associated with susceptibility to obesity, cardiovascular disease, and pesticide exposure. In this study, we assessed DNA methylation in 18 CpG sites located along PON1 shores, shelves, and its CpG island in blood specimens collected from newborns and 9-year-old children participating (n = 449) in the CHAMACOS birth cohort study. The promoter polymorphism, PON1-108 , was strongly associated with methylation, particularly for CpG sites located near the CpG island (P << 0.0005). Among newborns, these relationships were even more pronounced after adjusting for blood cell composition. We also observed significant decreases in arylesterase activity with increased methylation at the same nine CpG sites at both ages. Using causal mediation analysis, we found statistically significant indirect effects of methylation (β(95% confidence interval): 6.9(1.5, 12.4)) providing evidence that DNA methylation mediates the relationship between PON1-108 genotype and PON1 expression. Our findings show that integration of genetic, epigenetic, and expression data can shed light on the functional mechanisms involving genetic and epigenetic regulation of candidate susceptibility genes like PON1.

Entities:  

Keywords:  arylesterase; mediation; methylation; paraoxonase

Year:  2015        PMID: 26913202      PMCID: PMC4762373          DOI: 10.1093/eep/dvv003

Source DB:  PubMed          Journal:  Environ Epigenet        ISSN: 2058-5888


Background

Epigenetic modifications, including DNA methylation, chromatin modifications, and non-coding RNAs, are heritable and affect gene expression without a change in DNA sequence [1, 2]. Because of their impact on gene expression, there is a growing recognition that epigenetics may play an important role in key biological processes and mechanisms of disease development [3-5]. DNA methylation is an important epigenetic mechanism and is also the most commonly measured epigenetic mark. Available methodologies for the measurement of DNA methylation are relatively high throughput and cost-effective, allowing for its use in population studies. Interest in DNA methylation has grown rapidly in recent years resulting in the publication of numerous studies examining associations of DNA methylation with environmental exposures as well as adverse health outcomes in human populations [6-9]. In genetic epidemiology, concepts of genetic architecture such as the existence of haplotype blocks and linkage disequilibrium (LD) transformed the way in which genetic association studies were designed, analyzed, and interpreted [10]. Epigenome-wide data emerging from recent studies are now providing analogous information on the methylome that can further inform epigenetic analysis for population studies. For instance, relationships of methylation at CpG sites in the same gene are often performed as one CpG site at a time. However, recent studies have shown that methylation at nearby CpG sites can be highly correlated with each other, although over shorter distances than observed with single-nucleotide polymorphisms (SNPs) in the genome [11, 12]. Furthermore, methylation may be partially controlled by genetics [11, 12]. Therefore, to understand the molecular mechanisms explaining how DNA methylation can affect health through gene expression changes, it will be important in population studies to integrate data on DNA methylation with data on genetic polymorphisms and gene expression [13, 14]. Here, we use paraoxonase 1 (PON1) as a model for integrating different types of molecular data that can be used to understand the role of epigenetics on gene expression. PON1 is a multifunctional enzyme involved in oxidant defense [15], and its genetic variants and lower levels have been linked to adverse health outcomes including adverse neurodevelopment, cardiovascular disease, and obesity [16-22]. We previously studied PON1 variability in Mexican-American children and their mothers from the Center for Health Assessment of Mothers and Children of Salinas (CHAMACOS) birth cohort and found a much larger than previously known (>100-fold) range of levels and substrate-specific activities defined in part by age and genotype [23, 24]. This broad variability of PON1 within populations suggests differential susceptibility among individuals. PON1 sequencing and haplotype analyses in this cohort have shown that the PON1 promoter polymorphism (PON1; rs705379) is the strongest known predictor of PON1 gene expression and protein levels. Even so, it explains less than 25% of the variability of PON1 protein levels; furthermore, incorporation of other genetic variants explained <3% of additional variability. This means that other factors beyond genetics, including epigenetics, may contribute to modulation of PON1 gene expression. There are a total of 287 CpG sites located in the PON1 gene including one CpG island in the promoter region comprising 19 CpG sites (Fig. 1). Beyond the CpG island, there are an additional 66, 48, and 146 CpG sites within shores, shelves, and open sea regions, respectively. One recent study showed that a PON1 SNP located in a miRNA binding site (miR-616) was associated both with changes in PON1 expression and increased risk of ischemic stroke and carotid atherosclerosis [25]. These data underscore the vital influence of epigenetic marks like miRNA and DNA methylation on PON1 and demonstrate further the clinical significance of PON1 variability [26]. To our knowledge, few studies of PON1 epigenetics in relation to molecular phenotype have been reported [27, 28].
Figure 1.

CpG sites and SNPs in the PON1 gene. This map of PON1 spans chromosome 7 from coordinates 94 927 671 to 94 941 000 in (A) and 94 941 000 to 94 955 500 in (B). It shows all 287 CpG sites and the subset of CpG sites included in the 450K BeadChip assay. There is one CpG island in the non-coding region (B) and the distribution of SNPs across the gene is also shown

CpG sites and SNPs in the PON1 gene. This map of PON1 spans chromosome 7 from coordinates 94 927 671 to 94 941 000 in (A) and 94 941 000 to 94 955 500 in (B). It shows all 287 CpG sites and the subset of CpG sites included in the 450K BeadChip assay. There is one CpG island in the non-coding region (B) and the distribution of SNPs across the gene is also shown The purpose of this study is to determine the relationship of DNA methylation in the PON1 gene with PON1 genetic polymorphisms and with gene expression at the protein level in CHAMACOS children. In addition to expanding molecular characterization of PON1 variability to epigenetics, our data can serve as a model for integrating genetic, epigenetic, and expression data on candidate susceptibility genes.

Results

PON1 CpG Sites

We used the data from a 450K BeadChip array to assess CpG sites located in the PON1 gene in blood specimens collected from 449 children. Samples were assessed at two time points, once at birth (n = 378) and again at the age of 9 years (n = 247). The 450K Bead Chip array included 18 of the 287 PON1 CpG sites, which are described in Table 1 and shown in Fig. 1. The majority of CpG sites interrogated were located in the promoter region and included several sites in the CpG island (n = 4), shores (n = 9), and shelves (n = 1).
Table 1.

Summary of PON1 CpG sites assessed by 450K Beadchip

CpG site numberIllumina 450K IDCoordinatesaRelation to CpG islandDistance from PON1-108 SNP
1cg2279873794 927 713Open sea26 188
2cg0187989394 928 193Open sea25 709
3cg0941620394 928 249Open sea25 652
cg05433222b94 948 020Open sea5881
4cg2406257194 950 658N_Shelf3244
5cg0740448594 953 654N_Shore248
6cg0534268294 953 680N_Shore221
7cg0415528994 953 771Island131
8cg1967839294 953 811Island91
9cg2185620594 953 878Island24
10cg1733025194 953 956Island55
11cg0187486794 954 059S_Shore158
12cg2011979894 954 144S_Shore243
13cg0487113194 954 202S_Shore301
14cg2305577294 954 438S_Shore537
15cg0780936994 954 455S_Shore554
cg17020263b94 955 053S_Shore1152
16cg1588728394 955 348S_Shore1447

aReference hg 19, Chromosome 7.

bThese CpG sites contained SNPs, so their methylation data were not included.

Summary of PON1 CpG sites assessed by 450K Beadchip aReference hg 19, Chromosome 7. bThese CpG sites contained SNPs, so their methylation data were not included.

PON1 Methylation

Levels of DNA methylation in newborns and 9-year-old children are shown graphically in Fig. 2. At both ages, average methylation levels (expressed as M values, see Methods) ranged from 2.1 to 3.4 among sites (Sites 1–4 and 14–16) that were further away from the transcription start site (TSS) and were much lower (0.03–2.11) among sites closer in proximity to the TSS (Sites 5–13). At the majority of individual CpG sites, methylation levels were slightly higher at the age of 9 years; these differences by age were no longer statistically significant after adjusting for cell composition in generalized estimating equation models with the exception of one CpG site (Site 4, cg 24062571). For Site 4, we observed slightly higher methylation at the age of 9 years, and this difference persisted after adjusting for cell composition.
Figure 2.

Methylation box plots in newborns (n = 378) and 9-year-old (n = 247) children. DNA methylation was measured using 450K BeadChip. The box plots show methylation levels at 16 CpG sites in the PON1 gene in (A) umbilical cord blood and (B) 9-year-old blood. CpG site numbers correspond to those listed in Table 1

Methylation box plots in newborns (n = 378) and 9-year-old (n = 247) children. DNA methylation was measured using 450K BeadChip. The box plots show methylation levels at 16 CpG sites in the PON1 gene in (A) umbilical cord blood and (B) 9-year-old blood. CpG site numbers correspond to those listed in Table 1 It is well documented in genetic epidemiology that SNPs in close proximity to each other are highly correlated with each other, forming LD blocks that tend to be inherited together [10]. Although less is known about epigenetic architecture, there is a growing understanding that nearby CpG sites may also be correlated with each other [11, 12]. Therefore, we examined correlations among PON1 CpG sites in newborns and in 9-year olds (Fig. 3). Among CpG sites in umbilical cord blood, we identified two methylation blocks (see Methods for definition) containing clusters of highly correlated sites (Fig. 3A). Block 1 contained sites in close proximity to the TSS (Sites 5–13) and spanned 350 base pairs; we will refer to this as the central cluster of PON1 CpG sites or Block 1. Correlation coefficients (r) for CpG site pairs in Block 1 ranged from 0.53 to 0.98. Block 2 (Sites 14–16) spanned 1000 base pairs in length and correlation coefficients ranged from 0.55 to 0.85. Interestingly, sites from Block 2 were also correlated with Sites 1–4 (r ranged from 0.42 to 0.88), and we subsequently refer to these sites (Block 2 and Sites 1–4) as the outer cluster of CpG sites. Among 9-year-old children (Fig. 3B), we observed only one block consisting of Sites 5–13 with correlation coefficients ranging from 0.76 to 0.98. Although sites were again correlated with each other among Sites 14–16, their correlations were not high enough to meet the definition of a methylation block (r2 < 0.4 for most CpG site pairs).
Figure 3.

Correlation plots of PON1 CpG sites in (A) umbilical cord blood and (B) 9-year-old blood. The colors of the squares represent the correlation between CpG sites (r). Squares are white/yellow if CpG sites are not correlated (r = 0) and red if they are strongly correlated (r ∼ 1). Similar to the concept of LD between SNPs, CpG sites within close proximity of each other tend to be correlated with each other. Here, we identified two methylation blocks (Sites 5–13 and 14–16) where methylation levels were strongly correlated with each at both ages. Block 1 is referred to as the inner cluster of CpG sites, whereas Sites 1–4 and 14–16 are referred to as the outer cluster of CpG sites. CpG site numbers correspond to those listed in Table 1

Correlation plots of PON1 CpG sites in (A) umbilical cord blood and (B) 9-year-old blood. The colors of the squares represent the correlation between CpG sites (r). Squares are white/yellow if CpG sites are not correlated (r = 0) and red if they are strongly correlated (r ∼ 1). Similar to the concept of LD between SNPs, CpG sites within close proximity of each other tend to be correlated with each other. Here, we identified two methylation blocks (Sites 5–13 and 14–16) where methylation levels were strongly correlated with each at both ages. Block 1 is referred to as the inner cluster of CpG sites, whereas Sites 1–4 and 14–16 are referred to as the outer cluster of CpG sites. CpG site numbers correspond to those listed in Table 1

Relationships of PON1 Genotype with PON1 Methylation

In agreement with other studies [29, 30] we previously found the promoter polymorphism at position -108 (PON1) to be the most significant predictor of PON1 expression as measured by arylesterase (AREase) activity in the CHAMACOS cohort [24, 31]. As methylation can also affect gene expression, we examined whether this promoter SNP was associated with methylation at the 16 CpG sites assessed in CHAMACOS newborns and 9-year olds. Figure 3 shows the differences in methylation by PON1 genotype. The methylation density plots demonstrate little to no difference in methylation in the outer cluster sites (Fig. 4A and C). However, among sites in methylation Block 1 (Fig. 4B), there is a clear separation by genotype, providing evidence of allele-specific methylation (ASM). Those with the CC genotype, which has been associated with higher levels of gene expression [30], had the lowest levels of methylation, whereas those with the CT genotype had intermediate levels of methylation and those with TT genotypes had the highest levels of methylation. Furthermore, variance was larger within the TT group compared with the CT and CC groups. Similar patterns were observed in 9-year-old children (data not shown).
Figure 4.

Methylation density plots by PON1 genotype (CC-pink CT – green TT –purple). Each plot shows density by methylation M-value at eight different CpG sites, from different regions of PON1. Plots in Row A are CpG Sites 1 and 4, respectively. Plots in Row B are from methylation Block 1 (Sites 5, 8, 11, and 13). Plots in Row C show Sites 14 and 16 from methylation Block 2. We include plots from just a few CpG sites that are representative of other CpG sites in the same region. Among CHAMACOS newborns with methylation data (n = 271), 83 were CC, 138 were CT, and 50 were TT. Sites from methylation Block 1 show a distinct separation by PON1 genotype

Methylation density plots by PON1 genotype (CC-pink CT – green TT –purple). Each plot shows density by methylation M-value at eight different CpG sites, from different regions of PON1. Plots in Row A are CpG Sites 1 and 4, respectively. Plots in Row B are from methylation Block 1 (Sites 5, 8, 11, and 13). Plots in Row C show Sites 14 and 16 from methylation Block 2. We include plots from just a few CpG sites that are representative of other CpG sites in the same region. Among CHAMACOS newborns with methylation data (n = 271), 83 were CC, 138 were CT, and 50 were TT. Sites from methylation Block 1 show a distinct separation by PON1 genotype Regression models of PON1 genotype with methylation (Table 2) also showed the same trends. We found very strong positive associations of the number of PON1T alleles with methylation levels, particularly those in methylation Block 1 (P values ranged from 1.34E-11 to 2.73E-30). For instance, on average, children with the CT genotype had methylation M values that were 0.63 units higher than CC kids at CpG Site 8 located in the CpG island. These associations remain significant even after Bonferonni correction for multiple testing. For Block 1 sites, the variation of methylation explained by PON1 genotype as determined by r2 of the regression models ranged from 17 to 39% in cord bloods. As methylation profiles can differ by cell type in blood, we also examined the relationship of PON1 genotype adjusting for cell composition in a subset of 82 newborns with differential cell count data (Supplementary Table S1). Results were very similar to unadjusted models; however, the variance of methylation explained by PON1 genotype doubled for Block 1 sites (38–76%). The same but even stronger trends were also seen in 9-year-old children (P values for Block 1 sites ranged from 6.28E-33 to 7.55E-50 and r2 ranged from 0.43 to 0.68); models adjusting for cell composition were very similar to unadjusted models (Table 2 and Supplementary Table S1).
Table 2.

Association of PON1 genotype with DNA methylation in blood in newborns and 9-year olds

Newborns
9-Year Olds
CpG SiteCpG site idRelation to islandNBeta (95% CI) P N Beta (95% CI) P
1cg22798737Open sea271−0.06(−0.14,0.03)0.202180.00(−0.07,0.06)0.97
2cg01879893Open sea271−0.04(−0.12,0.03)0.232180.00(−0.07,0.07)0.90
3cg09416203Open sea271−0.08(−0.15, −0.01)0.03218−0.03(−0.11,0.05)0.42
4cg24062571N_Shelf271−0.05(−0.12,0.02)0.18218−0.01(−0.07,0.06)0.86
5cg07404485N_Shore2710.32(0.24,0.39)8.28E-152180.51(0.45,0.58)6.58E-37
6cg05342682N_Shore2710.32(0.25,0.40)5.00E-152180.52(0.45,0.58)5.54E-35
7cg04155289Island2710.33(0.24,0.42)1.34E-112180.59(0.51,0.67)6.28E-33
8cg19678392Island2710.63(0.54,0.73)4.13E-302180.92(0.82,1.02)5.73E-45
9cg21856205Island2710.26(0.22,0.30)2.73E-302180.35(0.31,0.39)4.71E-46
10cg17330251Island2710.63(0.52,0.74)2.15E-252180.95(0.85,1.04)4.56E-48
11cg01874867S_Shore2710.59(0.49,0.69)8.59E-252180.92(0.82,1.01)7.55E-50
12cg20119798S_Shore2710.38(0.30,0.47)2.14E-162180.69(0.61,0.77)3.77E-40
13cg04871131S_Shore2710.26(0.17,0.34)8.04E-092180.52(0.44,0.61)1.55E-24
14cg23055772S_Shore271−0.01(−0.06,0.05)0.762180.00(−0.05,0.05)0.86
15cg07809369S_Shore2710.05(−0.04,0.13)0.312180.20(0.12,0.28)1.29E-06
16cg15887283S_Shore271−0.06(−0.14,0.03)0.18218−0.02(−0.09,0.05)0.64

CI, confidence interval.

a PON1 genotype was coded as 0, 1, and 2 T alleles for CC, CT, and TT children, respectively.

Association of PON1 genotype with DNA methylation in blood in newborns and 9-year olds CI, confidence interval. a PON1 genotype was coded as 0, 1, and 2 T alleles for CC, CT, and TT children, respectively. Beyond the PON1 SNP, we also examined associations of PON1 methylation sites with an additional panel of 40 SNPs and indels distributed throughout the PON1 gene and 4 SNPs located in PON2 and PON3 in a smaller subset of 214 newborns and 157 9-year-old children (Supplementary Tables S2 and S3). As expected, methylation in Block 1 CpG sites were associated with additional SNPs located in the promoter region. These SNPs were all in strong LD with PON1. We also observed weaker but significant associations of several other SNPs with methylation of Block 1 CpG sites, including several located further downstream (>2 kb) of PON1 in the PON2 and PON3 genes and two SNPs that were in strong LD with PON1, a common coding SNP.

Relationships of PON1 Methylation with AREase Activity

Using AREase as a marker of gene expression at the protein level, we examined associations of PON1 methylation at each of the 16 CpG sites with AREase activity (Fig. 5A, Supplementary Table S4). At both ages, methylation at individual Block 1 CpG sites was significantly associated with AREase activity, and this relationship was noticeably stronger at the age of 9 years (Table 2, Fig. 5B). This may be due to the higher mean, range, and variance of AREase activity observed in 9-year-old children compared with newborns who have very low levels of PON1 protein levels at birth. Effects of cell composition may also contribute to this difference. After adjustment for cell composition in a subset of cord bloods, we observed trends in the same direction but the magnitude of the associations increased and P values were decreased. Results were essentially unchanged in 9-year-old children suggesting that the influence of cell composition on these associations was much stronger in cord blood (Supplementary Table S5).
Figure 5.

Relationship of methylation with AREase activity. The plot shows regression P values for modelling the association of methylation at each individual CpG Site with AREase activity in (A) newborns and (B) 9-year olds. The solid line indicates a P value of 0.05 and the dotted line represents a P value of 0.01

Relationship of methylation with AREase activity. The plot shows regression P values for modelling the association of methylation at each individual CpG Site with AREase activity in (A) newborns and (B) 9-year olds. The solid line indicates a P value of 0.05 and the dotted line represents a P value of 0.01 To consider the methylation at multiple correlated CpG sites in one model, we also used PCs analysis. Two PCs explained >95% of the variability of methylation in Block 1, and these PCs were included in the models of AREase activity (Table 3). Both PCs describing cord blood methylation from Block 1 (PC1-1 and PC1-2) were significantly associated with cord AREase activity. In 9-year-old children, only PC1-1 was significantly associated with AREase activity.
Table 3.

PCs regression models of DNA methylation in blood with AREase activity in children

Newbornsa
9-Year oldsa
estimateSE P r 2 estimateSE P r 2
Block 1 CpG sites pc1-11.310.620.0355.841.092.78E-07
pc1-2−8.192.610.0020.06−8.206.100.1810.15

SE, standard error.

aThe newborn model examines association of cord blood methylation with cord AREase activity. The 9-year model examines the association of 9-year-old blood methylation with 9-year AREase activity.

PCs regression models of DNA methylation in blood with AREase activity in children SE, standard error. aThe newborn model examines association of cord blood methylation with cord AREase activity. The 9-year model examines the association of 9-year-old blood methylation with 9-year AREase activity.

Mediation of PON1 Genotype on AREase Activity by PON1 Methylation

Because PON1 was the SNP most strongly associated with methylation and expression and prior studies suggest epigenetics can at times mediate effects of genetics on expression, we used the PARAMED module [32, 33] to determine to what extent PON1 methylation mediates the effects of PON1 genotype on AREase activity (Table 4). PARAMED relies on causal inference methodologies to estimate the natural direct effect and NIE in the presence of exposure-mediator interactions. We limited this analysis only to the PCs that were significantly associated with AREase activity (PC1-1 and 2 for newborns and PC1-1 for 9-year olds). Among newborns, there was a significant interaction between PON1 genotype and both PCs of Block 1 methylation. Furthermore, the NIE of the first PC of Block 1 methylation (PC1-1) on expression was statistically significant (β(95% confidence interval) = 6.91(1.45,12.37); P = 0.013) providing evidence that methylation in Block 1 mediates the effect of PON1 genotype on AREase activity. The NIE for PC1-2 was not statistically significant, suggesting that some portion of the relationship between PON1 genotype and AREase is not driven by methylation. Among 9-year olds, we did not observe a significant interaction of genotype and methylation on AREase activity. Additionally, the NIE for PC1-1 did not reach statistical significance. Overall, our mediation analyses suggest that at least in newborns, PON1 methylation in Block 1 mediates the association of PON1 genotype on AREase activity.
Table 4.

Mediation analysis: total, direct, and indirect effect

Newborns
9-Year olds
PC1-1
PC1-2
PC1-1
Estimate (95% CI) P Estimate (95% CI) P Estimate (95% CI) P
Total effect16.3(−22.1, −10.5)<0.0005−15.6(−21.5, −9.7)<0.0005−30.6(−42.9, −18.3)<0.0005
Natural direct effect−23.2(−30.8, −15.6)<0.0005−19.1(−26.3, −11.9)<0.0005−15.6(−35.3,4.19)0.123
NIE6.9(1.5,12.4)0.0133.5(−1.8,8.9)0.195−15.0(−30.9,0.83)0.063

CI, confidence interval.

Mediation analysis: total, direct, and indirect effect CI, confidence interval.

Discussion

In this study, we used PON1 as a model for integration of genetic and epigenetic data and their relationship with gene expression. We expanded our molecular studies of PON1 to DNA methylation at 16 CpG sites in a large cohort of young Mexican-American children from the Salinas Valley, CA. We found evidence of allele-specific methylation between the PON1 SNP and methylation sites belonging to methylation Block 1 surrounding the promoter region, demonstrating a genetic contribution to DNA methylation. Furthermore, methylation levels at this Block 1 were strongly associated with AREase activity, a measure of PON1 protein quantity in blood. Mediation analyses provided evidence that methylation can act as a mediator of the PON1 SNP on AREase activity To our knowledge, only one other study has examined associations of PON1 methylation with AREase activity. Similar to our results, de la Iglesia et al. [28] also reported an inverse relationship between methylation at CpG Sites 5–8 and AREase activity in white blood cell specimens collected from 47 obese adults. Our study, relying on a much larger sample size (n = 449) demonstrated that similar relationships also exist at birth and during childhood and found that some of these relationships may be driven by genetics. Few if any studies that have focused on PON1 have examined associations of genetic polymorphisms with PON1 methylation. However, we were able to extract publicly available data on PON1 from a handful of studies that have taken genome- and epigenome-wide approaches to explore patterns of methylation and relationships with genetics. The same methylation block that we identified in CHAMACOS children (Sites 4–12, Block 1) was also identified as a methylation cluster in two other studies, one a study of whole blood specimens from adults (n = 247) and the other a smaller study of fetal (n = 14) and adult (n = 181) liver samples [11, 34]. In line with our findings, the study of liver specimens demonstrated inverse correlations between PON1 methylation at Sites 4–12 and PON1 gene expression (mRNA) [34] and also observed significant associations of PON1 genotype with methylation at the same sites. Our study showed that similar relationships can be observed in children’s blood specimens and when we consider PON1 expression at the protein level (AREase activity)—the final step of the flow of genetic information described in the central dogma of molecular biology. There is growing interest in the phenomenon of ASM such as that observed between PON1 methylation in the promoter region and PON1 genotype. ASM is a hallmark feature of imprinted genes but recent epigenome-wide studies have shown that the most prevalent types of ASM are those where genotype affects DNA methylation in cis among non-imprinted genes [12, 35, 36]. A recent study in blood specimens reported a prevalence of ASM that was cis in nature in 8.1% of heterozygous SNPs and that 22% of the genes exhibiting ASM also displayed allele-specific gene expression, as we observed with PON1 [36]. ASM is a particularly interesting phenomenon because it provides a functional link between genetic polymorphisms and phenotypic differences beyond nonsynonymous SNPs in coding regions [35]. Furthermore, it has been suggested that regions characterized by ASM may be more susceptible to environmental influences [37]. Mechanisms regulating methylation patterns in promoter regions are not well understood although several models through which regulation may occur have recently been proposed [38]. The strong association of the PON1 genotype with both methylation and expression is particularly interesting because it serves as an example of genetic regulation of methylation and expression at a transcription factor binding site. Previous studies have demonstrated that this polymorphism is located in a specificity protein 1 (Sp1) binding site [30, 39], which likely activates transcription via methylation. Sp1 binding blocks DNA methyltransferases (DNMTs) from accessing the promoter, leading to lower methylation levels and thus more transcription [38, 40]. The Sp1 binding site at position 108 can be disrupted by the T allele, which in CHAMACOS children was associated both with increased methylation levels and lower expression as measured by AREase activity. Furthermore, results from our mediation analyses suggest that the effect of the PON1 polymorphisms on expression is mediated in part by methylation at methylation Block 1. Although we observed very strong associations between the PON1 genotype, DNA methylation, and AREase activity in a large cohort of CHAMACOS children, this study has some limitations. We used AREase activity, a measure of protein levels in blood, as a proxy for gene expression. It is possible, however, that AREase may be more reflective of liver expression as PON1 present in plasma is primarily synthesized in the liver [41]. Relationships observed between blood methylation and AREase activity were strikingly similar to those reported between methylation and mRNA expression in blood and liver samples, possibly because ASM events may not be tissue specific. One study recently observed a concordance of ASM between tissue types and hypothesized that this may be due either to “shared gene regulatory events that occur early in development or an inherent property of DNA sequence that directly affects the propensity of DNA methylation” [36]. Additionally, as the 450 K BeadChip array assays only a select number of CpG sites throughout the methylome, we were only able to interrogate 18 of the 287 CpG sites located in the PON1 gene. Although the coverage of sites in and around the promoter region was quite good, future studies would benefit from a more complete assessment of the PON1 CpG sites across the entire gene using a higher resolution methodology like MethylSeq. Another potential source of bias is related to missingness of biological specimens, which is common in cohort studies utilizing pediatric blood samples. Only 65–71% of participants included in this study had adequate blood volumes for AREase analysis. This could limit generalizability of the relationships that we observed. When we compared children included in our study with all other children in the cohort, however, we did not find that they differed by significant demographic or exposure factors. Finally, our data on differential cell count in cord blood specimens were limited to only a subset of participants. It is still possible that some amount of residual confounding was not accounted for in the models that did not adjust for cell composition. Since adjusting for cell composition did not substantially change results, however, the unmeasured residual confounding is likely small.

Conclusions

Recent genome- and epigenome-wide studies are providing new data showing emerging patterns of a complex interplay between genetics and epigenetics on gene expression that may affect health [11, 12]. Here, we applied some of the new understanding of epigenomic architecture, relationships with genetic polymorphisms, and mediation analysis using PON1 as a model. We found strong evidence of ASM between the promoter polymorphism PON1 and methylation in a cluster of CpG sites surrounding the TSS that was also associated with PON1 protein levels. Our results demonstrate that integrating genetic, epigenetic, and expression data can provide evidence of functional mechanisms involving genetic and epigenetic regulation of candidate susceptibility genes.

Methods

Study Subjects

The CHAMACOS study aims to examine the effects of pesticides and other environmental exposures in a population of pregnant women and children living in the agricultural Salinas Valley, CA. Women were eligible for enrollment if they were at least 18 years of age, at less than 20 weeks’ gestation, Spanish or English speaking, eligible for low-income health insurance, receiving prenatal care at one of the local community clinics, and planning to deliver at the public hospital. Six hundred and one pregnant women were enrolled in 1999–2000 and 526 delivered live born singleton newborns [42]. CHAMACOS women were interviewed by bilingual, bicultural interviewers near the end of the first (∼13 weeks’ gestation) and second (∼26 weeks’ gestation) trimesters of pregnancy. Information was obtained on sociodemographic characteristics, mother’s reproductive and medical history, prenatal lifestyle exposures, diet, occupational and residential history, exposures to pesticides and other environmental chemicals, and housing quality. Methylation of PON1 CpG sites was measured in blood samples collected from children at delivery (umbilical cord blood representing fetal blood) and when they were 9-year old (mean ± SD = 9.3 ± 0.3 years). Our study sample included a total of 449 children who had DNA samples available for PON1 genotyping in addition to methylation analysis at birth and/or at the age of 9 years. Of these children, 176 had samples available at both time points, 202 had samples only at birth, and 71 had samples at the age of 9 years only. Furthermore, among those children with DNA samples available, 247 and 176 had adequate blood volumes available for analysis of arylesterase (AREase) activity at birth and age of 9 years, respectively. For both time points (birth and 9 years), children included in the study did not differ from all children in the cohort by other demographic and exposure variables (e.g., poverty level, marriage status, type of work during pregnancy, alcohol and smoking intake during pregnancy, prenatal exposure to DDT/DDE and PBDEs). Study protocols (2010-01-620 and 2010-03-949) were approved by the University of California, Berkeley Committee for Protection of Human Subjects. Written informed consent was obtained from all mothers, and assent was provided by the children at the 9-year assessment.

Blood Collection and Processing

Blood specimens were collected from the umbilical cords of CHAMACOS children after delivery and by venipuncture when children were approximately 9 years old. Heparinized whole blood was collected in BD vacutainers® (Becton, Dickinson and Company, Franklin Lakes, NJ), centrifuged, divided into plasma, buffy coats and red blood cells, and stored at −80°C at the School of Public Health Biorepository, University of California, Berkeley. Stringent conditions in accordance with the Best Practices for Biorepositories were followed [43].

Determination of PON1 Genotypes

DNA was isolated from blood clots as described previously [24]. In total, 39 genetic variants in PON1 and 2 SNPS each in PON2 and PON3 were genotyped. The promoter SNP, PON1, was genotyped using a fluorogenic allele-specific assay (Amplifluor, Chemicon, Temecula, CA). It required a two-part nested polymerase chain reaction strategy, in which the region surrounding the SNP was pre-amplified using non-allelic flanking primers and then the amplicon was diluted and used as the template for the Amplifluor assay. The coding polymorphisms, PON1 and PON1 as well as promoter SNP, PON1 were genotyped using the Taqman real-time polymerase chain reaction method. Briefly, primers for the nucleotide sequence flanking the SNP and probes specific for the SNP were custom designed by Applied Biosystems, Inc. (Foster City, CA). The remaining 39 SNPs and indels were genotyped using the multiplex platform iPlex (Sequenom, San Diego, CA). Quality assurance procedures for genotyping of these PON1 SNPs included assessment of randomly distributed blank samples in each plate and duplicates of randomly selected samples with independently isolated DNA from the same subjects. Repeated analysis (4% of samples) in several runs showed a high degree (>99%) of concordance. All discrepancies were resolved with additional genotyping.

Determination of PON1 AREase Activity Levels

AREase activity in plasma samples was measured by determining the rate of phenyl acetate hydrolysis using spectrophotometric methods as described previously [44]. A strong correlation (r > 0.85) between measured PON1 quantity and AREase activity have been demonstrated by ELISA and Western blot-based methods utilizing PON1 antibodies [45, 46], making the AREase assay a reliable measure of PON1 enzyme quantity. All assays were performed in triplicate. Quality assurance included use of internal controls (aliquots of the same sample run on all assay plates), assessment of repeat samples (separate aliquots of the same sample run on different days), and concurrent analyses of specimens from different collections (samples from different time points run on the same plates) [47]. The average coefficient of variation (CV) for repeated samples was 8.5% and the correlation coefficient between repeated runs was 0.94. The average CV for internal controls samples, a measure of inter-assay variability, was 8.7%.

Bisulfite Treatment and Methylation Analyses

DNA was previously isolated from clots using a QIAamp Blood DNA Maxi kit (Qiagen, Inc., Santa Clarita, CA), normalized to 55 µg/ml. Zymo Bisulfite conversion Kits (Zymo Research, Orange, CA) were used to bisulfite convert 1ug aliquots of DNA. Methylation levels at 18 PON1 CpG sites were analyzed as part of a genome-wide methylation assessment using the Illumina Infinium 450k DNA methylation BeadChip. DNA samples were whole-genome amplified, enzymatically fragmented, purified, and applied to the Infinium 450k BeadChips according to the Illumina methylation protocol [48, 49]. BeadChip processing was performed using robotics, and the Illumina Hi-Scan system was used for analysis. Samples included in the analysis had detection P values below 0.01 for 95% of CpG sites. Poor performing CpG sites with P value > 0.01 were excluded. Two PON1 CpG sites mapping to SNPs in the Illumina annotation were also excluded resulting in a total of 16 PON1 CpGs that were included in the subsequent analyses. Raw signal intensities were background corrected and then normalized for color-channel bias using the all sample mean normalization method as described previously by Yousefi et al. [50]. We also applied a second normalization algorithm, beta mixture quantile normalization, to make interpretation between type I and type II probes comparable [51]. Methylation data were expressed as M values, which are calculated as the log2 ratio of the intensities of methylated probe to unmethylated probe [52]. Negative M values therefore indicate that the unmethylated form of a CpG site is more abundant than the methylated form. Stringent quality control criteria were applied for handling of all samples and DNA methylation data. The quality assurance procedures included use of repeats and internal standards to minimize technical variability. Additionally, samples from both age groups were randomly distributed among different BeadChips to minimize experimental bias and batch effects.

Differential Cell Count

To examine the relationship of blood cell composition with DNA methylation in PON1 CpG sites, we performed differential cell counts in a subset of umbilical cord blood samples (n = 82) as described previously [53]. To prepare heparinized whole blood smears, we used the “gold standard” Wright-Push blood smearing technique [54] followed by staining utilizing a DiffQuik® staining kit. Slides were fixed for 15 minutes at 23°C and were then stained in basophilic dye and eosinophilic dye for 5 seconds each and washed after each staining period. Slides were scored under light microscopy (Zeiss Axioplan) with a magnification lens of 1000x and oil immersion. At least 100 cells were scored for each slide, and a percentage of each cell type (lymphocytes, monocytes, neutrophils, eosinophils, and basophils) was used for data analysis. To ensure consistency and reproducibility of scoring, 100 cells were scored in sets of 3 (3*100 = 300) for a subset of 35 samples. The CV for the repeat scoring in this subset was less than 10%.

Minfi Cell Count Estimation

Whole blood smears were not available for differential cell count in 9-year-old CHAMACOS children. Instead for these children, we used the Bioconductor R package minfi (v1.10.2) [55] to estimate the distribution of six (CD8+ T and CD4+ T lymphocytes, CD56+ natural killer cells, CD19+ B cells, CD14+ monocytes, and granulocytes) different white blood cell types based on their methylation signatures in 450K data. We did not use minfi estimates for cell composition in cord blood samples because as we recently described in Yousefi et al. [53] In Press, proportions of white blood cells in newborns are significantly different from the adult reference samples on which minfi estimates are based [56, 57]. We also found that unlike newborns, minfi estimates of cell composition in older children were much more comparable to those determined by differential cell count. This corresponds with published age-specific reference values [56, 57], which show that compared with adults, blood cell composition in newborns and young children is quite different but begin to resemble adult levels by ages 9–12 years. For comparison of cell type composition in cord bloods to those estimated by minfi in 9-year-old bloods, we use proportions of lymphocytes, granulocytes, and monocytes. For minfi estimates, this required summation of the frequencies for CD8+ T, CD4+ T, natural killer cells, and B cells to calculate the proportion of lymphocytes. For differential cell count, proportions of neutrophils, eosinophils, and basophils were summed to give an estimate of granulocytes.

Statistical Analysis

We used generalized estimating equations to determine the association of age with methylation levels at PON1 CpG, adjusting for batch (coded as categorical variable) and cell composition. For cell composition, we included proportion of monocytes and granulocytes in the model, using the proportion of lymphocytes as baseline. Correlations of methylation levels at different CpG sites were determined by calculation of Pearson’s correlation coefficients (r). LD methylation blocks were established based on several criteria slightly modified from Shoemaker et al. [12] and Liu et al. [11]: (i) they had to contain at least 3 contiguous CpG sites and (ii) at least 50% of the CpG site pairs had to have methylation levels that were highly correlated with each other (r2 > 0.4). To determine the relationship between genetic polymorphisms and methylation, we constructed linear regression models where genotype expressed as the number of minor alleles (0, 1, or 2) was the independent variable and the methylation at each CpG site (expressed as M values) was the outcome. Models were also adjusted for batch (coded as a categorical variable). A separate model was run for each individual CpG site. Bonferonni correction was used to account for multiple testing. As there were 16 CpG sites, 3 genetic polymorphisms (PON1,PON1, and PON1), and 2 ages comprising 96 tests, we used an adjusted type I error rate of 0.05/96 = 5.21 E-04. We repeated these models in a subset of newborns (n = 83) who also had differential cell counts available to adjust for cell type distribution. In these models, we included percent lymphocytes, monocytes, basophils, and eosinophils in the models and used the percent neutrophils as the baseline. Similar models were also performed for 9-year olds using minfi estimates of cell composition (n = 218). We also used linear regression models to examine associations between methylation and AREase activity. We constructed models including methylation at a single CpG site and batch as the independent variables and AREase as the dependent variable. Using Bonferonni correction, we considered P values less than 1.6 E-03 (0.05/32 tests) to be statistically significant. In addition to constructing separate models for each individual CpG site, we also used principal component (PC) analysis to summarize methylation at Block 1 and in the outer cluster of CpG sites (Fig. 2). The PCs explaining ∼95% of the variance of methylation for each group of CpG sites were incorporated as variables in regression models of AREase activity (dependent variable). Recent studies examining mediation of the relation of genetic polymorphisms on gene expression by epigenetics have employed causal inference methodologies [58-60]. We used the PARAMED module in STATA [32, 33] to determine the extent to which PON1 methylation mediates the effect of PON1 genotype on AREase activity. The methodologies applied in this module rely on a counterfactual framework based on parametric regression modelling and can estimate the natural direct effect and natural indirect effect (NIE) in the presence of exposure-mediator interaction. All analyses were performed in STATA (version 12.0; StataCorp, College Station, TX) and R (version 3.1.2; R Development Core Team 2008). Conflict of interest: None declared. Supplementary Data Click here for additional data file.
  54 in total

Review 1.  DNA methylation and human disease.

Authors:  Keith D Robertson
Journal:  Nat Rev Genet       Date:  2005-08       Impact factor: 53.242

Review 2.  Pharmacological and dietary modulators of paraoxonase 1 (PON1) activity and expression: the hunt goes on.

Authors:  Lucio G Costa; Gennaro Giordano; Clement E Furlong
Journal:  Biochem Pharmacol       Date:  2010-11-18       Impact factor: 5.858

3.  Considerations for normalization of DNA methylation data by Illumina 450K BeadChip assay in population studies.

Authors:  Paul Yousefi; Karen Huen; Raul Aguilar Schall; Anna Decker; Emon Elboudwarej; Hong Quach; Lisa Barcellos; Nina Holland
Journal:  Epigenetics       Date:  2013-08-19       Impact factor: 4.528

Review 4.  Cross-talk between site-specific transcription factors and DNA methylation states.

Authors:  Adam Blattler; Peggy J Farnham
Journal:  J Biol Chem       Date:  2013-10-22       Impact factor: 5.157

5.  Development of an immunoblot assay with infrared fluorescence to quantify paraoxonase 1 in serum and plasma.

Authors:  Philip W Connelly; Graham F Maguire; Clive M Picardo; John F Teiber; Dragomir Draganov
Journal:  J Lipid Res       Date:  2007-09-28       Impact factor: 5.922

6.  PON1 and neurodevelopment in children from the CHAMACOS study exposed to organophosphate pesticides in utero.

Authors:  Brenda Eskenazi; Karen Huen; Amy Marks; Kim G Harley; Asa Bradman; Dana Boyd Barr; Nina Holland
Journal:  Environ Health Perspect       Date:  2010-12       Impact factor: 9.031

7.  Association of organophosphate pesticide exposure and paraoxonase with birth outcome in Mexican-American women.

Authors:  Kim G Harley; Karen Huen; Raul Aguilar Schall; Nina T Holland; Asa Bradman; Dana Boyd Barr; Brenda Eskenazi
Journal:  PLoS One       Date:  2011-08-31       Impact factor: 3.240

8.  Analysis of DNA methylation in a three-generation family reveals widespread genetic influence on epigenetic regulation.

Authors:  Jason Gertz; Katherine E Varley; Timothy E Reddy; Kevin M Bowling; Florencia Pauli; Stephanie L Parker; Katerina S Kucera; Huntington F Willard; Richard M Myers
Journal:  PLoS Genet       Date:  2011-08-11       Impact factor: 5.917

9.  Validation of PON1 enzyme activity assays for longitudinal studies.

Authors:  Karen Huen; Rebecca Richter; Clement Furlong; Brenda Eskenazi; Nina Holland
Journal:  Clin Chim Acta       Date:  2008-12-24       Impact factor: 3.786

10.  Integrative analysis of 111 reference human epigenomes.

Authors:  Anshul Kundaje; Wouter Meuleman; Jason Ernst; Misha Bilenky; Angela Yen; Alireza Heravi-Moussavi; Pouya Kheradpour; Zhizhuo Zhang; Jianrong Wang; Michael J Ziller; Viren Amin; John W Whitaker; Matthew D Schultz; Lucas D Ward; Abhishek Sarkar; Gerald Quon; Richard S Sandstrom; Matthew L Eaton; Yi-Chieh Wu; Andreas R Pfenning; Xinchen Wang; Melina Claussnitzer; Yaping Liu; Cristian Coarfa; R Alan Harris; Noam Shoresh; Charles B Epstein; Elizabeta Gjoneska; Danny Leung; Wei Xie; R David Hawkins; Ryan Lister; Chibo Hong; Philippe Gascard; Andrew J Mungall; Richard Moore; Eric Chuah; Angela Tam; Theresa K Canfield; R Scott Hansen; Rajinder Kaul; Peter J Sabo; Mukul S Bansal; Annaick Carles; Jesse R Dixon; Kai-How Farh; Soheil Feizi; Rosa Karlic; Ah-Ram Kim; Ashwinikumar Kulkarni; Daofeng Li; Rebecca Lowdon; GiNell Elliott; Tim R Mercer; Shane J Neph; Vitor Onuchic; Paz Polak; Nisha Rajagopal; Pradipta Ray; Richard C Sallari; Kyle T Siebenthall; Nicholas A Sinnott-Armstrong; Michael Stevens; Robert E Thurman; Jie Wu; Bo Zhang; Xin Zhou; Arthur E Beaudet; Laurie A Boyer; Philip L De Jager; Peggy J Farnham; Susan J Fisher; David Haussler; Steven J M Jones; Wei Li; Marco A Marra; Michael T McManus; Shamil Sunyaev; James A Thomson; Thea D Tlsty; Li-Huei Tsai; Wei Wang; Robert A Waterland; Michael Q Zhang; Lisa H Chadwick; Bradley E Bernstein; Joseph F Costello; Joseph R Ecker; Martin Hirst; Alexander Meissner; Aleksandar Milosavljevic; Bing Ren; John A Stamatoyannopoulos; Ting Wang; Manolis Kellis
Journal:  Nature       Date:  2015-02-19       Impact factor: 69.504

View more
  18 in total

Review 1.  Organophosphate neurotoxicity to the voluntary motor system on the trail of environment-caused amyotrophic lateral sclerosis: the known, the misknown, and the unknown.

Authors:  Samantha J Merwin; Teresa Obis; Yanelli Nunez; Diane B Re
Journal:  Arch Toxicol       Date:  2017-01-09       Impact factor: 5.153

Review 2.  Linking the Epigenome with Exposure Effects and Susceptibility: The Epigenetic Seed and Soil Model.

Authors:  Emma C Bowers; Shaun D McCullough
Journal:  Toxicol Sci       Date:  2016-10-20       Impact factor: 4.849

Review 3.  Future of environmental research in the age of epigenomics and exposomics.

Authors:  Nina Holland
Journal:  Rev Environ Health       Date:  2017-03-01       Impact factor: 3.458

4.  PON1 DNA methylation and neurobehavior in Mexican-American children with prenatal organophosphate exposure.

Authors:  Karen Huen; Olivia Solomon; Katherine Kogut; Brenda Eskenazi; Nina Holland
Journal:  Environ Int       Date:  2018-08-30       Impact factor: 9.621

5.  AHR gene-dioxin interactions and birthweight in the Seveso Second Generation Health Study.

Authors:  Jennifer Ames; Marcella Warner; Paolo Mocarelli; Paolo Brambilla; Stefano Signorini; Claudia Siracusa; Karen Huen; Nina Holland; Brenda Eskenazi
Journal:  Int J Epidemiol       Date:  2018-12-01       Impact factor: 7.196

Review 6.  DNA methylation as a mediator of associations between the environment and chronic diseases: A scoping review on application of mediation analysis.

Authors:  Ryosuke Fujii; Shuntaro Sato; Yoshiki Tsuboi; Andres Cardenas; Koji Suzuki
Journal:  Epigenetics       Date:  2021-08-12       Impact factor: 4.861

Review 7.  The Relationship between Cancer and Paraoxonase 1.

Authors:  Irma Martha Medina-Díaz; Néstor Ponce-Ruíz; Aurora Elizabeth Rojas-García; José Francisco Zambrano-Zargoza; Yael Y Bernal-Hernández; Cyndia Azucena González-Arias; Briscia S Barrón-Vivanco; José Francisco Herrera-Moreno
Journal:  Antioxidants (Basel)       Date:  2022-03-31

8.  CpG Methylation across the adipogenic PPARγ gene and its relationship with birthweight and child BMI at 9 years.

Authors:  Vitaly Volberg; Paul Yousefi; Karen Huen; Kim Harley; Brenda Eskenazi; Nina Holland
Journal:  BMC Med Genet       Date:  2017-01-26       Impact factor: 2.103

9.  Persistent DNA methylation changes associated with prenatal mercury exposure and cognitive performance during childhood.

Authors:  Andres Cardenas; Sheryl L Rifas-Shiman; Golareh Agha; Marie-France Hivert; Augusto A Litonjua; Dawn L DeMeo; Xihong Lin; Chitra J Amarasiriwardena; Emily Oken; Matthew W Gillman; Andrea A Baccarelli
Journal:  Sci Rep       Date:  2017-03-21       Impact factor: 4.379

10.  Small-Magnitude Effect Sizes in Epigenetic End Points are Important in Children's Environmental Health Studies: The Children's Environmental Health and Disease Prevention Research Center's Epigenetics Working Group.

Authors:  Carrie V Breton; Carmen J Marsit; Elaine Faustman; Kari Nadeau; Jaclyn M Goodrich; Dana C Dolinoy; Julie Herbstman; Nina Holland; Janine M LaSalle; Rebecca Schmidt; Paul Yousefi; Frederica Perera; Bonnie R Joubert; Joseph Wiemels; Michele Taylor; Ivana V Yang; Rui Chen; Kinjal M Hew; Deborah M Hussey Freeland; Rachel Miller; Susan K Murphy
Journal:  Environ Health Perspect       Date:  2017-03-31       Impact factor: 9.031

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.