Literature DB >> 26553366

Sex differences in DNA methylation assessed by 450 K BeadChip in newborns.

Paul Yousefi1, Karen Huen1, Veronica Davé1, Lisa Barcellos1, Brenda Eskenazi1, Nina Holland2.   

Abstract

BACKGROUND: DNA methylation is an important epigenetic mark that can potentially link early life exposures to adverse health outcomes later in life. Host factors like sex and age strongly influence biological variation of DNA methylation, but characterization of these relationships is still limited, particularly in young children.
METHODS: In a sample of 111 Mexican-American subjects (58 girls , 53 boys), we interrogated DNA methylation differences by sex at birth using the 450 K BeadChip in umbilical cord blood specimens, adjusting for cell composition.
RESULTS: We observed that ~3% of CpG sites were differentially methylated between girls and boys at birth (FDR P < 0.05). Of those CpGs, 3031 were located on autosomes, and 82.8% of those were hypermethylated in girls compared to boys. Beyond individual CpGs, we found 3604 sex-associated differentially methylated regions (DMRs) where the majority (75.8%) had higher methylation in girls. Using pathway analysis, we found that sex-associated autosomal CpGs were significantly enriched for gene ontology terms related to nervous system development and behavior. Among hits in our study, 35.9% had been previously reported as sex-associated CpG sites in other published human studies. Further, for replicated hits, the direction of the association with methylation was highly concordant (98.5-100%) with previous studies.
CONCLUSIONS: To our knowledge, this is the first reported epigenome-wide analysis by sex at birth that examined DMRs and adjusted for confounding by cell composition. We confirmed previously reported trends that methylation profiles are sex-specific even in autosomal genes, and also identified novel sex-associated CpGs in our methylome-wide analysis immediately after birth, a critical yet relatively unstudied developmental window.

Entities:  

Mesh:

Year:  2015        PMID: 26553366      PMCID: PMC4640166          DOI: 10.1186/s12864-015-2034-y

Source DB:  PubMed          Journal:  BMC Genomics        ISSN: 1471-2164            Impact factor:   3.969


Background

There is a growing interest in examining the role epigenetic marks like histone modifications, non- coding RNAs, and DNA methylation may play as biological mechanisms through which environmental exposures and other physiological and lifestyle factors can lead to disease. Unlike genetics, epigenetic modifications are dynamic and can change over time or in response to exposures. Furthermore, host factors such as sex and age also contribute to inter-individual differences in epigenetic markers. Previous studies of DNA methylation using the Illumina 27 K BeadChip methylation array have reported autosomal differentially methylated positions (DMPs) or CpG sites with varying methylation between males and females, providing evidence that it will be important to adjust for sex in analysis of methylation data [1-6]. However, these studies did not account for the existence of non-specific probes for autosomal CpGs that cross react with CpGs on sex chromosomes, thereby yielding false positives [7]. Recently, McCarthy et al. published a meta-analysis of 76 studies all using the 27 K BeadChip array to identify sex-associated autosomal DMPs across specimens from multiple tissue types from adults and children [8]. After excluding the sex-biased cross-reactive probes, they identified 184 DMPs that were associated with sex. While McCarthy et al. identified several interesting autosomal DMPs, their study focused on methylation assessed by the 27 K BeadChip. In 2011, Illumina released a new version of their methylation array, the 450 K BeadChip, which greatly expanded the number of CpGs interrogated to over 480,000 sites. Further, their approach was restricted to identification of individual DMPs rather than differentially methylated regions (DMRs). DMR-finding approaches have several advantages over considering CpG sites individually, including decreased likelihood of hits from technical artifacts and possibly improved functional impact of results. As methylation is cell-type specific and immune cell profiles have been shown to vary between sexes, consideration of cell composition is of utmost importance in methylation studies [9, 10]. Yet previous studies of sex-associated differences in methylation [1-6] haven’t taken this into account in their analyses. White blood cell composition can be estimated from 450 K BeadChip data computationally in adults [11, 12], but these estimates are not appropriate for use for young children in their current implementation [13]. As an alternative, differential cell count (DCC) can be employed to effectively determine such cell type proportions (% lymphocytes, monocytes, neutrophils, eosinophils, and basophils) in cord blood samples. Here, we use the 450 K BeadChip to assess sex differences in DNA methylation from umbilical cord blood from boys and girls participating in a large epidemiologic cohort followed by the Center for the Health Assessment of Mothers and Children of Salinas (CHAMACOS) study. We use DCCs to account for white blood cell composition. In addition to interrogating DMPs, we apply the newly released ‘DMRcate’ methodology [14] to identify sex-associated DMRs in newborns.

Methods

Study population

The CHAMACOS study is a longitudinal birth cohort study of the effects of exposure to pesticides and environmental chemicals on the health and development of Mexican-American children living in the agricultural region of Salinas Valley, CA. Detailed description of the CHAMACOS cohort has previously been published [15, 16]. Briefly, 601 pregnant women were enrolled in 1999–2000 at community clinics and 527 liveborn singletons were born. Follow up visits occurred at regular intervals throughout childhood. For this analysis, we include the subset of subjects that had both 450 K BeadChip data and differential cell count analysis available at birth (n = 111). Mothers retained in the study subset had a mean age of 25.8 years (±5.1 SD) at time of delivery. Study protocols were approved by the University of California, Berkeley Committee for Protection of Human Subjects. Written informed consent was obtained from all mothers.

Blood collection and processing

Cord blood was collected and stored in both heparin coated BD vacutainers (Becton, Dickinson and Company, Franklin Lakes, NJ) and vacutainers without anticoagulant at the same time. Blood clots from anticoagulant-free vacutainers were stored at −80 °C and used for isolation of DNA for DNA methylation analysis. Heparinized cord blood was used to prepare whole blood slides using the push-wedge blood smearing technique [17] and stored at −20 °C until staining for differential white blood cell count.

DNA preparation

DNA isolation was performed using QIAamp DNA Blood Maxi Kits (Qiagen, Valencia, CA) according to manufacturer’s protocol with small, previously described modifications [18]. Following isolation, all samples were checked for DNA quality and quantity by Nanodrop 2000 Spectrophotometer (Thermo Scientific, Waltham, MA). Those with good quality (260/280 ratio exceeding 1.8) were normalized to a concentration of 50 ng/ul.

450 K BeadChip DNA methylation analysis

DNA samples were bisulfite converted using Zymo Bisulfite Conversion Kits (Zymo Research, Irvine, CA), whole genome amplified, enzymatically fragmented, purified, and applied to Illumina Infinium HumanMethylation450 BeadChips (Illumina, San Diego, CA) according to manufacturer protocol. Locations of samples from boys and girls were randomly assigned across assay wells, chips and plates to prevent any batch bias. 450 K BeadChips were handled by robotics and analyzed using the Illumina Hi-Scan system. DNA methylation was measured at 485,512 CpG sites. Probe signal intensities were extracted by Illumina GenomeStudio software (version XXV2011.1, Methylation Module 1.9) methylation module and back subtracted. Systematic QA/QC was performed, including assessment of assay repeatability, batch effects using 38 technical replicates, and data quality established as previously described [19]. Samples were retained only if 95 % of sites assayed had detection P > 0.01. Color channel bias, batch effects and difference in Infinium chemistry were minimized by application of All Sample Mean Normalization (ASMN) algorithm [19], followed by Beta Mixture Quantile (BMIQ) normalization [20]. Sites with annotated probe SNPs and with common SNPs (minor allele frequency >5 %) within 50 bp of the target identified in the MXL (Mexican ancestry in Los Angeles, California) HapMap population were excluded from analysis (n = 49,748). Probes where 95 % of samples had detection P > 0.01 were also dropped (n = 460). Since our analysis was focused on CpG sites associated with sex, we excluded sites on the Y chromosome (n = 95) and X-chromosome cross-reactive probes (n = 29,233) identified by Chen and colleagues [7]. Remaining CpGs included 410,072 sites for analysis of sex. Methylation values at all sites were logit transformed to the M-value scale to better comply with modeling assumption [21].

Differential cell counts

Whole blood smear slides were stained utilizing a DiffQuik® staining kit, a modern commercial variant of the Romanovsky stain, a histological stain used to differentiate cells on a variety of smears and aspirates. This staining highlights cytoplasmic details and neurosecretory granules, which are utilized to characterize the differential white blood count. The staining kit is composed of a fixative (3:1 methanol: acetic acid solution), eosinophilic dye (xanthene dye), basophilic dye (dimethylene blue dye) and wash (deionized water). For consistency and to ensure the best results the slides were all fixed for 15 min at 23 °C (room temperature), stained in both the basophilic dye and eosinophilic dye for 5 s each and washed after each staining period to prevent the corruption of the dye. Slides were scored for white blood cell type composition by Zeiss Axioplan light microscope with 100× oil immersion lens. Scoring was conducted at the perceived highest density of white blood cells using the standard battlement track scan method, which covers the entire width of a slide examination area. Counts for each of the five identifiable cell types (lymphocytes, monocytes, neutrophils, eosinophils, and basophils) were recorded by a dedicated mechanical counter. At least 100 cells were scored for each slide following validation of reproducibility by the repeated scoring of 5 sets of 100 cells from the same slide (CV ≤ 5 %).

DMP analysis

Association between sex at birth and differential 450 K DNA methylation at individual CpGs was performed by linear regression, adjusting for DCC variables and analysis batch. This analysis was performed using R statistical computing software (v3.1.0) [22]. Although DCC estimates were not significantly associated with sex, we chose to include them in the model because likelihood ratio tests showed that including them improved model fit for more than 2000 of the CpG sites assessed by 450 K BeadChip. We also examined gestational age and subject birthweight as possible covariates since both have been shown to be associated with DNA methylation [23], and performed sensitivity analysis to assess their potential impact. However, neither was associated with child sex or contributed to improved model fit. P-values were corrected for multiple testing using a Benjamini-Hochberg (BH) FDR threshold of 0.05 [24].

Enrichment of annotated genomic features

Comparison of sex-DMP results to annotated function categories, including relation to genes(TSS1500, TSS200, 5′UTR, 1stExon, Body, 3′UTR, Intergenic) and CpG islands (Island, Shore, Shelf, Open Sea), was performed using UCSC Genome Browser annotations supplied by Illumina. A χ2 test of independence with 1° of freedom was used to determine whether there was evidence of enrichment among DMP results (P value < 0.05).

DMR analysis

Identification of sex-associated DMRs was performed using the method described by Peters et al. [14] and implemented in the DMRcate Bioconductor R-package [25]. The approach begins by fitting a standard limma linear model to all CpG sites in parallel [26]. This model was parameterized identically to the DMP analysis with sex as the binary predictor of interest, adjusting for DCC variables and analysis batch. The CpG site test statistics were then smoothed by chromosome according to the DMRcate defaults, which employs a Gaussian kernel smoother with bandwidth λ = 1000 base pairs (bp) and scaling factor C = 2. The resulting kernel-weighted local model fit statistics were compared to modeled values using the method of Satterthwaite [27] to produce p-values that are adjusted for multiple testing using a BH FDR threshold of 0.05 [24]. Regions or DMRs were assigned by grouping FDR significant sites that are a maximum of λ bp from one another and contain at least two or more CpGs. Under this method, CpGs are collapsed into DMRs without considering the direction of the association with the predictor (i.e. sex). The minimum BH-adjusted p-value within a given DMR is taken as representative of the statistical inference for that region and the maximum fold change in methylation values (here on the M-value scale) summarizes the effect size.

Gene ontology analysis

Gene ontology term enrichment analysis was performed by DAVID [28, 29], WebGestalt (WEB-based Gene SeT AnaLysis) [30], and ConsensusPathDB [31], using hypergeometric distribution to assess enrichment significance. Visualization of results and GO term categorization by semantic similarity dimension reduction was performed by REVIGO [32].

Results

Sex-associated differentially methylated positions in newborns

Analysis of DNA methylation differences between newborn boys and girls was performed by linear regression for 450 K BeadChip CpGs among subjects with DCC measurements (n = 111; 58 girls and 53 boys), adjusting for cell composition and batch (Table 1). After data cleaning, n = 410,072 CpGs were analyzed, which excluded sites previously reported to exhibit sex-chromosome specific cross-reactivity [7]. Resulting p-values were plotted by chromosome, with sites having higher methylation levels in girls compared to boys plotted above the x-axis and those with lower levels plotted below (Fig. 1). After adjustment for multiple testing (FDR p < 0.05), we identified 11,776 CpGs that differed significantly by sex in newborns (Table 2). Of those hits, the majority of sites had higher methylation in girls compared to boys (69.0 %). This trend was consistent on both the X chromosome (64.3 % of sites higher in girls) and in autosomes (82.8 %). While the majority of hits were found on the X chromosome (74.3 %), a substantial number were also identified on autosomes (3031 or 25.7 %; Table 2).
Table 1

Demographic characteristics of newborn CHAMACOS subjects, N = 111

Boys, N = 53Girls, N = 58
N % N % P Value*
Country of birth0.18
 Mexico4446.85053.2
 United States642.9857.1
 Other310000
Maternal age at delivery (years)
 18–242655.32144.70.54
 25–291643.22156.8
 30–34838.11361.9
 35–45350350
Family income0.58
 Below poverty threshold32503250
 Above poverty threshold2144.72655.3
Maternal BMI (kg/m2)0.31
 Normal2451.12348.9
 Overweight1437.82362.2
 Obese14561144
Smoking during pregnancy0.07
 No5150.55049.5
 Yes220880
Boys, N = 53Girls, N = 58
Mean ± SDMean ± SD P Value**
Gestational age (weeks)39.1 ± 1.438.7 ± 2.00.35
Birthweight (Kg)3.5 ± 0.53.4 ± 0.60.35
Blood cell composition (%)
 Lymphocytes28.7 ± 4.529.6 ± 2.50.15
 Monocytes7.1 ± 1.96.8 ± 1.80.48
 Neutrophils60.5 ± 1.860.4 ± 2.90.53
 Eosinophils3.1 ± 1.22.8 ± 1.00.44
 Basophils0.2 ± 0.30.3 ± 0.50.27

*P value from χ2 test for independence

**P value from Mann-Whitney U test

Fig. 1

Manhattan plot for association between child sex and DNA methylation at all 450 K CpGs, adjusting for batch and cell composition by differential cell count (DCC). Associations where methylation was higher for girls relative to boys are plotted above the x-axis, while those with decreased methylation are plotted below. CpGs meeting FDR multiple testing threshold of (P < 0.05) shown in red

Table 2

Summary of sex-associated DMPs

AutosomesX chromosome
All CpG sitesCount%Count%
Hypermethylated in girls compared to boys8,1312,50930.95,62269.1
Hypomethylated in girls compared to boys3,64552214.33,12385.7
Total11,7763,03125.78,74574.3

Number of CpGs significantly hyper- and hypo-methylated in newborn girls compared to boys at FDR multiple testing threshold (q < 0.05), for all CpGs, and then stratified by autosomes and the X chromosome

Demographic characteristics of newborn CHAMACOS subjects, N = 111 *P value from χ2 test for independence **P value from Mann-Whitney U test Manhattan plot for association between child sex and DNA methylation at all 450 K CpGs, adjusting for batch and cell composition by differential cell count (DCC). Associations where methylation was higher for girls relative to boys are plotted above the x-axis, while those with decreased methylation are plotted below. CpGs meeting FDR multiple testing threshold of (P < 0.05) shown in red Summary of sex-associated DMPs Number of CpGs significantly hyper- and hypo-methylated in newborn girls compared to boys at FDR multiple testing threshold (q < 0.05), for all CpGs, and then stratified by autosomes and the X chromosome As differential hypermethylation is to be expected for girls due to X-inactivation [33-35], we focused characterization of results on autosomal sites showing sex differences (Table 3 and Additional file 1). Most of these were located in CpG shores, islands and open sea (40.4, 40.1, and 15.4 %, respectively) (Fig. 2 and Table 4). In comparison, shelf regions had the lowest percentage of hits (4.1 %). To assess whether the overrepresentation of hits in CpG islands and shores was due to the design of the 450 K BeadChip, we compared the number of hits in each functional category with the number of CpG sites included in the assay. Both shores and CpG islands were significantly overrepresented among all autosomal hits compared to the 450 K background (χ2 = 486.1, P < 0.01 and χ2 = 95.5, P < 0.01), while shelves and the open sea hits were underrepresented (each with P < 0.01). For CpG sites that were hypermethylated in girls compared to boys, we also observed overrepresentation in CpG islands and shores, and underrepresentation in shelf and open-sea locations (all P < 0.01). Sites that were hypomethylated in girls compared to boys were underrepresented in the open sea (30.3 %, P < 0.01) and shelves (5.6 %, P < 0.01). Hypomethylated sites were enriched at islands (χ2 = 6.53, P = 0.01), but did not deviate significantly from the 450 K representation of shores (χ2 = 3.42, P = 0.06).
Table 3

Results for the top 30 gene-annotated autosomal DMPs associated with sex in CHAMACOS newborns

ProbeIDGene symbol(s)ChromosomePositiona βGirl (M-value) P-valueGirl mean (% methylation)Boy mean (% methylation)
cg03691818KRT7712530850382.381.84E-4311.93.0
cg26921482AMDHD21625702831.722.43E-4343.820.1
cg17743279CDK67924632681.844.04E-313.11.0
cg07852945TLE19843039150.937.30E-3114.07.8
cg26355737TFDP113114292172−0.957.57E-2980.688.5
cg25568337ARID1B61570983380.681.36E-2621.214.6
cg05100634SMAD218454576041.772.91E-262.40.9
cg03608000ZNF6919119986230.861.76E-245.53.1
cg02325951FOXN31489878619−0.724.04E-2462.473.0
cg17612569GABPA;ATP5J2127107221−3.077.86E-241.06.8
cg04874129SLC6A216556908730.759.82E-2314.19.0
cg08906898RBM392034319899−0.922.66E-2287.292.6
cg04946709LOC6446491659789030−0.733.58E-2270.178.8
cg02989351YWHAQ297705840.371.90E-2016.513.2
cg12204423PHF174129732568−0.821.94E-205.08.3
cg25304146WBP11P11830092971−0.494.64E-2057.664.9
cg22345911CSNK1D17802312631.185.43E-205.22.7
cg01906879GBE13818110160.544.17E-198.56.0
cg06152526PLLP1657290525−0.625.25E-1969.877.5
cg04190002SHANK322511136040.405.35E-1933.827.9
cg06644124ZNF28112003790830.506.63E-1931.425.1
cg07628841GPN1;CCDC1212278514300.307.09E-1942.337.3
cg23001456KIAA06641726150740.867.47E-194.52.8
cg26213873CTTNBP2NL11129390560.413.34E-188.66.6
cg25438440CLDND13982411680.553.72E-186.14.2
cg07816873ERC11211004720.543.88E-1823.417.5
cg24016844C1orf1031111506641−0.431.28E-177.49.6
cg11841231PARD3B2205543309−0.685.39E-1779.885.9
cg13323902VTRNA1-151400908590.536.23E-1726.019.8
cg12900929PRDM4121081548620.831.41E-1611.26.9

Regression coefficients, βgirl, are reported in M-value scale for the change in methylation of girls relative to boys. Girl and boy mean methylation levels are shown on the β value or % methylation

aPositions shown for hg19 (Genome Reference Consortium GRCh37) genome assembly

Fig. 2

Percent of 450 K CpGs (purple), and percent of all (blue), hypermethylated (dark green), and hypomethylated (light green) autosomal differentially methylated positions (DMPs) associated with sex (a & b). These percentages are given by island functional categories (island, shore, shelf, and open sea) in a, and gene functional categories (within 1500 bp of a transcription start site (TSS), 200 bp of a TSS, a 5′ untranslated region (UTR), first exon, gene body, 3′UTR, and intergenic) in b. * indicates that the proportion of sites significantly altered compared to the coverage on the 450 K BeadChip (P < 0.05)

Table 4

DMPs by gene and CpG island annotation

Sex DMPs
Autosomal 450 KAll DMPsHypermethylatedHypomethylated
Island127,4881,2161,022194
Shore93,9811,2251,084141
Shelf37,4901249529
Open sea141,170466308158
Total400,1293,0312,509522
TSS150058,08859454846
TSS20044,64729824058
5′UTR36,27923920732
1stExon18,84213411321
Body135,011766607159
3′UTR14,073796019
Intergenic93,189921734187
Total400,1293,0312,509522

Counts of all, hypermethylated, and hypomethylated autosomal CpGs associated with sex in newborns by proximity to CpG island and gene features. Count of autosomal and all 450 K CpGs shown for reference

Results for the top 30 gene-annotated autosomal DMPs associated with sex in CHAMACOS newborns Regression coefficients, βgirl, are reported in M-value scale for the change in methylation of girls relative to boys. Girl and boy mean methylation levels are shown on the β value or % methylation aPositions shown for hg19 (Genome Reference Consortium GRCh37) genome assembly Percent of 450 K CpGs (purple), and percent of all (blue), hypermethylated (dark green), and hypomethylated (light green) autosomal differentially methylated positions (DMPs) associated with sex (a & b). These percentages are given by island functional categories (island, shore, shelf, and open sea) in a, and gene functional categories (within 1500 bp of a transcription start site (TSS), 200 bp of a TSS, a 5′ untranslated region (UTR), first exon, gene body, 3′UTR, and intergenic) in b. * indicates that the proportion of sites significantly altered compared to the coverage on the 450 K BeadChip (P < 0.05) DMPs by gene and CpG island annotation Counts of all, hypermethylated, and hypomethylated autosomal CpGs associated with sex in newborns by proximity to CpG island and gene features. Count of autosomal and all 450 K CpGs shown for reference The 11,776 CpG hits differentially methylated between newborn boys and girls were found in 2250 unique genes, and 1430 (63.6 %) of these genes were located on autosomes. Many genes contained multiple significant sites, with an average of 4.7 CpGs per gene and a maximum of 114 CpGs. However, the largest portion of sex-associated autosomal hits (30.4 %) was located in intergenic regions and seen at lower than expected frequency in gene bodies (P < 0.01)(Fig. 2). Near gene transcription starting points (TSS200, 5′UTR, and first exons), all categories were either lower than 450 K CpG design frequencies or did not deviate from them significantly. Further upstream (TS1500), hits that were hypermethylated in girls were significantly enriched (χ2 = 108.5, P < 0.01) while those showing decreased methylation were underepresented (χ2 = 13.3, P < 0.01). At the end of genes (3′UTR), hits that had higher methylation for girls were underrepresented (2.4 %, P < 0.01), while hits having higher methylation for boys did not deviate from expected 450 K frequencies (3.6 %, p = 0.97). Examining the autosomal genes containing sex-associated DMPs for enrichment of particular gene ontology (GO) terms identified 278 pathways that were significantly enriched (FDR P < 0.05 and at least 5 genes per GO term) (Table 5). These enriched GO terms fell into several broad categories including: 1) nervous system development, 2) behavior, 3) cellular development processes, and 4) cellular signaling and motility (Additional file 2).
Table 5

The top 30 differentially enriched gene ontology pathways among hits for sex in autosomal CpGs

Genes
GO categoryCategory nameTotalChanged P-valueFDR
GO:0043005Neuron projection651672.22E-050.00E + 00
GO:0007275Multicellular organismal development46213539.06E-124.91E-09
GO:0097458Neuron part9801048.83E-118.47E-09
GO:0044767Single-organism developmental process54023942.05E-101.24E-08
GO:0048856Anatomical structure development48283592.43E-101.24E-08
GO:0007268Synaptic transmission692781.50E-095.09E-08
GO:0048731System development40933115.69E-101.54E-07
GO:0007270Neuron-neuron synaptic transmission129263.30E-094.76E-07
GO:0007267Cell-cell signaling11921153.51E-094.76E-07
GO:0043167Ion binding60384222.02E-081.25E-06
GO:0007626Locomotory behavior186301.25E-073.19E-06
GO:0032879Regulation of localization20101684.15E-084.50E-06
GO:0044707Single-multicellular organism process64624382.47E-075.04E-06
GO:0009653Anatomical structure morphogenesis24841962.97E-075.05E-06
GO:0044765Single-organism transport36052621.79E-062.36E-05
GO:0044763Single-organism cellular process11,9497391.85E-062.36E-05
GO:0009790Embryo development998945.15E-074.66E-05
GO:1902578Single-organism localization37912714.22E-064.79E-05
GO:0007399Nervous system development20531714.97E-084.81E-05
GO:0065008Regulation of biological quality32392364.77E-064.86E-05
GO:0051703Intraspecies interaction between organisms40116.02E-065.58E-05
GO:0009887Organ morphogenesis918879.65E-076.55E-05
GO:0048513Organ development29582229.66E-076.55E-05
GO:0036477Somatodendritic compartment562601.10E-068.50E-05
GO:0044459Plasma membrane part22791741.90E-069.14E-05
GO:0016043Cellular component organization54103661.68E-051.43E-04
GO:0030425Dendrite378451.15E-061.70E-04
GO:0051705Multi-organism behavior79152.27E-051.78E-04
GO:0005883Neurofilament962.54E-061.88E-04
GO:0035637Multicellular organismal signaling751867.94E-082.00E-04
The top 30 differentially enriched gene ontology pathways among hits for sex in autosomal CpGs

Sex-associated differentially methylated regions in newborns

Additionally, identification of groups of CpGs with 450 K BeadChip methylation differences between newborn boys and girls was performed using the DMR-finding algorithm DMRcate [14, 25]. This approach identifies and ranks DMRs by Gaussian kernel smoothing of results from linear models for individual CpGs that were adjusted for cell composition and array batch (see Methods for details). A total of 3604 DMRs were significantly associated with sex in newborns after correcting for multiple testing (FDR p < 0.05; Table 6 and Additional files 3 and 4). These spanned 2608 genes and contained a total of 22,402 unique CpGs. The number of sites within the DMRs ranged from 2 to 99 CpGs, with 50 % of DMRs containing 5 or more CpGs and 25 % having 8 or more. Further, DMR length averaged 863.8 bp, and ranged from 3 to 16.5 kb. Figure 3 shows the DNA methylation levels for boys and girls at two example top DMRs. Figure 3a shows 7 CpG sites in a DMR that had higher methylation for girls in a region spanning the PPP1R3G transcription factor on chromosome 6. While Fig. 3b shows a 8 CpGs from a DMR with lower methylation among girls in the promoter of PIWIL1, which is an important gene for stem cell proliferation and inhibition of transposon migration [36, 37].
Table 6

Results for the top 30 gene-annotated autosomal DMRs associated with sex in CHAMACOS newborns

Gene symbol(s)ChromosomeStart positiona End positiona # of probesMaxFC for girls compared to boysMinimum P-value
KRT7712530847095308532340.091.34E-132
AMDHD2,ATP6V0C162569911257144990.247.17E-129
PPP1R3G65085986508774970.163.35E-78
CDK679246197192464481140.021.89E-65
TFDP11311429197711429274010−0.084.54E-57
CYP1A1157501815075019376260.126.24E-57
C6orf174612779628712779728670.122.64E-54
SMAD2184545644145458698110.021.47E-51
ARID1B615709780015709937590.076.59E-48
PEX10123440892347015260.076.53E-46
ATP5J,GABPA21271067932710825711−0.069.90E-45
SLC6A216556898655569110290.051.00E-43
A1BG,NCRNA0018119588615025886239860.111.14E-43
PHF1741297318351297335748−0.032.69E-43
NUPL1132587485925876335140.118.61E-42
PPFIA319496362704963659430.152.68E-40
ZNF69191199845711999148110.034.58E-39
REM1,NCRNA0002820300717263007357690.117.17E-39
YWHAQ29770130977134770.031.00E-37
LOC6446491659788728597901807−0.091.74E-35
FOXN31489878584898787335−0.112.22E-35
PXDNL8523209445232234190.077.31E-35
SHANK322511125365111436440.061.86E-34
SHANK211706723657067325611−0.061.33E-33
RBM392034319899343199892−0.061.74E-33
GIPC217851114078512129120.061.75E-33
CSNK1D178023066080232440120.031.82E-33
NAPSA1950860534508621218−0.102.28E-33
FBXO4717371236383712455890.082.94E-31
CCDC121,GPN122785096427852231140.055.71E-31

Max fold changes (FC) reported in M-value scale for the change in methylation of girls relative to boys

aPositions shown for hg19 (Genome Reference Consortium GRCh37) genome assembly

Fig. 3

DNA methylation (β values) for CpG sites included in two top DMRs associated with child sex in newborns. One DMR (a) contains 7 CpG sites, is located on chromosome 6 and spans a 1763 bp region in the exon of PPP1R3G (chr6:5085986–5087749). The other (b) on chromosome 12 includes 8 CpGs over a 1365 bp region across the promoter and 1st exon of PIWIL1 (chr12:130821453–130822818). Girls are shown with red circles, boys with blue triangles, and median methylation per CpG by sex is shown by red and blue lines. Green lines show the genomic coordinates of exon regions for each gene shown

Results for the top 30 gene-annotated autosomal DMRs associated with sex in CHAMACOS newborns Max fold changes (FC) reported in M-value scale for the change in methylation of girls relative to boys aPositions shown for hg19 (Genome Reference Consortium GRCh37) genome assembly DNA methylation (β values) for CpG sites included in two top DMRs associated with child sex in newborns. One DMR (a) contains 7 CpG sites, is located on chromosome 6 and spans a 1763 bp region in the exon of PPP1R3G (chr6:5085986–5087749). The other (b) on chromosome 12 includes 8 CpGs over a 1365 bp region across the promoter and 1st exon of PIWIL1 (chr12:130821453–130822818). Girls are shown with red circles, boys with blue triangles, and median methylation per CpG by sex is shown by red and blue lines. Green lines show the genomic coordinates of exon regions for each gene shown As with DMPs, the majority of sex-associated DMRs had higher methylation in girls compared to boys (75.8 %; Additional file 3: Table S1). This was true for both autosomes and sex chromosomes when considered individually, with 83.8 and 58.5 % of DMRs having higher methylation in girls, respectively. However, a greater total number of DMRs identified were located on autosomes (2471 or 68.6 %) compared to the X chromosome. Similarly, the 70.3 % of the genes covered by sex-associated DMRs were located on autosomes. Further, while the DMRcate method does not constrain all CpGs within a DMR to have the same direction of association with the predictor of interest, we found that the majority of DMRs had 100 % concordance across CpGs in the direction of effect with sex (Additional file 5). Comparison of the individual site results (DMPs) with the DMR findings revealed that of the 11,776 CpG sites associated with sex in the DMP analysis, 9, 941 (84.4 %) were also included in a DMR. On autosomes, DMRs included 83.2 % of sites found as sex-associated DMPs. Conversely, the DMRs added 12,461 total sites (11,719 on autosomes) that had not been found by DMP analysis alone.

Discussion

Here, we assessed methylation sex differences in newborns as determined by 450 K BeadChip. Using reliable DCC estimates, our results are the first reported EWAS analysis by sex at birth that adjusted for confounding by cell composition. To our knowledge, we are also the first study to assess regions of differential methylation associated with sex in addition to considering all CpG sites individually. We identified a large numbers of X-chromosome CpG sites with higher methylation in girls, which is most likely attributable to X-inactivation [33, 38]. Interestingly, we further demonstrated that a substantial number of autosomal sites and regions also appear hypermethylated in females (Fig. 1 and Table 2). To assess the consistency of our findings with those of prior analyses, autosomal CpG sites identified as differentially methylated by sex in the current analysis were compared to hits from the three most similar published studies to date (Table 7) [8, 39, 40]. These studies differed from ours either in DNA methylation analysis platform (27 K in McCarthy et al. [18]) or in tissue type used (Xu et al. [39] in human prefrontal cortex and Hall et al. [40] in pancreatic isolates). Although the meta-analysis performed by McCarthy et al. included some studies in umbilical cord blood, most of the studies were performed in adult tissues. Each study found between 184 and 614 autosomal CpG sites that were differentially methylated in association with sex (total of n = 1192 unique sites across all three studies). Our results replicated 428 (35.9 %) of all hits, and 29.4–42.4 % by different studies. Further, among replicated sites we observed 98.5–100 % concordance in the direction of methylation differences. While there was substantial overlap between our autosomal sex-associated hits and these previously published results, 2603 or 85.9 % of our results are novel findings, some of which may be specific to the time point and tissue assessed (umbilical cord blood). Our larger number of hits is likely due to the increased coverage of the 450 K BeadChip. In fact, when considered as a percentage of the number of sites analyzed, we observed a comparable portion of autosomal hits to that found by McCarthy and colleagues using the 27 K platform (0.74 and 0.68 % respectively; P = 0.25).
Table 7

Comparison of CHAMACOS autosomal sex-associated CpG sites (n = 3031) with other published studies

StudyStudy populationTissuePlatformNumber of autosomal hitsNumber of CHAMACOS hits replicated (%)Percent of concordance in direction of methylation differences among replicated sites
McCarthy et al. 2014 [18]Meta analysis of 76 studies (n = 6,795)Multiple typesIllumina 27 K18454 (29.4 %)100 %
Xu et al. 2014 [39]46 Caucasian adultspostmortem prefrontal cortexIllumina 450 K614260 (42.4 %)98.50 %
Hall et al. 2014 [40]87 Caucasian adultspancreasIllumina 450 K470176 (37.5 %)100 %
Comparison of CHAMACOS autosomal sex-associated CpG sites (n = 3031) with other published studies Importantly, the autosomal methylation increases we observed were most concentrated in CpG islands and shores (Fig. 2a). As this trend was not evaluated in past studies, it should be explored and confirmed in additional datasets. Further, our findings that neurodevelopmental ontology terms were strongly enriched among our autosomal findings suggests that DNA methylation may contribute to differences in cognitive processes early in life. This is consistent with sex differences in brain development and rates of maturation that have previously been observed by magnetic resonance imaging in slightly older children (6–17 years of age) [41] and represent a possible regulatory mechanism requiring additional investigation. Our autosomal hits included several genes already known to exhibit sex-specific functions. These included the male fertility and spermatogenesis related genes identified by McCarthy and colleagues (DDX43, NUPL1, CRISP2, FIGNL1, SPESP1 and SLC9A2). One of our top hits showing increased methylation for girls (Table 3) included SLC6A4, Solute Carrier Family 6, that is involved in presynaptic reuptake of norepinephrine and has been implicated in several neurological disorders with sex-differences in prevalence [42-44]. Similarly, we observed novel sex differences in the SHANK2 and SHANK3 scaffolding protein genes that have been associated with autism spectrum disorders (Tables 3 and 6, Additional file 1) [45, 46]. Further, our hits included the homeobox containing transcription factor EMX2, Empty Spiracles Homeobox2, that is required for sexual differentiation and gonadal development [47] and we found to be hypermethylated among girls (Additional file 1). The DMR analysis confirmed several trends observed by analyzing CpGs individually. In particular, DMR results again showed that girls tend to exhibit hypermethylation compared to boys. Also, many CpGs found to be autosomal DMPs were separately identified as being located within sex-associated DMRs. Besides confirming many of the findings in the DMP analysis, the application of DMR-finding substantially expanded the number of CpG sites considered significant. These results demonstrate that considering methylation over regions rather than single CpG sites may be a more effective way to identify differentially methylated sites and genes of interest.

Conclusions

We confirmed and expanded previously identified trends in autosomal and X-chromosome methylation sex differences during a previously unstudied window in child development, immediately after birth, likely critical in establishing long term health. This strategy to assess epigenetic perturbation as near as possible to the prenatal period remains a high priority in light of the fetal origins of human disease hypothesis [48-51].
  46 in total

1.  Sex differences in brain maturation during childhood and adolescence.

Authors:  M D De Bellis; M S Keshavan; S R Beers; J Hall; K Frustaci; A Masalehdan; J Noll; A M Boring
Journal:  Cereb Cortex       Date:  2001-06       Impact factor: 5.357

2.  Age related changes in 5-methylcytosine content in human peripheral leukocytes and placentas: an HPLC-based study.

Authors:  C Fuke; M Shimabukuro; A Petronis; J Sugimoto; T Oda; K Miura; T Miyazaki; C Ogura; Y Okazaki; Y Jinno
Journal:  Ann Hum Genet       Date:  2004-05       Impact factor: 1.670

3.  An approximate distribution of estimates of variance components.

Authors:  F E SATTERTHWAITE
Journal:  Biometrics       Date:  1946-12       Impact factor: 2.571

4.  Considerations for normalization of DNA methylation data by Illumina 450K BeadChip assay in population studies.

Authors:  Paul Yousefi; Karen Huen; Raul Aguilar Schall; Anna Decker; Emon Elboudwarej; Hong Quach; Lisa Barcellos; Nina Holland
Journal:  Epigenetics       Date:  2013-08-19       Impact factor: 4.528

5.  Epigenetic vestiges of early developmental adversity: childhood stress exposure and DNA methylation in adolescence.

Authors:  Marilyn J Essex; W Thomas Boyce; Clyde Hertzman; Lucia L Lam; Jeffrey M Armstrong; Sarah M A Neumann; Michael S Kobor
Journal:  Child Dev       Date:  2011-09-02

6.  Prevalence of neutropenia in the U.S. population: age, sex, smoking status, and ethnic differences.

Authors:  Matthew M Hsieh; James E Everhart; Danita D Byrd-Holt; John F Tisdale; Griffin P Rodgers
Journal:  Ann Intern Med       Date:  2007-04-03       Impact factor: 25.391

Review 7.  The control of sexual differentiation of the reproductive system and brain.

Authors:  C A Wilson; D C Davies
Journal:  Reproduction       Date:  2007-02       Impact factor: 3.906

8.  Shank3 mutant mice display autistic-like behaviours and striatal dysfunction.

Authors:  João Peça; Cátia Feliciano; Jonathan T Ting; Wenting Wang; Michael F Wells; Talaignair N Venkatraman; Christopher D Lascola; Zhanyan Fu; Guoping Feng
Journal:  Nature       Date:  2011-03-20       Impact factor: 49.962

9.  Sex differences in the genome-wide DNA methylation pattern and impact on gene expression, microRNA levels and insulin secretion in human pancreatic islets.

Authors:  Elin Hall; Petr Volkov; Tasnim Dayeh; Jonathan Lou S Esguerra; Sofia Salö; Lena Eliasson; Tina Rönn; Karl Bacos; Charlotte Ling
Journal:  Genome Biol       Date:  2014-12-03       Impact factor: 13.583

10.  WEB-based GEne SeT AnaLysis Toolkit (WebGestalt): update 2013.

Authors:  Jing Wang; Dexter Duncan; Zhiao Shi; Bing Zhang
Journal:  Nucleic Acids Res       Date:  2013-05-23       Impact factor: 16.971

View more
  71 in total

1.  Sex-specific DNA methylation differences in people exposed to polybrominated biphenyl.

Authors:  Sarah W Curtis; Sabrina A Gerkowicz; Dawayland O Cobb; Varun Kilaru; Metrecia L Terrell; M Elizabeth Marder; Dana Boyd Barr; Carmen J Marsit; Michele Marcus; Karen N Conneely; Alicia K Smith
Journal:  Epigenomics       Date:  2020-06-04       Impact factor: 4.778

2.  Blood DNA methylation as a potential biomarker of dementia: A systematic review.

Authors:  Peter D Fransquet; Paul Lacaze; Richard Saffery; John McNeil; Robyn Woods; Joanne Ryan
Journal:  Alzheimers Dement       Date:  2017-11-08       Impact factor: 21.566

3.  Colonic Bacteroides are positively associated with trabecular bone structure and programmed by maternal vitamin D in male but not female offspring in an obesogenic environment.

Authors:  C R Villa; A Taibi; J Chen; W E Ward; E M Comelli
Journal:  Int J Obes (Lond)       Date:  2017-11-30       Impact factor: 5.095

4.  Evaluation of DNA Methylation Episignatures for Diagnosis and Phenotype Correlations in 42 Mendelian Neurodevelopmental Disorders.

Authors:  Erfan Aref-Eshghi; Jennifer Kerkhof; Victor P Pedro; Mouna Barat-Houari; Nathalie Ruiz-Pallares; Jean-Christophe Andrau; Didier Lacombe; Julien Van-Gils; Patricia Fergelot; Christèle Dubourg; Valerie Cormier-Daire; Sophie Rondeau; François Lecoquierre; Pascale Saugier-Veber; Gaël Nicolas; Gaetan Lesca; Nicolas Chatron; Damien Sanlaville; Antonio Vitobello; Laurence Faivre; Christel Thauvin-Robinet; Frederic Laumonnier; Martine Raynaud; Mariëlle Alders; Marcel Mannens; Peter Henneman; Raoul C Hennekam; Guillaume Velasco; Claire Francastel; Damien Ulveling; Andrea Ciolfi; Simone Pizzi; Marco Tartaglia; Solveig Heide; Delphine Héron; Cyril Mignot; Boris Keren; Sandra Whalen; Alexandra Afenjar; Thierry Bienvenu; Philippe M Campeau; Justine Rousseau; Michael A Levy; Lauren Brick; Mariya Kozenko; Tugce B Balci; Victoria Mok Siu; Alan Stuart; Mike Kadour; Jennifer Masters; Kyoko Takano; Tjitske Kleefstra; Nicole de Leeuw; Michael Field; Marie Shaw; Jozef Gecz; Peter J Ainsworth; Hanxin Lin; David I Rodenhiser; Michael J Friez; Matt Tedder; Jennifer A Lee; Barbara R DuPont; Roger E Stevenson; Steven A Skinner; Charles E Schwartz; David Genevieve; Bekim Sadikovic
Journal:  Am J Hum Genet       Date:  2020-02-27       Impact factor: 11.025

5.  miRNAs differentially expressed by next-generation sequencing in cord blood buffy coat samples of boys and girls.

Authors:  Daneida Lizarraga; Karen Huen; Mary Combs; Maria Escudero-Fung; Brenda Eskenazi; Nina Holland
Journal:  Epigenomics       Date:  2016-11-24       Impact factor: 4.778

6.  Sexual epigenetic dimorphism in the human placenta: implications for susceptibility during the prenatal period.

Authors:  Elizabeth Martin; Lisa Smeester; Paige A Bommarito; Matthew R Grace; Kim Boggess; Karl Kuban; Margaret R Karagas; Carmen J Marsit; T Michael O'Shea; Rebecca C Fry
Journal:  Epigenomics       Date:  2017-02-17       Impact factor: 4.778

7.  Sex-specific associations of birth weight with measures of adiposity in mid-to-late adulthood: the Brazilian Longitudinal Study of Adult Health (ELSA-Brasil).

Authors:  G Rockenbach; V C Luft; N T Mueller; B B Duncan; M C Stein; Á Vigo; S M A Matos; M J M Fonseca; S M Barreto; I M Benseñor; L J Appel; M I Schmidt
Journal:  Int J Obes (Lond)       Date:  2016-04-28       Impact factor: 5.095

8.  5-Hydroxymethylcytosine in cord blood and associations of DNA methylation with sex in newborns.

Authors:  Olivia Solomon; Julia L Macisaac; Gwen Tindula; Michael S Kobor; Brenda Eskenazi; Nina Holland
Journal:  Mutagenesis       Date:  2019-12-19       Impact factor: 3.000

9.  Sex-differential DNA methylation and associated regulation networks in human brain implicated in the sex-biased risks of psychiatric disorders.

Authors:  Yan Xia; Rujia Dai; Kangli Wang; Chuan Jiao; Chunling Zhang; Yuchen Xu; Honglei Li; Xi Jing; Yu Chen; Yi Jiang; Richard F Kopp; Gina Giase; Chao Chen; Chunyu Liu
Journal:  Mol Psychiatry       Date:  2019-04-11       Impact factor: 15.992

10.  Biobehavioral organization shapes the immune epigenome in infant rhesus Macaques (Macaca mulatta).

Authors:  A Baxter; J P Capitanio; K L Bales; E L Kinnally
Journal:  Brain Behav Immun       Date:  2021-06-16       Impact factor: 7.217

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.