| Literature DB >> 21297937 |
Shimul Chowdhury1, Stephen W Erickson, Stewart L MacLeod, Mario A Cleves, Ping Hu, Mohammad A Karim, Charlotte A Hobbs.
Abstract
The majority of congenital heart defects (CHDs) are thought to result from the interaction between multiple genetic, epigenetic, environmental, and lifestyle factors. Epigenetic mechanisms are attractive targets in the study of complex diseases because they may be altered by environmental factors and dietary interventions. We conducted a population based, case-control study of genome-wide maternal DNA methylation to determine if alterations in gene-specific methylation were associated with CHDs. Using the Illumina Infinium Human Methylation27 BeadChip, we assessed maternal gene-specific methylation in over 27,000 CpG sites from DNA isolated from peripheral blood lymphocytes. Our study sample included 180 mothers with non-syndromic CHD-affected pregnancies (cases) and 187 mothers with unaffected pregnancies (controls). Using a multi-factorial statistical model, we observed differential methylation between cases and controls at multiple CpG sites, although no CpG site reached the most stringent level of genome-wide statistical significance. The majority of differentially methylated CpG sites were hypermethylated in cases and located within CpG islands. Gene Set Enrichment Analysis (GSEA) revealed that the genes of interest were enriched in multiple biological processes involved in fetal development. Associations with canonical pathways previously shown to be involved in fetal organogenesis were also observed. We present preliminary evidence that alterations in maternal DNA methylation may be associated with CHDs. Our results suggest that further studies involving maternal epigenetic patterns and CHDs are warranted. Multiple candidate processes and pathways for future study have been identified.Entities:
Mesh:
Year: 2011 PMID: 21297937 PMCID: PMC3031146 DOI: 10.1371/journal.pone.0016506
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Selected characteristics of cases and controls.
| Cases (n = 180) n (%) | Controls (n = 187) n (%) | p-value | |
|
| |||
| <30 | 111 (61.7) | 113 (60.4) | |
| ≥30 | 69 (38.3) | 74 (39.6) | 0.831 |
|
| |||
| Caucasian | 136 (75.6) | 148 (79.1) | |
| African American | 32 (17.8) | 22 (11.8) | |
| Others | 12 (6.7) | 17 (9.1) | 0.205 |
|
| |||
| Missing | 0 (0.0) | 1 (0.5) | |
| No | 126 (70.0) | 151 (80.7) | |
| Yes | 54 (30.0) | 35 (18.7) | 0.015 |
|
| |||
| Missing | 1 (0.6) | 2 (1.1) | |
| No | 93 (51.7) | 85 (45.4) | |
| Yes | 86 (47.8) | 100 (53.5) | 0.294 |
|
| |||
| Missing | 0 (0.0) | 1 (0.5) | |
| No | 99 (55.0) | 111 (59.4) | |
| Yes | 81 (45.0) | 75 (40.1) | 0.400 |
|
| |||
| Missing | 5 (2.8) | 0 (0.0) | |
| High school or lower | 83 (46.1) | 79 (42.2) | |
| College education or higher | 92 (51.1) | 108 (57.8) | 0.342 |
|
| |||
| Missing | 13 (7.2) | 12 (6.4) | |
| Less than $10,000 | 34 (18.9) | 23 (12.3) | |
| $10,000–$30,000 | 60 (33.3) | 55 (29.4) | |
| $30,000–$50,000 | 39 (21.7) | 51 (27.3) | |
| More than $50,000 | 34 (18.9) | 46 (24.6) | 0.137 |
|
| |||
| Missing | 8 (4.4) | 9 (4.8) | |
| Underweight (<18.5) | 3 (1.7) | 6 (3.2) | |
| Normal (18.5–24.9) | 57 (31.7) | 78 (41.7) | |
| Overweight (25.0–29.9) | 44 (24.4) | 34 (18.2) | |
| Obese (≥30.0) | 68 (37.8) | 60 (32.1) | 0.118 |
Fisher's exact test for categorical variables.
Figure 1Distribution of β-values for selected subjects and CpG sites.
A) The distribution of β values for 27,249 CpG sites is displayed for four randomly selected subjects. The distribution for the four subjects is representative of the distribution for the entire sample population. B) The distribution of β values at selected CpG sites across all 367 study subjects. The four CpG sites were selected to illustrate typical methylation patterns observed in our sample population. Each CpG site displays skewed distributions, with a small amount of variation between individuals. Three of the four panels show substantially skewed distributions, most dramatically for the sites massed nearest 0 or 1. We did not observe drastic shifts in DNA methylation patterns in our sample population at individual CpG sites.
Figure 2Quantile-Quantile (Q-Q) plot of observed versus expected p-values.
A quantile-quantile (Q-Q) plot of –log10(p-values) for the tests of association between gene-specific methylation and disease status. The observed quantiles are consistently higher than their expected values under the null hypothesis of no disease association, providing evidence of site-specific disease association for a large number of CpG sites.
Top 50 differentially methylated genes ranked by statistical significance.
| SYMBOL | Gene name | Gene_ID | p-value | q-value | Chr | CpG Island | Direction |
| CTSL2 | cathepsin L2 | 1515 | 1.6E-05 | 0.13378 | 9 | Yes | + |
| RREB1 | ras responsive element binding protein 1 | 6239 | 1.6E-05 | 0.13378 | 6 | Yes | + |
| BSN | bassoon (presynaptic cytomatrix protein) | 8927 | 2.2E-05 | 0.13378 | 3 | Yes | + |
| HSPC268 | hypothetical protein HSPC268 | 154791 | 4.6E-05 | 0.17086 | 7 | Yes | + |
| NEUROG1 | neurogenin 1 | 4762 | 5.5E-05 | 0.17086 | 5 | Yes | + |
| MAP4K5 | mitogen-activated protein kinase kinase kinase kinase 5 | 11183 | 7.3E-05 | 0.17086 | 14 | Yes | + |
| FTMT | ferritin mitochondrial | 94033 | 7.4E-05 | 0.17086 | 5 | Yes | - |
| CPE | carboxypeptidase E | 1363 | 8.8E-05 | 0.17086 | 4 | Yes | + |
| EXT2 | exostosin 2 | 2132 | 0.0001 | 0.17086 | 11 | Yes | + |
| UBE3A | ubiquitin protein ligase E3A | 7337 | 0.00013 | 0.17086 | 15 | Yes | + |
| DNAJB6 | DnaJ (Hsp40) homolog, subfamily B, member 6 | 10049 | 0.00014 | 0.17086 | 7 | Yes | + |
| UQCRH | ubiquinol-cytochrome c reductase hinge protein | 7388 | 0.00014 | 0.17086 | 1 | Yes | + |
| ORF1-FL49 | chromosome 5 open reading frame 32 | 84418 | 0.00016 | 0.17086 | 5 | Yes | + |
| H2AFY2 | H2A histone family, member Y2 | 55506 | 0.00018 | 0.17086 | 10 | Yes | + |
| PKN2 | protein kinase N2 | 5586 | 0.00022 | 0.17086 | 1 | Yes | + |
| STRN3 | striatin, calmodulin binding protein 3 | 29966 | 0.00022 | 0.17086 | 14 | Yes | + |
| C8orf35 | chromosome 8 open reading frame 35 | 55174 | 0.00022 | 0.17086 | 8 | Yes | + |
| NDUFA7 | NADH dehydrogenase (ubiquinone) 1 alpha subcomplex, 7 | 4701 | 0.00023 | 0.17086 | 19 | Yes | - |
| SLC36A4 | solute carrier family 36 (proton/amino acid symporter), member 4 | 120103 | 0.00023 | 0.17086 | 11 | Yes | + |
| C6orf72 | chromosome 6 open reading frame 72 | 116254 | 0.00023 | 0.17086 | 6 | Yes | + |
| C6orf199 | chromosome 6 open reading frame 199 | 221264 | 0.00025 | 0.17086 | 6 | Yes | + |
| CCDC74A | coiled-coil domain containing 74A | 90557 | 0.00026 | 0.17086 | 2 | Yes | + |
| TERF1 | telomeric repeat binding factor (NIMA-interacting) 1 | 7013 | 0.00028 | 0.17086 | 8 | Yes | + |
| SNX3 | sorting nexin 3 | 8724 | 0.00028 | 0.17086 | 6 | Yes | + |
| HNRPH3 | heterogeneous nuclear ribonucleoprotein H3 (2H9) | 3189 | 0.0003 | 0.17086 | 10 | Yes | + |
| CGNL1 | cingulin-like 1 | 84952 | 0.00031 | 0.17086 | 15 | Yes | - |
| ITM2B | integral membrane protein 2B | 9445 | 0.00031 | 0.17086 | 13 | Yes | + |
| ZNF544 | zinc finger protein 544 | 27300 | 0.00032 | 0.17086 | 19 | Yes | + |
| MGC14376 | hypothetical protein MGC14376 | 84981 | 0.00035 | 0.17086 | 17 | Yes | + |
| ICAM3 | intercellular adhesion molecule 3 | 3385 | 0.00036 | 0.17086 | 19 | No | + |
| RRP22 | Ras-related protein 22 | 10633 | 0.00036 | 0.17086 | 22 | Yes | + |
| AGTPBP1 | ATP/GTP binding protein 1 | 23287 | 0.00037 | 0.17086 | 9 | Yes | + |
| WNT5A | wingless-type MMTV integration site family, member 5A | 7474 | 0.00037 | 0.17086 | 3 | Yes | + |
| GMDS | GDP-mannose 4,6-dehydratase | 2762 | 0.00039 | 0.17086 | 6 | Yes | + |
| XPR1 | xenotropic and polytropic retrovirus receptor 1 | 9213 | 0.0004 | 0.17086 | 1 | Yes | + |
| GDF3 | growth differentiation factor 3 | 9573 | 0.00041 | 0.17086 | 12 | Yes | + |
| RASEF | RAS and EF-hand domain containing | 158158 | 0.00041 | 0.17086 | 9 | Yes | + |
| KCTD4 | potassium channel tetramerisation domain containing 4 | 386618 | 0.00041 | 0.17086 | 13 | No | - |
| RFXAP | regulatory factor X-associated protein | 5994 | 0.00041 | 0.17086 | 13 | Yes | + |
| KCNA4 | potassium voltage-gated channel, shaker-related subfamily, member 4 | 3739 | 0.00042 | 0.17086 | 11 | Yes | + |
| EGFR | epidermal growth factor receptor | 1956 | 0.00043 | 0.17086 | 7 | Yes | + |
| LGR6 | leucine-rich repeat-containing G protein-coupled receptor 6 | 59352 | 0.00045 | 0.17086 | 1 | Yes | + |
| MAPK13 | mitogen-activated protein kinase 13 | 5603 | 0.00047 | 0.17086 | 6 | Yes | + |
| SSFA2 | sperm specific antigen 2 | 6744 | 0.00049 | 0.17086 | 2 | Yes | + |
| PHCA | phytoceramidase, alkaline | 55331 | 0.00049 | 0.17086 | 11 | Yes | + |
| TBC1D1 | TBC1 (tre-2/USP6, BUB2, cdc16) domain family, member 1 | 23216 | 0.00049 | 0.17086 | 4 | Yes | + |
| LRRC3B | leucine rich repeat containing 3B | 116135 | 0.00049 | 0.17086 | 3 | Yes | + |
| SHCBP1 | SHC SH2-domain binding protein 1 | 79801 | 0.0005 | 0.17086 | 16 | Yes | + |
| PMS2 | PMS2 postmeiotic segregation increased 2 (S. cerevisiae) | 5395 | 0.00053 | 0.17086 | 7 | Yes | + |
| ZNF304 | zinc finger protein 304 | 57343 | 0.00053 | 0.17086 | 19 | Yes | + |
p-values were calculated by multiple linear regression and randomization testing (see Materials and Methods).
q-values were calculated using the qvalue package in the R statistical programming environment under default settings (see Materials and Methods).
Chr = chromosome.
Direction of methylation in cases.
Differentially methylated imprinted genes.
| SYMBOL | Gene function | p-value | q-value | Direction of methylation in cases | Chromosome | CpG Island |
|
| Imprinted | 0.0001262 | 0.1708606 | + | 15 | Yes |
|
| Imprinted | 0.0008294 | 0.1743975 | + | 19 | Yes |
|
| Imprinted | 0.0039526 | 0.1822456 | - | 15 | Yes |
|
| Imprinted | 0.0043908 | 0.1822456 | - | 11 | No |
p-values were calculated by multiple linear regression and randomization testing (see Materials and Methods).
q-values were calculated using the qvalue package in the R statistical programming environment under default settings (see Materials and Methods).
Differentially methylated miRNAs.
| Gene | miRNA | p-value | q-value | Direction of methylation in cases | Chromosome | CpG Island |
|
| HSA-MIR-10A; | 0.0043384 | 0.1822456 | + | 17 | Yes |
|
| HSA-MIR-34B;HSA-MIR-34C; | 0.0006663 | 0.1708606 | + | 11 | Yes |
|
| HSA-MIR-423; | 0.0029542 | 0.1822456 | - | 17 | Yes |
|
| HSA-MIR-565; | 0.0008043 | 0.1743975 | + | 3 | Yes |
|
| HSA-MIR-594; | 0.0000463 | 0.1708606 | + | 7 | Yes |
|
| HSA-MIR-594; | 0.0045767 | 0.1822456 | + | 7 | Yes |
|
| HS_104; | 0.0033306 | 0.1822456 | + | 10 | Yes |
|
| HS_128; | 0.0034043 | 0.1822456 | + | 8 | Yes |
|
| HS_138; | 0.0026144 | 0.1822456 | + | 10 | Yes |
|
| HS_140; | 0.0044543 | 0.1822456 | + | 9 | Yes |
|
| HS_176; | 0.0015582 | 0.1822456 | + | 11 | Yes |
|
| HS_253;HS_253; | 0.0041322 | 0.1822456 | - | 16 | Yes |
|
| HS_255; | 0.0049322 | 0.1822456 | + | 1 | Yes |
|
| HS_78; | 0.0015055 | 0.1822456 | + | 1 | Yes |
p-values were calculated by multiple linear regression model and randomization testing (see Materials and Methods).
q-values were calculated using the qvalue package in the R statistical programming environment under default settings (see Materials and Methods).
Figure 3Gene Set Enrichment Analysis (GSEA) of biological processes and canonical pathways for differentially methylated genes.
A) The top ten significantly enriched biological processes are displayed with their corresponding –log10(p-value). Multiple processes appear to be directly related to fetal heart development. B) The top ten significantly enriched canonical pathways are displayed with the corresponding −log10(p-value). Multiple pathways had previously been implicated in CHDs.