| Literature DB >> 23157493 |
Kristel R van Eijk1, Simone de Jong, Marco P M Boks, Terry Langeveld, Fabrice Colas, Jan H Veldink, Carolien G F de Kovel, Esther Janson, Eric Strengman, Peter Langfelder, René S Kahn, Leonard H van den Berg, Steve Horvath, Roel A Ophoff.
Abstract
BACKGROUND: The predominant model for regulation of gene expression through DNA methylation is an inverse association in which increased methylation results in decreased gene expression levels. However, recent studies suggest that the relationship between genetic variation, DNA methylation and expression is more complex.Entities:
Mesh:
Year: 2012 PMID: 23157493 PMCID: PMC3583143 DOI: 10.1186/1471-2164-13-636
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Number of probes constituting significant methylation and expression combinations and their association with SNPs
| + | - | ||||
|---|---|---|---|---|---|
| | | | | | |
| 517 | 224 | 354 | 61 | 69 probes (13.3%), 86 independent loci | |
| 495 | 214 | 336 | 55 | 62 probes (12.5%), 73 independent loci | |
| | | | | | |
| 705 | 585 | 230 | 110 | 1 probe (<1%) | |
| 170 | 101 | 117 | 48 | 0 probes |
This table shows the significant methylation and expression combinations, subdivided into cis and trans associations. The first column shows the counts of unique probes (for methylation and expression). The second and third columns indicate the number of probes positively (+) or negatively (−) associated. The fourth column indicates the overlapping probes: methylation or expression probes that are associated with expression or methylation levels in both directions. The last column indicates the number (and %) of unique probes associated with SNPs and the number of independent (pruned r2 of 0.2) loci in cis.
Distribution of results over CpG islands and shores
| 11,582 | 42% | 269 | 38.2%, | |||||
| 8,718 | 32% | 184 | 35.6%, | |||||
| 7,278 | 26% | 23 | 31.5%, ns | |||||
| 27,578 | 517 | 705 | 73 | |||||
Methylation probes were classified into three categories according to UCSC browser ( http://genome.ucsc.edu/); CpG islands, CGI shores (up to 2kb around an island) and outside islands or shores. Differences compared to Illumina Human Methylation 27K array were tested using Fisher’s Exact for count data (Bonferroni threshold p:0.05/9=0.006). A downward arrow indicates significantly lower percentage of probes while an upward arrow indicates significantly higher percentage of observations compared to the overall probe distribution on the Illumina array.
Top results LEO analysis, Results for top SNPs
| - | Outside | S → M → E | 6.90 | 0.15 | rs2093169 | 6 | 26,603,078 | butyrophilin, subfamily 3, member A2 | |
| - | Outside | S → M → E | 4.24 | 0.82 | rs8044555 | 16 | 70,710,256 | haptoglobin | |
| - | Outside | S → M → E | 2.73 | 0.13 | rs11227306 | 11 | 65,335,248 | cathepsin W | |
| - | Shore | S → M → E | 2.69 | 0.11 | rs4874159 | 8 | 144,742,093 | nicotinate phosphoribosyltransferase domain containing 1 | |
| - | Shore | S → M → E | 1.50 | 2.9e-03 | rs4755227 | 11 | 44,078,659 | 1-aminocyclopropane-1-carboxylate synthase homolog | |
| + | Shore | S → M → E | 1.36 | 0.16 | rs6627737 | X | 151,971,610 | nicotinate phosphoribosyltransferase domain containing 1 | |
| - | Island | S → M → E | 1.09 | 0.01 | rs11147317 | 13 | 113,957,498 | cell division cycle 16 homolog (S. cerevisiae) | |
| - | Shore | S → E → M | 2.42 | 7.2e-04 | rs2030731 | 11 | 63,130,224 | phospholipase A2, group XVI | |
| - | Island | S → E → M | 2.08 | 9.5e-03 | rs11207272 | 1 | 58,846,018 | tumor-associated calcium signal transducer 2 | |
| - | Shore | S → E → M | 1.87 | 5.4e-03 | rs6076864 | 20 | 569,825 | sulfiredoxin 1 homolog (S. cerevisiae) | |
| - | Outside | S → E → M | 1.30 | 2.8e-03 | rs8133866 | 21 | 46,423,604 | chromosome 21 open reading frame 56 | |
| - | Outside | S → E → M | 1.14 | 6.4e-05 | rs12199613 | 6 | 26,475,197 | butyrophilin, subfamily 3, member A2 | |
| - | Shore | S → E → M | 0.95 | 8.6e-04 | rs11763011 | 7 | 72,922,084 | Williams Beuren syndrome chromosome region 27 | |
| - | Island | S → E → M | 0.88 | 1.4e-03 | rs11807 | 1 | 110,062,265 | glutathione S-transferase mu 3 (brain) |
This table contains top probes resulting from causality analysis (LEO scores > 0.8). The top seven genes fit the causal scenario SNP → Methylation → Expression (S→M→E), while the bottom seven genes fit the reverse model in which DNA methylation is regulated by gene expression that is under genetic control (S→E→M). The Gene Symbol is given in the first column. The second column indicates whether the methylation and expression levels are associated negatively (−) or positively (+). The third column indicates whether the methylation probe is located in a CpG island (CGI), in the shore, or outside both. The columns “LEO model”, “LEO score” and “P-value” indicate which causal model fits best with the corresponding LEO score and P-value. This model fitting p-value is calculated using the model chi-square statistic statistic. The chi-square statistic tests the null hypothesis that the model is correct, thus a p-value > 0.01 indicates good fit. The next column indicates the SNP most significantly associated. The last three columns contain chromosome number and base pair location (NCBI build 36) of the SNP and full name of the gene.
Top GO enrichment terms for expression modules
| 1 | 1520 | 1 | 1.20E-14 | 0.65147 | CC | membrane-bounded organelle |
| 1 | 1520 | 2 | 1.90E-14 | 0.65007 | CC | intracellular membrane-bounded organelle |
| 2 | 703 | 1 | 1.30E-05 | 0.04885 | CC | ribosome |
| 2 | 703 | 2 | 1.50E-05 | 0.07481 | BP | translation |
| 3 | 658 | 1 | 0.00018 | 0.16382 | CC | extracellular region |
| 4 | 647 | 1 | 0.011 | 0.63711 | CC | membrane-bounded organelle |
| 6 | 442 | 1 | 7.60E-07 | 0.27229 | BP | response to stress |
| 6 | 442 | 2 | 2.80E-06 | 0.34217 | BP | signal transduction |
| 7 | 426 | 1 | 9.40E-07 | 0.18734 | BP | immune system process |
| 8 | 407 | 1 | 0.0042 | 0.66755 | CC | intracellular membrane-bounded organelle |
| 9 | 387 | 1 | 1.70E-05 | 0.82961 | CC | intracellular part |
| 10 | 355 | 1 | 1.80E-05 | 0.07855 | BP | ncRNA metabolic process |
| 10 | 355 | 2 | 3.60E-05 | 0.19335 | CC | mitochondrion |
| 12 | 306 | 1 | 2.00E-16 | 0.1134 | CC | ribosome |
| 12 | 306 | 2 | 4.00E-16 | 0.17182 | CC | ribonucleoprotein complex |
| 12 | 306 | 3 | 1.20E-14 | 0.14777 | BP | translation |
| 12 | 306 | 4 | 2.30E-14 | 0.09278 | BP | viral transcription |
| 13 | 260 | 1 | 0.00041 | 0.025 | CC | hemoglobin complex |
| 13 | 260 | 2 | 0.011 | 0.025 | BP | heme biosynthetic process |
| 14 | 237 | 1 | 0.014 | 0.61086 | MF | protein binding |
| 15 | 118 | 1 | 0.018 | 0.25688 | BP | intracellular signal transduction |
| 15 | 118 | 2 | 0.032 | 0.12844 | BP | small GTPase mediated signal transduction |
| 16 | 108 | 1 | 1.30E-05 | 0.18478 | BP | translation |
| 16 | 108 | 2 | 0.00017 | 0.1087 | BP | ribosome biogenesis |
| 17 | 99 | 1 | 2.30E-09 | 0.36957 | CC | mitochondrion |
| 18 | 72 | 1 | 3.30E-06 | 0.1791 | BP | platelet activation |
| 18 | 72 | 2 | 1.20E-05 | 0.22388 | BP | blood coagulation |
| 19 | 60 | 1 | 1.40E-17 | 0.31034 | MF | structural constituent of ribosome |
| 19 | 60 | 2 | 2.90E-17 | 0.32759 | CC | ribosome |
| 20 | 52 | 1 | 7.10E-23 | 0.32653 | BP | type I interferon-mediated signaling pathway |
| 20 | 52 | 2 | 7.10E-23 | 0.32653 | BP | cellular response to type I interferon |
| 23 | 32 | 1 | 0.053 | 0.17241 | CC | external side of plasma membrane |
For each module we list the top enriched GO terms. Columns list the module label, module size, rank of the enrichment p-value for that particular module, the Bonferroni-corrected enrichment p-value (the correction is performed with respect to the number of GO terms), fraction of the module genes also in the GO term, GO ontology, and GO term name. Multiple expression modules exhibit significant enrichment. Row shading separates modules for easier reading. The enrichment provides evidence that the modules are biologically meaningful.
Top GO enrichment terms for methylation modules
| 32 | 1067 | 1 | 4.20E-09 | 0.1875 | CC | extracellular region |
| 1 | 1045 | 1 | 8.20E-16 | 0.790123 | CC | intracellular |
| 1 | 1045 | 3 | 1.50E-11 | 0.083333 | CC | ribonucleoprotein complex |
| 30 | 616 | 1 | 2.50E-21 | 0.34687 | BP | anatomical structure development |
| 30 | 616 | 2 | 1.60E-18 | 0.189509 | BP | nervous system development |
| 7 | 594 | 1 | 8.30E-10 | 0.157424 | BP | immune system process |
| 7 | 594 | 2 | 9.10E-10 | 0.177102 | MF | receptor activity |
| 3 | 427 | 1 | 3.20E-12 | 0.259947 | CC | extracellular region |
| 12 | 130 | 1 | 0.0032 | 0.075 | BP | DNA recombination |
| 2 | 105 | 1 | 1.70E-10 | 0.191919 | BP | lymphocyte activation |
| 2 | 105 | 2 | 2.30E-10 | 0.20202 | BP | leukocyte activation |
| 2 | 105 | 3 | 3.60E-10 | 0.323232 | BP | immune system process |
For each module we list the top enriched GO terms. Columns list the module label, module size, rank of the enrichment p-value for that particular module, the Bonferroni-corrected enrichment p-value (the correction is performed with respect to the number of GO terms), fraction of the module genes also in the GO term, GO ontology, and GO term name. Multiple methylation modules exhibit significant enrichment. Row shading separates modules for easier reading. Again, the enrichment provides evidence that the modules are biologically meaningful.
Figure 1Preservation and association of co-expression and co-methylation modules. A. Module preservation statisticZsummary that summarizes evidence of preservation of expression modules in methylation data. Each module is labelled by a numeric label and the corresponding colour. Values of Zsummary below 2 indicate no evidence of preservation; values between 2 and 5 indicate weak to moderate evidence for preservation. Only the largest module, labelled 1 (turquoise), exhibits Zsummary above 5 that can be considered moderate-strong evidence of preservation. B. Analogous plot of the Zsummary statistic for preservation of methylation modules in expression data. As in expression data, only the largest module (also labelled 1, turquoise) exhibits moderate-strong evidence of preservation. C. Robust correlations and the corresponding p-values of expression (y-axis) and methylation (x-axis) eigengenes. Each row corresponds to an expression eigengene (E.ME) labelled by numeric module label and colour. Each column corresponds to a methylation eigengene (M.ME) labelled by numeric module label and colour. Numbers in the table report the robust correlation and the corresponding p-value of the respective expression and methylation eigengenes. Only correlations whose p-value is below 0.05 are displayed. The table is colour-coded according to correlation such that (strong) green colour corresponds to (strong) negative correlations, and (strong) red colour corresponds to (strong) positive correlations.