| Literature DB >> 23644594 |
Xiaojing Xu1, Shaoyong Su, Vernon A Barnes, Carmen De Miguel, Jennifer Pollock, Dennis Ownby, Hidong Shi, Haidong Zhu, Harold Snieder, Xiaoling Wang.
Abstract
Besides differential methylation, DNA methylation variation has recently been proposed and demonstrated to be a potential contributing factor to cancer risk. Here we aim to examine whether differential variability in methylation is also an important feature of obesity, a typical non-malignant common complex disease. We analyzed genome-wide methylation profiles of over 470,000 CpGs in peripheral blood samples from 48 obese and 48 lean African-American youth aged 14-20 y old. A substantial number of differentially variable CpG sites (DVCs), using statistics based on variances, as well as a substantial number of differentially methylated CpG sites (DMCs), using statistics based on means, were identified. Similar to the findings in cancers, DVCs generally exhibited an outlier structure and were more variable in cases than in controls. By randomly splitting the current sample into a discovery and validation set, we observed that both the DVCs and DMCs identified from the first set could independently predict obesity status in the second set. Furthermore, both the genes harboring DMCs and the genes harboring DVCs showed significant enrichment of genes identified by genome-wide association studies on obesity and related diseases, such as hypertension, dyslipidemia, type 2 diabetes and certain types of cancers, supporting their roles in the etiology and pathogenesis of obesity. We generalized the recent finding on methylation variability in cancer research to obesity and demonstrated that differential variability is also an important feature of obesity-related methylation changes. Future studies on the epigenetics of obesity will benefit from both statistics based on means and statistics based on variances.Entities:
Keywords: African-Americans; epigenome-wide association study (EWAS); genome-wide association study (GWAS); methylation variation; obesity
Mesh:
Year: 2013 PMID: 23644594 PMCID: PMC3741222 DOI: 10.4161/epi.24506
Source DB: PubMed Journal: Epigenetics ISSN: 1559-2294 Impact factor: 4.528

Figure 1. DMC, DVCs and DMVCs. (A) Density histograms of DMCs (differentially methylated CpG sites). These P-values were derived from linear regression based on the Limma package comparing differences in means between lean and obese subjects. (B) Density histograms of DVCs (differentially variable CpG sites). These P-values were derived from Bartlett’s test comparing differences in variances between lean and obese. (C) Venn diagram illustrating DMCs, DVCs and DMVCs (differentially methylated and variable CpG sites). The overlapping DMVCs were significantly enriched with P.Fisher < 2.2E-16 and P.permutation (1000 times) < 0.001. (D) Top ranked DMC cg08339189. Y-axis shows the β value, x-axis the sample. Phenotypes were indicated as lean (black, n = 48) and obese (red, n = 48). The dashed lines show the mean levels in lean (0.15) and obese (0.18) separately. (E) Top ranked DVC cg24570070. The mean levels were not significantly different (0.88 in obese vs. 0.91 in lean). However, obese cases showed a large methylation variance. (F) Top ranked DMVC cg00033915. The mean levels (dashed line) were significantly different between the two groups (0.92 in obese vs. 0.93 in lean, raw p = 1.1E-5 and FDR = 4.2E-3). Furthermore, the obese group also showed significantly larger variance.

Figure 2. Distributions of DMCs, DVCs and DMVCs. (A) Distributions across average methylation levels. The Y-axis represents the density, x-axis the average methylation β value. The orange line presented the kernel density curve of overall CpG sites across β 0–1, the green line presented DMCs, the gray line DVCs and the blue line DMVCs. The percentages of each of these four types of CpG sites with betas below 0.2 or above 0.8 are also listed in the plot. (B) Distribution across genomic locations. Chi-square test found significant difference among their distributions across the genome with P-value < 2.2E-16. TSS200, CpG sites within 200bp from the transcription starting site (TSS); TSS1500, CpG sites within 200–1500bp from the transcription starting site (TSS); body, gene body.

Figure 3. Increased methylation variance in obese. (A) Scatter plot of mean methylation difference (x-axis) against methylation variance ratio (y-axis) comparing lean and obese. The percentage of DMCs, DVCs and DMVCs within each of the four quadrants was also listed. (B) Scatter plot of mean methylation difference (x-axis) against methylation variance ratio (y-axis) comparing lean and obese limited to the CpG sites on the illumina 27K chip. The percentage of DMCs, DVCs and DMVCs within each of the four quadrants was also listed.

Figure 4. Predictive ability of DMCs and DVCs. (A) Predictive ability of DMCs. The receiver operating characteristic (ROC) analysis showed the area under curve (AUC) and its 95% confidence interval (CI) in a randomly split testing set (24 obese vs. 24 lean) from the whole data set. (B) Predictive ability of DVCs. AUC and 95%CI was presented in a randomly split testing set (24 obese vs. 24 lean) from the whole data set.

Figure 5. Gene ontology enrichment analysis of DMCs, DVCs and DMVCs. Gene ontology analysis was performed using DAVID (http://david.abcc.ncifcrf.gov). The human genome was used as background. The top 500 DMCs (A), the top 500 DVCs (B) and all DMVCs (n = 1608) (C) were selected for analysis. The top ten enriched pathways are listed here together with their enrichment P-values, which are derived from a modified Fisher’s exact test.
Table 1. Enrichment analysis of DMCs, DVCs, DMVCs and Hyper-DMVCs in obesity and its comorbidities
| Obesity related diseases | Significant genes | Odds ratio‡ | 95% confidence interval | |||
|---|---|---|---|---|---|---|
| Obesity (105)* | DMCs † | 77 | 2.70 | 1.73–4.32 | 2.9E-6 | < 0.001 |
| DVCs | 72 | 1.97 | 1.28–3.07 | 0.001 | < 0.001 | |
| DMVCs | 24 | 3.38 | 2.04–5.40 | 2.9E-6 | < 0.001 | |
| Hyper-DMVCs | 23 | 3.87 | 2.32–6.23 | 5.1E-7 | < 0.001 | |
| Diabetes (54)* | DMCs | 37 | 2.13 | 1.17–4.04 | 0.009 | < 0.001 |
| DVCs | 36 | 1.80 | 1.00–3.66 | 0.041 | 0.003 | |
| DMVCs | 6 | 1.41 | 0.49–3.31 | 0.448 | 0.238 | |
| Hyper-DMVCs | 6 | 1.71 | 0.60–4.00 | 0.270 | 0.065 | |
| Hypertension (46)* | DMCs | 29 | 1.67 | 0.89–3.24 | 0.105 | 0.018 |
| DVCs | 28 | 1.40 | 0.75–2.68 | 0.302 | 0.078 | |
| DMVCs | 4 | 1.07 | 0.28–2.97 | 0.787 | 0.313 | |
| Hyper-DMVCs | 3 | 0.95 | 0.19–2.97 | 1.000 | 0.380 | |
| Dyslipidemia (106) * | DMCs | 68 | 1.75 | 1.16–2.68 | 0.006 | 0.003 |
| DVCs | 64 | 1.37 | 0.91–2.08 | 0.119 | 0.034 | |
| DMVCs | 12 | 1.44 | 0.72–2.65 | 0.214 | 0.103 | |
| Hyper-DMVCs | 12 | 1.75 | 0.87–3.20 | 0.079 | 0.028 | |
| Breast Cancer (60)* | DMCs | 40 | 1.96 | 1.11–3.53 | 0.014 | 0.001 |
| DVCs | 42 | 2.10 | 1.18–3.88 | 0.009 | 0.003 | |
| DMVCs | 9 | 2.00 | 0.86–4.10 | 0.059 | 0.019 | |
| Hyper-DMVCs | 7 | 1.80 | 0.69–3.99 | 0.191 | 0.054 | |
| Colon Cancer (19) * | DMCs | 13 | 2.12 | 0.75–6.79 | 0.167 | 0.010 |
| DVCs | 14 | 2.52 | 0.86–8.93 | 0.105 | 0.010 | |
| DMVCs | 3 | 2.14 | 0.40–7.50 | 0.194 | 0.063 | |
| Hyper-DMVCs | 2 | 1.60 | 0.18–6.77 | 0.377 | 0.153 | |
| Tumor suppressor genes (828)** | DMCs | 537 | 1.85 | 1.59–2.14 | 2.2E-16 | < 0.001 |
| DVCs | 550 | 1.82 | 1.57–2.12 | 3.5E-16 | < 0.001 | |
| DMVCs | 119 | 1.97 | 1.59–2.41 | 7.2E-10 | < 0.001 | |
| Hyper-DMVCs | 98 | 1.89 | 1.50–2.36 | 9.9E-10 | < 0.001 | |
| Oncogenes (461)** | DMCs | 301 | 1.87 | 1.53–2.28 | 1.5E-10 | < 0.001 |
| DVCs | 303 | 1.74 | 1.43–2.13 | 1.2E-8 | < 0.001 | |
| DMVCs | 53 | 1.48 | 1.09–1.99 | 0.012 | 0.007 | |
| Hyper-DMVCs | 45 | 1.49 | 1.06–2.04 | 0.015 | 0.007 |
† DMCs [FDR < 0.05], different mean CpG sites, include 9994 unique genes. DVCs (FDR < 0.05), different variance CpG sites, include 10408 unique genes. DMVCs, different mean and variance CpG sites, include 1608 unique genes. Hyper-DMVCs, DMVCs presenting hyper variable in obese, include 1351 unique genes. And there were a total of 19751 valid unique genes detected. ‡ Odds ratio, 95% confidence interval and P.Fisher were calculated by Fisher’s exact test. P.permutation was calculated by a test based on 1000 permutations. * GWAS (Genome-wide association studies) gene numbers of these obesity related diseases are summarized from previous publications as well as the GWAS catalog (http://www.genome.gov/gwastudies). We summarized 105 obesity GWAS genes, 54 diabetes genes, 46 hypertension genes, 106 dyslipidemia genes, 60 breast cancer genes as well as 19 colon cancer genes.** Tumor suppressor genes and oncogenes were selected from the Memorial Sloan-Kettering Cancer Center (MSKCC) cancer database (http://cbio.mskcc.org/CancerGenes/Select.action). We exported 828 tumor suppressor genes and 461 oncogenes from this database.