| Literature DB >> 25592753 |
Xiaowei Wu1, Ming-An Sun2, Hongxiao Zhu3, Hehuang Xie4,5.
Abstract
BACKGROUND: With recent development in sequencing technology, a large number of genome-wide DNA methylation studies have generated massive amounts of bisulfite sequencing data. The analysis of DNA methylation patterns helps researchers understand epigenetic regulatory mechanisms. Highly variable methylation patterns reflect stochastic fluctuations in DNA methylation, whereas well-structured methylation patterns imply deterministic methylation events. Among these methylation patterns, bipolar patterns are important as they may originate from allele-specific methylation (ASM) or cell-specific methylation (CSM).Entities:
Mesh:
Year: 2015 PMID: 25592753 PMCID: PMC4302125 DOI: 10.1186/s12859-014-0439-2
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Promoter methylation pattern of prodynorphin gene in human brain [ 27 ]. This figure shows a genomic region of 95 bp with 9 CpG dinucleotides. Most reads are completely unmethylated (open circles) or methylated (filled circles), but only a few reads are with both unmethylated and methylated cytosines.
Empirical type-I error rate and power for bipolar methylation detection
|
|
|
| ||||
|---|---|---|---|---|---|---|
|
|
|
|
|
|
| |
|
| .079 | .075 | .087 | .279 | .580 | .983 |
|
| .090 | .077 | .082 | .590 | .875 | .997 |
|
| .082 | .080 | .094 | .875 | .976 | .998 |
|
| .084 | .083 | .088 | .887 | .986 | .997 |
|
| .085 | .088 | .088 | .931 | .995 | .998 |
|
|
|
| ||||
|
|
|
|
|
|
| |
|
| .032 | .015 | .006 | .275 | .556 | .871 |
|
| .034 | .015 | .004 | .528 | .771 | .984 |
|
| .031 | .017 | .008 | .770 | .937 | .996 |
|
| .027 | .020 | .008 | .778 | .946 | .996 |
|
| .025 | .016 | .006 | .782 | .951 | .997 |
The empirical type-I error rate and power are calculated from 5,000 simulations under significance level 0.05, for different number of reads m and for different cell-type proportion w. In all simulations, we set the threshold parameter δ= 0.35. The type-I error rates for the same m but different w are not the same because we set different methylation probability vectors for different w when generating data under H 0, although these probabilities are all sampled from beta(8, 8).
Average p-values for bipolar methylation detection
|
|
|
|
| |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
|
| |||
| (1, 1) | 1 | 1 | 1 | 1 | 1 | (3, 7) |
|
|
|
|
| |
| (1, 2) | 1 | 1 | 1 | 1 | 1 | (3, 8) |
|
|
|
|
| |
| (1, 3) | 0.95 | 0.90 | 0.95 | 0.99 | 0.97 | (3, 9) |
|
|
|
|
| |
| (1, 4) | 0.82 | 0.71 | 0.76 | 0.76 | 0.81 | (3,10) |
|
|
|
|
| |
| (1, 5) | 0.58 | 0.69 | 0.73 | 0.70 | 0.64 | (3,11) |
|
|
|
|
| |
| (1, 6) | 0.51 | 0.47 | 0.58 | 0.50 | 0.57 | (3,12) |
|
|
|
|
| |
| (1, 7) | 0.46 | 0.34 | 0.29 | 0.42 | 0.36 | (3,13) |
|
|
|
|
| |
| (1, 8) | 0.23 | 0.29 | 0.25 | 0.27 | 0.35 | (4, 4) |
|
|
|
|
| |
| (1, 9) | 0.14 | 0.14 | 0.15 | 0.15 | 0.16 | (4, 5) |
|
|
|
|
| |
| (1,10) | 0.09 | 0.09 | 0.11 | 0.08 | 0.09 | (4, 6) |
|
|
|
|
| |
| (1,11) | 0.07 | 0.07 | 0.06 | 0.06 | 0.07 | (4, 7) |
|
|
|
|
| |
| (1,12) |
|
|
|
|
| (4, 8) |
|
|
|
|
| |
| (1,13) |
|
|
|
|
| (4, 9) |
|
|
|
|
| |
| (1,14) |
|
|
|
|
| (4,10) |
|
| 0 |
|
| |
| (1,15) |
|
|
|
|
| (4,11) |
|
|
|
|
| |
| (2, 2) | 0.72 | 0.79 | 0.81 | 0.60 | 0.61 | (4,12) |
|
|
|
|
| |
| (2, 3) | 0.56 | 0.59 | 0.48 | 0.41 | 0.44 | (5, 5) |
|
|
|
|
| |
| (2, 4) | 0.38 | 0.41 | 0.45 | 0.28 | 0.28 | (5, 6) |
|
|
|
|
| |
| (2, 5) | 0.23 | 0.20 | 0.24 | 0.16 | 0.15 | (5, 7) |
|
|
|
|
| |
| (2, 6) | 0.14 | 0.13 | 0.10 | 0.11 | 0.07 | (5, 8) |
|
|
|
|
| |
| (2, 7) | 0.06 | 0.08 | 0.07 |
| 0.06 | (5, 9) |
|
|
|
|
| |
| (2, 8) |
|
|
|
|
| (5,10) |
|
|
|
|
| |
| (2, 9) |
|
|
|
|
| (5,11) |
|
|
|
|
| |
| (2,10) |
|
|
|
|
| (6, 6) |
|
|
|
|
| |
| (2,11) |
|
|
|
|
| (6, 7) |
|
|
|
|
| |
| (2,12) |
|
|
|
|
| (6, 8) |
|
|
|
|
| |
| (2,13) |
|
|
|
|
| (6, 9) |
|
|
|
|
| |
| (2,14) |
|
|
|
|
| (6,10) |
|
|
|
|
| |
| (3, 3) | 0.23 | 0.31 | 0.11 | 0.16 | 0.15 | (7, 7) |
|
|
|
|
| |
| (3, 4) | 0.16 | 0.12 | 0.08 | 0.08 | 0.10 | (7, 8) |
|
|
|
|
| |
| (3, 5) | 0.06 |
|
|
|
| (7, 9) |
|
|
|
|
| |
| (3, 6) |
|
|
|
|
| (8, 8) |
|
|
|
|
| |
The average p-values are calculated from 100 simulations for a 4-CpG segment with 16 reads, using τ= 0.32 and different δ values. Significant p-values (under significance level 0.05) are marked with boldfaced font. Zero-valued p-values are actually <5 × 10-3. In each simulation, we generate i reads with methylation pattern (0,0,0,0), j reads with methylation pattern (1,1,1,1), and randomly generate (16-i-j) reads with other methylation patterns.
Figure 2Comparison between DPM search, -means and Bayesian mixture clustering in Simulation III. A. Average mis-classification rates. B. Average power. These results are obtained from 1,000 simulations under the alternative hypothesis, for different number of reads and for different cell-type proportions, using DPM search, k-means and Bayesian mixture clustering.
Figure 3Analysis of mouse brain methylomes. A. Venn diagram shows the relationships between (a) bipolar segments identified from pooled dataset, (b) DMR identified between neuron and glia, (c) bipolar segments identified from glia, and (d) bipolar segments identified from neuron. B. Gene ontology analysis of genes associated with bipolar methylated segments in neuron and glia datasets, respectively. P-values for GO enrichment were adjusted with Bonferroni correction.
Figure 4Analysis of human blood methylomes. A. Venn diagram shows the relationships between (a) bipolar segments identified from pooled dataset, (b) DMR identified between B cell and neutrophil, (c) bipolar segments identified from B cell, and (d) bipolar segments identified from neutrophil. B. Gene ontology analysis of genes associated with bipolar methylated segments in B cell and neutrophil datasets, respectively. P-values for GO enrichment were adjusted with Bonferroni correction.