| Literature DB >> 26631489 |
Frank Jühling1, Helene Kretzmer1, Stephan H Bernhart1, Christian Otto1, Peter F Stadler2, Steve Hoffmann1.
Abstract
The detection of differentially methylated regions (DMRs) is a necessary prerequisite for characterizing different epigenetic states. We present a novel program, metilene, to identify DMRs within whole-genome and targeted data with unrivaled specificity and sensitivity. A binary segmentation algorithm combined with a two-dimensional statistical test allows the detection of DMRs in large methylation experiments with multiple groups of samples in minutes rather than days using off-the-shelf hardware. metilene outperforms other state-of-the-art tools for low coverage data and can estimate missing data. Hence, metilene is a versatile tool to study the effect of epigenetic modifications in differentiation/development, tumorigenesis, and systems biology on a global, genome-wide level. Whether in the framework of international consortia with dozens of samples per group, or even without biological replicates, it produces highly significant and reliable results.Entities:
Mesh:
Year: 2015 PMID: 26631489 PMCID: PMC4728377 DOI: 10.1101/gr.196394.115
Source DB: PubMed Journal: Genome Res ISSN: 1088-9051 Impact factor: 9.043
Figure 1.(A) Workflow of metilene. After a presegmentation step to exclude noninformative regions, the circular binary segmentation is used to identify regions with significant differential methylation. The segmentation algorithm is applied recursively trying to identify a window (a,b) with the maximum difference of the cumulative sum of the mean methylation difference, indicating a potential DMR. (B) The performances of metilene, MOABS, and BSmooth were assessed in terms of true positive rates and positive predictive values (PPVs) for four different classes of DMRs, starting with highly different DMRs in class 1 and ending up with a set containing more indifferent DMRs in class 4. The DMRs were simulated within two background settings: the homogeneous background 1 and the more heterogeneous background 2. The evaluation was performed in terms of the fraction of correctly predicted CpGs within simulated DMRs (top) as well as in terms of simulated and predicted DMR segments with an overlap of at least 50% (bottom). (C) Boundary detection analysis for strong (left) and weak (right) differences in the background methylation level. (D) Results for metilene, MOABS, and BSmooth on the low-coverage data sets. (E) Runtime and memory consumption on a single core and 10 cores.
Figure 2.(A) Venn diagram of DMRs found by metilene, BSmooth, and MOABS in the WGBS medulloblastoma data on human Chromosome 10. (B) Count of DMRs exclusively found by metilene, BSmooth, and MOABS binned into methylation difference classes. (C) Box plots of P-values of exclusive DMRs using an independent Wilcoxon test. (D) Scatter plot of length and mean methylation differences of DMRs exclusively reported by metilene. Isoclines indicate their distribution, while labels denote the fraction of DMRs found inside the respective area. Note the minimum methylation cutoff at 0.1 (gray line). (E) Correlation of mean difference of exclusive metilene DMRs and 450k methylation beta values. The plot shows all DMRs covered by at least two probes on the array. (F) Figure of the DLG5 gene containing a DMR (red line) exclusively found by metilene. Methylation rates of control (top) and medulloblastoma (below) are heatmap color-coded, indicating low methylation rates in blue and high methylation rates in yellow. The MDS is shown above the DMR annotation and the gene annotation (bottom).