| Literature DB >> 30581840 |
Yuanyuan Zhang1, Shudong Wang2, Xinzeng Wang3.
Abstract
BACKGROUND: DNA methylation is essential for regulating gene expression, and the changes of DNA methylation status are commonly discovered in disease. Therefore, identification of differentially methylation patterns, especially differentially methylated regions (DMRs), in two different groups is important for understanding the mechanism of complex diseases. Few tools exist for DMR identification through considering features of methylation data, but there is no comprehensive integration of the characteristics of DNA methylation data in current methods.Entities:
Mesh:
Substances:
Year: 2018 PMID: 30581840 PMCID: PMC6276520 DOI: 10.1155/2018/1070645
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.411
Figure 1Flowchart of the proposed approach. Step 1, single site-level energy is calculated based on modified 1D Ising model. Step 2, candidate DMRs are identified using a greedy algorithm. Step 3, for each candidate DMR, the significance is assessed through permuting the sample labels.
Confusion matrix.
| Identified results of a method | |||
|---|---|---|---|
| Number of differential methylated sites within DMRs | Number of non-differential methylated sites without in DMRs | ||
| Real DMRs | Number of differential methylated sites within DMRs |
|
|
| True positive | False negative | ||
| Number of non-differential methylated sites without in DMRs |
|
| |
| False positive | True negative | ||
Different parameter settings.
| Parameters |
|
|
|
|
|
|---|---|---|---|---|---|
| 1 | -2 | 2 | 1.5 | 0.3 | 0.7 |
|
| |||||
| 2 | -2 | 2 | 2.5 | 0.3 | 0.7 |
|
| |||||
| 3 | -3 | 3 | 1.5 | 0.3 | 0.7 |
|
| |||||
| 4 | -3 | 3 | 2.5 | 0.3 | 0.7 |
|
| |||||
| 5 | -2 | 2 | 1.5 | 0.3 | 0.4 |
Comparison of different methods in different parameters.
| Parameters | Bump hunting | DMRcate | ProbeLasso | Wang's method |
| |||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| SP | SE | No. TP (FP)a | No. TP (FP)b | SP | SE | No. TP (FP)a | No. TP (FP)b | SP | SE | No. TP (FP)a | No. TP (FP)b | SP | SE | No. TP (FP)a | No. TP (FP)b | SP | SE | No. TP (FP)a | No. TP (FP)b | |
| 1 | 0.99 | 0.12 | 0(1) | 0(1) | 1.00 | 0.25 | 2(0) | 2(0) | 1.00 | 0.16 | 3(1) | 1(3) | 0.99 | 0.53 | 6(1) | 3(4) | 0.99 | 0.81 | 9(0) | 7(2) |
| 2 | 1.00 | 0.23 | 2(0) | 0(2) | 1.00 | 0.25 | 2(0) | 2(0) | 1.00 | 0.12 | 2(1) | 1(2) | 0.99 | 0.38 | 5(0) | 3(2) | 0.99 | 0.90 | 10(0) | 10(0) |
| 3 | 1.00 | 0.15 | 0(2) | 0(2) | 1.00 | 0.25 | 2(0) | 2(0) | 1.00 | 0.16 | 3(1) | 1(3) | 1.00 | 0.59 | 9(0) | 4(5) | 0.99 | 0.82 | 8(0) | 6(2) |
| 4 | 1.00 | 0.26 | 1(1) | 2(0) | 1.00 | 0.25 | 2(0) | 2(0) | 1.00 | 0.16 | 3(1) | 1(3) | 1.00 | 0.52 | 5(0) | 3(2) | 0.99 | 0.93 | 10(0) | 10(0) |
| 5 | 1.00 | 0.18 | 3(0) | 1(2) | 1.00 | 0.25 | 2(0) | 2(0) | 1.00 | 0.12 | 2(1) | 1(2) | 1.00 | 0.45 | 8(1) | 3(6) | 0.99 | 0.79 | 9(1) | 7(3) |
aθ = 0.2; bθ = 0.5
Comparison results of our method with Wang's method based on β-value and M-value.
| Our method | Wang's method | Overlapping ratio | |
|---|---|---|---|
|
| 2127 | 1618 | 78.9% (1276/1618) |
|
| |||
| M-value | 7871 | 3070 | 93.0% (2856/3070) |
|
| |||
| Overlapping ratio | 88.3% (1879/2127) | 98.6% (1595/1618) | - - - |
Overlap results of different methods in identifying DMRs.
| ProbeLasso |
| 1566 | 2932 |
|
| DMRcate |
| 2579 |
| 2932 |
| Wang's |
|
| 2579 | 1566 |
|
|
|
|
|
|
|
| ||||
|
| Wang's | DMRcate | ProbeLasso | |
∗ Italic numbers indicate the numbers of DMRs identified by different methods. Black ones are overlap numbers of two methods.
Figure 2Distribution of DMRs relative to genes.
Figure 3An example of DMRs identified based on M-value but not for β-value. The difference of these two DMRs in mean signal (a) and variance signal (b) between normal and tumor samples.
A Comparison of the results to show the necessity of integrating data characteristics.
| Integrated method |
|
| Without considering correlation | |
|---|---|---|---|---|
| Numbers of DMRs | 883 | 2374 | 370 | 34 |
|
| ||||
| Overlapping (a/b) | 100% (883/883) | 64.5% (1532/2374) | 100% (370/370) | 94.1% (32/34) |
∗a is the numbers of overlapping DMRs compared with integrated method (1532>883 means that more DMRs coincide with one DMR identified by integrated method.); b is the numbers of DMRs identified by the corresponding method.