| Literature DB >> 31533352 |
Fengjiao Dunbar1, Hongyan Xu2, Duchwan Ryu3, Santu Ghosh4, Huidong Shi5, Varghese George6.
Abstract
Researchers in genomics are increasingly interested in epigenetic factors such as DNA methylation, because they play an important role in regulating gene expression without changes in the DNA sequence. There have been significant advances in developing statistical methods to detect differentially methylated regions (DMRs) associated with binary disease status. Most of these methods are being developed for detecting differential methylation rates between cases and controls. We consider multiple severity levels of disease, and develop a Bayesian statistical method to detect the region with increasing (or decreasing) methylation rates as the disease severity increases. Patients are classified into more than two groups, based on the disease severity (e.g., stages of cancer), and DMRs are detected by using moving windows along the genome. Within each window, the Bayes factor is calculated to test the hypothesis of monotonic increase in methylation rates corresponding to severity of the disease versus no difference. A mixed-effect model is used to incorporate the correlation of methylation rates of nearby CpG sites in the region. Results from extensive simulation indicate that our proposed method is statistically valid and reasonably powerful. We demonstrate our approach on a bisulfite sequencing dataset from a chronic lymphocytic leukemia (CLL) study.Entities:
Keywords: Bayes factor; Bayesian mixed-effect model; CpG sites; DNA methylation; Ordinal responses
Mesh:
Substances:
Year: 2019 PMID: 31533352 PMCID: PMC6770971 DOI: 10.3390/genes10090721
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Conditional probabilities pkj at each CpG site for simulation of BFM under Scenario 1.
| Site | 1 | 2 | … | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | … | 24 |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| group 1 | 0.44 | 0.46 | … | 0.6 | 0.62 | 0.64 | 0.66 | 0.66 | 0.64 | 0.62 | 0.6 | 0.58 | … | 0.44 |
| group 2 | 0.44 | 0.46 | … | 0.6 | 0.72 | 0.74 | 0.76 | 0.76 | 0.74 | 0.72 | 0.6 | 0.58 | … | 0.44 |
| group 3 | 0.44 | 0.46 | … | 0.6 | 0.82 | 0.84 | 0.86 | 0.86 | 0.84 | 0.82 | 0.6 | 0.58 | … | 0.44 |
| group 4 | 0.44 | 0.46 | … | 0.6 | 0.92 | 0.94 | 0.96 | 0.96 | 0.94 | 0.92 | 0.6 | 0.58 | … | 0.44 |
Mean Bayes factors at each CpG site, based on simulation studies.
| Start | End | N = 50 (Scenario 1) | N = 100 (Scenario 1) | N = 50 (Scenario 2) |
|---|---|---|---|---|
| 1 | 6 | 1.02 | 1.02 | 1.03 |
| 2 | 7 | 1.01 | 1.02 | 1.01 |
| 3 | 8 | 1.01 | 1.02 | 1.02 |
| 4 | 9 | 1.02 | 1.01 | 1.01 |
| 5 | 10 | 1.24 | 1.53 | 1.26 |
| 6 | 11 | 1.78 | 3.12 | 1.78 |
| 7 | 12 | 2.95 | 9.16 | 2.85 |
| 8 | 13 | 5.74 | 41.42 | 4.95 |
| 9 | 14 | 10.53 | 1052.07 | 9.31 |
| 10 | 15 | 18.79 | 8554.12 | 18.31 |
| 11 | 16 | 13.9 | 3718.77 | 13.79 |
| 12 | 17 | 8.44 | 306.07 | 8.12 |
| 13 | 18 | 4.43 | 21.91 | 4.5 |
| 14 | 19 | 2.4 | 5.66 | 2.6 |
| 15 | 20 | 1.52 | 2.22 | 1.6 |
| 16 | 21 | 1.07 | 1.11 | 1.07 |
| 17 | 22 | 1.03 | 1.04 | 1.02 |
| 18 | 23 | 1.01 | 1.03 | 1.02 |
| 19 | 24 | 1.03 | 1.03 | 1.01 |
Figure 1Mean of Bayes factors at each CpG site with N = 50 (Scenario 1).
Figure 2Mean of Bayes factors at each CpG site with N = 100 (Scenario 1).
Proportions of Bayes factors that fell above the cut-off.
| Cut-off Point | Number of DMCs in the Windows | ||||||
|---|---|---|---|---|---|---|---|
| 0 | 1 | 2 | 3 | 4 | 5 | 6 | |
| 1.34 | 0.050 | 0.56 | 0.97 | 1 | 1 | 1 | 1 |
| 1.5 | 0.010 | 0.35 | 0.91 | 1 | 1 | 1 | 1 |
Comparison of BFM and SSM for window size of 10 (p < 0.05).
| BFM > 2 | SSM ( | Common | |
|---|---|---|---|
| Total | 183 | 181 | 67 |
| PubMed | 42 | 41 | 18 |