| Literature DB >> 32484876 |
Viivi Halla-Aho1, Harri Lähdesmäki1.
Abstract
MOTIVATION: DNA methylation is an important epigenetic modification, which has multiple functions. DNA methylation and its connections to diseases have been extensively studied in recent years. It is known that DNA methylation levels of neighboring cytosines are correlated and that differential DNA methylation typically occurs rather as regions instead of individual cytosine level.Entities:
Year: 2020 PMID: 32484876 PMCID: PMC7750928 DOI: 10.1093/bioinformatics/btaa539
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.Plate diagram of the LuxUS model. The gray and white circles represent observed and latent variables, respectively. Rectangles represent fixed hyperparameters, design matrices or other type of input data
Fig. 2.The top panel shows the methylation states for a chosen genomic region in chromosome 21 (from 28 214 962 to 28 217 867 bp) for the six samples in the colon cancer dataset. The colon cancer sample methylation states have been plotted in orange and the normal colon samples in purple. The experiment was paired, and the cancer and colon samples from the same person have been plotted with the same marker symbols. The middle panel shows the LuxUS Bayes factor values with log scaling. The Bayes factor values have been truncated to 1000. The cytosines which were filtered from further analysis in the preanalysis phase have been plotted with value 0 with empty markers. The horizontal red line corresponds to the Bayes factor value 3. The bottom panel shows the RADMeth P-values for the same genomic region. The horizontal red line corresponds to the adjusted P-value of 0.05. The cytosines for which RADMeth produced no P-value are plotted with P-value of 1 with empty markers
AUROC values for LuxUS with HMC and ADVI, LuxUS for separate cytosines, RADMeth, M3D, metilene, DSS and bsseq for simulated dataset with , corresponding to
|
|
| LuxUS | LuxUS | LuxUS | RAD- | M3D | metilene | metilene | DSS | bsseq |
|---|---|---|---|---|---|---|---|---|---|---|
| HMC | ADVI | sep. | Meth | mode 2 | ||||||
| 6 | 6 | 0.734 | 0.738 | 0.522 | 0.607 | 0.719 | 0.631 |
| 0.738 | 0.615 |
| 6 | 12 | 0.814 | 0.783 | 0.606 | 0.712 | 0.703 | 0.658 |
| 0.801 | 0.643 |
| 6 | 24 |
| 0.909 | 0.687 | 0.819 | 0.682 | 0.759 | 0.917 | 0.904 | 0.711 |
| 12 | 6 | 0.687 | 0.696 | 0.545 | 0.632 | 0.640 | 0.590 |
| 0.690 | 0.590 |
| 12 | 12 | 0.822 | 0.803 | 0.625 | 0.756 | 0.701 | 0.678 |
| 0.812 | 0.654 |
| 12 | 24 | 0.927 | 0.899 | 0.721 | 0.867 | 0.755 | 0.781 |
| 0.924 | 0.754 |
| 24 | 6 | 0.667 | 0.668 | 0.541 | 0.612 |
| 0.635 | 0.686 | 0.680 | 0.597 |
| 24 | 12 | 0.840 | 0.835 | 0.637 | 0.765 | 0.777 | 0.735 | 0.850 |
| 0.712 |
| 24 | 24 | 0.933 | 0.886 | 0.762 | 0.890 | 0.732 | 0.830 |
| 0.916 | 0.773 |
Note: The highest AUROC value is bolded for each simulation scenario. denotes the number of sequencing reads overlapping a cytosine and N denotes the number of samples.
AUROC values for LuxUS with HMC and ADVI, LuxUS for separate cytosines, RADMeth, M3D, DSS, metilene and bsseq for simulated dataset with , corresponding to
|
|
| LuxUS | LuxUS | LuxUS | RAD- | M3D | metilene | metilene | DSS | bsseq |
|---|---|---|---|---|---|---|---|---|---|---|
| HMC | ADVI | sep. | Meth | mode 2 | ||||||
| 6 | 6 |
| 0.890 | 0.737 | 0.824 | 0.874 | 0.816 | 0.926 | 0.917 | 0.770 |
| 6 | 12 |
| 0.975 | 0.889 | 0.974 | 0.949 | 0.923 | 0.999 | 0.997 | 0.912 |
| 6 | 24 |
| 0.960 | 0.974 | 0.994 | 0.969 | 0.910 |
|
| 0.979 |
| 12 | 6 | 0.960 | 0.942 | 0.781 | 0.881 | 0.914 | 0.821 |
| 0.957 | 0.825 |
| 12 | 12 |
| 0.940 | 0.913 | 0.978 | 0.953 | 0.950 | 0.999 | 0.998 | 0.926 |
| 12 | 24 |
| 0.940 | 0.982 | 0.995 | 0.959 | 0.982 |
|
| 0.982 |
| 24 | 6 |
| 0.924 | 0.795 | 0.871 | 0.908 | 0.836 | 0.939 | 0.936 | 0.830 |
| 24 | 12 |
| 0.934 | 0.918 | 0.966 | 0.925 | 0.931 | 0.993 | 0.988 | 0.923 |
| 24 | 24 |
| 0.945 | 0.993 | 0.996 | 0.972 | 0.964 |
|
| 0.992 |
Note: The highest AUROC value is bolded for each simulation scenario. denotes the number of sequencing reads overlapping a cytosine and N denotes the number of samples.
Fig. 3.The AUROC value for LuxUS as function of number of cytosines in a 1000 bp window for simulated data and its interpolation with third-degree polynomial. Each AUROC value is estimated using 400 datasets, including 200 datasets with differential methylation and 200 datasets with no differential methylation
AUROC values for LuxUS with HMC and ADVI, LuxUS for separate cytosines, RADMeth, M3D, DSS, metilene and bsseq for simulated dataset with confounding covariates
|
|
| LuxUS | LuxUS | LuxUS | RAD- | M3D | metilene | metilene | DSS | bsseq |
|---|---|---|---|---|---|---|---|---|---|---|
| HMC | ADVI | sep. | Meth | mode 2 | ||||||
| 6 | 12 |
| 0.851 | 0.605 | 0.728 | 0.674 | 0.625 | 0.717 | 0.845 | 0.614 |
| 6 | 24 |
| 0.883 | 0.682 | 0.800 | 0.618 | 0.735 | 0.840 | 0.870 | 0.688 |
| 12 | 12 |
| 0.802 | 0.634 | 0.702 | 0.644 | 0.676 | 0.712 | 0.757 | 0.628 |
| 12 | 24 |
| 0.899 | 0.748 | 0.821 | 0.722 | 0.772 | 0.861 | 0.915 | 0.737 |
| 24 | 12 |
| 0.738 | 0.641 | 0.717 | 0.658 | 0.592 | 0.684 | 0.750 | 0.626 |
| 24 | 24 |
| 0.880 | 0.731 | 0.827 | 0.690 | 0.709 | 0.836 | 0.874 | 0.714 |
Note: The highest AUROC value is bolded for each simulation scenario. denotes the number of sequencing reads overlapping a cytosine and N denotes the number of samples.
Fig. 4.Receiver operating characteristics curves for LuxUS with HMC and ADVI model estimation, RADMeth, M3D, bsseq, metilene and DSS for the simulated data by Hebestreit . The dashed black line shows the expected ROC curve for random guessing