| Literature DB >> 35053511 |
Angelika Merkel1, Manel Esteller2,3,4,5.
Abstract
DNA methylation is an essential epigenetic mark. Alterations of normal DNA methylation are a defining feature of cancer. Here, we review experimental and bioinformatic approaches to showcase the breadth and depth of information that this epigenetic mark provides for cancer research. First, we describe classical approaches for interrogating bulk DNA from cell populations as well as more recently developed approaches for single cells and multi-Omics. Second, we focus on the computational analysis from primary data processing to the identification of unique methylation signatures. Additionally, we discuss challenges such as sparse data and cellular heterogeneity.Entities:
Keywords: DNA methylation; cancer; computational analysis; methods; software
Year: 2022 PMID: 35053511 PMCID: PMC8773752 DOI: 10.3390/cancers14020349
Source DB: PubMed Journal: Cancers (Basel) ISSN: 2072-6694 Impact factor: 6.639
Figure 1Popularity (Pubmed Citations) of DNA methylation approaches. (A) Bulk DNA methylation assays. (B) Experimental and bioinformatic approaches for bisulfite sequencing (WGBS + RRBS) and bisulfite arrays (450K and EPIC). Bismark, popular BS-seq analysis tool; Minfi, popular microarray analysis tool. (C) Single-cell DNA methylation sequencing assays. (D) Single-cell multi-omics sequencing assays.
Data analysis and methylation calling.
| Description | Software | Bulk BS-Seq | scBS-Seq | AE-Seq | BS- | Ref |
|---|---|---|---|---|---|---|
| Quality control | FastQC | yes | yes | yes | [ | |
| Adapter/end-base trimming | TrimGalore | yes | yes | [ | ||
| BS-aware read alignment | BISMARK, BS Seeker2, gemBS, BSMAP | yes | yes | [ | ||
| Remove PCR duplicates | PicardTools | yes | yes | yes | [ | |
| Variant calling | gemBS, Bis-SNP, GATK | yes | [ | |||
| Methylation calling | BISMARK, Bis-SNP, gemBS, MethylExtract | yes | yes | [ | ||
| standard read alignment | bowtie2, BWA | yes | [ | |||
| Normalization | DESeq2, MEDIPS, Diffbind | yes | [ | |||
| Enrichment analysis | QSEA, RaMWAS, Diffbind | yes | [ | |||
| Quality control | minfi, limma, wateRmelon | yes | [ | |||
| Normalization | minfi, limma, wateRmelon | yes | [ | |||
| Methylation calling (bvalues, mvalues) | minfi, wateRmelon | yes | [ |
Methods for data imputation and exploratory analysis.
| Process | Description | Method | Software | BulkBS-Seq | scBS-Seq | AE-Seq | BS-Arrays | Ref |
|---|---|---|---|---|---|---|---|---|
| Visualization | Variance decomposition | PCA | R | yes | yes | yes | yes | [ |
| Dimensionality reduction | MDS, t-SNE, NMF | MASS, stats, Rtsne, NMF | yes | yes | (yes) | (yes) | [ | |
| Clustering | Clustering (nearest neighbour) | k-means | ||||||
| Hierarchical clustering | hclust() | stats, cluster, | yes | yes | yes | yes | [ | |
| Imputation of | Based on local spatial methylation correlation | Local likelihood smoothing | BSmooth | yes | (yes) | [ | ||
| Based on local spatial methylation correlations within and across cells and different genomic regions | glm, Bayesian clustering | Melissa | yes | [ | ||||
| Based on local spatial methylation correlations within and across cells and different genomic regions | Bayesian clustering, hierarchical mixture model | Epiclonal | yes | yes | [ | |||
| Based on neighbouring CpG correlation and sequence composition | Deep neural network | DeepCpG | yes | [ |
Methods for cell-type deconvolution and estimation of tumour purity.
| Task | Class | Method | Software | Bulk BS-Seq | scBS-Seq | BS-Arrays | Ref |
|---|---|---|---|---|---|---|---|
| Remove unwanted variation (including batch effects) | Reference-free | Surrogate and independent surrogate variable analysis | SVA | yes | yes | [ | |
| Remove unwanted variation | RUV, missMethyl | yes | [ | ||||
| Intra-sample cell type deconvolution | Reference-free, semi-reference-free | NMF using recursive QP | RefFreeEWAS | yes | yes | ||
| Reference based | Robust partial correlations, CIBERSORT, Houseman CP, COMBAT | HEpiDISH/EpiDISH | yes | [ | |||
| CIBERSORT | METHYLCIBERSORT | yes | yes | [ | |||
| Reference based using scRNAseq | EPISCORE | yes | |||||
| Estimate immune cell fraction in tumours | Reference based | MethylResolveR | [ | ||||
| Inference of tumour burden and tissue of origin from plasma cfDNA | CancerDetector | yes | |||||
| Estimate tumour purity from plasma cf-DNA | Reference-free | Concordance of neighbouring CpGs | CancerDetector | [ | |||
| Estimate epipolymorphism, methylation entropy, clonal heterogeneity | Reference-free | Epiallele frequency | WSH | yes | (yes) | [ |
Popular methods for differential DNA methylation.
| Type | Method | Distribution | Software | Bulk BS-Seq | scBS-Seq | AE-Seq | BS- | Ref |
|---|---|---|---|---|---|---|---|---|
| DMC, DMR (predefined) | Fisher’s Exact test, logistic regression | Binomial (dispersion) | MethylKit | yes | [ | |||
| DMC, DMR (predefined) | Likelihood ratio | Beta-binomial | MethylSig | yes | [ | |||
| DMC, DMR (defines) | Wald test, linear regression | Beta-binomial (dispersion) | DSS | yes | [ | |||
| DMC, DMR (defines) | local linear regression, smoothing, | Binomial | BSseq (BSmooth), | yes | [ | |||
| DMC, DMR (predefind) | Linear regression, | Linear | RnBeads | yes | yes | yes | [ | |
| DMC, DMR (predefind) | glm, likelihood ratio | Negative-binomial (dispersion) | EdgeR | yes | yes | [ | ||
| DMC, DMR (predefind) | glm, Wald test | Negative-binomial (dispersion) | DEseq2 (Diffbind) | yes | yes | [ | ||
| DMC | non-parametric test, beta-regression | Gauss | limma | yes | [ | |||
| DMC, DMR (defines) | local linear models, smoothing | Gauss | minfi (bump hunter, DMPfinder) | yes | [ | |||
| DMC, DMR (defines) | local linear models, smoothing | DMRcate | yes | [ | ||||
| DMC, DMR (defines) | Linear models, combining subregions | Gauss | dmrff | yes | [ |
Popular approaches for methylome segmentation.
| Type | Method | Model | Software | Bulk DNA BS-Seq | Ref |
|---|---|---|---|---|---|
| UMR, LMR, HMR | Segmentation | 3-State HMM | MethylSeekR | yes | [ |
| PMD | Segmentation | 3-State HMM | MethylSeekR | yes | |
| Hypo/Hypermethylated regions, DMR, PMR, PMD, AMR | Segmentation | 2-State HMM, | methPipe | yes | [ |