| Literature DB >> 27358654 |
Wai-Shin Yong1, Fei-Man Hsu2, Pao-Yang Chen1.
Abstract
DNA methylation is an epigenetic modification that plays an important role in regulating gene expression and therefore a broad range of biological processes and diseases. DNA methylation is tissue-specific, dynamic, sequence-context-dependent and trans-generationally heritable, and these complex patterns of methylation highlight the significance of profiling DNA methylation to answer biological questions. In this review, we surveyed major methylation assays, along with comparisons and biological examples, to provide an overview of DNA methylation profiling techniques. The advances in microarray and sequencing technologies make genome-wide profiling possible at a single-nucleotide or even a single-cell resolution. These profiling approaches vary in many aspects, such as DNA input, resolution, genomic region coverage, and bioinformatics analysis, and selecting a feasible method requires knowledge of these methods. We first introduce the biological background of DNA methylation and its pattern in plants, animals and fungi. We present an overview of major experimental approaches to profiling genome-wide DNA methylation and hydroxymethylation and then extend to the single-cell methylome. To evaluate these methods, we outline their strengths and weaknesses and perform comparisons across the different platforms. Due to the increasing need to compute high-throughput epigenomic data, we interrogate the computational pipeline for bisulfite sequencing data and also discuss the concept of identifying differentially methylated regions (DMRs). This review summarizes the experimental and computational concepts for profiling genome-wide DNA methylation, followed by biological examples. Overall, this review provides researchers useful guidance for the selection of a profiling method suited to specific research questions.Entities:
Keywords: Bisulfite sequencing; DNA methylation; Hydroxymethylation; Methylome; RRBS; Single-cell; WGBS
Year: 2016 PMID: 27358654 PMCID: PMC4926291 DOI: 10.1186/s13072-016-0075-3
Source DB: PubMed Journal: Epigenetics Chromatin ISSN: 1756-8935 Impact factor: 4.954
Fig. 1Commonly used methods for genome-wide DNA methylation analysis. a The procedures may involve fragmentation of genomic DNA by restriction enzyme digestion or sonication. The genomic DNA can be subjected to MBD enrichment, antibody enrichment, bisulfite conversion or TET oxidation before analyzing by microarray or next-generation sequencing platform. b Single-cell DNA methylation analysis that involves the isolation of single cells allows the assessment of methylation heterogeneity in cell populations while other genome-wide DNA methylation profiling methods using pooled heterogeneous cell populations are not capable to dissect the methylation heterogeneity. Blue concrete dots represent 5mC, and hollowed ones represent C. Each track represents 1 read
Experimental approaches for profiling genome-wide DNA methylation
| Experimental approach | Strength | Weakness | Resolution | Quantitative nature | Cost | Examples | References |
|---|---|---|---|---|---|---|---|
| CHARM | -Cost-effective | -Moderate resolution | – | Abundance | Low | CGI shores show alteration DNA methylation in colon cancer [ | [ |
| MBDCap-Seq | -Cost-effective | -Relatively low resolution | ~150 bp | Abundance | Moderate | Confirmed previous known differentially methylated sites and discovered new differentially methylated loci in 3 isogenic colon cancer cell lines [ | [ |
| MeDIP | -Cost-effective | -Biased toward hypermethylated regions | ~100 bp | Abundance | Moderate | MBDCap-seq shows higher genomic coverage than MeDIP-seq along with twice as many DMRs between colon cancer and adjacent normal cells [ | [ |
| Illumina’s Infinium Methylation assay | -Cost-effective | -Human sample only | Single base | Abundance | Low | DNA methylation as a signature to surrogate different cord blood cell types [ | [ |
| WGBS | Evaluate methylation state of almost every CpG sites | -High cost | Single base | Digital | High | Bulk methylation level of CpG/CHG/CHH of wild-type Arabidopsis and methyltransferase-deficient mutants [ | [ |
| RRBS | -High CGI coverage | -May exhibit a lack of coverage at intergenic and distal regulatory elements | Single base | Digital | Moderate | The EWAS study integrating DNA methylation, gene expression, proteomics, metabolomics and clinical traits in 90 mouse inbred strains [ | [ |
| scWGBS | Able to study methylome intra-population distribution | -Low sequencing efficiency (~20 million reads typically required per cell) | Single base | Digital | High | Determining epigenomic cell-state dynamics in mouse pluripotent and differentiating cells [ | [ |
| scRRBS | -Highly sensitive | -Substantial DNA degradation after bisulfite treatment | Single base | Digital | High | Profiling epigenomic dynamics of 1 million CpG sites during early embryonic development in ESCs [ | [ |
| TAB-seq | Can distinguish 5hmC from 5mC | -Substantial DNA degradation after bisulfite treatment | Single base | Digital | High | Profiling 5hmC distribution in 108 days human PGCs to reveal DNA demethylation [ | [ |
Fig. 2Schematic overview of genome-wide DNA methylation profiling methods. a 5mC assays. b 5hmC assays. The actual sample requirement may vary according to the type of sample, genome size and number of PCR cycles
Fig. 3Computational pipeline for genome-wide bisulfite sequencing data analysis. Reads from bisulfite sequencing are first aligned to the reference genome. The alignment data may be visualized in different tracks for comparison. After methylation calling, the bulk methylation level and genome-wide methylation level are calculated and plotted, and DMRs are determined. To perform an integrative analysis, DNA methylation data are coupled with gene expression, e.g., differentially expressed genes (DEGs), to delineate the regulatory role of DNA methylation
Bioinformatics tools for bisulfite sequencing data analysis
| Function(s) | Software | Features | References |
|---|---|---|---|
| Quality trim | Cutadapt | Removes adapter sequences | [ |
| Bisulfite sequencing aligner | Bismark | Three-letter aligner; supporting both Bowtie and Bowtie2 | [ |
| BRAT-BW | Three-letter aligner for mapping and methylation calling | [ | |
| BS Seeker 2 | Three-letter aligner; supporting local alignment, and computational removal of unconversion reads | [ | |
| MethylCoder | Three-letter aligner to be used with Bowtie or GSNAP | [ | |
| GSNAP | Wild card aligner | [ | |
| LAST | Wild card aligner wrapped in a general-purpose alignment tool | [ | |
| Data visualization | UCSC Genome Browser | Web-based genome browser allowing visualizing DNA methylation data ( | [ |
| WBSA | Web service for comprehensive analysis of WGBS and RRBS data and DMR finding ( | [ | |
| Integrative Genome Viewer (IGV) | Graphical genome browser to run locally on the user’s computer | [ | |
| Methylation plotter | Web-based tool that plot up to 100 samples in lollipop or grid style ( | [ | |
| Post-alignment data analysis | BSPAT | Summarizing and visualizing DNA methylation co-occurrence patterns | [ |
| GBSA | Sequencing quality assessment | [ | |
| MethGo | Calculating and plotting global methylation level | [ | |
| SAAP-RRBS | Read quality assessment | [ | |
| SMAP | Read quality assessment | [ | |
| Identifying DMR | BSmooth | A pipeline includes alignment, quality control and data analysis; the DMR finding function adapts bump hunting on smoothed t-like score; supporting multiple testing correction | [ |
| methylKit | R package for clustering, sample quality visualization and DMR finding with logistic regression; supporting multiple testing correction | [ | |
| methylSig | R package for DMR finding with likelihood ratio test; supporting multiple testing correction | [ | |
| methylPipe | R package for DMR finding with Wilcoxon or Kruskal–Wallis paired nonparametric test; supporting multiple testing correction | [ | |
| BiSeq | R package for DMR finding with Wald test; performing comprehensive RRBS data analysis; supporting multiple testing correction | [ |
BRAT-BW Bisulfite-treated Reads Analysis Tool (Burrows–Wheeler transform), UCSC genome browser University of California Santa Cruz Genome Browser, WBSA web service for bisulfite sequencing data analysis, BSPAT bisulfite sequencing pattern analysis tool, GBSA genome bisulfite sequencing analyser, IGV integrative genomics viewer, SAAP-RRBS streamlined analysis and annotation pipeline for RRBS data, SMAP streamlined methylation analysis pipeline