| Literature DB >> 31831061 |
Ieva Rauluseviciute1, Finn Drabløs2, Morten Beck Rye2,3.
Abstract
Sequencing technologies have changed not only our approaches to classical genetics, but also the field of epigenetics. Specific methods allow scientists to identify novel genome-wide epigenetic patterns of DNA methylation down to single-nucleotide resolution. DNA methylation is the most researched epigenetic mark involved in various processes in the human cell, including gene regulation and development of diseases, such as cancer. Increasing numbers of DNA methylation sequencing datasets from human genome are produced using various platforms-from methylated DNA precipitation to the whole genome bisulfite sequencing. Many of those datasets are fully accessible for repeated analyses. Sequencing experiments have become routine in laboratories around the world, while analysis of outcoming data is still a challenge among the majority of scientists, since in many cases it requires advanced computational skills. Even though various tools are being created and published, guidelines for their selection are often not clear, especially to non-bioinformaticians with limited experience in computational analyses. Separate tools are often used for individual steps in the analysis, and these can be challenging to manage and integrate. However, in some instances, tools are combined into pipelines that are capable to complete all the essential steps to achieve the result. In the case of DNA methylation sequencing analysis, the goal of such pipeline is to map sequencing reads, calculate methylation levels, and distinguish differentially methylated positions and/or regions. The objective of this review is to describe basic principles and steps in the analysis of DNA methylation sequencing data that in particular have been used for mammalian genomes, and more importantly to present and discuss the most pronounced computational pipelines that can be used to analyze such data. We aim to provide a good starting point for scientists with limited experience in computational analyses of DNA methylation and hydroxymethylation data, and recommend a few tools that are powerful, but still easy enough to use for their own data analysis.Entities:
Keywords: Bisulfite sequencing; Computational pipelines; DNA methylation; Hydroxymethylation
Year: 2019 PMID: 31831061 PMCID: PMC6909609 DOI: 10.1186/s13148-019-0795-x
Source DB: PubMed Journal: Clin Epigenetics ISSN: 1868-7075 Impact factor: 6.551
Selected tools and their features. Whether the pipeline is capable of performing an analysis from raw reads to DMRs and DMPs was a crucial criterion for the selection of tools. However, other aspects, such as graphic output and availability of a detailed manual, were also important for the final recommendation
| Selected tool | Experimental approach | From raw reads to DMPs and DMRs | Graphic output | Detailed manual available | Reference | |||
|---|---|---|---|---|---|---|---|---|
| Quality control and 3′ trimming | Alignment | Methylation levels | Differential methylation | |||||
| WGBS | No | Yes | Yes | DMRs only | BED, bedGraph, Tab-del | Yes | [ | |
| WGBS, RRBS, and possibly 5hmC seq | Yes | Yes | Yes, but no beta score | Yes | BED, bedGraph, Tab-del | Yes | [ | |
| WGBS and 5hmC seq | Error estimation only | Yes | Yes | Yes | BED, bedGraph, Tab-del | Yes | [ | |
| WGBS and 5hmC seq | Yes | Yes | Yes | Yes | BED, bedGraph, Tab-del, VCF | Yes | [ | |
| RRBS only | Trimming only | Yes | Yes | Yes, also SNPs | Tab-del | Yes | [ | |
| WGBS and RRBS | Yes | Yes | Yes | No | Yes | Yes | [ | |
| MeDIP-seq | Yes | Yes | Yes | No | No | Yes | [ | |
| MeDIP-seq | Yes | Yes | Yes | Yes | No | Yes | [ | |
| MRE-seq | Yes | No | Yes | Yes | Yes | Yes | [ | |
*Recommended
Tab-del, tab-delimited output