| Literature DB >> 29391490 |
Benjamin T Mayne1,2, Shalem Y Leemaqz3,4, Sam Buckberry5,6, Carlos M Rodriguez Lopez7, Claire T Roberts3,4, Tina Bianco-Miotto3,8, James Breen9,10.
Abstract
Genotyping-by-sequencing (GBS) or restriction-site associated DNA marker sequencing (RAD-seq) is a practical and cost-effective method for analysing large genomes from high diversity species. This method of sequencing, coupled with methylation-sensitive enzymes (often referred to as methylation-sensitive restriction enzyme sequencing or MRE-seq), is an effective tool to study DNA methylation in parts of the genome that are inaccessible in other sequencing techniques or are not annotated in microarray technologies. Current software tools do not fulfil all methylation-sensitive restriction sequencing assays for determining differences in DNA methylation between samples. To fill this computational need, we present msgbsR, an R package that contains tools for the analysis of methylation-sensitive restriction enzyme sequencing experiments. msgbsR can be used to identify and quantify read counts at methylated sites directly from alignment files (BAM files) and enables verification of restriction enzyme cut sites with the correct recognition sequence of the individual enzyme. In addition, msgbsR assesses DNA methylation based on read coverage, similar to RNA sequencing experiments, rather than methylation proportion and is a useful tool in analysing differential methylation on large populations. The package is fully documented and available freely online as a Bioconductor package ( https://bioconductor.org/packages/release/bioc/html/msgbsR.html ).Entities:
Mesh:
Substances:
Year: 2018 PMID: 29391490 PMCID: PMC5794748 DOI: 10.1038/s41598-018-19655-w
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Comparison between msgbsR and existing DNA methylation pipeline software tools.
| Package | Type of DNA methylation data | Format of files for importing | Filtering outliers and poor quality probes/sequences | Differential methylation analysis |
|---|---|---|---|---|
|
| MRE-seq | BAM | Yes | Yes |
|
| Array | IDAT | Yes | Yes |
|
| Array | IDAT | Yes | Yes |
|
| Array | XYS | Yes | Yes |
|
| WGBS | BAM | Yes | Yes |
|
| WGBS | BAM | Yes | Yes |
|
| RRBS/WGBS | bismarkbed2graph output[ | Yes | Yes |
RRBS: reduced representation bisulfite sequencing, MRE-seq: methylation sensitive restriction enzyme sequencing, WGBS: whole genome bisulfite sequencing.
Figure 1A simplified schematic of methylation-sensitive restriction enzyme sequencing approach and the msgbsR pipeline. (A) An example of MRE-seq/msGBS using the restriction enzyme, MspI, which cleaves DNA at the recognition sequence C^CGG if the internal cytosine is methylated. However, MspI does not cut at the recognition site if both cytosines are methylated or the external cytosine is methylated. (B) The data analysis pipeline represented by a flowchart which highlights the names of the main functions in the msgbsR package.
Figure 2The output of the plotCounts function showing the distribution of the library size compared to the total number of ApeKI cut sites produced for each sample from either the (A) barley or (B) maize data set. Each individual point represents a unique sample.
Figure 3The msgbsR pipeline on our rat prostate MRE-seq data. (A) Output of the plotCounts function showing the distribution of the total number of reads and cut sites per sample. Samples are coloured depending on their diet group. (B) A histogram of reads for a control sample showing a negative distribution. (C) A volcano plot showing differentially methylated sites (FDR < 0.01) between the control diet (blue dots) and the experimental diet (red dots).