| Literature DB >> 24875479 |
Daniel Becker1, Pavlo Lutsik2, Peter Ebert3, Christoph Bock4, Thomas Lengauer3, Jörn Walter5.
Abstract
Recent data suggest important biological roles for oxidative modifications of methylated cytosines, specifically hydroxymethylation, formylation and carboxylation. Several assays are now available for profiling these DNA modifications genome-wide as well as in targeted, locus-specific settings. Here we present BiQ Analyzer HiMod, a user-friendly software tool for sequence alignment, quality control and initial analysis of locus-specific DNA modification data. The software supports four different assay types, and it leads the user from raw sequence reads to DNA modification statistics and publication-quality plots. BiQ Analyzer HiMod combines well-established graphical user interface of its predecessor tool, BiQ Analyzer HT, with new and extended analysis modes. BiQ Analyzer HiMod also includes updates of the analysis workspace, an intuitive interface, a custom vector graphics engine and support of additional input and output data formats. The tool is freely available as a stand-alone installation package from http://biq-analyzer-himod.bioinf.mpi-inf.mpg.de/.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24875479 PMCID: PMC4086109 DOI: 10.1093/nar/gku457
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Principal scheme of oxBS-seq and fCAB-seq methods. (a) oxBS-seq. Oxidative bisulfite treatment followed by PCR leads to conversion of the oxidative modifications to thymines while only 5mC appears as cytosine in the sequencing reads. The cumulative level of 5hmC can be established for each CpG site by comparing to ordinary bisulfite sequencing. (b) In fCAB-seq 5fC is protected from the conversion and together with 5mC and 5hmC appears as cytosine after PCR. The bulk 5fC abundance is calculated by subtracting the cumulative levels of 5mC + 5hmC pair obtained from ordinary bisulfite sequencing.
Information that can be extracted from each DNA modification profiling method supported by BiQ Analyzer HiMod
SM—single-molecule resolution, B—bulk per-CpG measurement.
Figure 2.BiQ Analyzer HiMod visualization features. (a). Global heat map of average bulk (5mC + 5hmC) modification levels at sequenced loci across the samples. The default color code is given in the color legend (see Supplementary Figure S2 for more details). (b) Bar chart displaying stacked levels of two modifications at each CpG position in a single locus (see also Supplementary Figure S3). (c) Diagnostic histogram visualizing the distribution of a quality control metric—sequence identity of each read-reference alignment—simplifying the sequence identity cutoff selection. (d) Integration with the IGV genome browser showing genomic tracks with 5mC and 5hmC levels of three samples (the value range is much smaller for the 5hmC tracks).
Benchmarking of BiQ Analyzer HiMod on artificially generated read batches of variable size
| Number. of reads in each batcha | 1000 | 2000 | 5000 | 10 000 | 20 000 | 50 000 | |
|---|---|---|---|---|---|---|---|
| Running time, min.sec | Normal | 0.15 | 0.32 | 1.52 | 5.20 | 16.48 | 155.15 |
| w/o pattern mapsb | 0.15 | 0.26 | 1.06 | 2.13 | 4.25 | 25.14 |
aGiven are numbers of reads in each of two read batches (oxBS and conventional bisulfite) processed in every benchmarking step. Therefore, the actual number of processed reads was twice the given number in each step.
bThe high computation time for large read sets is based on the size of the produced single read-resolution pattern maps. The user can decide which graphics to generate and by choosing to disable pattern maps the computation time can be significantly reduced.