| Literature DB >> 26798339 |
Ming-An Sun1, Karthik Raja Velmurugan2, David Keimig1, Hehuang Xie3.
Abstract
The emerging genome-wide hairpin bisulfite sequencing (hairpin-BS-Seq) technique enables the determination of the methylation pattern for DNA double strands simultaneously. Compared with traditional bisulfite sequencing (BS-Seq) techniques, hairpin-BS-Seq can determine methylation fidelity and increase mapping efficiency. However, no computational tool has been designed for the analysis of hairpin-BS-Seq data yet. Here we present HBS-tools, a set of command line based tools for the preprocessing, mapping, methylation calling, and summarizing of genome-wide hairpin-BS-Seq data. It accepts paired-end hairpin-BS-Seq reads to recover the original (pre-bisulfite-converted) sequences using global alignment and then calls the methylation statuses for cytosines on both DNA strands after mapping the original sequences to the reference genome. After applying to hairpin-BS-Seq datasets, we found that HBS-tools have a reduced mapping time and improved mapping efficiency compared with state-of-the-art mapping tools. The HBS-tools source scripts, along with user guide and testing data, are freely available for download.Entities:
Year: 2015 PMID: 26798339 PMCID: PMC4698518 DOI: 10.1155/2015/760423
Source DB: PubMed Journal: Adv Bioinformatics ISSN: 1687-8027
Summary of the programs included in HBS-tools.
| Module name | Function |
|---|---|
| hbs_process | Preprocessing of raw reads, including bad quality bases trimming, sequencing adaptor, and hairpin adaptor removal |
| hbs_mapper | Original sequence recovery, mapping, and SAM file output |
| hbs_methylation_extractor | Extract and output methylation pattern from the SAM file |
| hbs_cg_mlmf | Summarize the methylation level and fidelity for covered CpG sites |
| hbs_ch_ml | Summarize the methylation level for covered non_CpG sites |
Figure 1Genome-wide hairpin bisulfite sequence generation and processing. The flow chart begins with double stranded genomic DNA ligated to hairpin adapter. Hairpin bisulfite PCR products are sequenced from both ends. HBS-tools accept these pair-end hairpin-BS-Seq reads to recover the original (pre-bisulfite converted) sequences. The recovered sequences are aligned with the reference genome. Methylation calls are obtained based on the sequence alignment between raw sequence reads and corresponding genome sequences.
Figure 2Comparison of the running times and mapping efficiencies between HBS-tools and Bismark. (a) CPU time used by Bismark and hbs_mapper for the mapping of public hairpin-BS-Seq datasets to reference. (b) The percentage of hairpin-BS-Seq read pairs mapped by Bismark and hbs_mapper, respectively.