| Literature DB >> 33966074 |
Colin Farrell1, Michael Thompson2, Anela Tosevska2, Adewale Oyetunde2, Matteo Pellegrini2.
Abstract
BACKGROUND: Bisulfite sequencing is commonly used to measure DNA methylation. Processing bisulfite sequencing data is often challenging owing to the computational demands of mapping a low-complexity, asymmetrical library and the lack of a unified processing toolset to produce an analysis-ready methylation matrix from read alignments. To address these shortcomings, we have developed BiSulfite Bolt (BSBolt), a fast and scalable bisulfite sequencing analysis platform. BSBolt performs a pre-alignment sequencing read assessment step to improve efficiency when handling asymmetrical bisulfite sequencing libraries.Entities:
Mesh:
Substances:
Year: 2021 PMID: 33966074 PMCID: PMC8106542 DOI: 10.1093/gigascience/giab033
Source DB: PubMed Journal: Gigascience ISSN: 2047-217X Impact factor: 6.524
Figure 1:BSBolt Workflows BSBolt is implemented as a series of discrete modules for read simulation, index generation, read alignment, methylation calling, and matrix aggregation. All BSBolt modules can be run using a command line interface or within a Python (>3.6) environment natively.
Simulated Bisulfite Sequencing Library Parameters: The parameters used to simulate libraries using BSBolt for tool comparisons. All simulations were carried out at read lengths of 50,100 and 150 base pairs.
| Average Read Depth | Mutation Rate | Sequencing error | Sequencing type | Library type |
|---|---|---|---|---|
| 30 | 0.005 | 0.005 | Paired End | Undirectional |
| 30 | 0.005 | 0.005 | Single End | Undirectional |
| 20 | 0.005 | 0.005 | Paired end | Directional |
| 20 | 0.005 | 0.005 | Single End | Directional |
| 8 | 0.005 | 0.005 | Paired end | Directional |
| 8 | 0.005 | 0.005 | Single End | Directional |
| 8 | 0.01 | 0.02 | Paired End | Directional |
Figure 2:Simulated bisulfite-sequencing library performance. (A) Reads aligned per minute for each bisulfite alignment tool. (B) Proportion of simulated reads mapped during alignments. Note, BWA-Meth does not support undirectional library alignment, resulting in low mappability for undirectional libraries. (C) Methylation call time (min) for each alignment tool. (D) Mean absolute error (MAE) observed between the simulated and called methylation value. Box and whisker show distribution quartiles with whisker extending to points lying within 1.5 interquartile ranges past the nearest quartile.
Figure 3:Targeted bisulfite-sequencing library performance. (A) The number of read pairs aligned per minute for each bisulfite alignment tool. (B) Total methylation calling time (min) for each alignment file. (C) The absolute difference between array methylation values and sequencing methylation values for overlapping calls, binned by effective read depth. Box and whisker show distribution quartiles with whisker extending to points lying within 1.5 interquartile ranges past the nearest quartile.