Haifeng Chen1, Andrew D Smith1, Ting Chen1. 1. Molecular and Computational Biology, University of Southern California, Los Angeles, CA 90089, USA.
Abstract
Whole-genome bisulfite sequencing (WGBS) has emerged as the gold-standard technique in genome-scale studies of DNA methylation. Mapping reads from WGBS requires unique considerations that make the process more time-consuming than in other sequencing applications. Typical WGBS data sets contain several hundred million reads, adding to this analysis challenge. We present the WALT tool for mapping WGBS reads. WALT uses a strategy of hashing periodic spaced seeds, which leads to significant speedup compared with the most efficient methods currently available. Although many existing WGBS mappers slow down with read length, WALT improves in speed. Importantly, these speed gains do not sacrifice accuracy. AVAILABILITY AND IMPLEMENTATION: WALT is available under the GPL v3 license, and downloadable from https://github.com/smithlabcode/walt. CONTACT: andrewds@usc.edu or tingchen@usc.edu SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Whole-genome bisulfite sequencing (WGBS) has emerged as the gold-standard technique in genome-scale studies of DNA methylation. Mapping reads from WGBS requires unique considerations that make the process more time-consuming than in other sequencing applications. Typical WGBS data sets contain several hundred million reads, adding to this analysis challenge. We present the WALT tool for mapping WGBS reads. WALT uses a strategy of hashing periodic spaced seeds, which leads to significant speedup compared with the most efficient methods currently available. Although many existing WGBS mappers slow down with read length, WALT improves in speed. Importantly, these speed gains do not sacrifice accuracy. AVAILABILITY AND IMPLEMENTATION: WALT is available under the GPL v3 license, and downloadable from https://github.com/smithlabcode/walt. CONTACT: andrewds@usc.edu or tingchen@usc.edu SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Authors: Jing Liu; Amrita Banerjee; Charles A Herring; Jonathan Attalla; Ruiying Hu; Yanwen Xu; Qiujia Shao; Alan J Simmons; Prasanna K Dadi; Sui Wang; David A Jacobson; Bindong Liu; Emily Hodges; Ken S Lau; Guoqiang Gu Journal: Dev Cell Date: 2019-01-07 Impact factor: 12.270
Authors: Kelly R Barnett; Benjamin E Decato; Timothy J Scott; Tyler J Hansen; Bob Chen; Jonathan Attalla; Andrew D Smith; Emily Hodges Journal: Mol Cell Date: 2020-01-29 Impact factor: 17.970
Authors: Benjamin E Decato; Jorge Lopez-Tello; Amanda N Sferruzzi-Perri; Andrew D Smith; Matthew D Dean Journal: Mol Biol Evol Date: 2017-07-01 Impact factor: 16.240
Authors: Ayala Tovy; Jaime M Reyes; Michael C Gundry; Lorenzo Brunetti; Henry Lee-Six; Mia Petljak; Hyun Jung Park; Anna G Guzman; Carina Rosas; Aaron R Jeffries; Emma Baple; Jonathan Mill; Andrew H Crosby; Valerie Sency; Baozhong Xin; Heather E Machado; Danielle Castillo; Jeffrey N Weitzel; Wei Li; Michael R Stratton; Peter J Campbell; Heng Wang; Mathijs A Sanders; Margaret A Goodell Journal: Cell Stem Cell Date: 2020-07-15 Impact factor: 24.633
Authors: Lauren J Mills; Milcah C Scott; Pankti Shah; Anne R Cunanan; Archana Deshpande; Benjamin Auch; Bridget Curtin; Kenneth B Beckman; Logan G Spector; Aaron L Sarver; Subbaya Subramanian; Todd A Richmond; Jaime F Modiano Journal: Bone Date: 2020-10-27 Impact factor: 4.398