Jeremiah Wala1, Cheng-Zhong Zhang2, Matthew Meyerson3, Rameen Beroukhim3. 1. The Broad Institute of Harvard and MIT, Cambridge, MA 02142, USA Bioinformatics and Integrative Genomics, Harvard University, Cambridge, MA 02138, USA Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, MA 02115, USA. 2. The Broad Institute of Harvard and MIT, Cambridge, MA 02142, USA. 3. The Broad Institute of Harvard and MIT, Cambridge, MA 02142, USA Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, MA 02115, USA.
Abstract
UNLABELLED: We developed VariantBam, a C ++ read filtering and profiling tool for use with BAM, CRAM and SAM sequencing files. VariantBam provides a flexible framework for extracting sequencing reads or read-pairs that satisfy combinations of rules, defined by any number of genomic intervals or variant sites. We have implemented filters based on alignment data, sequence motifs, regional coverage and base quality. For example, VariantBam achieved a median size reduction ratio of 3.1:1 when applied to 10 lung cancer whole genome BAMs by removing large tags and selecting for only high-quality variant-supporting reads and reads matching a large dictionary of sequence motifs. Thus VariantBam enables efficient storage of sequencing data while preserving the most relevant information for downstream analysis. AVAILABILITY AND IMPLEMENTATION: VariantBam and full documentation are available at github.com/jwalabroad/VariantBam CONTACT: rameen@broadinstitute.org SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
UNLABELLED: We developed VariantBam, a C ++ read filtering and profiling tool for use with BAM, CRAM and SAM sequencing files. VariantBam provides a flexible framework for extracting sequencing reads or read-pairs that satisfy combinations of rules, defined by any number of genomic intervals or variant sites. We have implemented filters based on alignment data, sequence motifs, regional coverage and base quality. For example, VariantBam achieved a median size reduction ratio of 3.1:1 when applied to 10 lung cancer whole genome BAMs by removing large tags and selecting for only high-quality variant-supporting reads and reads matching a large dictionary of sequence motifs. Thus VariantBam enables efficient storage of sequencing data while preserving the most relevant information for downstream analysis. AVAILABILITY AND IMPLEMENTATION: VariantBam and full documentation are available at github.com/jwalabroad/VariantBam CONTACT: rameen@broadinstitute.org SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Authors: Derek W Barnett; Erik K Garrison; Aaron R Quinlan; Michael P Strömberg; Gabor T Marth Journal: Bioinformatics Date: 2011-04-14 Impact factor: 6.937
Authors: Kristian Cibulskis; Michael S Lawrence; Scott L Carter; Andrey Sivachenko; David Jaffe; Carrie Sougnez; Stacey Gabriel; Matthew Meyerson; Eric S Lander; Gad Getz Journal: Nat Biotechnol Date: 2013-02-10 Impact factor: 54.908
Authors: Michael G Ross; Carsten Russ; Maura Costello; Andrew Hollinger; Niall J Lennon; Ryan Hegarty; Chad Nusbaum; David B Jaffe Journal: Genome Biol Date: 2013-05-29 Impact factor: 13.583
Authors: Michelle D Noyes; William T Harvey; David Porubsky; Arvis Sulovari; Ruiyang Li; Nicholas R Rose; Peter A Audano; Katherine M Munson; Alexandra P Lewis; Kendra Hoekzema; Tuomo Mantere; Tina A Graves-Lindsay; Ashley D Sanders; Sara Goodwin; Melissa Kramer; Younes Mokrab; Michael C Zody; Alexander Hoischen; Jan O Korbel; W Richard McCombie; Evan E Eichler Journal: Am J Hum Genet Date: 2022-03-14 Impact factor: 11.043
Authors: Madlen Stange; Alfredo Mari; Tim Roloff; Helena Mb Seth-Smith; Michael Schweitzer; Myrta Brunner; Karoline Leuzinger; Kirstine K Søgaard; Alexander Gensch; Sarah Tschudin-Sutter; Simon Fuchs; Julia Bielicki; Hans Pargger; Martin Siegemund; Christian H Nickel; Roland Bingisser; Michael Osthoff; Stefano Bassetti; Rita Schneider-Sliwa; Manuel Battegay; Hans H Hirsch; Adrian Egli Journal: PLoS Pathog Date: 2021-03-19 Impact factor: 6.823
Authors: Maeva A Techer; Rahul V Rane; Miguel L Grau; John M K Roberts; Shawn T Sullivan; Ivan Liachko; Anna K Childers; Jay D Evans; Alexander S Mikheyev Journal: Commun Biol Date: 2019-10-01