Literature DB >> 32462192

Succinct dynamic de Bruijn graphs.

Bahar Alipanahi1, Alan Kuhnle2, Simon J Puglisi3, Leena Salmela3, Christina Boucher1.   

Abstract

MOTIVATION: The de Bruijn graph is one of the fundamental data structures for analysis of high throughput sequencing data. In order to be applicable to population-scale studies, it is essential to build and store the graph in a space- and time-efficient manner. In addition, due to the ever-changing nature of population studies, it has become essential to update the graph after construction, e.g. add and remove nodes and edges. Although there has been substantial effort on making the construction and storage of the graph efficient, there is a limited amount of work in building the graph in an efficient and mutable manner. Hence, most space efficient data structures require complete reconstruction of the graph in order to add or remove edges or nodes.
RESULTS: In this article, we present DynamicBOSS, a succinct representation of the de Bruijn graph that allows for an unlimited number of additions and deletions of nodes and edges. We compare our method with other competing methods and demonstrate that DynamicBOSS is the only method that supports both addition and deletion and is applicable to very large samples (e.g. greater than 15 billion k-mers). Competing dynamic methods, e.g. FDBG cannot be constructed on large scale datasets, or cannot support both addition and deletion, e.g. BiFrost.
AVAILABILITY AND IMPLEMENTATION: DynamicBOSS is publicly available at https://github.com/baharpan/dynboss. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author(s) 2021. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Mesh:

Year:  2021        PMID: 32462192      PMCID: PMC8337006          DOI: 10.1093/bioinformatics/btaa546

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.931


  14 in total

1.  An Eulerian path approach to DNA fragment assembly.

Authors:  P A Pevzner; H Tang; M S Waterman
Journal:  Proc Natl Acad Sci U S A       Date:  2001-08-14       Impact factor: 11.205

2.  ABySS: a parallel assembler for short read sequence data.

Authors:  Jared T Simpson; Kim Wong; Shaun D Jackman; Jacqueline E Schein; Steven J M Jones; Inanç Birol
Journal:  Genome Res       Date:  2009-02-27       Impact factor: 9.043

3.  Practical dynamic de Bruijn graphs.

Authors:  Victoria G Crawford; Alan Kuhnle; Christina Boucher; Rayan Chikhi; Travis Gagie
Journal:  Bioinformatics       Date:  2018-12-15       Impact factor: 6.937

4.  Mantis: A Fast, Small, and Exact Large-Scale Sequence-Search Index.

Authors:  Prashant Pandey; Fatemeh Almodaresi; Michael A Bender; Michael Ferdman; Rob Johnson; Rob Patro
Journal:  Cell Syst       Date:  2018-06-20       Impact factor: 10.304

5.  Succinct colored de Bruijn graphs.

Authors:  Martin D Muggli; Alexander Bowe; Noelle R Noyes; Paul S Morley; Keith E Belk; Robert Raymond; Travis Gagie; Simon J Puglisi; Christina Boucher
Journal:  Bioinformatics       Date:  2017-10-15       Impact factor: 6.937

6.  De novo assembly and genotyping of variants using colored de Bruijn graphs.

Authors:  Zamin Iqbal; Mario Caccamo; Isaac Turner; Paul Flicek; Gil McVean
Journal:  Nat Genet       Date:  2012-01-08       Impact factor: 38.330

7.  Bloom Filter Trie: an alignment-free and reference-free data structure for pan-genome storage.

Authors:  Guillaume Holley; Roland Wittler; Jens Stoye
Journal:  Algorithms Mol Biol       Date:  2016-04-14       Impact factor: 1.405

8.  Dynamic compression schemes for graph coloring.

Authors:  Harun Mustafa; Ingo Schilken; Mikhail Karasikov; Carsten Eickhoff; Gunnar Rätsch; André Kahles
Journal:  Bioinformatics       Date:  2019-02-01       Impact factor: 6.937

9.  Bifrost: highly parallel construction and indexing of colored and compacted de Bruijn graphs.

Authors:  Guillaume Holley; Páll Melsted
Journal:  Genome Biol       Date:  2020-09-17       Impact factor: 13.583

10.  Space-efficient and exact de Bruijn graph representation based on a Bloom filter.

Authors:  Rayan Chikhi; Guillaume Rizk
Journal:  Algorithms Mol Biol       Date:  2013-09-16       Impact factor: 1.405

View more
  1 in total

1.  Buffering updates enables efficient dynamic de Bruijn graphs.

Authors:  Jarno Alanko; Bahar Alipanahi; Jonathen Settle; Christina Boucher; Travis Gagie
Journal:  Comput Struct Biotechnol J       Date:  2021-07-06       Impact factor: 7.271

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.