Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 TwoPaCo: an efficient algorithm to build the compacted de Bruijn graph from many complete genomes.

Literature DB >> 27659452

TwoPaCo: an efficient algorithm to build the compacted de Bruijn graph from many complete genomes.

Ilia Minkin¹, Son Pham², Paul Medvedev^1,3,4.

Abstract

MOTIVATION: de Bruijn graphs have been proposed as a data structure to facilitate the analysis of related whole genome sequences, in both a population and comparative genomic settings. However, current approaches do not scale well to many genomes of large size (such as mammalian genomes).
RESULTS: In this article, we present TwoPaCo, a simple and scalable low memory algorithm for the direct construction of the compacted de Bruijn graph from a set of complete genomes. We demonstrate that it can construct the graph for 100 simulated human genomes in less than a day and eight real primates in < 2 h, on a typical shared-memory machine. We believe that this progress will enable novel biological analyses of hundreds of mammalian-sized genomes.
AVAILABILITY AND IMPLEMENTATION: Our code and data is available for download from github.com/medvedevgroup/TwoPaCo. CONTACT: ium125@psu.edu. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Entities: Chemical

Mesh：

Year: 2017 PMID： 27659452 DOI： 10.1093/bioinformatics/btw609

Source DB: PubMed Journal: Bioinformatics ISSN： 1367-4803 Impact factor: 6.937

Keyword Cloud
Cited

19 in total

1. The design and construction of reference pangenome graphs with minigraph.

Authors: Heng Li; Xiaowen Feng; Chong Chu
Journal: Genome Biol Date: 2020-10-16 Impact factor: 13.583

Review 2. Pangenome Graphs.

Authors: Jordan M Eizenga; Adam M Novak; Jonas A Sibbesen; Simon Heumos; Ali Ghaffaari; Glenn Hickey; Xian Chang; Josiah D Seaman; Robin Rounthwaite; Jana Ebler; Mikko Rautiainen; Shilpa Garg; Benedict Paten; Tobias Marschall; Jouni Sirén; Erik Garrison
Journal: Annu Rev Genomics Hum Genet Date: 2020-05-26 Impact factor: 8.929

TwoPaCo: an efficient algorithm to build the compacted de Bruijn graph from many complete genomes.

1. The design and construction of reference pangenome graphs with minigraph.

Review 2. Pangenome Graphs.

3. The effect of genome graph expressiveness on the discrepancy between genome graph distance and string set distance.

4. Scalable, ultra-fast, and low-memory construction of compacted de Bruijn graphs with Cuttlefish 2.

5. Multiplex de Bruijn graphs enable genome assembly from long, high-fidelity reads.

6. Faucet: streaming de novo assembly graph construction.

7. A space and time-efficient index for the compacted colored de Bruijn graph.

8. Cuttlefish: fast, parallel and low-memory compaction of de Bruijn graphs from large-scale genome collections.

9. Constructing small genome graphs via string compression.

10. seq-seq-pan: building a computational pan-genome data structure on whole genome alignment.