Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Journaled string tree-a scalable data structure for analyzing thousands of similar genomes on your laptop.

Literature DB >> 25028723

Journaled string tree-a scalable data structure for analyzing thousands of similar genomes on your laptop.

René Rahn¹, David Weese¹, Knut Reinert¹.

Abstract

MOTIVATION: Next-generation sequencing (NGS) has revolutionized biomedical research in the past decade and led to a continuous stream of developments in bioinformatics, addressing the need for fast and space-efficient solutions for analyzing NGS data. Often researchers need to analyze a set of genomic sequences that stem from closely related species or are indeed individuals of the same species. Hence, the analyzed sequences are similar. For analyses where local changes in the examined sequence induce only local changes in the results, it is obviously desirable to examine identical or similar regions not repeatedly.
RESULTS: In this work, we provide a datatype that exploits data parallelism inherent in a set of similar sequences by analyzing shared regions only once. In real-world experiments, we show that algorithms that otherwise would scan each reference sequentially can be speeded up by a factor of 115.

Mesh：

Year: 2014 PMID： 25028723 DOI： 10.1093/bioinformatics/btu438

Source DB: PubMed Journal: Bioinformatics ISSN： 1367-4803 Impact factor: 6.937

Keyword Cloud
Cited

11 in total

Review 1. Pangenome Graphs.

Authors: Jordan M Eizenga; Adam M Novak; Jonas A Sibbesen; Simon Heumos; Ali Ghaffaari; Glenn Hickey; Xian Chang; Josiah D Seaman; Robin Rounthwaite; Jana Ebler; Mikko Rautiainen; Shilpa Garg; Benedict Paten; Tobias Marschall; Jouni Sirén; Erik Garrison
Journal: Annu Rev Genomics Hum Genet Date: 2020-05-26 Impact factor: 8.929

Journaled string tree-a scalable data structure for analyzing thousands of similar genomes on your laptop.

Review 1. Pangenome Graphs.

Review 2. Searching and Indexing Genomic Databases via Kernelization.

3. Sequence Factorization with Multiple References.

4. A representation of a compressed de Bruijn graph for pan-genome analysis that enables search.

Review 5. Computational pan-genomics: status, promises and challenges.

6. Bit-parallel sequence-to-graph alignment.

7. Indexes of large genome collections on a PC.

Review 8. Visual programming for next-generation sequencing data analytics.

9. seq-seq-pan: building a computational pan-genome data structure on whole genome alignment.

10. Founder Reconstruction Enables Scalable and Seamless Pangenomic Analysis.