Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Compression of nucleotide databases for fast searching.

Literature DB >> 9367128

Compression of nucleotide databases for fast searching.

Abstract

MOTIVATION: International sequencing efforts are creating huge nucleotide databases, which are used in searching applications to locate sequences homologous to a query sequence. In such applications, it is desirable that databases are stored compactly, that sequences can be accessed independently of the order in which they were stored, and that data can be rapidly retrieved from secondary storage, since disk costs are often the bottleneck in searching.
RESULTS: We present a purpose-built direct coding scheme for fast retrieval and compression of genomic nucleotide data. The scheme is lossless, readily integrated with sequence search tools, and does not require a model. Direct coding gives good compression and allows faster retrieval than with either uncompressed data or data compressed by other methods, thus yielding significant improvements in search times for high-speed homology search tools.

Mesh：

Year: 1997 PMID： 9367128 DOI： 10.1093/bioinformatics/13.5.549

Source DB: PubMed Journal: Comput Appl Biosci ISSN： 0266-7061

Keyword Cloud
Cited

2 in total

1. Data structures and compression algorithms for genomic sequence data.

Authors: Marty C Brandon; Douglas C Wallace; Pierre Baldi
Journal: Bioinformatics Date: 2009-05-15 Impact factor: 6.937

2. Bitpacking techniques for indexing genomes: I. Hash tables.

Authors: Thomas D Wu
Journal: Algorithms Mol Biol Date: 2016-04-18 Impact factor: 1.405

2 in total