Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Space efficient computation of rare maximal exact matches between multiple sequences.

Literature DB >> 18361760

Space efficient computation of rare maximal exact matches between multiple sequences.

Abstract

In this article, we propose a new method for computing rare maximal exact matches between multiple sequences. A rare match between k sequences S(1), ... , S(k) is a string that occurs at most t(i)-times in the sequence S(i), where the t(i) > 0 are user-defined thresholds. First, the suffix tree of one of the sequences (the reference sequence) is built, and then the other sequences are matched separately against this suffix tree. Second, the resulting pairwise exact matches are combined to multiple exact matches. A clever implementation of this method yields a very fast and space efficient program. This program can be applied in several comparative genomics tasks, such as the identification of synteny blocks between whole genomes.

Mesh：

Year: 2008 PMID： 18361760 DOI： 10.1089/cmb.2007.0105

Source DB: PubMed Journal: J Comput Biol ISSN： 1066-5277 Impact factor: 1.479

Keyword Cloud
Cited

5 in total

Space efficient computation of rare maximal exact matches between multiple sequences.

1. Separating significant matches from spurious matches in DNA sequences.

2. Adaptive seeds tame genomic sequence comparison.

3. A practical algorithm for finding maximal exact matches in large sequence datasets using sparse suffix arrays.

4. Murasaki: a fast, parallelizable algorithm to find anchors from multiple genomes.

5. CoCoNUT: an efficient system for the comparison and analysis of genomes.