Literature DB >> 18361760

Space efficient computation of rare maximal exact matches between multiple sequences.

Enno Ohlebusch1, Stefan Kurtz.   

Abstract

In this article, we propose a new method for computing rare maximal exact matches between multiple sequences. A rare match between k sequences S(1), ... , S(k) is a string that occurs at most t(i)-times in the sequence S(i), where the t(i) > 0 are user-defined thresholds. First, the suffix tree of one of the sequences (the reference sequence) is built, and then the other sequences are matched separately against this suffix tree. Second, the resulting pairwise exact matches are combined to multiple exact matches. A clever implementation of this method yields a very fast and space efficient program. This program can be applied in several comparative genomics tasks, such as the identification of synteny blocks between whole genomes.

Mesh:

Year:  2008        PMID: 18361760     DOI: 10.1089/cmb.2007.0105

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  5 in total

1.  Separating significant matches from spurious matches in DNA sequences.

Authors:  Hugo Devillers; Sophie Schbath
Journal:  J Comput Biol       Date:  2011-12-09       Impact factor: 1.479

2.  Adaptive seeds tame genomic sequence comparison.

Authors:  Szymon M Kiełbasa; Raymond Wan; Kengo Sato; Paul Horton; Martin C Frith
Journal:  Genome Res       Date:  2011-01-05       Impact factor: 9.043

3.  A practical algorithm for finding maximal exact matches in large sequence datasets using sparse suffix arrays.

Authors:  Zia Khan; Joshua S Bloom; Leonid Kruglyak; Mona Singh
Journal:  Bioinformatics       Date:  2009-04-23       Impact factor: 6.937

4.  Murasaki: a fast, parallelizable algorithm to find anchors from multiple genomes.

Authors:  Kris Popendorf; Hachiya Tsuyoshi; Yasunori Osana; Yasubumi Sakakibara
Journal:  PLoS One       Date:  2010-09-24       Impact factor: 3.240

5.  CoCoNUT: an efficient system for the comparison and analysis of genomes.

Authors:  Mohamed I Abouelhoda; Stefan Kurtz; Enno Ohlebusch
Journal:  BMC Bioinformatics       Date:  2008-11-12       Impact factor: 3.169

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.