| Literature DB >> 17044164 |
Gregory Kucherov1, Laurent Noé, Mikhail Roytberg.
Abstract
We study a method of seed-based lossless filtration for approximate string matching and related bioinformatics applications. The method is based on a simultaneous use of several spaced seeds rather than a single seed as studied by Burkhardt and Kärkkäinen. We present algorithms to compute several important parameters of seed families, study their combinatorial properties, and describe several techniques to construct efficient families. We also report a large-scale application of the proposed technique to the problem of oligonucleotide selection for an EST sequence database.Entities:
Mesh:
Year: 2005 PMID: 17044164 DOI: 10.1109/TCBB.2005.12
Source DB: PubMed Journal: IEEE/ACM Trans Comput Biol Bioinform ISSN: 1545-5963 Impact factor: 3.710