Literature DB >> 17804438

Multiple spaced seeds for homology search.

Lucian Ilie1, Silvana Ilie.   

Abstract

MOTIVATION: Homology search finds similar segments between two biological sequences, such as DNA or protein sequences. The introduction of optimal spaced seeds in PatternHunter has increased both the sensitivity and the speed of homology search, and it has been adopted by many alignment programs such as BLAST. With the further improvement provided by multiple spaced seeds in PatternHunterII, Smith-Waterman sensitivity is approached at BLASTn speed. However, computing optimal multiple spaced seeds was proved to be NP-hard and current heuristic algorithms are all very slow (exponential).
RESULTS: We give a simple algorithm which computes good multiple seeds in polynomial time. Due to a completely different approach, the difference with respect to the previous methods is dramatic. The multiple spaced seed of PatternHunterII, with 16 weight 11 seeds, was computed in 12 days. It takes us 17 s to find a better one. Our approach changes the way of looking at multiple spaced seeds.

Mesh:

Year:  2007        PMID: 17804438     DOI: 10.1093/bioinformatics/btm422

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  14 in total

1.  Inexact Local Alignment Search over Suffix Arrays.

Authors:  Mohammadreza Ghodsi; Mihai Pop
Journal:  Proceedings (IEEE Int Conf Bioinformatics Biomed)       Date:  2009-11-01

2.  BOND: Basic OligoNucleotide Design.

Authors:  Lucian Ilie; Hamid Mohamadi; Geoffrey Brian Golding; William F Smyth
Journal:  BMC Bioinformatics       Date:  2013-02-27       Impact factor: 3.169

3.  Ab initio detection of fuzzy amino acid tandem repeats in protein sequences.

Authors:  Marco Pellegrini; Maria Elena Renda; Alessio Vecchio
Journal:  BMC Bioinformatics       Date:  2012-03-21       Impact factor: 3.169

4.  Seeds for effective oligonucleotide design.

Authors:  Lucian Ilie; Silvana Ilie; Shima Khoshraftar; Anahita Mansouri Bigvand
Journal:  BMC Genomics       Date:  2011-06-01       Impact factor: 3.969

5.  Hit integration for identifying optimal spaced seeds.

Authors:  Won-Hyoung Chung; Seong-Bae Park
Journal:  BMC Bioinformatics       Date:  2010-01-18       Impact factor: 3.169

6.  Efficient computation of spaced seeds.

Authors:  Silvana Ilie
Journal:  BMC Res Notes       Date:  2012-02-28

7.  rasbhari: Optimizing Spaced Seeds for Database Searching, Read Mapping and Alignment-Free Sequence Comparison.

Authors:  Lars Hahn; Chris-André Leimeister; Rachid Ounit; Stefano Lonardi; Burkhard Morgenstern
Journal:  PLoS Comput Biol       Date:  2016-10-19       Impact factor: 4.475

8.  SPRINT: ultrafast protein-protein interaction prediction of the entire human interactome.

Authors:  Yiwei Li; Lucian Ilie
Journal:  BMC Bioinformatics       Date:  2017-11-15       Impact factor: 3.169

9.  BFAST: an alignment tool for large scale genome resequencing.

Authors:  Nils Homer; Barry Merriman; Stanley F Nelson
Journal:  PLoS One       Date:  2009-11-11       Impact factor: 3.240

10.  SANS: high-throughput retrieval of protein sequences allowing 50% mismatches.

Authors:  J Patrik Koskinen; Liisa Holm
Journal:  Bioinformatics       Date:  2012-09-15       Impact factor: 6.937

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.