Literature DB >> 17044162

Optimizing multiple seeds for protein homology search.

Daniel G Brown1.   

Abstract

We present a framework for improving local protein alignment algorithms. Specifically, we discuss how to extend local protein aligners to use a collection of vector seeds or ungapped alignment seeds to reduce noise hits. We model picking a set of seed models as an integer programming problem and give algorithms to choose such a set of seeds. While the problem is NP-hard, and Quasi-NP-hard to approximate to within a logarithmic factor, it can be solved easily in practice. A good set of seeds we have chosen allows four to five times fewer false positive hits, while preserving essentially identical sensitivity as BLASTP.

Mesh:

Substances:

Year:  2005        PMID: 17044162     DOI: 10.1109/TCBB.2005.13

Source DB:  PubMed          Journal:  IEEE/ACM Trans Comput Biol Bioinform        ISSN: 1545-5963            Impact factor:   3.710


  2 in total

1.  A unifying framework for seed sensitivity and its application to subset seeds.

Authors:  Gregory Kucherov; Laurent Noé; Mikhail Roytberg
Journal:  J Bioinform Comput Biol       Date:  2006-04       Impact factor: 1.122

2.  Best hits of 11110110111: model-free selection and parameter-free sensitivity calculation of spaced seeds.

Authors:  Laurent Noé
Journal:  Algorithms Mol Biol       Date:  2017-02-14       Impact factor: 1.405

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.