Literature DB >> 17037963

Optimizing multiple spaced seeds for homology search.

Jinbo Xu1, Daniel Brown, Ming Li, Bin Ma.   

Abstract

Optimized spaced seeds improve sensitivity and specificity in local homology search. Several authors have shown that multiple seeds can have better sensitivity and specificity than single seeds. We describe a linear programming (LP)-based algorithm to optimize a set of seeds. Theoretically, our algorithm offers a performance guarantee: the sensitivity of a chosen seed set is at least 70% of what can be achieved, in most reasonable models of homologous sequences. In practice, our algorithm generates a solution which is at least 90% of the optimal. Our method not only achieves performance better than or comparable to that of a greedy algorithm, but also gives this area a mathematical foundation.

Mesh:

Year:  2006        PMID: 17037963     DOI: 10.1089/cmb.2006.13.1355

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  4 in total

1.  Ab initio detection of fuzzy amino acid tandem repeats in protein sequences.

Authors:  Marco Pellegrini; Maria Elena Renda; Alessio Vecchio
Journal:  BMC Bioinformatics       Date:  2012-03-21       Impact factor: 3.169

2.  Hit integration for identifying optimal spaced seeds.

Authors:  Won-Hyoung Chung; Seong-Bae Park
Journal:  BMC Bioinformatics       Date:  2010-01-18       Impact factor: 3.169

3.  PerM: efficient mapping of short sequencing reads with periodic full sensitive spaced seeds.

Authors:  Yangho Chen; Tade Souaiaia; Ting Chen
Journal:  Bioinformatics       Date:  2009-08-12       Impact factor: 6.937

4.  Accurate multiple alignment of distantly related genome sequences using filtered spaced word matches as anchor points.

Authors:  Chris-André Leimeister; Thomas Dencker; Burkhard Morgenstern
Journal:  Bioinformatics       Date:  2019-01-15       Impact factor: 6.937

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.