Literature DB >> 16049912

YAKUSA: a fast structural database scanning method.

Mathilde Carpentier1, Sophie Brouillet, Joël Pothier.   

Abstract

YAKUSA is a program designed for rapid scanning of a structural database with a query protein structure. It searches for the longest common substructures called SHSPs (structural high-scoring pairs) existing between a query structure and every structure in the structural database. It makes use of protein backbone internal coordinates (alpha angles) in order to describe protein structures as sequences of symbols. The structural similarities are established in 5 steps, the first 3 being analogous to those used in BLAST: (1) building up a deterministic finite automaton describing all patterns identical or similar to those in the query structure; (2) searching for all these patterns in every structure in the database; (3) extending the patterns to longer matching substructures (i.e., SHSPs); (4) selecting compatible SHSPs for each query-database structure pair; and (5) ranking the query-database structure pairs using 3 scores based on SHSP similarity, on SHSP probabilities, and on spatial compatibility of SHSPs. Structural fragment probabilities are estimated according to a mixture transition distribution model, which is an approximation of a high-order Markov chain model. With regard to sensitivity and selectivity of the structural matches, YAKUSA compares well to the best related programs, although it is by far faster: A typical database scan takes about 40 s CPU time on a desktop personal computer. It has also been implemented on a Web server for real-time searches. (c) 2005 Wiley-Liss, Inc.

Mesh:

Substances:

Year:  2005        PMID: 16049912     DOI: 10.1002/prot.20517

Source DB:  PubMed          Journal:  Proteins        ISSN: 0887-3585


  34 in total

1.  The power of detecting enriched patterns: an HMM approach.

Authors:  Zhiyuan Zhai; Shih-Yen Ku; Yihui Luan; Gesine Reinert; Michael S Waterman; Fengzhu Sun
Journal:  J Comput Biol       Date:  2010-04       Impact factor: 1.479

2.  GOSSIP: a method for fast and accurate global alignment of protein structures.

Authors:  I Kifer; R Nussinov; H J Wolfson
Journal:  Bioinformatics       Date:  2011-02-03       Impact factor: 6.937

3.  The SALAMI protein structure search server.

Authors:  Thomas Margraf; Gundolf Schenk; Andrew E Torda
Journal:  Nucleic Acids Res       Date:  2009-05-22       Impact factor: 16.971

4.  deconSTRUCT: general purpose protein database search on the substructure level.

Authors:  Zong Hong Zhang; Kavitha Bharatham; Westley A Sherman; Ivana Mihalek
Journal:  Nucleic Acids Res       Date:  2010-06-03       Impact factor: 16.971

5.  ProteinDBS v2.0: a web server for global and local protein structure search.

Authors:  Chi-Ren Shyu; Bin Pang; Pin-Hao Chi; Nan Zhao; Dmitry Korkin; Dong Xu
Journal:  Nucleic Acids Res       Date:  2010-06-10       Impact factor: 16.971

6.  Automatic classification of protein structures relying on similarities between alignments.

Authors:  Guillaume Santini; Henry Soldano; Joël Pothier
Journal:  BMC Bioinformatics       Date:  2012-09-14       Impact factor: 3.169

7.  Multiple structure alignment with msTALI.

Authors:  Paul Shealy; Homayoun Valafar
Journal:  BMC Bioinformatics       Date:  2012-05-20       Impact factor: 3.169

8.  iSARST: an integrated SARST web server for rapid protein structural similarity searches.

Authors:  Wei-Cheng Lo; Che-Yu Lee; Chi-Ching Lee; Ping-Chiang Lyu
Journal:  Nucleic Acids Res       Date:  2009-05-06       Impact factor: 16.971

9.  Swelfe: a detector of internal repeats in sequences and structures.

Authors:  Anne-Laure Abraham; Eduardo P C Rocha; Joël Pothier
Journal:  Bioinformatics       Date:  2008-05-16       Impact factor: 6.937

10.  Multiple graph regularized protein domain ranking.

Authors:  Jim Jing-Yan Wang; Halima Bensmail; Xin Gao
Journal:  BMC Bioinformatics       Date:  2012-11-19       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.