Literature DB >> 12855441

PSI: indexing protein structures for fast similarity search.

Orhan Camoglu1, Tamer Kahveci, Ambuj K Singh.   

Abstract

MOTIVATION: We consider the problem of finding similarities in protein structure databases. Current techniques sequentially compare the given query protein to all of the proteins in the database to find similarities. Therefore, the cost of similarity queries increases linearly as the volume of the protein databases increase. As the sizes of experimentally determined and theoretically estimated protein structure databases grow, there is a need for scalable searching techniques.
RESULTS: Our techniques extract feature vectors on triplets of SSEs (Secondary Structure Elements). Later, these feature vectors are indexed using a multidimensional index structure. For a given query protein, this index structure is used to quickly prune away unpromising proteins in the database. The remaining proteins are then aligned using a popular alignment tool such as VAST. We also develop a novel statistical model to estimate the goodness of a match using the SSEs. Experimental results show that our techniques improve the pruning time of VAST 3 to 3.5 times while maintaining similar sensitivity.

Mesh:

Substances:

Year:  2003        PMID: 12855441     DOI: 10.1093/bioinformatics/btg1009

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  5 in total

1.  ProteinDBS: a real-time retrieval system for protein structure comparison.

Authors:  Chi-Ren Shyu; Pin-Hao Chi; Grant Scott; Dong Xu
Journal:  Nucleic Acids Res       Date:  2004-07-01       Impact factor: 16.971

2.  Efficient protein alignment algorithm for protein search.

Authors:  Zaixin Lu; Zhiyu Zhao; Bin Fu
Journal:  BMC Bioinformatics       Date:  2010-01-18       Impact factor: 3.169

3.  Secondary structure spatial conformation footprint: a novel method for fast protein structure comparison and classification.

Authors:  Elena Zotenko; Dianne P O'Leary; Teresa M Przytycka
Journal:  BMC Struct Biol       Date:  2006-06-08

4.  A method of protein model classification and retrieval using bag-of-visual-features.

Authors:  Jinlin Ma; Ziping Ma; Baosheng Kang; Ke Lu
Journal:  Comput Math Methods Med       Date:  2014-09-01       Impact factor: 2.238

5.  Fold classification based on secondary structure--how much is gained by including loop topology?

Authors:  Jieun Jeong; Piotr Berman; Teresa Przytycka
Journal:  BMC Struct Biol       Date:  2006-03-08
  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.