| Literature DB >> 20538653 |
Chi-Ren Shyu1, Bin Pang, Pin-Hao Chi, Nan Zhao, Dmitry Korkin, Dong Xu.
Abstract
ProteinDBS v2.0 is a web server designed for efficient and accurate comparisons and searches of structurally similar proteins from a large-scale database. It provides two comparison methods, global-to-global and local-to-local, to facilitate the searches of protein structures or substructures. ProteinDBS v2.0 applies advanced feature extraction algorithms and scalable indexing techniques to achieve a high-running speed while preserving reasonably high precision of structural comparison. The experimental results show that our system is able to return results of global comparisons in seconds from a complete Protein Data Bank (PDB) database of 152,959 protein chains and that it takes much less time to complete local comparisons from a non-redundant database of 3276 proteins than other accurate comparison methods. ProteinDBS v2.0 supports query by PDB protein ID and by new structures uploaded by users. To our knowledge, this is the only search engine that can simultaneously support global and local comparisons. ProteinDBS v2.0 is a useful tool to investigate functional or evolutional relationships among proteins. Moreover, the common substructures identified by local comparison can be potentially used to assist the human curation process in discovering new domains or folds from the ever-growing protein structure databases. The system is hosted at http://ProteinDBS.rnet.missouri.edu.Entities:
Mesh:
Year: 2010 PMID: 20538653 PMCID: PMC2896110 DOI: 10.1093/nar/gkq522
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.ProteinDBS v2.0 system architecture which has five modules (i) query interface; (ii) protein structure data management; (iii) data pre-processing; (iv) search engines; and (v) retrieval result visualization.
Figure 2.ProteinDBS retrieval results visualization for global-to-global comparison. The top-left panel shows a superimposed view of a query protein chain and a selected result chain from the ranked list in the top-right panel (the third-ranked protein chain in this figure). Users can check ‘Result Protein’ and/or ‘Query Protein’ to view the retrieved chain and/or the query chain. By clicking on a hyperlink below the top-left panel, users can view different display themes. The lower panel displays the structure alignment results represented by sequences with the RMSD and alignment length.
Figure 3.ProteinDBS retrieval results visualization for local-to-local comparison. The top-left panel shows a superimposed view of a query protein and a selected result protein from the ranked list in the top-right panel (the first-ranked protein chain in second-ranked SCOP fold). Users can check ‘Result Protein’ and/or ‘Query Protein’ to view the retrieved chain and/or the query chain. By clicking on a hyperlink below the top-left panel, users can view different display themes. The lower panel shows the sequence alignment. Clicking on ‘residue checkbox’ below a pair of residue will highlight residues in the 3D superimposition view. To show one specific substructure, users first click hyperlink ‘Clear Display’ to hide all the substructures and then click substructure bar to show the designed substructure.