| Literature DB >> 22689645 |
Bin Pang1, Xingyan Kuang, Nan Zhao, Dmitry Korkin, Chi-Ren Shyu.
Abstract
PBSword is a web server designed for efficient and accurate comparisons and searches of geometrically similar protein-protein binding sites from a large-scale database. The basic idea of PBSword is that each protein binding site is first represented by a high-dimensional vector of 'visual words', which characterizes both the global and local shape features of the binding site. It then uses a scalable indexing technique to search for those binding sites whose visual words representations are similar to that of the query binding site. Our system is able to return ranked results of binding sites in short time from a database of 194 322 domain-domain binding sites. PBSword supports query by protein ID and by new structures uploaded by users. PBSword is a useful tool to investigate functional connections among proteins based on the local structures of binding site and has potential applications to protein-protein docking and drug discovery. The system is hosted at http://pbs.rnet.missouri.edu.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22689645 PMCID: PMC3394332 DOI: 10.1093/nar/gks527
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.PBSword system architecture. (a) The database management and preprocessing module is responsible for feature extraction, visual vocabulary construction and word representation of the database binding sites, which can be performed offline. (b) The query interface modules provide friendly interfaces in an Internet browser to allow users to input protein ID or upload protein structure. (c) The search engine module organizes the word representation of the database binding site into indexing tree and returns n nearest neighbors for a query binding site in real-time. (d) The retrieval visualization module shows 3D structure/surface view, sequence and properties of retrieved binding sites.
Figure 2.PBSword retrieval results visualization. (a) The top-left panel shows a 3D structure and surface view of a selected result protein-binding site from the ranked list in the top-right panel. Users can click on the buttons and checkboxes in the top-left panel to select binding site and its partner as well as display modes of surface. (b) The sequence panel shows sequence information of subunit pairs. Each column in the panel corresponds to an amino acid of protein subunit, which consists of three rows. First row represents residue sequence number. For binding site residue, its number is in red font. For the residue with intermolecular hydrogen bonds, it numbers is underlined. Second row is the residue name. The third row is residue check box. By clicking a checkbox, corresponding residues will be shown in the top-left structure view.
Figure 3.PBSword retrieval results of binding site properties. (a) The SCOP classification panel shows the subunit’s classification, including class, fold, superfamily, family and species. (b) The site properties panel shows values of various physicochemical properties of binding sites, including number of residues, ASA, percentage of ASA, percentage of polarity, percentage of hydrophobicity, planarity, number of hydrogen bonds and gap index. In addition, users can click on the hyperlink of SCOP family at the row “Statistics of family” to view the statistics summary of these properties for those binding sites belonging to same SCOP family. (c) The residue properties panel shows detailed properties for each binding site residue, including ASA, percentage of ASA, ASA of polar atoms, percentage of polar residue and number of hydrogen bonds. (d) The family statistics panel shows the statistics summary of binding sites from a SCOP family, including number of binding sites, amino acids compositions, as well as the mean (standard deviation) and histogram of binding site properties. The properties include ASA, percentage of polarity, percentage of hydrophobicity, planarity, hydrogen bonding and gap index. Here, hydrogen bonding is defined as the number of hydrogen bonds per 100 Å2 ASA. Owing to the page limitation, we have only shown subset of each panel.