Literature DB >> 22600737

ProBiS-2012: web server and web services for detection of structurally similar binding sites in proteins.

Abstract

The ProBiS web server is a web server for detection of structurally similar binding sites in the PDB and for local pairwise alignment of protein structures. In this article, we present a new version of the ProBiS web server that is 10 times faster than earlier versions, due to the efficient parallelization of the ProBiS algorithm, which now allows significantly faster comparison of a protein query against the PDB and reduces the calculation time for scanning the entire PDB from hours to minutes. It also features new web services, and an improved user interface. In addition, the new web server is united with the ProBiS-Database and thus provides instant access to pre-calculated protein similarity profiles for over 29 000 non-redundant protein structures. The ProBiS web server is particularly adept at detection of secondary binding sites in proteins. It is freely available at http://probis.cmm.ki.si/old-version, and the new ProBiS web server is at http://probis.cmm.ki.si.

Entities: Chemical Disease Gene Species

Mesh：

Substances：

Year: 2012 PMID： 22600737 PMCID： PMC3394329 DOI： 10.1093/nar/gks435

Source DB: PubMed Journal: Nucleic Acids Res ISSN： 0305-1048 Impact factor: 16.971

INTRODUCTION

Detection of structural similarities in proteins can be applied to many open questions. These include elucidation of the biochemical functions of newly characterized proteins (1–4), identification of novel indications for existing drugs—drug repositioning (5–7), and prediction of interactions of known drugs with secondary targets, or off-targets that may lead to undesirable side effects (8,9). However, comparison of only the sequence and folding of proteins fails to address these problems because protein binding sites, rather than protein folding, control interaction with ligands and hence biochemical function (10). Web servers that allow the detection of local similarities in proteins have been developed (11–21), and there is an increasing number of approaches that deal with drug repositioning and off-target prediction from different perspectives (22–24). Two programs developed at the National Institute of Chemistry in Ljubljana, ProBiS web server (25,26) and ProBiS-Database (27) enable the detection of structurally similar protein binding sites and local pairwise alignment of crystallographically or NMR determined protein structures from the PDB. ProBiS-Database holds pre-calculated structural similarity profiles for over 29 000 non-redundant proteins in the PDB, and allows access to these results in seconds. The ProBiS web server allows de novo similarity calculations, using the ProBiS algorithm, for any protein in the PDB. ProBiS is different from most other structural alignment algorithms, in that it can align proteins having different folds, if they share similar binding sites. ProBiS conducts searches for similar three-dimensional structural regions in proteins without reference to known binding sites or co-crystallized ligands, and takes into account entire protein surfaces. It accepts a protein structure as a query, and compares it against each protein in the non-redundant PDB (nr-PDB). The nr-PDB, updated weekly, is derived from the entire PDB which currently has ∼182 000 protein single chain structures (28). All these single chain protein structures are clustered with >95% sequence identical structures and a representative of each cluster is chosen. These 29 000 representatives identified in this way, constitute the nr-PDB; each is identified with a PDB number and chain, as in 2q4u.A. The ProBiS algorithm represents the surfaces of compared proteins as protein graphs, i.e. structures of vertices and edges, the vertices corresponding to functional groups of surface amino acid residues and the edges determined by distances between pairs of adjacent vertices. This representation captures both geometric as well as physicochemical characteristics of protein surfaces. ProBiS compares the query protein to each of the database proteins, using the maximum clique algorithm (29), which allows it to efficiently detect the largest similar subgraphs of compared protein graphs. After the comparison of the query protein to every nr-PDB protein structure is complete, the degrees of structural conservation are calculated for all amino acid residues in the query protein. These are analogous to degrees of sequence conservation, represented, e.g. in sequence logos with amino acid letters of different sizes, and reveal the frequency of occurrence of a particular residue in the local structural alignments that were found in the nr-PDB. These degrees of structural conservation are represented as different colors on the query protein structure as in Figure 3A, and often indicate the position of binding or other functionally important sites.

Figure 3.

ProBiS output page for query protein (PDB/Chain ID: 2q4u.A) of unknown function. (A) Structurally conserved binding sites. (B) An interactive table of similar proteins, filtered to show only calcium binding proteins. (C) Local structural superimposition of the blue query (2q4u.A) and the violet similar protein (2ehb.A) that contains a co-crystallized Ca2+ ion shown as green sphere. (D) The alignments presented as tables of residue-residue correspondences. (E) Integrated search tool. (F) Columns reordering tool.

The latest ProBiS web server shown in Figure 1 has a number of powerful features. It has an order of magnitude faster computation time, pre-calculated results and an improved user interface. All the functions of the ProBiS web server can now be accessed fully automatically from user scripts through the new ProBiS web services based on RESTful (Representational State Transfer) technology (30). The new additions to the ProBiS web server invite users to explore similarities among binding sites in proteins, and provide increased annotation of uncharacterized protein structures, drug repositioning and off-target predictions. Details regarding the new ProBiS are available at http://probis.cmm.ki.si.

Figure 1.

ProBiS web server page provides access to the Protein Binding Sites Tools.

ProBiS WEB SERVER

Input

An example of the input to ProBiS is shown in Figure 2. The user first defines the query protein by either providing PDB/Chain ID(s) or uploading a PDB file. In the first case, a yellow pop-up window appears, containing a link to the ‘Local Structural Similarity’ web page with the pre-calculated results for a homologous member of the nr-PDB. Following this link, the results are taken directly from the ProBiS-Database, and no computations will be performed; in the other case, one continues with de novo search for structural similarities. This can optionally be limited to a specific region on the query protein surface, for example, a binding site, by clicking the ‘Select Motif’ button, shown in Figure 2. This opens a new browser window with the 3D query protein model. A binding site, or any other part of the protein, can then be selected by clicking on the query protein 3D model; the surface atoms within a radius of 15 Å around this selected point are highlighted in yellow. This default value of 15 Å can be changed in the text box above the ‘Select & Close Window’ button. When the window is closed, the selected, highlighted surface is converted to a text format in the ‘Residue Motif’ text box, and can be immediately used as the input to the ProBiS web server. This enables selection of binding sites to be used as queries. The comparison database can also be changed in the new ProBiS web server and the query can now be compared against either the non-redundant PDB (default) or a custom list of any protein structures. The latter are represented by the PDB/Chain IDs, e.g. 1all.A, 3dbj.C, 2vjt.A, which can be entered into the text box that opens when the user selects the ‘List of PDB/Chain IDs’ option in the ‘Comparison Database’ drop-down list.

Figure 2.

Detection of Structurally Similar Binding Sites. ProBiS input page. Detailed instructions are provided in the User’s Guide at http://probis.cmm.ki.si.

Detection of Structurally Similar Binding Sites. ProBiS input page. Detailed instructions are provided in the User’s Guide at http://probis.cmm.ki.si. Access to the pre-calculated similarity profiles in the ‘ProBiS-Database’ is provided through the search text box located on the top of the ProBiS web server page as shown in Figure 2, and through ProBiS-Database widget or ProBiS-Database web services, which have been described previously (27).

Output

The ProBiS output page shown in Figure 3 contains the query protein cartoon model 2q4u.A colored according to degrees of structural conservation from unconserved (blue) to structurally conserved (red) and visualized in an integrated Jmol molecular viewer (see panel A). The structurally conserved residues are shown as red spheres and indicate the location of the putative binding sites. ProBiS output page for query protein (PDB/Chain ID: 2q4u.A) of unknown function. (A) Structurally conserved binding sites. (B) An interactive table of similar proteins, filtered to show only calcium binding proteins. (C) Local structural superimposition of the blue query (2q4u.A) and the violet similar protein (2ehb.A) that contains a co-crystallized Ca2+ ion shown as green sphere. (D) The alignments presented as tables of residue-residue correspondences. (E) Integrated search tool. (F) Columns reordering tool. There is also an interactive table of available similar proteins (see panel B). Clicking on the ‘View’ link in the ‘Alignments’ column shows the superimposition of the query (2q4u.A) and the similar protein (2ehb.A), and opens the ‘Details’ tab of residue–residue correspondences of all alignments between the query and the similar database protein as shown in panels C and D. Clicking on any protein chain link in the ‘Chain’ column opens a new output page with the pre-calculated structural similarity profile for a homologous member of the nr-PDB. The ‘Name’ column presents the names of the similar proteins; the ‘Pfam’, ‘SCOP’ and ‘UniProt’ columns provide links, where available (31), to the corresponding external protein annotation resources. The similar proteins in the interactive table are ranked by the standard Z-scores; Z-Scores >2.0 are colored green and those between 1.0 and 2.0 are yellow. The table can be downloaded in a CSV format, and thus directly exported to a spreadsheet program, such as Excel. Local structural superimposition of a query protein, 2q4u.A in blue, and a similar protein 2ehb.A in violet, is shown in Figure 3C. The calcium ion was co-crystallized in 2ehb.A. Alignments of 2q4u.A with 2ehb.A are also shown in tables of residue-residue correspondences in panel D; the alignment 1 is that seen in panel C. Where available (31), the PDB, Pfam, SCOP and UniProt accession numbers are at the top, and are links to external databases for structural and functional protein annotation. Below, the pairwise local alignments are presented in tabular form, ranked according to their Z-Scores. A continuous green dash connecting a pair of aligned residues indicates a good structural correspondence; an interrupted green dash indicates a poorer correspondence between the residues. The ‘Download’ buttons allow downloading the alignment in various formats, and the ‘View in Jmol’ button loads the alignment as shown in Figure 3C. Alignment scores for each pairwise alignment are shown in a yellow box, and are explained in detail in references (26,27). Filtering of the table by different search conditions is accomplished by the integrated search tool, shown in panel E. Here, the table was filtered, so that the protein names in ‘Name’ column must contain a ‘Ca’ keyword, which filters out all but calcium-related proteins. The table columns can also be reordered, shown or hidden, using the ‘Reorder Columns’ tool in panel F.

ProBiS web services

The new ProBiS web server uses RESTful web services to provide ready access from user scripts to the binding site similarities and local pairwise alignments for any PDB protein structure (30). Specification of the web services interface input data, a full set of commands and useful examples can be found at the ‘ProBiS-Web server RESTful Web Services’ instructions page at http://probis.cmm.ki.si/?what=webservices. The results of the web services calculations are returned in XML, Json or PDB formats, which are well supported in modern programming languages.

NEW FEATURES IN THE 2012 ProBiS WEB SERVER

Faster calculation

In 2010, a batch script ran the non-parallel version of the ProBiS algorithm on 16 processors of a single computer (25). In 2012, the ProBiS algorithm (26) was parallelized using the Open-MPI library, and now runs on ∼250 processors of the ProBiS web server, which has shortened search time from hours to minutes.

Integration with the ProBiS-Database

The new ProBiS web server features integrated access to the ProBiS-Database (27), which is a searchable repository of local pairwise alignments of non-redundant protein structures (nr-PDB) generated by the ProBiS algorithm. This database consists of ∼420 million pre-calculated pair-wise alignments and presents a faster alternative to the standard de novo protein similarity detection used by the ProBiS web server: structural similarity results are obtained in seconds from the ProBiS-Database. However, the database holds results only for the ∼29 000 nr-PDB proteins; for non-nr-PDB proteins, results are for the closest homologue in the nr-PDB.

Improved user interface

New features in ProBiS include: (i) submission of a binding site as a query—previously only complete proteins could be used as queries; (ii) comparison of the query protein against a user-provided list of PDB/Chain IDs. Previously, this was possible only against proteins in the nr-PDB; (iii) links to other protein annotation resources, such as Pfam, SCOP or UniProt; (iv) extended download options—results can be downloaded as CSV, XML, Json or PDB files; and (v) searching within the table of similar proteins. A complete list of new features is available in the User's Guide at http://probis.cmm.ki.si. RESTful web services have been implemented to allow access to the ProBiS web server by user written programs or scripts. A list of commands available is on the ProBiS web server home page.

Methodological improvements

Statistically meaningful ranking of the identified locally similar proteins or binding sites is achieved through the use of standard scores (Z-Scores) (27), which replace the various alignment scores described previously (26).

EXAMPLES

Binding sites detection

For the query protein (PDB/Chain ID: 2q4u.A), ProBiS accurately detects three conserved binding sites as shown in Figure 3A. The Ca2+ ion, which is co-crystallized in the similar protein calcineurin B (2ehb.A), is shown in Figure 3C, and reveals a probable binding pose of the Ca2+ in the query protein. Proteins 2q4u.A and 2ehb.A have similar binding sites, despite their low amino acid identity, i.e. between 2 and 10%, as judged by different pair-wise structural alignment methods in the ‘3D Similarity tab’ at the RCSB PDB Web page (28). ProBiS also predicts binding of 5 other Ca2+ ions to the query protein, which is not shown in Figure 3; these can be seen by viewing alignments 1–4 in Jmol for the similar protein calmodulin (1fw4.A).

Drug repositioning

ProBiS can detect weak binding site similarities in proteins with different protein folds. Here, we present an example of drug repositioning. Protein kinase inhibitors were considered for inhibition of bacterial enzyme d-alanine–d-alanine ligase (Ddl), based on the similarities between ATP binding sites in protein kinases and in Ddl (32,33). However, repositioning was only done for a small proportion of all available protein kinases, which manifested this similarity. Using the ATP binding site of Ddl, 1iov.A, shown in Figure 2, as the query, ProBiS detects a previously unrecognized similarity between this binding site and an ATP binding site in protein kinase C (PKC; 1xjd.A); the superimposition of these two binding sites is shown in Figure 4. Here, the ADP ligand of Ddl is superimposed upon the PKC inhibitor, staurosporine, as a consequence of the alignment of the two similar binding sites. This supports the suggestion that staurosporine may bind to Ddl, and that it should be experimentally tested for inhibition of Ddl.

Figure 4.

Superimposition of similar binding sites in Ddl (1iov.A; blue) and PKC (1xjd.A; violet). The superimposed residues from Ddl and protein kinase C are shown as thin wireframe models. The co-crystallized ligands, ADP (blue) and staurosporine (violet), are shown as thick wireframe models.

Off-target prediction

Many inhibitors have been developed to compete for the ATP binding sites of protein kinases, but their poor selectivity usually eliminates them from consideration as clinical agents (34). The kinase 1z57.A is important in the control of alternative splicing and selective inhibitors targeting its ATP binding site have been developed. Debrohymenialdisine, for example, co-crystallized with human dual-specificity kinase, has been reported as 1z57.A (35), and we found that the ATP binding site of this protein kinase is similar to an ATP binding site in human ATP-citrate synthase (3mwd.A) as shown in Figure 5. Sequence identity of 1z57.A and 3mwd.A proteins is between 4 and 8% as computed by the FATCAT (36) and DaliLite (37) structural alignment algorithms. However, these two algorithms are not able to detect the binding site similarity. This result thus suggests that debrohymenialdisine, inhibiting the ATP binding site in this human dual-specificity protein kinase, would probably have undesirable side effects in human patients, due to off-target binding to ATP-citrate synthase.

Figure 5.

Superimposition of similar binding sites in dual-specificity kinase (1z57.A; blue) and ATP-citrate synthase (3mwd.A; violet). The superimposed residues are thin wireframe models. The inhibitor debrohymenialdisine is colored blue and is a thick wireframe model.

SOFTWARE REQUIREMENTS

The ProBiS web server requires Sun Java plugin Version 6 Update 26 or higher (http://www.java.com), and has been shown to function correctly with Firefox, IE8, Chrome 14.0, Safari 5.1 and Opera 11.5 web browsers. It also works with OpenJDK (IcedTea-Web 1.1.1) plugin on Firefox.

CONCLUSION

ProBiS is a web server for detection of local structural similarities in proteins. It allows detection of similar three-dimensional patterns of residues in protein structures irrespective of protein folds and with no prior knowledge of binding sites. ProBiS enables the detection of similar binding sites in differently folded proteins, and can suggest protein targets amenable to drug repositioning. It can also be used to generate hypotheses for protein functions and for the prediction of off-target effects. To our knowledge, there is no such comprehensive, freely available web server that would allow these functions in this automated and intuitive manner. ProBiS can provide useful insights to experimentalists, and can directly suggest molecules that have a potential value in pharmaceutical applications.

FUNDING

Ministry of Higher Education, Science and Technology of Slovenia; Slovenian Research Agency [P1-0002, Z1-3666]. Funding for open access charge: National Institute of Chemistry, Ljubljana, Slovenia. Conflict of interest statement. None declared.

35 in total

1. The Protein Data Bank.

Authors: H M Berman; J Westbrook; Z Feng; G Gilliland; T N Bhat; H Weissig; I N Shindyalov; P E Bourne
Journal: Nucleic Acids Res Date: 2000-01-01 Impact factor: 16.971

2. DaliLite workbench for protein structure comparison.

Authors: L Holm; J Park
Journal: Bioinformatics Date: 2000-06 Impact factor: 6.937

3. Annotation in three dimensions. PINTS: Patterns in Non-homologous Tertiary Structures.

Authors: Alexander Stark; Robert B Russell
Journal: Nucleic Acids Res Date: 2003-07-01 Impact factor: 16.971

4. Flexible structure alignment by chaining aligned fragment pairs allowing twists.

Authors: Yuzhen Ye; Adam Godzik
Journal: Bioinformatics Date: 2003-10 Impact factor: 6.937

Review 5. Drug repositioning: identifying and developing new uses for existing drugs.

Authors: Ted T Ashburn; Karl B Thor
Journal: Nat Rev Drug Discov Date: 2004-08 Impact factor: 84.694

6. Detection of protein three-dimensional side-chain patterns: new examples of convergent evolution.

Authors: R B Russell
Journal: J Mol Biol Date: 1998-06-26 Impact factor: 5.469

Review 7. Old friends in new guise: repositioning of known drugs with structural bioinformatics.

Authors: V Joachim Haupt; Michael Schroeder
Journal: Brief Bioinform Date: 2011-03-26 Impact factor: 11.622

8. PAR-3D: a server to predict protein active site residues.

Authors: Kshama Goyal; Debasisa Mohanty; Shekhar C Mande
Journal: Nucleic Acids Res Date: 2007-05-03 Impact factor: 16.971

9. E-MSD: an integrated data resource for bioinformatics.

Authors: S Velankar; P McNeil; V Mittard-Runte; A Suarez; D Barrell; R Apweiler; K Henrick
Journal: Nucleic Acids Res Date: 2005-01-01 Impact factor: 16.971

10. eF-seek: prediction of the functional sites of proteins by searching for similar electrostatic potential and molecular surface shape.

Authors: Kengo Kinoshita; Yoichi Murakami; Haruki Nakamura
Journal: Nucleic Acids Res Date: 2007-06-12 Impact factor: 16.971

32 in total

1. Combined rational design and a high throughput screening platform for identifying chemical inhibitors of a Ras-activating enzyme.

Authors: Chris R Evelyn; Jacek Biesiada; Xin Duan; Hong Tang; Xun Shang; Ruben Papoian; William L Seibel; Sandra Nelson; Jaroslaw Meller; Yi Zheng
Journal: J Biol Chem Date: 2015-03-30 Impact factor: 5.157

2. Structure of the UreD-UreF-UreG-UreE complex in Helicobacter pylori: a model study.

Authors: Francesco Biagi; Francesco Musiani; Stefano Ciurli
Journal: J Biol Inorg Chem Date: 2013-05-10 Impact factor: 3.358

3. Updates to Binding MOAD (Mother of All Databases): Polypharmacology Tools and Their Utility in Drug Repurposing.

Authors: Richard D Smith; Jordan J Clark; Aqeel Ahmed; Zachary J Orban; James B Dunbar; Heather A Carlson
Journal: J Mol Biol Date: 2019-05-22 Impact factor: 5.469

4. Identification of ligand templates using local structure alignment for structure-based drug design.

Authors: Hui Sun Lee; Wonpil Im
Journal: J Chem Inf Model Date: 2012-09-28 Impact factor: 4.956

5. Computational Methods for Drug Repurposing.

Authors: Rosaria Valentina Rapicavoli; Salvatore Alaimo; Alfredo Ferro; Alfredo Pulvirenti
Journal: Adv Exp Med Biol Date: 2022 Impact factor: 2.622

6. Selective inhibitors of aldo-keto reductases AKR1C1 and AKR1C3 discovered by virtual screening of a fragment library.

Authors: Petra Brožič; Samo Turk; Adegoke O Adeniji; Janez Konc; Dušanka Janežič; Trevor M Penning; Tea Lanišnik Rižner; Stanislav Gobec
Journal: J Med Chem Date: 2012-08-27 Impact factor: 7.446

7. Correlating protein hot spot surface analysis using ProBiS with simulated free energies of protein-protein interfacial residues.

Authors: Nejc Carl; Milan Hodošček; Blaž Vehar; Janez Konc; Bernard R Brooks; Dušanka Janežič
Journal: J Chem Inf Model Date: 2012-10-08 Impact factor: 4.956

8. Considerations of Protein Subpockets in Fragment-Based Drug Design.

Authors: Matthew Bartolowits; V Jo Davisson
Journal: Chem Biol Drug Des Date: 2015-08-31 Impact factor: 2.817

9. PoSSuM v.2.0: data update and a new function for investigating ligand analogs and target proteins of small-molecule drugs.

Authors: Jun-ichi Ito; Kazuyoshi Ikeda; Kazunori Yamada; Kenji Mizuguchi; Kentaro Tomii
Journal: Nucleic Acids Res Date: 2014-11-17 Impact factor: 16.971

10. A Novel Antiviral Target Structure Involved in the RNA Binding, Dimerization, and Nuclear Export Functions of the Influenza A Virus Nucleoprotein.

Authors: Michinori Kakisaka; Yutaka Sasaki; Kazunori Yamada; Yasumitsu Kondoh; Hirokazu Hikono; Hiroyuki Osada; Kentaro Tomii; Takehiko Saito; Yoko Aida
Journal: PLoS Pathog Date: 2015-07-29 Impact factor: 6.823