| Literature DB >> 17488838 |
Hani Neuvirth1, Uri Heinemann, David Birnbaum, Naftali Tishby, Gideon Schreiber.
Abstract
The development of bioinformatic tools by individual labs results in the abundance of parallel programs for the same task. For example, identification of binding site regions between interacting proteins is done using: ProMate, WHISCY, PPI-Pred, PINUP and others. All servers first identify unique properties of binding sites and then incorporate them into a predictor. Obviously, the resulting prediction would improve if the most suitable parameters from each of those predictors would be incorporated into one server. However, because of the variation in methods and databases, this is currently not feasible. Here, the protein-binding site prediction server is extended into a general protein-binding sites research tool, ProMateus. This web tool, based on ProMate's infrastructure enables the easy exploration and incorporation of new features and databases by the user, providing an evaluation of the benefit of individual features and their combination within a set framework. This transforms the individual research into a community exercise, bringing out the best from all users for optimized predictions. The analysis is demonstrated on a database of protein protein and protein-DNA interactions. This approach is basically different from that used in generating meta-servers. The implications of the open-research approach are discussed. ProMateus is available at http://bip.weizmann.ac.il/promate.Entities:
Mesh:
Substances:
Year: 2007 PMID: 17488838 PMCID: PMC1933218 DOI: 10.1093/nar/gkm301
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Success rate of proMate with new features
| Features combination | Number of predictions | Coverage | Number of successful predictions | Success rate |
|---|---|---|---|---|
| ProMate | 51 | 0.89 | 36 | 0.71 |
| Re-optimized ProMate | 55 | 0.96 | 38 | 0.69 |
| ProMate + ConSurf ( | 49 | 0.86 | 35 | 0.71 |
| ProMate + WHISCY ( | 44 | 0.77 | 33 | 0.75 |
| ProMate + CMDist ( | 42 | 0.74 | 26 | 0.62 |
Figure 1.An overview of the protein–protein interactions database induced by comparing the different methods. The pie chart shows the fraction of proteins where the binding site was predicted by all 4 methods (20 proteins), 3 methods (8), 2 methods (12), 1 method (4) or where all the predictors failed (13 proteins).
Figure 2.Comparison of secondary structure distribution at protein versus amino- acid level. The amino acid secondary structure categories at the upper rows were extracted using PROMOTIF, with H representing helices, G–3–10 helices; E–strands; S–bends; T–turns; B–beta bridges; Small letters stand for edges of the relevant structure. The lower row was extracted using the method by Raveh et al. (26) that is based on contact map clustering. The classification as described by the authors: 1: sheet-like loops; 2: parallel sheets; 3: anti-parallel sheets; 4–6: loops; 7: short and flexible helices; 8: long helices.