| Literature DB >> 27802573 |
A Vangone1, J P G L M Rodrigues1, L C Xue1, G C P van Zundert1, C Geng1, Z Kurkcuoglu1, M Nellen1, S Narasimhan1, E Karaca1, M van Dijk1, A S J Melquiond1, K M Visscher1, M Trellet1, P L Kastritis1, A M J J Bonvin1.
Abstract
Our information-driven docking approach HADDOCK is a consistent top predictor and scorer since the start of its participation in the CAPRI community-wide experiment. This sustained performance is due, in part, to its ability to integrate experimental data and/or bioinformatics information into the modelling process, and also to the overall robustness of the scoring function used to assess and rank the predictions. In the CASP-CAPRI Round 1 scoring experiment we successfully selected acceptable/medium quality models for 18/14 of the 25 targets - a top-ranking performance among all scorers. Considering that for only 20 targets acceptable models were generated by the community, our effective success rate reaches as high as 90% (18/20). This was achieved using the standard HADDOCK scoring function, which, thirteen years after its original publication, still consists of a simple linear combination of intermolecular van der Waals and Coulomb electrostatics energies and an empirically derived desolvation energy term. Despite its simplicity, this scoring function makes sense from a physico-chemical perspective, encoding key aspects of biomolecular recognition. In addition to its success in the scoring experiment, the HADDOCK server takes the first place in the server prediction category, with 16 successful predictions. Much like our scoring protocol, because of the limited time per target, the predictions relied mainly on either an ab initio center-of-mass and symmetry restrained protocol, or on a template-based approach whenever applicable. These results underline the success of our simple but sensible prediction and scoring scheme. Proteins 2017; 85:417-423.Entities:
Keywords: biomolecular complexes; data-driven docking; desolvation energy; docking; electrostatics; ranking; scoring functions; van der Waals energy
Mesh:
Substances:
Year: 2016 PMID: 27802573 PMCID: PMC5324763 DOI: 10.1002/prot.25198
Source DB: PubMed Journal: Proteins ISSN: 0887-3585
Summary of HADDOCK Prediction and Scoring Performance in CASP‐CAPRI
| Target name | Oligomeric state | HADDOCK server prediction | HADDOCK scoring prediction |
|---|---|---|---|
| T68§ | Homo‐dimer | 0 | 0 |
| T69 | Homo‐dimer | 10 | 4/3** |
| T70 | Homo‐tetramer | 8/7** | 8/6** |
| T71 | Homo‐tetramer | 4 | 2 |
| T72 | Homo‐dimer | 1 | 2 |
| T73 | Homo‐tetramer | 0 | 0 |
| T74§ | Homo‐tetramer | 0 | 0 |
| T75 | Homo‐dimer | 2 | 4/2** |
| T77§ | Homo‐dimer | 0 | 0 |
| T78§ | Homo‐tetramer | 0 | 0 |
| T79 | Homo‐dimer | 0 | 2/1** |
| T80 | Homo‐dimer | 10 | 2/2** |
| T81 | Hetero‐dimer | 0 | 2 |
| T82 | Homo‐dimer | 10/10** | 10** |
| T84 | Homo‐dimer | 10/10** | 7/4** |
| T85 | Homo‐dimer | 10/10** | 10/6** |
| T86 | Homo‐dimer | 0 | 0 |
| T87 | Homo‐dimer | 10/10** | 7/5** |
| T88§ | Hetero‐dimer | 0 | 0 |
| T89 | Hetero‐dimer | 10/8** | 4/1** |
| T90 | Homo‐dimer | 10/10** | 5/4** |
| T91 | Homo‐dimer | 10/10** | 8/6** |
| T92 | Homo‐dimer | 6 | 6/5** |
| T93 | Homo‐dimer | 10/10** | 3/2** |
| T94 | Homo‐dimer | 10 | 3 |
| #Targets | 16/9** | 18/14** |
The overall performance is reported at the bottom in terms of number of successful targets, that is, the ones having at least one acceptable or better solution. The numbers without stars indicate the number of models of acceptable or better quality; numbers with “**” indicate the number of medium quality models.
For targets labeled with “§,” no single acceptable solution was present in the pool of models provided for scoring. See also Supporting Information Table S1 for details about how the server predictions were made.
Oligomeric state provided to the CAPRI participants by the time of the challenge. For further information see Lensink et al. (2016).11
Figure 1Comparison of the scoring performance of HADDOCK versus the CAPRI community in terms of i‐RMSD, l‐RMSD and Fnat. For each scored interface, the (A) backbone i‐RMSD, (B) l‐RMSD and (C) Fnat values over all the submitted models are reported. Data are shown only for those targets for which at least one acceptable or better solution has been submitted by the community. The number after the target ID refers to the specific interface evaluated in case of multimers with more than two subunits. The i‐RMSD/l‐RMS/Fnat of the HADDOCK models are indicated in black‐empty dots with the best one highlighted with a black‐filled circle and the i‐RMSD/l‐RMSD/Fnat of the best model submitted among the CAPRI scorers except HADDOCK (CAPRI‐community) is indicated by a filled‐gray circle. The box plots report the i‐RMSD/l‐RMSD/Fnat distributions for all models submitted by “others,” each scorer group being allowed to submit 10 models. Background‐gray regions correspond to the thresholds of 4Å/10Å/0.1 for i‐RMSD/l‐RMSD/Fnat, respectively, defining for defining acceptable or better models according to the CAPRI criteria. Increasingly shades of gray have been used for acceptable/medium/acceptable modes thresholds. The median is indication by a thick gray line; the left and right edges of the box indicate the 25th and 75th percentiles, respectively. The whiskers represent all data lying within 1.5 times the interquartile range. Values that falls outside this range are considered outliers and are not shown (plot generated with default values in MATLAB29).
Figure 2Comparison of the quality of the docked and starting homology models for the unsuccessfully predicted targets. The i‐RMSD of the best model predicted by HADDOCK (grey bars) and the overall backbone RMSD of the unbound homology model used as input structure for docking (light gray bars) are shown for those targets for which no successful HADDOCK predictions were submitted. The dashed line indicates the CAPRI 4.0 Å acceptable i‐RMSD cut‐off. For heterodimers targets (T81 and T89), the backbone‐RMSD of the worst model among the two unbound modeled proteins is reported.