| Literature DB >> 28831657 |
Zeynep Kurkcuoglu1, Panagiotis I Koukos1, Nevia Citro1, Mikael E Trellet1, J P G L M Rodrigues2, Irina S Moreira1,3, Jorge Roel-Touris1, Adrien S J Melquiond1, Cunliang Geng1, Jörg Schaarschmidt1, Li C Xue1, Anna Vangone1, A M J J Bonvin4.
Abstract
We present the performance of HADDOCK, our information-driven docking software, in the second edition of the D3R Grand Challenge. In this blind experiment, participants were requested to predict the structures and binding affinities of complexes between the Farnesoid X nuclear receptor and 102 different ligands. The models obtained in Stage1 with HADDOCK and ligand-specific protocol show an average ligand RMSD of 5.1 Å from the crystal structure. Only 6/35 targets were within 2.5 Å RMSD from the reference, which prompted us to investigate the limiting factors and revise our protocol for Stage2. The choice of the receptor conformation appeared to have the strongest influence on the results. Our Stage2 models were of higher quality (13 out of 35 were within 2.5 Å), with an average RMSD of 4.1 Å. The docking protocol was applied to all 102 ligands to generate poses for binding affinity prediction. We developed a modified version of our contact-based binding affinity predictor PRODIGY, using the number of interatomic contacts classified by their type and the intermolecular electrostatic energy. This simple structure-based binding affinity predictor shows a Kendall's Tau correlation of 0.37 in ranking the ligands (7th best out of 77 methods, 5th/25 groups). Those results were obtained from the average prediction over the top10 poses, irrespective of their similarity/correctness, underscoring the robustness of our simple predictor. This results in an enrichment factor of 2.5 compared to a random predictor for ranking ligands within the top 25%, making it a promising approach to identify lead compounds in virtual screening.Entities:
Keywords: Binding affinity; D3R; Docking; Drug design data resource; Intermolecular contacts; Ranking
Mesh:
Substances:
Year: 2017 PMID: 28831657 PMCID: PMC5767195 DOI: 10.1007/s10822-017-0049-y
Source DB: PubMed Journal: J Comput Aided Mol Des ISSN: 0920-654X Impact factor: 3.686
Comparison of the prediction performance of atom-pair and maximum common substructure predictors on the training dataset using 10 repeats of 5-fold cross-validation
| Atom-pair | Maximum common substructure | |
|---|---|---|
| Kendall’s Tau | 0.52 ± 0.01 | 0.50 ± 0.01 |
| Pearson’s correlation coefficient | 0.70 ± 0.01 | 0.68 ± 0.02 |
Fig. 1Examples of successfully predicted ligand poses in Stage1 for (a) FXR-27 (b) FXR-34 with a l-RMSD of 1.27 and 1.94 Å, respectively. The receptor conformations are shown as cartoon and the ligands as stick representation. The reference crystal structure is colored grey and the model as slate
Fig. 2Comparison of the l-RMSDs of the top5 scoring poses between stages 1 and 2. l-RMSD values of the top5 poses are drawn as boxplots with the values of Stage1 colored dark gray and those of Stage2 light gray. The black line in the middle of the boxes corresponds to the median, the lower and upper hinges correspond to the 25th and 75th percentile respectively, the whiskers extend to no longer than 1.5 times the IQR from the hinge. Any point beyond that range is considered an outlier and drawn as a filled black point. The circles correspond to the overall minimum l-RMSD obtained in it1 for that target. In the cases where the circle overlaps with an outlier or a boxplot, the minimum l-RMSD structure is part of the top5 scoring poses. The dotted line represents the l-RMSD cutoff of 2.5 Å. The number of successful predictions increases from 6/35 in Stage1 to 13/35 in Stage2
Fig. 3Successful prediction (l-RMSD < 2.5 Å) rates for top1, top5 and top10 in different docking runs for 35 targets. Bound-ligand docking refers to runs with bound ligand conformer and the ensemble of receptors used in Stage1. Bound-receptor is the one with bound receptor and the ensemble of ligands used in Stage1. Finally, bound–bound is the bound receptor-bound ligand docking runs
Fig. 4Comparison of the top100 models for the protocols used for stages 1 and 2. Each bar corresponds to structures belonging to runs for the indicated target. The coloring of the bars separates the structures in 3 classes. Structures colored black have a l-RMSD smaller than 2.5 Å, structures colored dark gray have a l-RMSD between 2.5 and 3.5 Å and structures with a l-RMSD of greater than 3.5 Å are colored light gray. The top-ranked structures are the ones close to zero on the y-axis
Fig. 5Comparison of predicted ln(IC50) with experimental ln(IC50) using our ligand-based binding affinity predictor
Fig. 6Ranking of binding affinity correlation per group for stages 1 and 2. The top panel reports the results of Stage1 and the bottom one of Stage2. Bars colored light gray correspond to groups which did not provide submissions for all targets. The bars colored dark gray correspond to the HADDOCK group submission
Fig. 7Positive predictive value (bottom) and enrichment factor (top) for 102 targets, using structure-based binding affinity predictor. Taking top 20–25% is associated with 2.5 enrichment factor