| Literature DB >> 29632488 |
Sami T Kurkinen1, Sanna Niinivehmas1, Mira Ahinko1, Sakari Lätti1, Olli T Pentikäinen1,2, Pekka A Postila1.
Abstract
Despite the large computational costs of molecular docking, the default scoring functions are often unable to recognize the active hits from the inactive molecules in large-scale virtual screening experiments. Thus, even though a correct binding pose might be sampled during the docking, the active compound or its biologically relevant pose is not necessarily given high enough score to arouse the attention. Various rescoring and post-processing approaches have emerged for improving the docking performance. Here, it is shown that the very early enrichment (number of actives scored higher than 1% of the highest ranked decoys) can be improved on average 2.5-fold or even 8.7-fold by comparing the docking-based ligand conformers directly against the target protein's cavity shape and electrostatics. The similarity comparison of the conformers is performed without geometry optimization against the negative image of the target protein's ligand-binding cavity using the negative image-based (NIB) screening protocol. The viability of the NIB rescoring or the R-NiB, pioneered in this study, was tested with 11 target proteins using benchmark libraries. By focusing on the shape/electrostatics complementarity of the ligand-receptor association, the R-NiB is able to improve the early enrichment of docking essentially without adding to the computing cost. By implementing consensus scoring, in which the R-NiB and the original docking scoring are weighted for optimal outcome, the early enrichment is improved to a level that facilitates effective drug discovery. Moreover, the use of equal weight from the original docking scoring and the R-NiB scoring improves the yield in most cases.Entities:
Keywords: benchmarking; consensus scoring; docking rescoring; molecular docking; negative image-based rescoring (R-NiB)
Year: 2018 PMID: 29632488 PMCID: PMC5879118 DOI: 10.3389/fphar.2018.00260
Source DB: PubMed Journal: Front Pharmacol ISSN: 1663-9812 Impact factor: 5.810
Figure 1Negative image-based rescoring workflow. Firstly, the protein 3D structure (neuraminidase; gray cartoon; PDB: 1B9V) (Finley et al., 1999) and ligand 3D structures for molecular docking are prepared (e.g., protonation). Secondly, the ligand-binding cavity is outlined using a detection radius for docking (yellow transparent circle above) and NIB model generation (yellow transparent surface below). If there exist a bound ligand in the PDB entry (BANA206 as a stick model with cyan backbone in the close-up below), it can be used in defining the cavity center and/or dimensions. Thirdly, the docking of ligands into the cavity is performed using a standard docking software and multiple docking solutions or conformers are outputted for rescoring. Fourthly, a cavity-based NIB model, composed of explicit cavity points (white neutral; blue positive; red negative) is generated with PANTHER (Niinivehmas et al., 2015) for the same cavity. Fifthly, the NIB model shape/electrostatics (transparent surface with charge potential) are compared directly against the docking solutions using a similarity comparison algorithm ShaEP (Vainio et al., 2009) without geometry optimization. Those solutions matching the cavity information are given higher scores than the ones that differ.
Target protein 3D structures used in the virtual screening.
| ER-agonist | 1L2I | 1.95 | 67 | 2,352 | – | – | – | – |
| ER-antagonist | 3ERT | 1.9 | 39 | 1,394 | – | – | – | – |
| ER-mixed | – | – | 106 | 3,746 | 1SJ0 | 1.9 | 383 | 20,663 |
| AR | 2AO6 | 1.89 | 74 | 2,628 | 2AM9 | 1.64 | 269 | 14,343 |
| GR | 1M2Z | 2.5 | 78 | 2,797 | 3BQD | 2.5 | 258 | 14,986 |
| MR | 2AA2 | 1.95 | 15 | 535 | 2AA2 | 1.95 | 94 | 5,146 |
| PPARγ | 1FM9 | 2.1 | 81 | 2,906 | 2GTK | 2.1 | 484 | 25,256 |
| RXRα | 1MVC | 1.9 | 20 | 706 | 1MV9 | 1.9 | 131 | 6,935 |
| COX2 | 1CX2 | 3.0 | 348 | 12,462 | 3LN1 | 2.4 | 435 | 23,136 |
| PDE5 | 1XP0 | 1.79 | 51 | 1,808 | 1UDT | 2.3 | 398 | 27,520 |
| 1UDT | 2.3 | – | – | 1XOZ | 1.37 | – | – | |
| PR | 1SR7 | 1.46 | 27 | 967 | 3KBA | 2.0 | 293 | 15,642 |
| NEU | – | – | – | – | 1B9V | 2.35 | 98 | 6,197 |
| CYP3A4 | – | – | – | – | 3NXU | 2.0 | 170 | 11,797 |
AR, androgen receptor; COX2, cyclo-oxygenase 2; CYP3A4, cytochrome P450 3A4; ER, estrogen receptor alpha; GR, glucocorticoid receptor; MR, mineralocorticoid receptor; NEU, neuraminidase; PPARγ, peroxisome proliferator activated receptor gamma; PR, progesterone receptor; RXRα, retinoid X receptor alpha; PDE5, phosphodiesterase type 5.
ER-agonist, ER-antagonist and ER-mixed refer to ligand sets containing ER-specific agonists, antagonists or both, respectively.
Number of active ligands (Ligs) and decoy (Decs) molecules after preprocessing with LIGPREP.
In the DUD database, ER agonists and antagonists are separated into two separate datasets, but in the case of the DUD-E the ligands are mixed. For comparison, the ER datasets in the DUD were also mixed.
Used in the NIB model generation.
The AUC values for the DUD datasets.
| ER-agonist | 0.81 ± 0.03 | 0.78 ± 0.03 | 0.76 ± 0.03 | 0.79 ± 0.03 | 0.82 ± 0.03 | 0.78 ± 0.03 |
| ER-antagonist | 0.81 ± 0.04 | 0.85 ± 0.04 | 0.77 ± 0.04 | 0.82 ± 0.04 | 0.71 ± 0.05 | 0.83 ± 0.04 |
| ER-mixed | 0.64 ± 0.03 | 0.66 ± 0.03 | 0.61 ± 0.03 | |||
| AR | 0.80 ± 0.03 | 0.81 ± 0.03 | – | 0.79 ± 0.03 | 0.78 ± 0.03 | |
| GR | 0.60 ± 0.03 | 0.53 ± 0.03 | ||||
| MR | 0.80 ± 0.07 | 0.82 ± 0.07 | 0.78 ± 0.07 | |||
| PPARγ | 0.95 ± 0.02 | 0.92 ± 0.02 | 0.87 ± 0.03 | – | 0.81 ± 0.03 | 0.94 ± 0.02 |
| PR | 0.63 ± 0.06 | 0.52 ± 0.06 | 0.50 ± 0.06 | 0.50 ± 0.06 | 0.51 ± 0.06 | 0.58 ± 0.06 |
| RXRα | 0.78 ± 0.06 | 0.84 ± 0.06 | 0.76 ± 0.06 | |||
| COX2 | 0.81 ± 0.01 | 0.65 ± 0.02 | ||||
| PDE5 | 0.71 ± 0.04 | 0.67 ± 0.04 | 0.67 ± 0.04 | 0.72 ± 0.04 | 0.54 ± 0.04 | 0.66 ± 0.04 |
If the rescoring produced higher AUC value in comparison to the initial docking (no overlapping standard error ranges), those numbers are shown in bold.
The ligand distance limit used in PANTHER varied between the targets due to the size/shape differences of the binding cavities and the screened ligand sets. Limits included 1.5 Å (ER, AR, MR, PPARγ, PR RXRα, and COX2), 2.0 Å (GR), and 3.0 Å (PDE5).
The box radius varied between the targets due to the size/shape differences of the binding cavities and screened ligand sets. The radiuses included 6.0 Å (GR, PR and COX2), 7.0 Å (ER-mixed, MR and RXRα), and 8.0 Å (ER-agonist, ER-antagonist, AR, PPARγ and PDE5).
The previously published PANTHER models, optimized for regular NIB screening, were taken from a prior study (Niinivehmas et al., .
The AUC values for the DUD-E datasets.
| ER-mixed | 0.74 ± 0.01 | 0.66 ± 0.02 | 0.65 ± 0.02 | – | 0.71 ± 0.01 | 0.70 ± 0.02 |
| AR | 0.54 ± 0.02 | 0.53 ± 0.02 | ||||
| GR | 0.54 ± 0.02 | 0.51 ± 0.02 | ||||
| MR | 0.55 ± 0.03 | 0.53 ± 0.03 | ||||
| PPARγ | 0.85 ± 0.01 | 0.77 ± 0.01 | 0.75 ± 0.01 | – | 0.66 ± 0.01 | 0.84 ± 0.01 |
| PR | 0.63 ± 0.02 | 0.63 ± 0.02 | 0.61 ± 0.02 | |||
| RXRα | 0.77 ± 0.02 | 0.70 ± 0.03 | ||||
| COX2 | 0.66 ± 0.01 | 0.65 ± 0.01 | – | 0.62 ± 0.01 | 0.67 ± 0.01 | |
| PDE5 | 0.78 ± 0.01 | 0.72 ± 0.02 | 0.70 ± 0.02 | – | 0.58 ± 0.02 | 0.74 ± 0.01 |
| NEU | 0.85 ± 0.02 | – | 0.68 ± 0.03 | 0.56 ± 0.03 | ||
| CYP3A4 | 0.61 ± 0.02 | 0.60 ± 0.02 | 0.60 ± 0.02 | – | 0.53 ± 0.02 | 0.60 ± 0.02 |
If the rescoring produced higher AUC value in comparison to the initial docking (no overlapping standard error ranges), those numbers are shown in bold.
The ligand distance limit used in PANTHER varied between the targets due to the size/shape differences of the binding cavities and screened ligand sets. Limits included 1.5 Å (ER-mixed, AR, PPARγ, PR, and COX2), 2.0 Å (MR, RXRα, NEU, PDE5, and CYP3A4) and 3.0 Å (GR).
The box radius varied between the targets due to the size/shape differences of the binding cavities and screened ligand sets. The radiuses included 6.0 Å (AR, GR, MR, COX2, NEU, and PR), 7.0 Å (PDE5, RXRα, and CYP3A4) and 9.0 Å (PPARγ) and 10.0 Å (ER-mixed).
The previously published PANTHER models, optimized for regular NIB screening, were taken from a prior study (Niinivehmas et al., .
The enrichment given as true positive rates for the DUD datasets.
| ER-agonist | 1% | 17.9 | 23.9 | 19.4 | 10.4 | ||
| 5% | 44.8 | 52.2 | 58.2 | 59.7 | 52.2 | 26.9 | |
| ER-antagonist | 1% | 15.4 | 7.7 | 12.8 | 15.4 | 12.8 | |
| 5% | 33.3 | 43.6 | 25.6 | 38.5 | 25.6 | 35.9 | |
| ER-mixed | 1% | 0.0 | 0.0 | ||||
| 5% | 20.8 | 23.6 | 5.7 | 8.5 | 6.6 | 7.5 | |
| AR | 1% | 17.6 | 27.0 | 12.2 | – | 9.5 | 14.9 |
| 5% | 40.5 | 45.9 | 45.9 | – | 31.1 | 39.2 | |
| GR | 1% | 6.4 | 29.5 | 3.8 | |||
| 5% | 15.4 | 50.0 | 14.1 | ||||
| MR | 1% | 26.7 | 33.3 | 13.3 | 0.0 | 0.0 | 33.3 |
| 5% | 60.0 | 73.3 | 40.0 | 26.7 | 40.0 | 60.0 | |
| PPARγ | 1% | 69.1 | 79.0 | 22.2 | – | 21.0 | 66.7 |
| 5% | 84.0 | 86.4 | 65.4 | – | 48.1 | 85.2 | |
| PR | 1% | 3.7 | 3.7 | ||||
| 5% | 11.1 | 7.4 | |||||
| RXRα | 1% | 5.0 | 0.0 | ||||
| 5% | 30.0 | 30.0 | |||||
| COX2 | 1% | 13.5 | 9.2 | ||||
| 5% | 35.3 | 20.1 | 44.8 | ||||
| PDE5 | 1% | 13.7 | 13.7 | 3.9 | 9.8 | ||
| 5% | 25.5 | 23.5 | 5.9 | 25.5 | |||
Those EF%.
The ligand distance limit used in PANTHER varied between the targets due to the size/shape differences of the binding cavities and the screened ligand sets. Limits included 1.5 Å (ER-agonist, ER-mixed, AR, MR, PPARγ, RXRα, and COX2) and 2.0 Å (GR and PR), 3.0 Å (ER-antagonist) and 4.0 Å (PDE5).
The box radius varied between the targets due to the size/shape differences of the binding cavities and screened ligand sets. The radiuses included 6.0 Å (MR and COX2), 7.0 Å (AR and PR) and 8.0 Å (ER's, GR, PPARγ and RXRα) and 9.0 Å (PDE5).
The previously published PANTHER models, optimized for regular NIB screening, were taken from a prior study (Niinivehmas et al., .
The enrichment given as true positive rates for the DUD-E datasets.
| ER-mixed | 1% | 21.7 | 18.3 | 5.5 | – | 6.3 | 12.8 |
| 5% | 36.6 | 32.6 | 20.1 | – | 24.8 | 28.7 | |
| AR | 1% | 1.5 | 1.9 | 0.4 | |||
| 5% | 7.1 | 7.8 | 5.2 | ||||
| GR | 1% | 1.2 | 1.2 | 1.2 | |||
| 5% | 12.0 | 12.8 | 10.5 | 10.1 | |||
| MR | 1% | 3.2 | 3.2 | 1.1 | 1.1 | ||
| 5% | 19.1 | 25.5 | 19.1 | 18.1 | 8.5 | 11.7 | |
| PPARγ | 1% | 24.2 | 4.5 | 10.3 | – | 5.0 | 19.6 |
| 5% | 57.0 | 24.4 | 32.4 | – | 13.8 | 48.3 | |
| PR | 1% | 2.0 | 2.0 | 2.4 | |||
| 5% | 17.1 | 17.1 | 11.6 | 17.4 | 11.6 | 15.0 | |
| RXRα | 1% | 11.5 | 6.9 | 1.5 | 10.7 | 15.3 | 1.5 |
| 5% | 37.4 | 25.2 | 12.2 | 23.9 | 45.8 | 19.8 | |
| COX2 | 1% | 5.7 | 2.3 | 0.5 | – | 2.1 | |
| 5% | 21.6 | 19.1 | 4.1 | – | 6.4 | 25.1 | |
| PDE5 | 1% | 11.3 | 10.6 | 3.8 | – | 1.5 | 8.8 |
| 5% | 28.1 | 25.9 | 14.1 | – | 7.0 | 24.4 | |
| NEU | 1% | 4.1 | – | 1.0 | 0.0 | ||
| 5% | 32.7 | 42.9 | 35.7 | – | 4.1 | 4.1 | |
| CYP3A4 | 1% | 7.1 | 7.6 | 5.3 | – | 2.4 | 6.5 |
| 5% | 12.9 | 15.3 | – | 6.5 | 13.5 | ||
Those EF%.
The ligand distance limit used in PANTHER varied between the targets due to the size/shape differences of the binding cavities and screened ligand sets. Limits included 1.5 Å (ER-mixed, AR, PDE5, GR, MR, PR and COX2), 2.0 Å (RXRα, NEU and CYP3A4) and 3.0 Å (PPARγ).
The box radius varied between the targets due to the size/shape differences of the binding cavities and screened ligand sets. The radiuses included 6.0 Å (AR, GR, MR and NEU), 7.0 Å (RXRα, PR, PDE5 and CYP3A4), 8.0 Å (COX2), 9.0 Å (PPARγ) and 11.0 Å (ER-mixed).
The previously published PANTHER models, optimized for regular NIB screening, were taken from a prior study (Niinivehmas et al., .
Figure 2The semi-logarithmic receiver operating characteristics plots for the docking and negative image-based rescoring with the DUD dataset. Only those R-NiB results with the highest early enrichment were plotted (EF1%DEC in Table 5). The red line shows the original docking enrichment by PLANTS, the blue line gives the result after PANTHER/ShaEP-based rescoring, and the black line gives the result from consensus scoring where both of them are given equal weight (50/50%). The dashed line outlines the random selection (AUC = 0.50). The semi-log10 scale is used only for the x axis to highlight the very early enrichment or lack thereof.
Figure 3The semi-logarithmic receiver operating characteristics plots for the docking and negative image-based rescoring with the DUD-E dataset. Only those R-NiB results with the highest very early enrichment were plotted (EF1%DEC in Table 6). With retinoid X receptor alpha (RXRα), the results are shown for the model (ligand exclusion of 2.0 Å; Table 6) producing the highest very early enrichment, which is visible in the plotted curve. For interpretation see Figure 2.
The consensus scoring of the DUD Datasets.
| ER-agonist | 0.70 | 0.81 ± 0.03 (↔) | 41.8 | 4.5 | 56.7 | 4.5 | 40.3 | 3.0 | 53.7 | 1.5 |
| ER-antagonist | 0.55 | 0.78 ± 0.04 (↓) | 35.9 | 7.7 | 43.6 | 0.0 | 35.9 | 7.7 | 43.6 | 0.0 |
| ER-mixed | 0.90 | 0.77 ± 0.03 (↑) | 11.3 | 0.0 | 26.4 | 2.8 | 7.5 | −3.8 | 29.2 | 5.6 |
| AR | 0.25 | 0.85 ± 0.03 (↑) | 32.4 | 5.4 | 47.3 | 1.4 | 28.4 | 1.4 | 50.0 | 4.1 |
| GR | 0.60 | 0.76 ± 0.03 (↑) | 19.2 | 2.5 | 26.9 | −1.3 | 19.2 | 2.5 | 25.6 | −2.6 |
| MR | 1.0 | 0.93 ± 0.05 (↑) | 33.3 | 0.0 | 73.3 | 0.0 | 33.3 | 0.0 | 73.3 | 0.0 |
| PPARγ | 0.35 | 0.93 ± 0.02 (↓) | 84.0 | 5.0 | 87.7 | 1.3 | 81.5 | 2.5 | 87.7 | 1.3 |
| PR | 0.60 | 0.53 ± 0.06 (↓) | 33.3 | 0.0 | 40.7 | 0.0 | 22.2 | −11.1 | 40.7 | 0.0 |
| RXRα | 1.0 | 0.89 ± 0.05 (↑) | 35.0 | 0.0 | 80.0 | 0.0 | 25.0 | −10.0 | 80.0 | 0.0 |
| COX2 | 0.80 | 0.95 ± 0.01 (↑) | 65.2 | 2.6 | 82.8 | −0.2 | 59.8 | −2.8 | 77.6 | −5.4 |
| PDE5 | 0.85 | 0.64 ± 0.04 (↓) | 31.4 | 0.0 | 43.1 | 3.8 | 23.5 | −7.9 | 33.3 | −5.9 |
The NIB model producing the highest EF1%.
If the ShaEP weight is 1.0, the consensus score comes entirely from ShaEP rescoring, and, vice versa, if the weight is 0, only the PLANTS score is used. The value of 0.50 corresponds to the situation in which PLANTS docking and ShaEP rescoring effect have equal weight in the results. Both the ShaEP and PLANTS scores were normalized to fit the scale from 0 to 1 before combining them. The consensus scoring was not done to acquire the best AUC enrichment possible and, accordingly, upon a rare occasion the value could decrease (downward arrow) instead improving it (upward arrow).
ΔEF%.
Figure 4The cavity-based NIB models and the docking solutions are aligned. The protein 3D structures of (A) cytochrome P450 3A4 (CYP3A4; lime; PDB: 3NXU) (Sevrioukova and Poulos, 2010), (E) glucocorticoid receptor (GR; white; PDB: 1M2Z) (Bledsoe et al., 2005) and (I) neuraminidase (NEU; yellow; PDB: 1B9V) (Finley et al., 1999) are shown as opaque surfaces on the far left. With CYP3A4 and GR, the X-ray crystal structures are shown in two sections to highlight the buried locations of their active sites (mauve opaque surfaces) at the center. The dotted lines indicate the cutting planes for the cross-sections chosen for the illustration. The prosthetic heme group is shown as a CPK model (black backbone) for CYP3A4. With NEU, the enzyme's active which opens directly from the protein surface, is only partially buried and, thus, no cross-sectioning was done. The contours of the active sites of (B,C) CYP3A4, (F,G) GR, and (J,K) NEU are shown both as opaque surfaces and finalized NIB models (transparent surfaces with charge potential) in the cross-section close-ups. The red, blue, and white dots in the NIB model indicate the negative, positive and neutral cavity dots (or filler atoms) constituting the negative image. The docked poses of five known active compounds (stick models with orange backbone) for (D) CYP3A4, (H) GR, and (L) NEU from PLANTS are shown stacked in the far right.
Figure 5A negative image-based rescoring example with mineralocorticoid receptor. (A) The X-ray crystal structure of mineralocorticoid receptor (MR; silver cartoon model; PDB: 2AA2) (Bledsoe et al., 2005) and the amino acid residues (stick models) making hydrogen bonds (magenta dotted lines) with the inhibitor aldosterone (stick model with cyan backbone) are shown. (B) The negative image or NIB model (transparent surface) of the MR active site was build using the same PDB entry (Bledsoe et al., 2005) and the 1.5 Å ligand distance limit option in PANTHER. The red and blue dots depict the negatively and positively charged cavity points, respectively, whereas the white dots are neutral. (C) The rescored pose (rank #13) of hydrocortisone (stick model with orange backbone) reminds closely the experimentally verified pose of its structural analog aldosterone (A vs. C). (D) Hence, the pose of hydrocortisone given the highest score by PLANTS (rank #17), showing a reversed pose in comparison to the aldosterone (A vs. D), is likely erroneous (D).
The consensus scoring of the DUD-E datasets.
| ER-mixed | 0.35 | 0.69 ± 0.02 (↓) | 24.5 | 6.2 | 37.9 | 5.3 | 23.0 | 4.7 | 36.8 | 4.2 |
| AR | 1.0 | 0.76 ± 0.02 (↑) | 13.0 | 0.0 | 23.0 | 0.0 | 9.3 | −3.7 | 19.0 | −4.0 |
| GR | 1.0 | 0.70 ± 0.02 (↑) | 5.8 | 0.0 | 17.4 | 0.0 | 2.3 | −3.5 | 16.7 | −0.7 |
| MR | 1.0 | 0.70 ± 0.03 (↑) | 11.7 | 0.0 | 25.5 | 0.0 | 9.6 | −2.1 | 21.3 | −4.2 |
| PPARy | 0.20 | 0.85 ± 0.01 (↔) | 27.7 | 17.4 | 58.1 | 25.7 | 21.9 | 11.2 | 46.7 | 14.3 |
| PR | 0.55 | 0.72 ± 0.02 (↑) | 6.8 | 2.4 | 18.4 | 1.3 | 6.8 | 2.4 | 18.1 | 1.3 |
| RXRa | 0.25 | 0.82 ± 0.02 (↑) | 19.1 | 8.4 | 46.6 | 22.7 | 14.5 | 3.8 | 29.0 | 5.1 |
| COX2 | 0.10 | 0.69 ± 0.01 (↑) | 7.6 | 5.3 | 25.5 | 6.4 | 6.0 | 3.7 | 23.4 | 4.3 |
| PDE5 | 0.25 | 0.82 ± 0.01 (↑) | 17.6 | 7.0 | 36.4 | 10.5 | 13.8 | 3.2 | 31.7 | 5.8 |
| NEU | 0.50 | 0.91 ± 0.02 (↑) | 16.3 | 3.0 | 52.0 | 9.1 | 16.3 | 3.0 | 52.0 | 9.1 |
| CYP3A4 | 0.50 | 0.61 ± 0.02 (↔) | 10.6 | 3.0 | 21.2 | 2.4 | 10.6 | 3.0 | 21.2 | 2.4 |
The NIB model producing the highest EF1%.