| Literature DB >> 31290660 |
Sami T Kurkinen1, Sakari Lätti1, Olli T Pentikäinen1,2, Pekka A Postila3.
Abstract
The failure of default scoring functions to ensure virtual screening enrichment is a persistent problem for the molecular docking algorithms used in structure-based drug discovery. To remedy this problem, elaborate rescoring and postprocessing schemes have been developed with a varying degree of success, specificity, and cost. The negative image-based rescoring (R-NiB) has been shown to improve the flexible docking performance markedly with a variety of drug targets. The yield improvement is achieved by comparing the alternative docking poses against the negative image of the target protein's ligand-binding cavity. In other words, the shape and electrostatics of the binding pocket is directly used in the similarity comparison to rank the explicit docking poses. Here, the PANTHER/ShaEP-based R-NiB methodology is tested with six popular docking softwares, including GLIDE, PLANTS, GOLD, DOCK, AUTODOCK, and AUTODOCK VINA, using five validated benchmark sets. Overall, the results indicate that R-NiB outperforms the default docking scoring consistently and inexpensively, demonstrating that the methodology is ready for wide-scale virtual screening usage.Entities:
Year: 2019 PMID: 31290660 PMCID: PMC6750746 DOI: 10.1021/acs.jcim.9b00383
Source DB: PubMed Journal: J Chem Inf Model ISSN: 1549-9596 Impact factor: 4.956
Figure 1Negative image-based rescoring step-by-step. The negative image-based rescoring (R-NiB) protocol follows five steps: (1) Ligand-binding cavity and its centroid are selected from the protein 3D structure (a cartoon model of RXRα with the bound docosa hexaenoate or HXA; PDB: 1MV9).[33] (2) Negative image or NIB (negative image-based) model (transparent surface), composed of neutral filler atoms (cyan spheres) and negative cavity point (red sphere), is generated using PANTHER.[22] (3) Flexible molecular docking (e.g., VINA[34]) is performed for the ligands (e.g., lig. #1 and lig. #2 or C44184559 and CHEMBL2085503 in the DUD-E set for the RXRα[18]), and several (e.g., N = 3) alternative docking poses (stick models with white backbone) are outputted for each compound. (4) Cavity-based rescoring or the shape/charge comparison of docking poses (one at a time!) is used with the NIB model without geometry optimization using ShaEP.[32] (5) Comparison produces similarity scores (from 1 to 0) for each docked pose, and this information is used to rank the individual docking poses and the ligands. Based on the R-NiB ranking, compounds can be categorized or predicted as inactive (red stick model; e.g., lig. #1) or active (green stick model; e.g., lig #2). Note that the steps involved in the protein or ligand preparation for NIB model generation or docking, visual inspection of the best-ranked poses, or potential benchmarking efforts are omitted for brevity. The figure was created using BODIL[35] and Visual Molecular Dynamics or VMD 1.9.2.[36]
Figure 2Negative images of target proteins’ ligand-binding cavities. On the left, 3D structures of target proteins (cartoon models) with co-crystallized ligands (CPK models) at binding cavities. In the middle, cross sections of binding cavities (opaque surfaces) in close ups (red boxes). PANTHER[22]-generated negative images or NIB (negative image-based) models are composed of neutral filler atoms and negatively or positively charged cavity points (cyan/red/blue spheres). On the right, NIB models are shown with space-filling transparent surfaces either with cavity points or an active ligand from PLANTS[37] docking (sticks with a green backbone). Both the shape and volume (N = 44–79) of NIB models vary substantially between (A) mineralocorticoid receptor (MR; PDB: 2AA2),[49] (B) neuraminidase (NEU; PDB: 1B9V),[50] (C) retinoid X receptor alpha (RXRα; PDB: 1MV9; different angle shown in Figure ),[33] (D) cyclooxygenase-2 (COX-2; PDB: 3LN1),[46] and (E) phosphodiesterase-5 (PDE5; PDB: 1UDT).[29] NIB models aim to encompass only those cavity sections needed for ligand binding instead of filling the cavities to the brim. The figure was created using BODIL[35] and Visual Molecular Dynamics or VMD 1.9.2.[36]
Benchmark Ligand Sets and Protein 3D Structures
| Target protein | RXRα | COX-2 | PDE5 | MR | NEU | |
|---|---|---|---|---|---|---|
| PDB code | 1MV9 | 3LN1 | 1UDT | 1XOZ | 2AA2 | 1B9V |
| Resolution (Å) | 1.9 | 2.4 | 2.3 | 1.37 | 1.95 | 2.35 |
| 131 | 435 | 398 | 398 | 94 | 98 | |
| 6935 | 23,136 | 27,520 | 27,520 | 5146 | 6197 | |
Retinoid X receptor alpha (RXRα),[33] cyclooxygenase-2 (COX-2),[46] phosphodiesterase-5 (PDE5),[47,48] mineralocorticoid receptor (MR),[49] neuraminidase (NEU).[50]
Only PDB entry 1UDT(48) was used for docking. Both 1UDT(48) and 1XOZ(47) were used in NIB (negative image-based) model generation.
Number of active ligands (ligs) and decoy molecules (decs) after performing ligand preparation with LIGPREP in MAESTRO (status before docking screening).
Performance of Molecular Docking Algorithms in Benchmarkinga
| GLIDE | DOCK | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| PLANTS | HTVS | SP | GOLD | Def. | Opt. | VINA | AUTODOCK | ||
| RXRα | AUC | 0.77 ± 0.02 | 0.68 ± 0.03 | 0.83 ± 0.02 | 0.76 ± 0.02 | 0.41 ± 0.02 | 0.52 ± 0.03 | 0.82 ± 0.02 | |
| EFd 1% | 11.5 | 44.3 | 16.8 | 4.5 | 8.4 | 40.5 | 54.2 | ||
| EFd 5% | 37.4 | 50.4 | 35.1 | 6.8 | 13.0 | 55.7 | 72.5 | ||
| docked | 100/100 | 51/31 | 82/67 | 100/100 | 26/45 | 73/69 | 100/100 | 100/100 | |
| COX-2 | AUC | 0.66 ± 0.01 | 0.66 ± 0.01 | 0.71 ± 0.01 | 0.61 ± 0.01 | 0.64 ± 0.01 | 0.61 ± 0.01 | ||
| EFd 1% | 5.7 | 24.1 | 37.2 | 12.8 | 11.3 | 11.3 | 33.8 | 3.0 | |
| EFd 5% | 21.6 | 35.2 | 46.7 | 38.1 | 20.2 | 22.1 | 11.7 | ||
| docked | 100/100 | 58/40 | 83/74 | 100/100 | 52/35 | 83/69 | 100/100 | 100/100 | |
| PDE5 | AUC | 0.67 ± 0.01 | 0.50 ± 0.01 | 0.54 ± 0.01 | 0.64 ± 0.02 | 0.61 ± 0.02 | |||
| EFd 1% | 7.0 | 10.3 | 9.0 | 0.3 | 1.5 | 5.8 | |||
| EFd 5% | 28.1 | 23.6 | 32.4 | 4.8 | 6.0 | 22.1 | 14.1 | ||
| docked | 100/100 | 83/84 | 94/99 | 100/100 | 89/94 | 96/97 | 100/100 | 100/100 | |
| MR | AUC | 0.48 ± 0.03 | 0.50 ± 0.03 | 0.47 ± 0.03 | 0.41 ± 0.03 | 0.42 ± 0.03 | 0.53 ± 0.03 | ||
| EFd 1% | 3.2 | 7.4 | 2.1 | 0.0 | 0.0 | 7.4 | |||
| EFd 5% | 19.1 | 9.6 | 19.1 | 7.4 | 1.1 | 0.0 | 9.6 | ||
| docked | 100/100 | 13/19 | 34/41 | 100/100 | 32/47 | 60/69 | 100/100 | 100/100 | |
| NEU | AUC | 0.61 ± 0.03 | 0.82 ± 0.03 | 0.69 ± 0.03 | 0.56 ± 0.03 | 0.62 ± 0.03 | |||
| EFd 1% | 4.1 | 18.4 | 2.0 | 12.2 | 14.3 | 0.0 | 0.0 | ||
| EFd 5% | 32.7 | 31.6 | 43.9 | 9.2 | 34.7 | 4.1 | 4.1 | ||
| docked | 100/100 | 96/97 | 100/100 | 100/100 | 99/98 | 100/99 | 100/100 | 100/100 | |
Best docking results are bolded and in italics, if the values are within the error margin. Only those AUC values that are within the error margin are highlighted.
Figure 3Docking and rescoring performance as receiver operator characteristic curves. The linear receiver operator characteristic (ROC) curves are plotted for the original docking and rescoring results of negative image-based rescoring (R-NiB; Figure )[21] or SMINA[45] rescoring. Benchmarking is shown for a selected assortment of results, but the full set of data is given in the Supporting Information for each software and test set in the form of linear (Figures S1–S5) and semilogarithmic ROC curves (Figures S6–S10). The figure was created using ROCKER0.1.4.[52]
Performance of Negative Image-Based Rescoringa
| GLIDE | DOCK | VINA | AUTODOCK | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| PLANTS | HTVS | SP | GOLD | Def. | Opt. | Polar H’s | All H’s | Polar H’s | All H’s | ||
| RXRα | AUC | 0.68 ± 0.03 | 0.83 ± 0.02 | 0.45 ± 0.02 | 0.78 ± 0.02 | 0.80 ± 0.02 | |||||
| EFd 1% | 61.1 | 29.0 | 15.3 | 29.0 | 16.8 | ||||||
| EFd 5% | 49.6 | 74.0 | 26.7 | 32.8 | |||||||
| COX-2 | AUC | 0.66 ± 0.01 | 0.73 ± 0.01 | 0.74 ± 0.01 | 0.75 ± 0.01 | ||||||
| EFd 1% | 24.1 | 31.0 | 11.3 | 11.0 | |||||||
| EFd 5% | 34.5 | 32.0 | |||||||||
| PDE5 | AUC | 0.76 ± 0.01 | 0.62 ± 0.02 | 0.63 ± 0.02 | 0.70 ± 0.01 | ||||||
| EFd 1% | 10.6 | 4.0 | 6.8 | 8.8 | |||||||
| EFd 5% | 27.1 | 15.3 | 15.3 | 20.4 | |||||||
| MR | AUC | 0.48 ± 0.03 | 0.50 ± 0.03 | 0.47 ± 0.03 | |||||||
| EFd 1% | 7.4 | 6.4 | 0.0 | 0.0 | 2.1 | 3.2 | 8.5 | ||||
| EFd 5% | 18.1 | 13.8 | 22.3 | ||||||||
| NEU | AUC | ||||||||||
| EFd 1% | |||||||||||
| EFd 5% | |||||||||||
Bolded and in italics if better than the original docking results.
Figure 4Negative image-based rescoring refocuses neuraminidase docking. (A) Best poses of 10 top-ranked docked compounds (stick models), sampled and selected by docking algorithm PLANTS,[37] are relatively scattered in the partially open surface pocket of neuraminidase (NEU; opaque magenta surface). The cavity center (green sphere), which is the geometric centroid of the co-crystallized ligand BANA206 (PDB: 1B9V),[50] was used to center PLANTS docking and NIB model generation with PANTHER.[22] (B) The NIB model (transparent yellow surface), which was used in the cavity-based rescoring with ShaEP,[32] is shown with the centroid. (C) Best poses of 10 top-ranked docked compounds selected by negative image-based rescoring (R-NiB; Figure )[21] form a much tighter cluster than scattered ligands/poses selected at the top by default docking scoring of PLANTS. The figure was created using BODIL[35] and Visual Molecular Dynamics or VMD 1.9.2.[36] See Figure for more information.
Performance of SMINA Rescoringa
| GLIDE | DOCK | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| PLANTS | HTVS | SP | GOLD | Def. | Opt. | VINA | AUTODOCK | ||
| RXRα | AUC | 0.66 ± 0.03 | 0.77 ± 0.02 | 0.82 ± 0.01 | 0.82 ± 0.02 | ||||
| EFd 1% | 32.8 | 41.2 | 38.9 | 35.9 | |||||
| EFd 5% | 41.2 | 55.7 | 54.2 | 54.2 | |||||
| COX-2 | AUC | 0.66 ± 0.01 | 0.73 ± 0.01 | 0.77 ± 0.01 | |||||
| EFd 1% | 22.3 | 21.4 | 33.3 | ||||||
| EFd 5% | 43.0 | 47.8 | |||||||
| PDE5 | AUC | 0.70 ± 0.01 | 0.62 ± 0.02 | 0.66 ± 0.02 | 0.73 ± 0.01 | 0.66 ± 0.02 | |||
| EFd 1% | |||||||||
| EFd 5% | 25.6 | 20.6 | 21.4 | 30.4 | |||||
| MR | AUC | 0.51 ± 0.03 | 0.48 ± 0.03 | 0.48 ± 0.03 | 0.51 ± 0.03 | 0.46 ± 0.03 | 0.53 ± 0.03 | 0.53 ± 0.03 | |
| EFd 1% | 6.4 | 9.6 | 7.4 | 6.4 | |||||
| EFd 5% | 9.6 | 6.4 | 11.7 | 8.5 | 9.6 | 9.6 | |||
| NEU | AUC | 0.52 ± 0.03 | 0.46 ± 0.03 | 0.57 ± 0.03 | 0.60 ± 0.03 | 0.58 ± 0.03 | 0.60 ± 0.03 | 0.52 ± 0.03 | 0.47 ± 0.03 |
| EFd 1% | 0.0 | 0.0 | 0.0 | 1.0 | 0.0 | 0.0 | 0.0 | 0.0 | |
| EFd 5% | 1.0 | 0.0 | 0.0 | 3.1 | 2.0 | 3.1 | 1.0 | 2.0 | |
Bolded and in italics if better than the original docking results. Underlined if also better than the negative image-based rescoring (R-NIB; Figure ) protocol.
Root-Mean-Square Deviation: Docking and Rescoring vs X-ray Crystallography
| RMSd < 1.0 Å | RMSd < 2.0 Å | RMSd < 3.0 Å | ||||||
|---|---|---|---|---|---|---|---|---|
| Docking software | Scoring method | Docked/co-crystallized | Best scored | All | Best scored | All | Best scored | All |
| PLANTS | default | 31/31 | 8 | 15 | 15 | 27 | 29 | |
| R-NiB | ||||||||
| SMINA | 5 | 8 | 12 | |||||
| GLIDE HTVS | default | 26/31 | 9 | 11 | 12 | 17 | 17 | 20 |
| R-NiB | ||||||||
| SMINA | 8 | 12 | 16 | |||||
| GLIDE SP | default | 29/31 | 16 | 17 | 21 | 21 | 24 | |
| R-NiB | 11 | |||||||
| SMINA | 11 | 16 | ||||||
| GOLD | default | 31/31 | 12 | 15 | 17 | 23 | 19 | 24 |
| R-NiB | 11 | |||||||
| SMINA | 17 | 21 | ||||||
| DOCK def. | default | 24/31 | 7 | 13 | 11 | 14 | 12 | 15 |
| R-NiB | 7 | 10 | 10 | |||||
| SMINA | 7 | |||||||
| DOCK opt. | default | 31/31 | 12 | 19 | 22 | |||
| R-NiB | 14 | |||||||
| SMINA | 6 | 13 | ||||||
| VINA | default | 31/31 | 12 | 26 | 28 | |||
| R-NiB | 17 | 18 | ||||||
| SMINA | 7 | 9 | 12 | |||||
| AUTODOCK | default | 31/31 | 13 | 21 | 19 | 24 | ||
| R-NiB | 9 | |||||||
| SMINA | 8 | 13 | 16 | |||||
Number of docked active ligands with known poses from the X-ray crystallographic studies.
Root-Mean-Square deviation (RMSd) value of the best pose suggested by the docking software, negative image-based rescoring (R-NiB), or SMINA.
Best RMSd value, if all docking poses are compared against the co-crystallized ligand conformer. The scoring results with most matches with the verified poses are bolded. The representative PDB codes for the used X-ray crystal structures were the following: 4K6I, 1FM9, 1RDT, 1MVC, 4K4J, and 3A9E for the retinoid X receptor alpha (RXRα); 4PH9, 4M11, 3QMO, 5KIR, 1PXX, 3NT1, 5JVZ, and 4COX for cyclooxygenase-2 (COX-2); 3TGE, 3HC8, 1XOZ, and 1TBF for phosphodiesterase-5 (PDE5); 5MWY, 3VHU, 2AA5, 2AA2, and 4UDA for the mineralocorticoid receptor (MR), and 1B9V, 1XOG, 2QWE, 1A4Q, 2QWG, 1LTF, and 6HCX for neuraminidase (NEU).