| Literature DB >> 28235019 |
Akshay Sridhar1, Gregory A Ross1, Philip C Biggin1.
Abstract
Water is often found to mediate interactions between a ligand and a protein. It can play a significant role in orientating the ligand within a binding pocket and contribute to the free energy of binding. It would thus be extremely useful to be able to accurately predict the position and orientation of water molecules within a binding pocket. Recently, we developed the WaterDock protocol that was able to predict 97% of the water molecules in a test set. However, this approach generated false positives at a rate of over 20% in most cases and whilst this might be acceptable for some applications, in high throughput scenarios this is not desirable. Here we tackle this problem via the inclusion of knowledge regarding the solvation structure of ligand functional groups. We call this new protocol WaterDock2 and demonstrate that this protocol maintains a similar true positive rate to the original implementation but is capable of reducing the false-positive rate by over 50%. To improve the usability of the method, we have also developed a plugin for the popular graphics program PyMOL. The plugin also contains an implementation of the original WaterDock.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28235019 PMCID: PMC5325533 DOI: 10.1371/journal.pone.0172743
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
The H-bond saturation limit enforced on the various functional groups implemented in the WaterDock 2.0 pipeline.
| Carbonyl | 2 |
| Carboxyl | 2 |
| Cyano | 1 |
| Imine | 1 |
| Nitro | 2 |
| Amine | No. of H |
| Sulfonyl | 2 |
| Phosphoryl | 2 |
| Hydroxyl | 3 |
| Ether | 2 |
| Halogen | 1 |
The dataset used to validate the ligand-directed Waterdock 2.0 algorithm.
| HIV-1 | 3FXS, 2ZYE | KNI-272 | 0.93, 1.9 | 8 |
| GluR2 | 1FTM | AMPA | 1.7, 1.8 | 18 |
| Trypsin | 2AH4, 3RJX | GBS | 1.13, 1.7 | 8 |
| GST | 1K3Y | GTX | 1.3, 1.5 | 22 |
| HSP90 | 2BRC, 2BT0 | CT5 | 1.6, 1.9 | 6 |
| PIM1 | 1XWS, 2BIK | RBT205 | 1.8, 1.8 | 4 |
| Bromodomain | 3ZYU | I-BET | 1.5, 1.6 | 6 |
| Androgen Receptor | 4OHA, 2AX6 | Hydroxyflutamide | 1.42, 1.5 | 4 |
| Casein Kinase II | 3BQC, 3Q9W | Emodin | 1.5, 1.7 | 4 |
| Thrombin | 4CH2 | 0G6 | 1.6, 1.69 | 14 |
| Carbonic Anhydrase | 3HS4, 3V2M, 3DC3 | Acetazolamide | 1.1, 1.47, 1.7 | 3 |
a—Structures where multiple chains were overlaid to validated waters.
The results of the validation dataset of eleven protein holo-structures.
| HIV-1 | 8 | 8 | 8 | 0 |
| GluR2 | 18 | 18 | 18 | 0 |
| Trypsin | 8 | 5 | 5 | 0 |
| GST | 22 | 22 | 18 | 4 |
| HSP90 | 6 | 7 | 6 | 1 |
| PIM1 | 4 | 5 | 4 | 1 |
| Bromodomain | 6 | 6 | 6 | 0 |
| Androgen Receptor | 4 | 4 | 4 | 0 |
| CK II | 4 | 5 | 3 | 2 |
| Thrombin | 14 | 13 | 11 | 2 |
| Carbonic Anhydrase | 3 | 3 | 3 | 0 |
| TOTAL | 97 | 96 | 86 | 10 |
The results of the OppA dataset of structures used to allow comparison of new prediction protocol to AcquaAlta and the original WaterDock methodologies.
| 1JET | 7 | 5 | 6 | 0 |
| 1JEU | 9 | 7 | 8 | 1 |
| 1JEV | 6 | 5 | 5 | 0 |
| 1B4Z | 10 | 7 | 9 | 1 |
| 1B5I | 7 | 5 | 7 | 1 |
| 1B32 | 7 | 5 | 6 | 0 |
| 1B3F | 7 | 5 | 6 | 1 |
| 1B46 | 6 | 4 | 6 | 0 |
| 1B51 | 9 | 7 | 8 | 0 |
| 1B58 | 7 | 5 | 6 | 0 |
| 1B5J | 10 | 8 | 8 | 1 |
| 1B9J | 6 | 5 | 6 | 1 |
| 1QKA | 6 | 6 | 6 | 1 |
| 1QKB | 6 | 4 | 5 | 1 |
| Total | 103 | 78 | 92 | 8 |