| Literature DB >> 20404926 |
Niu Huang1, Matthew P Jacobson.
Abstract
The accurate prediction of protein druggability (propensity to bind high-affinity drug-like small molecules) would greatly benefit the fields of chemical genomics and drug discovery. We have developed a novel approach to quantitatively assess protein druggability by computationally screening a fragment-like compound library. In analogy to NMR-based fragment screening, we dock approximately 11,000 fragments against a given binding site and compute a computational hit rate based on the fraction of molecules that exceed an empirically chosen score cutoff. We perform a large-scale evaluation of the approach on four datasets, totaling 152 binding sites. We demonstrate that computed hit rates correlate with hit rates measured experimentally in a previously published NMR-based screening method. Secondly, we show that the in silico fragment screening method can be used to distinguish known druggable and non-druggable targets, including both enzymes and protein-protein interaction sites. Finally, we explore the sensitivity of the results to different receptor conformations, including flexible protein-protein interaction sites. Besides its original aim to assess druggability of different protein targets, this method could be used to identifying druggable conformations of flexible binding site for lead discovery, and suggesting strategies for growing or joining initial fragment hits to obtain more potent inhibitors.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20404926 PMCID: PMC2852417 DOI: 10.1371/journal.pone.0010109
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Targets, binding sites, and hit rate data from NMR-based fragment screening and two different computational models.
| Target | Binding site | PDB ID | NMR-based Screening | Model of Hajduk et al. | Virtual Fragment ScreeningLog (Hit Rate) |
| AK | Adenosine | 1lii | −0.66 | −0.42 | 0.82 |
| Akt-PH | IP3 | 1h10 | −1.91 | −1.98 | −0.51 |
| Bcl-XL | Bak | 1bxl (1ysn) | −0.11 | −0.64 | 0.86 (0.88) |
| Bir3 | Peptide | 1g3f | −1.03 | −0.72 | −0.62 |
| CMPK | CMP | 1q3t | −1.13 | −0.72 | 0.12 |
| E2-31 | DNA | 1dhm | −0.71 | −0.72 | −1.05 |
| ErmAM | SAH | 1qam | −1.01 | −0.87 | 1.04 |
| FBP | DNA | 1j4w | −1.61 | −1.04 | −0.28 |
| FKBP | FK506 | 1fkj | −0.03 | −0.24 | 0.66 |
| FKBP | 2nd site | 1fkj | −1.24 | −1.22 | −0.22 |
| HI-0065 | ADP | 1fl9 | −0.82 | −1.28 | 0.59 |
| LCK | pTyr | 1lkl | −0.21 | −1.07 | −0.67 |
| LFA | IDAS | 1rd4 | −0.40 | −0.35 | 0.74 |
| MDM2 | P53 | 1rv1 (1ycr) | −0.49 | −0.35 | 0.92 (0.45) |
| MurI | Glu | 1zuw | −1.93 | −2.00 | −0.12 |
| PAK4 | ATP | 2cdz | −0.78 | −0.63 | 0.85 |
| PDZ-PSD95 | Peptide | 1iu0 | −2.00 | −1.99 | −0.60 |
| Pin1 | Peptide | 1i8h | −0.94 | −1.49 | −0.05 |
| PTP1B | Catalytic pTyr | 1ph0 | −0.68 | −1.15 | 0.72 |
| PTP1B | Noncatalytic pTyr | 1ph0 | −1.77 | −1.66 | −0.31 |
| SARS N-term | RNA | 1ssk | −1.93 | −1.92 | −0.03 |
| SCD | Substrate | 1g4k | −0.09 | −0.55 | 0.51 |
| Survivin | Bir3 | 1e31 | −1.97 | −1.99 | −0.37 |
| UK | Peptide | 1fv9 | −0.40 | −0.81 | 1.46 |
For the NMR-based screening results and the predictive model of Hajduk et al., druggable is defined as log (Hit Rate) >−1.0 and non-druggable as log (Hit Rate) ≤−1.0; the corresponding cut-off in our virtual fragment screening model is 0.36.
from Reference [27].
Three outliners (Bir3, E2–31 DNA site and LCK pTyr binding site) identified in our study.
Structures were not reported in the Hajduk et al. dataset.
For Bcl-xl and MDM2, we used two structures, peptide-bound (1bxl and 1ycr, respectively) and small ligand bound (1ysn and 1rv1, respectively).
Figure 1Correlation between the experimental NMR hit rate and our calculated druggability score (red line) for 21 binding sites as described in the text, comparing to the Hajduk et al. predictive model for these 21 sites (blue line).
Note that three outliers in our druggability calculation were excluded in regression analysis, and are only labelled here for visualization purpose.
Figure 2ROC curve plotting the false positive rate vs the true positive rate as a function of the score used for differentiating druggable vs non-druggable binding sites in the external dataset of Hajduk et al.
The values for the default cutoff scores are marked with a solid circle and shown in parentheses. For this analysis, true positives were defined as the 35 binding sites with known high affinity ligands, while true negatives were the remaining 37 binding sites.
Druggability score calculated on 15 well-known drug targets.
| Drug Target | PDB ID | RMSDave (Å) | RMSDmax (Å) | Log (Hit Rate) | Drug Target | PDB ID | RMSDave (Å) | RMSDmax (Å) | Log (Hit Rate) |
| ACE | 1uze | 0.60 | HIVRT | 1vrt | 1.66 | ||||
| 1o86 | 0.17 | 0.37 | 0.45 | 1rt1 | 1.51 | 2.45 | 1.75 | ||
| 1uzf | 0.35 | 0.79 | 0.69 | 1c1c | 1.88 | 3.12 | 1.61 | ||
| Alr2 | 1ah0 | 1.42 | 1rth | 1.62 | 2.28 | 1.61 | |||
| 1ah3 | 1.06 | 3.19 | 1.27 | HMGR | 1hw8 | 1.39 | |||
| 2acr | 0.88 | 1.72 | 1.10 | 1hwk | 0.61 | 1.49 | 1.31 | ||
| CDK2 | 1aq1 | 1.32 | NA | 1a4g | 0.57 | ||||
| 1buh | 1.77 | 3.20 | 1.44 | 1a4q | 0.48 | 2.11 | 0.52 | ||
| 1dm2 | 1.75 | 4.49 | 1.62 | 1nsc | 0.34 | 1.49 | 0.52 | ||
| COX-2 | 1cvu | 1.51 | P38 MAPK | 1a9u | 1.00 | ||||
| 1cx2 | 1.24 | 3.78 | 1.53 | 1kv1 | 3.84 | 10.41 | 1.16 | ||
| 3pgh | 1.11 | 3.96 | 1.64 | 1kv2 | 3.54 | 11.26 | 1.61 | ||
| DHFR | 3dfr | 1.01 | PDE5 | 1xoz | 1.18 | ||||
| 6dfr | 1.47 | 1.96 | 1.02 | 1xp0 | 0.79 | 2.23 | 1.24 | ||
| ER | 1l2i | 1.69 | PPARg | 1fm6 | 1.46 | ||||
| 3ert | 2.61 | 4.47 | 1.55 | 1fm9 | 1.47 | 4.64 | 1.62 | ||
| 1err | 2.01 | 4.39 | 1.61 | 2prg | 0.71 | 1.27 | 1.43 | ||
| Fxa | 1f0r | 1.64 | Thrombin | 1ba8 | 1.53 | ||||
| 1fjs | 1.09 | 2.57 | 1.59 | 1hgt | 0.69 | 1.85 | 1.55 | ||
| 1ksn | 0.67 | 1.65 | 1.59 | TK | 1kim | 1.58 | |||
| 1xka | 1.27 | 2.46 | 1.56 | 1ki4 | 1.78 | 2.90 | 1.40 |
RMSDave was defined as the sidechain RMSD based on binding site residues within a cutoff distance of 4.5 Å from crystallographic ligands; RMSDmax is defined as the largest sidechain RMSD value among all the binding site residues.
Structures used in the induced fit docking dataset of Sherman et al. [35].
Apo structure, the rest are all holo structures.
ACE, angiotensin-converting enzyme; ALR2, aldose reductase; CDK2, cyclin-dependent kinase 2; COX-2, cyclooxygenase-2; DHFR, dihydrofolate reductase; ER, estrogen receptor; FXa, factor Xa; HIVRT, HIV reverse transcriptase; HMGR, hydroxymethylglutaryl-CoA reductase; NA, neuraminidase; P38 MAPK, P38 mitogen activated protein kinase; PDE5, phosphodiesterase 5; PPARg, peroxisome proliferator activated receptor gamma; TK, thymidine kinase.
Druggability scores calculated for 6 targets involved in protein-protein interactions.
| PPI Target | PDB ID | RMSDave. (Å ) | RMSDmax. (Å ) | Log (Hit Rate) | PPI Target | PDB ID | RMSDave. (Å ) | RMSDmax. (Å ) | Log (Hit Rate) |
| IL-2 | 1z92 | 0.13 | MDM2 | 1ycr | 0.45 | ||||
| 1py2 | 2.59 | 5.80 | 0.62 | 1rv1 | 1.82 | 3.32 | 0.92 | ||
| 1m48 | 2.51 | 4.57 | 0.62 | 1t4e | 1.57 | 2.91 | 0.66 | ||
| BCL-XL | 2bzw | 1.04 | HPV E2 | 1tue | −0.24 | ||||
| 2yxj | 2.54 | 6.16 | 0.84 | 1r6n | 2.80 | 4.32 | 1.02 | ||
| TNF | 1tnf | 0.95 | ZipA | 1f47 | −0.02 | ||||
| 2az5 | 2.90 | 5.65 | 0.96 | 1y2f | 0.59 | 1.26 | −0.10 |
Reference structure bound with protein or peptide substrate; the remaining structures contain small molecule ligands.
Figure 3Conformational changes in IL-2.
(A) IL-2 holo conformation bound with the co-crystallized ligand FRH (1py2). (B) The same ligand is superimposed on the apo conformation of the protein (1z92), highlighting the conformational changes. Molecular images were generated with UCSF Chimera [42].
Figure 4Top ranked fragments from the virtual screen mimic portions of a known potent PTP1B inhibitor.
(A) A co-crystallized ligand (stick) is shown bound to PTP1B (1ph0), and extends across both the catalytic and non-catalytic sites. The key hydrogen bonding interactions between the ligands and the binding site residues are illustrated with yellow lines. Different portions of the ligand are colored for comparison with the fragments in (B). (B) Two high-ranking fragments from virtual screening. One predicted heterocyclic carboxylic acid (carbon atoms colored green, rank 49) is shown bound to the PTP1B catalytic site, and one neutral methyl salicylate hit (carbon atoms colored magenta, rank 59) is shown bound to the non-catalytic site. Molecular images were generated with UCSF Chimera [42].
Figure 5Top ranked fragments from the virtual screen mimic portions of known inhibitors of p38 MAP kinase.
(A) The high affinity inhibitor BIRB796 (stick) is shown bound to p38 MAP kinase (1kv2). Key hydrogen bonding interactions—between the morpholino group and the main chain amide of residue Met109, the urea group and the side chain carboxylate group of conserved residue Glu71, and the main chain amide of residue Asp168—are illustrated with yellow lines. Portions of the ligand are colored for comparison with fragments in panels (C) and (D). (B) A low-affinity ligand BMU (stick) is shown bound to the allosteric pocket (1kv1). (C) Three partially overlapping top fragment hits (stick) identified from virtual screening against the 1kv2 structure are shown: a pyridinyl-imidazole type of fragment (carbon atoms colored green, rank 155 in the virtual screen) bound to the ATP binding pocket; a urea-like moiety on a substituted naphthyl ring (carbon atoms colored cyan, rank 55) interacting with the Glu71 sidechain and the lipophilic pocket; and a substituted heterocyclic ring (carbon atoms colored magenta, rank 165) deeply buried into the allosteric binding pocket. (D) The overlap of three top scored fragment hits identified from virtual screening against the 1kv1 structure (ranks 6, 136 and 210). Molecular images were generated with UCSF Chimera [42].
Figure 6A virtual fragment screening protocol for druggability assessment.