| Literature DB >> 25941465 |
Zuojun Guo1, Bo Li, Li-Tien Cheng, Shenggao Zhou, J Andrew McCammon, Jianwei Che.
Abstract
Protein–ligand binding is a key biological process at the molecular level. The identification and characterization of small-molecule binding sites on therapeutically relevant proteins have tremendous implications for target evaluation and rational drug design. In this work, we used the recently developed level-set variational implicit-solvent model (VISM) with the Coulomb field approximation (CFA) to locate and characterize potential protein–small-molecule binding sites. We applied our method to a data set of 515 protein–ligand complexes and found that 96.9% of the cocrystallized ligands bind to the VISM-CFA-identified pockets and that 71.8% of the identified pockets are occupied by cocrystallized ligands. For 228 tight-binding protein–ligand complexes (i.e, complexes with experimental pKd values larger than 6), 99.1% of the cocrystallized ligands are in the VISM-CFA-identified pockets. In addition, it was found that the ligand binding orientations are consistent with the hydrophilic and hydrophobic descriptions provided by VISM. Quantitative characterization of binding pockets with topological and physicochemical parameters was used to assess the “ligandability” of the pockets. The results illustrate the key interactions between ligands and receptors and can be very informative for rational drug design.Entities:
Mesh:
Substances:
Year: 2015 PMID: 25941465 PMCID: PMC4410907 DOI: 10.1021/ct500867u
Source DB: PubMed Journal: J Chem Theory Comput ISSN: 1549-9618 Impact factor: 6.006
Top 10 Functional Types of Proteins Studied in the Data Set Containing 515 Complexes
| functional type | count (PDB) |
|---|---|
| hydrolase | 298 |
| transferase | 26 |
| chaperone | 21 |
| transporter | 20 |
| isomerase | 17 |
| transcription factor | 15 |
| oxidoreductase | 12 |
| aspartyl protease | 11 |
| hormone receptor | 7 |
| lyase | 4 |
Figure 1Definition of the binding pocket region by the differences between the molecular surface (gray) and VISM surfaces obtained using a loose initial surface.
Figure 2(a) Identified pockets (PDB entry 1sqn), in which the target protein is shown as a gray-colored cartoon, the ligand as sticks colored by atom name, and the identified pockets as transparent surfaces. The primary pocket is shown in red, the secondary pocket in blue, the tertiary pocket in green, and the last pocket in yellow. The cocrystallized ligand binds to the primary pocket. (b) Hydrophilic regions for the identified primary binding pocket and overlapping with the cocrystallized ligand are shown in blue wireframe. The top-right region is occupied by the ligand hydroxyl group (−OH), and the bottom one is occupied by a carbonyl group (C=O). (c) Topological and energetic parameters for the identified protein pockets.
Binding Site Identification Performance of VISM-CFA in 515 Target Proteins
| result | set of 515
proteins | set of 228 tight binders |
|---|---|---|
| deepest pocket (primary) | 92.4 | 96.9 |
| largest pocket | 90.7 | 94.2 |
| best optimal binding affinity (OBA) pocket | 91.1 | 97.0 |
| both deepest and largest pocket | 88.3 | 93.0 |
| identified pocket site | 96.9 | 99.1 |
All 515 proteins contained a cocrystallized ligand.
Subset of 228 proteins that contain a ligand with experimental pKd larger than 6.
Data for the 16 Ligand–Protein Complexes in Which the Cocrystallized Ligand Does Not Bind to the VISM-CFA-Identified Pocket
| case | PDB ID | target | ligand binding mode |
|---|---|---|---|
| 1 | 1l83 | T4 lysozyme | small ligand bound to deep buried cavity |
| 2 | 220l | T4 lysozyme | small ligand bound to deep buried cavity |
| 3 | 1oss | small ligand bound to deep buried cavity | |
| 4 | 1gtb | glutathione S-transferase (GST) | charged ligand bound to shallow pocket |
| 5 | 2r5a | MBT repeats of sex comb on midleg (Scm) | charged ligand bound to shallow pocket |
| 6 | 2vyt | MBT repeats of sex comb on midleg (Scm) | charged ligand bound to shallow pocket |
| 7 | 2bt9 | lectin from | charged ligand bound to shallow pocket |
| 8 | 2jkj | adhesin subunit (DraE/AfaE) | charged ligand bound to shallow pocket |
| 9 | 1hwi | HMG-COA reductase | charged ligand bound to shallow pocket |
| 10 | 1fao | pleckstrin homology domains | Tetrakis(phosphate) as the ligand (strong charge) |
| 11 | 1o9d | 14-3-3 protein | non-natural peptide as the ligand |
| 12 | 2itk | human pin1 | non-natural peptide as the ligand |
| 13 | 1m48 | interleukin-2 | non-natural peptide as the ligand |
| 14 | 1nlj | human cathepsin K | non-natural peptide as the ligand |
| 15 | 1fzq | ADP-ribosylation factor-like protein 3 | nucleic acid base with phosphate as the ligand |
| 16 | 1t03 | HIV-1 nucleotide RT | nucleic acid base with phosphate as the ligand |
Figure 3Comparison between the topological characters of ligand-occupied and unoccupied pockets.
Figure 4Comparison between physicochemical characters of ligand-occupied and unoccupied pockets.
Figure 5(a, b) VISM-CFA-identified pockets for “closed” and “open” conformations of HSP90. The target protein is shown as a gray-colored molecular surface, and the ligand is shown in the ball-and-stick representation. The identified pockets are enclosed by transparent red surfaces. (c, d) Hydrophilic regions are depicted with blue wireframe for the identified primary binding pocket together with the cocrystallized ligand.
Parameters for the “Closed” and “Open” Binding Pockets of HSP90
| pocket
characteristic parameters obtained from VISM-CFA analysis | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| geometrical
characters | physiochemical
characters | ||||||||
| PDB ID | conformation | exptl | depth (Å) | SASA (Å2) | volume (Å3) | shape (principal moments of inertia) | hydrophobic fraction (%) | optimal binding affinity (kcal/mol) | dehydration penalties (vdW, surf, elec, tot) (kcal/mol) |
| 2qfo | closed | 18 ± 1 | 5 | 100.0 | 348.2 | (0.43, 0.72, 1) | 94.7 | –13.5 | (3.3, −16.0, 10.7, −2.1) |
| 2qg2 | open | 4 ± 1 | 9 | 173.5 | 640 | (0.51, 0.95, 1) | 81.6 | –18.5 | (6.1, −29.1, 13.6, −9.4) |
Topological and Energetic Information for VISM-CFA-Identified Pockets for the 27 Targets Classified As “Druggable”, “Difficult”, and “Undruggable” by Cheng et al.[30]
| pocket
characteristic parameters obtained from VISM-CFA analysis | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| knowledge-based | geometrical
characters | physiochemical
characters | |||||||
| PDB ID | target | druggability | depth (Å) | SASA (Å2) | volume (Å3) | shape (principal moments of inertia) | hydrophobic fraction (%) | optimal binding affinity (kcal/mol) | dehydration penalties (vdW, surf, elec, tot) (kcal/mol) |
| 1rth | HIV RT (NNRTI) | druggable | 6.8 | 575.3 | 1799.2 | (0.21, 0.92, 1) | 81.9 | –58.7 | (20.8, −85.7, 40.9, −23.9) |
| 1hvr | HIV-1 protease | druggable | 6.4 | 211.5 | 571.4 | (0.33, 0.85, 1) | 92.7 | –30.7 | (6.5, −34.5, 12.3, −15.7) |
| 1m17 | EGFR kinase | druggable | 8.8 | 427.8 | 1416.7 | (0.54, 0.75, 1) | 68.7 | –29.9 | (17.0, −52.9, 76.3, 40.4) |
| 1hwi | HMG CoA reductase | druggable | 3.8 | 270.7 | 787.5 | (0.32, 0.85, 1) | 88.6 | –25.9 | (11.9, −36.5, 10.5, −14.1) |
| 4cox | cyclooxygenase 2 | druggable | 12.0 | 337.1 | 1097.2 | (0.53, 0.82, 1) | 74.1 | –25.0 | (14.6, −47.1, 22.8, −9.7) |
| 1udt | PDE 5A | druggable | 10.2 | 260.6 | 1009.2 | (0.56, 0.63, 1) | 50.8 | –24.4 | (11.1, −46.2, 56.3, 21.2) |
| 1c14 | enoyl reductase | druggable | 8.6 | 243.0 | 817.2 | (0.39, 0.86, 1) | 70.9 | –22.6 | (11.1, −35.5, 22.8, −1.6) |
| 1ke9 | CDK2 | druggable | 7.6 | 193.6 | 801.3 | (0.47, 0.85, 1) | 74.9 | –21.7 | (7.3, −34.6, 36.5, 9.2) |
| 1oyn | PDE 4D | druggable | 9.2 | 341.8 | 1023.0 | (0.55, 0.76, 1) | 51.8 | –18.9 | (14.071, −46.0, 63.1, 31.2) |
| 1kv1 | P38 kinase | druggable | 8.8 | 335.2 | 1230.3 | (0.37, 0.98, 1) | 32.9 | –17.9 | (13.9, −58.6, 89.0, 44.4) |
| 1kij | DNA gyrase B | druggable | 8.0 | 200.8 | 728.6 | (0.42, 0.81, 1) | 52.4 | –13.0 | (8.1, −37.9, 36.9, 7.1) |
| 1iep | cAbl kinase | druggable | 9.6 | 302.4 | 1207.3 | (0.63, 0.74, 1) | 32.3 | –10.1 | (11.7, −51.6, 81.6, 41.6) |
| 1pwl | aldose reductase | druggable | 8.8 | 139.3 | 587.8 | (0.18, 0.96, 1) | 42.2 | –8.8 | (6.7, −29.8, 15.5, −7.6) |
| 1rv1 | MDM2 | druggable | 3.8 | 66.8 | 218.6 | (0.31, 0.94, 1) | 90.2 | –8.5 | (2.4, −10.6, 1.8, −6.4) |
| 1ea1 | fungal Cyp51 | druggable | 10.2 | 223.5 | 973.8 | (0.60, 0.80, 1) | 33.9 | –7.7 | (9.9, −40.1, 37.8, 7.6) |
| 1gpk | acetylcholinesterase | druggable | 11.0 | 162.4 | 583.7 | (0.32, 0.89, 1) | 42.4 | –7.7 | (7.9, −27.9, 16.0, −3.9) |
| 1ezq | factor Xa | druggable | 5.5 | 93.9 | 353.8 | (0.34, 0.88, 1) | 49.7 | –5.4 | (3.9, −19.5, 13.5, −2.1) |
| 1o86 | ACE-1 | difficult | 14.0 | 1345.4 | 4328.5 | (0.19, 0.88, 1) | 41.1 | –52.2 | (62.5, −188.9, 261.4, 135.0) |
| 1nf7 | IMPDH | difficult | 9.2 | 425.0 | 1276.4 | (0.31, 0.92, 1) | 68.9 | –26.4 | (20.9, −49.0, 44.4, 16.4) |
| 1t03 | HIV RT (nucleotide) | difficult | 2.2 | 156.4 | 499.7 | (0.21, 0.98, 1) | 93.0 | –18.3 | (6.5, −20.9, 8.9, −5.5) |
| 1ktt | thrombin | difficult | 3.8 | 106.4 | 417.3 | (0.21, 0.87, 1) | 65.8 | –12.0 | (4.4, −21.4, 12.4, −4.6) |
| 1qmf | penicillin binding protein | difficult | 9.4 | 206.1 | 789.5 | (0.25, 0.85, 1) | 43.2 | –11.6 | (9.0, −16.8, 43.8, 35.9) |
| 1a4q | neuraminidase | difficult | 2.4 | 66.3 | 225.3 | (0.41, 0.98, 1) | 19.4 | –1.6 | (3.4, −9.9, 14.3, 7.9) |
| 1g1f | PTP-1B | undruggable | 4.0 | 31.0 | 138.8 | (0.41, 0.78, 1) | 5.5 | –0.4 | (1.2, −8.5, 9.5, 2.2) |
| 1nlj | cathepsin K | undruggable | 1.0 | 23.0 | 87.0 | (0.43, 0.88, 1) | 28.7 | –0.6 | (0.1, −0.8, 10.0, 9.1) |
| 1bmq | caspase 1 (ICE-1) | undruggable | N/A | N/A | N/A | N/A | N/A | N/A | N/A |
| 1qs4 | HIV integrase | undruggable | N/A | N/A | N/A | N/A | N/A | N/A | N/A |
Figure 6VISM-CFA pocket prediction (red transparent isosurface) vs small-molecule inhibition of the p53–MDM2 PPI. The p53–MDM2 protein–protein complex (gray and orange, PDB entry 1ycr) is superimposed on the protein–inhibitor complex (tan for protein, red for oxygen, blue for nitrogen, cyan for carbon, and pink for bromine; PDB entry 1rv1).