| Literature DB >> 29800865 |
Sandor Vajda1, Dmitri Beglov2, Amanda E Wakefield3, Megan Egbert2, Adrian Whitty4.
Abstract
Many proteins in their unbound structures lack surface pockets appropriately sized for drug binding. Hence, a variety of experimental and computational tools have been developed for the identification of cryptic sites that are not evident in the unbound protein but form upon ligand binding, and can provide tractable drug target sites. The goal of this review is to discuss the definition, detection, and druggability of such sites, and their potential value for drug discovery. Novel methods based on molecular dynamics simulations are particularly promising and yield a large number of transient pockets, but it has been shown that only a minority of such sites are generally capable of binding ligands with substantial affinity. Based on recent studies, current methodology can be improved by combining molecular dynamics with fragment docking and machine learning approaches.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29800865 PMCID: PMC6088748 DOI: 10.1016/j.cbpa.2018.05.003
Source DB: PubMed Journal: Curr Opin Chem Biol ISSN: 1367-5931 Impact factor: 8.822
Figure 1Types of cryptic sites. In all cases the bound structure is colored orange, the bound ligand is shown as yellow sticks, and the unbound structure superimposed over the bound one is colored cyan. PDB codes are shown in parenthesis. (a) Unbound (1W50) and ligand-bound (3IXJ) structures of beta-secretase 1 (BACE-1) protease, the latter co-crystallized with a isophthalamide inhibitor. In the unbound structure the binding pocket is too open, and the loop is closing down on the ligand upon binding. (b) Unbound (2GFC) and ligand-bound (2JDS) structures of cAMP-dependent protein kinase, the latter co-crystallized with the ATP-competitive inhibitor A-443654. In the unbound structure the activation loop protrudes into the binding site and would clash with the inhibitor superimposed from the bound structure. (c) Distributions of druggability scores in the unliganded structures of BACE-1 protease (homologs of 1W50) and cAMP-dependent protein kinase (homologs of 2GFC). Based on the druggability score, in the BACE-1 protease the pockets are almost evenly distributed between conformations resembling the unbound and bound forms. In contrast, in the cAMP-dependent protein kinase unbound-like conformations dominate. (d) Unbound (1JWP) and ligand-bound (1PZO) structures of TEM β-lactamase. The bound structure 1PZO includes two small inhibitor molecules bound between helices 11 and 12.
Proteins in the CryptoSite set with validated high affinity cryptic sites.
| Name | Unbound | Bound | Lig. | Affinity, nM | R-L distance | Type | |
|---|---|---|---|---|---|---|---|
| Bound | Unb. | ||||||
| mRNA-decapping enzyme DcpS | 3BL9B | 3BL7A | DD1 | IC50 = 7.6 | 2.68 | 0.57 | Allo |
| Hepatitis C virus polymerase | 3CJ0A | 2BRLA | POO | IC50 = 18 | 2.57 | 0.89 | Allo |
| Hepatitis C virus polymerase | 3CJ0A | 3FQKB | 79Z | IC50 = 81 | 2.63 | 1.15 | Allo |
| Tyrosine kinase domain of C-MET | 1R1WA | 3F82A | 353 | IC50 = 4.6 | 2.81 | 0.62 | Allo |
| TetR-like transcriptional regulator | 2WGBA | 2V57A | PRL | KD = 79 | 2.84 | 0.48 | Allo |
| Angiopoietin-1 receptor | 1FVRA | 2OO8X | RAJ | IC50 = 1 | 2.93 | 0.45 | Allo |
| Nicotinic acetylcholine receptor | 3PEOG | 2BYSJ | LOB | KD = 0.3 | 2.41 | 2.04 | Allo |
| Biotin carboxylase | 1BNCB | 2V5AA | LZL | IC50 = 150 | 2.73 | 2.55 | Allo |
| Staphylococcal nuclease | 1TQOA | 1TR5A | THP | Ki ∼ 100 | 2.84 | 2.33 | Ortho |
| DXP reductoisomerase | 1K5HC | 2EGHB | FOM | Ki = 38 | 2.49 | 1.69 | Ortho |
| Glutamate Racemase (GluR) | 2OHGA | 2OHVA | NHL | Ki = 16 | 2.46 | 0.70 | Ortho |
| SARS-CoV main protease | 1UK2A | 2GZ7A | D3F | IC50 = 300 | 2.71 | 1.56 | Ortho |
| Serotonin N-acetyltransferase | 1B6BA | 1KUVA | CA5 | Ki = 22 | 2.47 | 0.64 | Ortho |
| Coagulation factor VII zymogen | 1JBUH | 1WUNH | 5B | IC50 = 62 | 2.74 | 2.24 | Ortho |
| NPC2 lysosomal protein | 1NEPA | 2HKAC | C3S | KD = 30–50 | 3.47 | 0.59 | Ortho |
| Hsp90 (heat shock protein 90) | 2QFOB | 2WI7A | 2 K L | IC50 = 58 | 2.83 | 2.23 | Ortho |
| Integrin alpha-L | 3F74C | 3BQMC | BQM | IC50 = 2 | 2.75 | 1.94 | Ortho |
| Interleukin-2 | 1Z92A | 1PY2A | FRH | IC50 = 60 | 2.81 | 1.39 | PPI |
| Bcl-xL | 3FDLA | 2YXJA | N3C | Ki = 0.5 | 3.07 | 2.27 | PPI |
Protein Data Bank ID of the unbound structure with chain identifier added to the 4-letter PDB ID.
Protein Data Bank ID of the bound structure with chain identifier added to the 4-letter PDB ID.
3-letter code of the ligand in the bound structure.
Affinity as reported.
Shortest distance between any atom of the protein and any atom of the ligand in the bound structure.
The maximum of the shortest distances between any atom of the protein and any atom of the ligand superimposed from the bound structure, maximized over all unbound structures. If this distance is less than the distance in the previous column, then the ligand would clash with the protein in all unbound structures, thus the pocket is not formed without ligand binding.
Type of binding site: Allo – allosteric, ortho – orthosteric (primary binding), PPI – protein–protein interaction inhibitor.