| Literature DB >> 23672464 |
Sarah R Langdon1, Isaac M Westwood, Rob L M van Montfort, Nathan Brown, Julian Blagg.
Abstract
We describe and apply a scaffold-focused virtual screen based upon scaffold trees to the mitotic kinase TTK (MPS1). Using level 1 of the scaffold tree, we perform both 2D and 3D similarity searches between a query scaffold and a level 1 scaffold library derived from a 2 million compound library; 98 compounds from 27 unique top-ranked level 1 scaffolds are selected for biochemical screening. We show that this scaffold-focused virtual screen prospectively identifies eight confirmed active compounds that are structurally differentiated from the query compound. In comparison, 100 compounds were selected for biochemical screening using a virtual screen based upon whole molecule similarity resulting in 12 confirmed active compounds that are structurally similar to the query compound. We elucidated the binding mode for four of the eight confirmed scaffold hops to TTK by determining their protein-ligand crystal structures; each represents a ligand-efficient scaffold for inhibitor design.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23672464 PMCID: PMC3665241 DOI: 10.1021/ci400100c
Source DB: PubMed Journal: J Chem Inf Model ISSN: 1549-9596 Impact factor: 4.956
Figure 1(a) Query compound 1 and its scaffold tree fragmentation including the level 1 scaffold (A). (b) Tautomers/rotamers (1a and 1b) of compound 1.
Figure 2Scaffold-focused virtual screen (solid arrows) and whole molecule-based virtual screen (dashed arrows) as described in the Methods section.
Biochemical Assay Results, IC50, and Ligand Efficiencies (LE) for Compounds Derived from the Scaffold-Focused Virtual Screen, with ≥50% Inhibition at 40 or 400 μMa
ND = not determined.
Single point assays were performed in duplicate; percent inhibition values shown are the mean of the two results reported with the standard deviation.
IC50 experiments were performed in duplicate on the same assay plate at 1% and 10% DMSO. An asterisk indicates compounds tested at 10% DMSO; all other compounds were tested at 1% DMSO.
Percent inhibition and IC50 values do not correlate for these compounds due to poor solubility at higher concentrations. The level 1 scaffold for each compound is highlighted in red.
Biochemical Assay Results IC50 and Ligand Efficiencies (LE) for Compounds Derived from the Whole Molecule Virtual Screen, with ≥50% Inhibition at 40 or 400 μMa
ND = not determined.
Single point assays were performed in duplicate; percent inhibition values shown are the mean of the two results reported with the standard deviation.
IC50 experiments were performed in duplicate on the same assay plate at 1% and 10% DMSO. An asterisk indicates compounds tested at 10% DMSO; all other compounds were tested at 1% DMSO.
Percent inhibition and IC50 values do not correlate for these compounds due to poor solubility at higher concentrations.
Number of Compounds and Unique Scaffolds Discovered Using the Scaffold-Focused and Whole Molecule Virtual Screena
| Hit Rate | MPS molecule | MPS scaffold | |||||||
|---|---|---|---|---|---|---|---|---|---|
| 2D ECFP_4 | 70 | 17 | 0.24 | 8 | 6 | 0.75 | 11.4% | 0.60 ± 0.19 | 0.76 ± 0.13 |
| 3D ROCS | 50 | 16 | 0.32 | 1 | 1 | 1 | 2.0% | N/A | N/A |
| 2D ECFP_4 | 50 | 22 | 0.44 | 10 | 6 | 0.60 | 20% | 0.77 ± 0.08 | 0.61 ± 0.15 |
| 3D ROCS tautomer | 37 | 22 | 0.59 | 2 | 2 | 1 | 5.4% | 0.61 ± 0.00 | 0.37 ± 0.00 |
| 3D ROCS tautomer | 35 | 19 | 0.54 | 1 | 1 | 1 | 2.9% | N/A | N/A |
M = number of compounds tested, N = number of unique scaffolds present in the compounds tested, MA = number of active compounds, NA = number of unique active scaffolds. Actives include only compounds confirmed by IC50 determination. MPS molecule = mean pairwise similarity of active compounds with the standard deviation. MPS scaffold = mean pairwise similarity of active scaffolds with the standard deviation.
Figure 3Box and whisker plots depicting the pairwise similarity for confirmed active compounds. SF molecule = pairwise similarity of active molecules from the scaffold-focused virtual screen. WM molecule = pairwise similarity of active molecules from the whole molecule virtual screen. SF scaffold = pairwise similarity of the level 1 scaffolds of active molecules from the scaffold-focused virtual screen. WM scaffold = pairwise similarity of the level 1 scaffolds of active molecules from the whole molecule virtual screen. All similarities are calculated using the Tanimoto coefficient and MDL public keys. Box plots for the scaffold-focused virtual screen are shown in blue and box plots for the whole molecule virtual screen are shown in red.
Figure 4Box and whisker plots depicting the similarity to the query compound 1 of compounds selected by virtual screening. SF molecule = similarity of compounds selected using the scaffold-focused virtual screen to the query molecule. WM molecule = similarity of compounds selected using the whole molecule virtual screen to the query molecule. SF scaffold = similarity of the level 1 scaffolds of compounds selected using the scaffold focused virtual screen to the query molecule. WM scaffold = similarity of the level 1 scaffolds of compounds selected using the whole molecule virtual screen to the query molecule. Box plots for the scaffold-focused virtual screen are shown in blue and box plots for the whole molecule virtual screen are shown in red.
Figure 5Activity of all compounds selected with scaffold-focused and whole molecule-based virtual screening against TTK at 40 μM versus their similarity to the query compound 1 calculated using the Tanimoto coefficient and MDL public keys. Numbers indicate which compound the respective point represents. Hit compounds located in the upper right quadrant have high structural similarity to query compound 1 and high activity. Hit compounds in the upper left quadrant have low structural similarity to query compound 1 and high activity.
Figure 6Activity of fragment-like compounds selected with scaffold-focused and whole molecule fragment virtual screening against TTK at 400 μM versus their similarity to the query compound 1 calculated using the Tanimoto coefficient and MDL public keys. Numbers indicate which compound the respective point represents. Hit compounds located in the upper right quadrant have high structural similarity to query compound 1 and high activity. Hit compounds in the upper left quadrant have low structural similarity to query compound 1 and high activity.
Data Collection and Refinement Statisticsa
| 4BHZ | 4BI0 | 4BI1 | 4BI2 | |
| space group | ||||
| 71.16 | 71.23 | 70.91 | 70.69 | |
| 111.68 | 111.97 | 111.82 | 111.68 | |
| 113.28 | 113.78 | 113.57 | 112.16 | |
| resolution range (Å) | 56.64–2.85 | 56.89–2.84 | 56.78–2.70 | 56.08–3.11 |
| highest resolution shell | (3.00–2.85) | (2.99–2.84) | (2.85–2.70) | (3.28–3.11) |
| unique reflections | 10886 (1573) | 11081 (1582) | 12760 (1827) | 8291 (1191) |
| completeness (%) | 100.0 (100.0) | 100.0 (100.0) | 100.0 (100.0) | 100.0 (100.0) |
| multiplicity | 4.2 (4.3) | 4.1 (4.1) | 4.0 (4.1) | 4.3 (4.4) |
| 6.7 (52.4) | 7.9 (48.8) | 7.2 (52.1) | 7.9 (51.5) | |
| 9.3 (1.5) | 7.8 (1.5) | 7.2 (1.5) | 8.2 (1.5) | |
| mean ( | 13.9 (2.8) | 12.2 (2.8) | 10.6 (2.4) | 11.5 (2.7) |
| mean mosaicity (deg) | 0.71 | 0.49 | 0.78 | 0.89 |
| R factor (%) | 18.7 | 18.4 | 18.8 | 19.4 |
| R free (%) | 23.0 | 22.36 | 22.8 | 22.6 |
| no. amino acids | 251 | 254 | 252 | 253 |
| no. waters | 7 | 15 | 21 | 3 |
| no. ligands | 1 | 1 | 1 | 1 |
| no. PEG | 2 | 2 | 4 | 1 |
| no. ethylene glycol | 1 | 1 | 4 | 2 |
| bond length (Å) | 0.010 | 0.010 | 0.010 | 0.010 |
| bond angles (deg) | 1.15 | 1.18 | 1.20 | 1.17 |
| favored (%) | 96.7 | 96.4 | 96.7 | 95.6 |
| forbidden (%) | 0.0 | 0.4 | 0.0 | 0.0 |
Values in parentheses are highest shell values. The wavelength for data collection was 0.8726 Å. All data was collected on April 13, 2011, on the ID23eh2 beamline at ESRF.
Figure 7(a) X-ray structure of 3 bound to TTK at 2.85 Å. PDB: 4BHZ. (b) X-ray structure of 4 bound to TTK at 2.84 Å. PDB: 4BI0. (c) X-ray structure of 5 bound to TTK at 2.70 Å. PDB: 4BI1. (d) X-ray structure of 7 bound to TTK at 3.11 Å. PDB: 4BI2. The electron density shown in green wire mesh is the Fo–Fc electron density omit map contoured at 3σ. TTK is shown in purple and residues near to the ligand are shown in cylinder representation with carbon atoms in purple and are labeled. Hydrogen bonds are indicated by black dashed lines.