| Literature DB >> 27924269 |
Tatsuya Okuno1, Koya Kato2, Shintaro Minami3, Tomoki P Terada2, Masaki Sasai2, George Chikenji2.
Abstract
We discuss methods and ideas of virtual screening (VS) for drug discovery by examining the performance of VS-APPLE, a recently developed VS method, which extensively utilizes the tendency of single binding pockets to bind diversely different ligands, i.e. promiscuity of binding pockets. In VS-APPLE, multiple ligands bound to a pocket are spatially arranged by maximizing structural overlap of the protein while keeping their relative position and orientation with respect to the pocket surface, which are then combined into a multiple-ligand template for screening test compounds. To greatly reduce the computational cost, comparison of test compound structures are made only with limited regions of the multiple-ligand template. Even when we use the narrow regions with most densely populated atoms for the comparison, VSAPPLE outperforms other conventional VS methods in terms of Area Under the Curve (AUC) measure. This region with densely populated atoms corresponds to the consensus region among multiple ligands. It is typically observed that expansion of the sampled region including more atoms improves screening efficiency. However, for some target proteins, considering only a small consensus region is enough for the effective screening of test compounds. These results suggest that the performance test of VS methods sheds light on the mechanisms of protein-ligand interactions, and elucidation of the protein-ligand interactions should further help improvement of VS methods.Entities:
Keywords: computational speed; drug discovery; flexibility; promiscuity
Year: 2016 PMID: 27924269 PMCID: PMC5042167 DOI: 10.2142/biophysico.13.0_149
Source DB: PubMed Journal: Biophys Physicobiol ISSN: 2189-4779
Figure 1An example of multiple-ligand template for ace (yellow thin lines) and an active compound detected by the multiple-ligand template (CPK colored thick lines). The multiple-ligand template comprises ten different ligands. The active compound was superposed so that the structural overlap between the active compound and the multiple-ligand template was maximized.
Dataset used for the performance test
| Target protein (abbrev.) | PDB code | # of actives | # of decoys |
|---|---|---|---|
| Angiotensin converting enzyme (ace) | 1o86 | 46 | 1797 |
| Acetylcholinesterase (ache) | 1eve | 100 | 3892 |
| Cyclin-dependent kinase 2 (cdk2) | 1ckp | 47 | 2074 |
| Cyclooxygenase 2 (cox2) | 1cx2 | 212 | 13289 |
| Epidermal growth factor receptor (egfr) | 1m17 | 365 | 15996 |
| Factor Xa (fxa) | 1f0r | 64 | 5745 |
| HIV reverse transcriptase (hivrt) | 1rt1 | 34 | 1519 |
| Enoyl ACP reductase InhA (inha) | 1p44 | 57 | 3266 |
| p38 mitogen activated protein (p38) | 1kv2 | 137 | 9141 |
| Phosphodiesterase (pde5) | 1xp0 | 26 | 1978 |
| Platelet derived growth factor receptor kinase (pdgfrb) | 1t46 | 124 | 5980 |
| Tyrosine kinase Src (src) | 2src | 98 | 6319 |
| Vascular endothelial growth factor receptor (vegfr2) | 1fgi | 48 | 2906 |
Figure 2Dependence of computational time on percentage of used coordinate systems for each compound. CPU time was measured on a PC with AMD Opteron 2.4 GHz processor. Calculated values are fitted by a linear function.
Figure 3Dependence of AUC on the percentage x of most crowded coordinates used in the performance test. The number in a parenthesis shown on the right hand side of each target name represents the total number of the coordinate systems of the multiple-ligand template for each target.
Figure 4Dependence of spread of densely populated regions of multiple-ligand templates on the percentage x of used coordinate systems for pde5 (A), ace (B), fxa (C), src (D) and p38 (E). The red colored atoms are ones that are assigned as the origin of reference frame system ranked in top x-percent of the crowdedness defined in Eq. 4.