| Literature DB >> 30552524 |
Hio Kuan Tai1, Siti Azma Jusoh2, Shirley W I Siu3.
Abstract
BACKGROUND: Protein-ligand docking programs are routinely used in structure-based drug design to find the optimal binding pose of a ligand in the protein's active site. These programs are also used to identify potential drug candidates by ranking large sets of compounds. As more accurate and efficient docking programs are always desirable, constant efforts focus on developing better docking algorithms or improving the scoring function. Recently, chaotic maps have emerged as a promising approach to improve the search behavior of optimization algorithms in terms of search diversity and convergence speed. However, their effectiveness on docking applications has not been explored. Herein, we integrated five popular chaotic maps-logistic, Singer, sinusoidal, tent, and Zaslavskii maps-into PSOVina[Formula: see text], a recent variant of the popular AutoDock Vina program with enhanced global and local search capabilities, and evaluated their performances in ligand pose prediction and virtual screening using four docking benchmark datasets and two virtual screening datasets.Entities:
Keywords: Autodock Vina; Chaotic maps; Docking; PSOVina; Singer map; Sinusoidal map; Virtual screening
Year: 2018 PMID: 30552524 PMCID: PMC6755579 DOI: 10.1186/s13321-018-0320-9
Source DB: PubMed Journal: J Cheminform ISSN: 1758-2946 Impact factor: 5.514
The parameters of chaotic maps used in this study
| Chaotic map | Parameter |
|---|---|
| Logistic map |
|
| Singer map |
|
| Sinusoidal map |
|
| Zaslavskii map |
Four datasets for the pose prediction test
| Name | Description | Number of complexes | References |
|---|---|---|---|
| PDBbind v2014 (core-set) | Representatives of protein clusters of high-quality structures selected from Protein Data Bank | 195 | [ |
| Astex diverse set | Proteins are pharmaceutical or agrochemical targets; ligands are approved drugs or in clinical trials | 85 | [ |
| GOLD benchmark set | Selected diverse complexes which were checked to be free from structural errors | 77 | [ |
| SB2012 docking validation set | Ligands with a wide range of flexibilities | 1043 | [ |
Number of actives and decoys in the DUD-E datasets for virtual screening test after preprocessing
| Target | Type | Active | Decoy | Failed active | Failed decoy |
|---|---|---|---|---|---|
| (a) DIV8: Diverse target subset | |||||
| akt1 | Kinase | 293 | 16,448 | 0 | 2 |
| ampc | Enzyme | 48 | 2850 | 0 | 0 |
| cp3a4 | Cytochrome | 170 | 11,798 | 0 | 2 |
| cxcr4 | G protein-coupled receptor | 40 | 3414 | 0 | 0 |
| gcr | Nuclear receptor | 258 | 14,996 | 0 | 4 |
| hivpr | Protease | 536 | 35,743 | 0 | 7 |
| hivrt | Enzyme | 337 | 18,887 | 1 | 4 |
| kif11 | Other | 116 | 6847 | 0 | 3 |
| Total | 1794 | 110,983 | 1 | 22 | |
Actives and decoys which failed to pass all the preprocessing steps were not included in the virtual screening experiments
Docking performance comparison of AutoDock Vina, PSOVina, PSOVina, and chaos-embedded PSOVina methods on four pose prediction datasets
| Best-scoring pose RMSD (Å) | Average RMSD (Å) | Best-scoring pose success rate (%) | Average success rate (%) | No. of iterations | Run time (s) | |
|---|---|---|---|---|---|---|
| (a) PDBBind v.2014 dataset | ||||||
| AutoDock Vina | 2.68393 | 2.70336 | 62.56 | 61.33 | 22777 | 21.46 |
| PSOVina | 2.27188 | 2.50727 | 68.21 | 64.67 | 892 | 8.97 |
| PSOVina | 2.14915 | 2.79023 | 70.77 | 61.03 | 957 | 3.43 |
| Chaos-embedded PSOVina | ||||||
| Logistic map | 1.95241 | 2.61573 | 72.82 | 63.49 | 1053 | 3.75 |
| Singer map | 1.98661 | 2.52277 | 72.82 | 64.26 | 1069 | 3.75 |
| Sinusoidal map |
| 2.73205 |
| 61.33 | 1105 | 3.82 |
| Tent map | 2.07797 | 2.77287 | 69.23 | 60.92 | 981 | 3.54 |
| Zaslavskii map | 1.98789 | 2.65951 | 72.31 | 62.00 | 1015 | 3.67 |
| (b) Astex diverse dataset | ||||||
| AutoDock Vina | 1.90681 | 1.92633 | 71.76 | 71.53 | 20086 | 18.53 |
| PSOVina | 1.82160 | 1.71506 | 74.12 | 76.35 | 1392 | 8.21 |
| PSOVina | 1.58374 | 1.87782 | 75.29 | 72.59 | 885 | 2.63 |
| Chaos-embedded PSOVina | ||||||
| Logistic map | 1.63183 | 1.90169 | 76.47 | 71.65 | 951 | 2.82 |
| Singer map | 1.61686 | 1.88862 | 77.65 | 72.35 | 1097 | 3.05 |
| Sinusoidal map |
| 1.99939 |
| 71.06 | 1234 | 3.30 |
| Tent map | 1.54835 | 1.91905 | 78.82 | 72.12 | 968 | 2.85 |
| Zaslavskii map | 1.54228 | 1.84950 | 78.82 | 72.12 | 928 | 2.72 |
| (c) GOLD benchmark set | ||||||
| AutoDock Vina | 2.78586 | 2.91744 | 64.94 | 63.25 | 20071 | 19.91 |
| PSOVina | 2.59811 | 2.58979 | 66.23 | 66.75 | 1289 | 7.64 |
| PSOVina | 2.41496 | 2.85823 | 71.43 | 60.91 | 897 | 2.75 |
| Chaos-embedded PSOVina | ||||||
| Logistic map | 2.32352 | 2.71251 |
| 64.42 | 1002 | 2.97 |
| Singer map | 2.50710 | 2.73068 | 71.43 | 62.73 | 990 | 2.97 |
| Sinusoidal map | 2.27549 | 2.61833 | 74.03 | 64.81 | 1065 | 3.15 |
| Tent map |
| 2.69675 | 70.13 | 62.60 | 916 | 2.72 |
| Zaslavskii map | 2.45169 | 2.80725 | 72.73 | 62.73 | 866 | 2.69 |
| (d) SB2012 docking validation dataset | ||||||
| AutoDock Vina | 2.64185 | 2.77003 | 63.47 | 61.79 | 22977 | 20.33 |
| PSOVina | 2.38248 | 2.64763 | 65.68 | 62.78 | 1372 | 12.77 |
| PSOVina | 2.29462 | 2.91399 | 66.06 | 58.12 | 1036 | 3.04 |
| Chaos-embedded PSOVina | ||||||
| Logistic map | 2.41665 | 2.91596 | 66.25 | 57.94 | 1112 | 3.31 |
| Singer map |
| 2.94298 |
| 57.48 | 1138 | 3.25 |
| Sinusoidal map | 2.16409 | 3.08916 | 70.09 | 54.67 | 1133 | 3.03 |
| Tent map | 2.17928 | 2.99936 | 69.22 | 56.74 | 1066 | 3.21 |
| Zaslavskii map | 2.35440 | 2.94977 | 66.06 | 57.17 | 1081 | 3.27 |
The best-scoring pose is the pose with the lowest binding affinity in docking repeats. Thus, best-scoring pose RMSD and success rate are the average RMSD and success rate of the best-scoring poses of all complexes in the dataset
No. of iterations and run time were averaged from all docking instances
Best results are shown in italics
Overall pose prediction performance of AutoDock Vina, PSOVina, PSOVina, chaos-embedded PSOVina methods
| Best-scoring pose RMSD (Å) | Best-scoring pose success rate (%) | Run time (s) | |
|---|---|---|---|
| AutoDock Vina | 2.50 (0.40) | 65.68 (4.17) | 17.56 (5.24) |
| PSOVina | 2.27 (0.33) | 68.56 (3.86) | 9.40 (2.31) |
| PSOVina | 2.11 (0.37) | 70.89 (3.79) | |
| Chaos-embedded PSOVina | |||
| Logistic map | 2.08 (0.36) | 72.72 (4.57) | 3.21 (0.41) |
| Singer map | 2.06 (0.37) | 73.21 (3.06) | 3.26 (0.35) |
| Sinusoidal map | 3.33 (0.35) | ||
| Tent map | 2.01 (0.31) | 71.85 (4.67) | 3.08 (0.37) |
| Zaslavskii map | 2.08 (0.41) | 72.48 (5.21) | 3.09 (0.47) |
Best results are shown in italics
Fig. 1ROC curves of virtual screening the DUD-E diverse targets using AutoDock Vina, PSOVina, and chaos-embedded PSOVina with Singer and sinusoidal maps
Fig. 2Run time (in seconds) of virtual screening the DUD-E diverse targets. Text annotations in the violin plot indicate the maximum (top), median (text in blue, location of the median shown as a black dot), and minimum (bottom) run times by each method
Results of area under ROC curves (AUC-ROC) and enrichment factor (EF) of virtual screening the DUD-E diverse targets (DIV8) using AutoDock Vina, PSOVina, and chaos-embedded PSOVina with Singer and sinusoidal maps
| Target | AutoDock Vina | PSOVina | Singer | Sinusoidal | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| AUC | EF | EF | AUC | EF | EF | AUC | EF | EF | AUC | EF | EF | |
| akt1 | 0.55 | 0.00 | 1.52 | 0.47 | 1.71 | 1.31 | 0.40 | 1.37 | 0.75 | 0.42 | 2.05 | 0.84 |
| ampc | 0.60 | 0.00 | 1.25 | 0.59 | 0.00 | 1.46 | 0.62 | 2.08 | 1.56 | 0.63 | 2.08 | 1.56 |
| cp3a4 | 0.58 | 0.60 | 1.65 | 0.59 | 1.19 | 1.62 | 0.57 | 1.19 | 1.53 | 0.58 | 1.79 | 1.53 |
| cxcr4 | 0.52 | 0.00 | 0.87 | 0.54 | 0.00 | 0.75 | 0.59 | 0.00 | 0.25 | 0.59 | 0.00 | 0.87 |
| gcr | 0.53 | 10.43 | 1.98 | 0.53 | 10.82 | 1.88 | 0.53 | 11.59 | 1.90 | 0.53 | 11.98 | 1.88 |
| hivpr | 0.71 | 4.10 | 2.31 | 0.71 | 3.17 | 2.38 | 0.71 | 2.98 | 2.34 | 0.69 | 3.17 | 2.17 |
| hivrt | 0.66 | 4.77 | 2.20 | 0.65 | 4.77 | 2.17 | 0.65 | 4.77 | 1.93 | 0.64 | 4.77 | 1.92 |
| kif11 | 0.84 | 23.15 | 3.66 | 0.85 | 25.73 | 3.71 | 0.87 | 24.87 | 3.92 | 0.86 | 18.87 | 3.71 |
| Average | 0.62 | 5.38 | 1.93 | 0.62 | 5.92 | 1.91 | 0.61 | 6.11 | 1.77 | 0.62 | 5.59 | 1.81 |
Results of area under ROC curves (AUC-ROC) and enrichment factor (EF) of virtual screening the DUD-E nuclear receptor targets (NR11) using AutoDock Vina, PSOVina, and chaos-embedded PSOVina with Singer and sinusoidal maps
| Target | AutoDock Vina | PSOVina | Singer | Sinusoidal | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| AUC | EF | EF | AUC | EF | EF | AUC | EF | EF | AUC | EF | EF | |
| andr | 0.57 | 11.54 | 1.93 | 0.57 | 11.17 | 1.90 | 0.57 | 11.17 | 1.84 | 0.57 | 10.79 | 1.88 |
| esr1 | 0.75 | 13.82 | 2.74 | 0.74 | 13.56 | 2.60 | 0.74 | 11.73 | 2.62 | 0.73 | 12.77 | 2.69 |
| esr2 | 0.77 | 11.70 | 3.09 | 0.76 | 10.88 | 3.04 | 0.76 | 12.51 | 3.00 | 0.76 | 13.06 | 3.05 |
| gcr | 0.53 | 10.43 | 1.98 | 0.53 | 10.82 | 1.88 | 0.53 | 11.59 | 1.90 | 0.53 | 11.98 | 1.88 |
| mcr | 0.53 | 3.22 | 1.54 | 0.53 | 3.22 | 1.54 | 0.53 | 3.22 | 1.60 | 0.53 | 3.22 | 1.54 |
| ppara | 0.85 | 4.55 | 3.65 | 0.80 | 2.68 | 2.98 | 0.75 | 2.14 | 2.60 | 0.73 | 1.07 | 2.24 |
| ppard | 0.81 | 2.50 | 3.33 | 0.79 | 2.08 | 2.75 | 0.72 | 0.83 | 2.06 | 0.74 | 1.67 | 2.17 |
| pparg | 0.79 | 5.57 | 3.11 | 0.76 | 3.30 | 2.52 | 0.72 | 2.89 | 2.13 | 0.70 | 2.68 | 2.02 |
| prgr | 0.61 | 9.56 | 2.25 | 0.61 | 9.56 | 2.24 | 0.61 | 9.56 | 2.22 | 0.60 | 9.90 | 2.25 |
| rxra | 0.83 | 33.50 | 3.55 | 0.83 | 32.74 | 3.47 | 0.81 | 33.50 | 3.51 | 0.81 | 32.73 | 3.44 |
| thb | 0.81 | 26.05 | 3.35 | 0.81 | 25.09 | 3.40 | 0.81 | 27.02 | 3.25 | 0.80 | 25.08 | 3.25 |
| Average | 0.71 | 12.04 | 2.78 | 0.70 | 11.37 | 2.57 | 0.69 | 11.47 | 2.43 | 0.68 | 11.36 | 2.40 |
| Average | 0.67 | 14.98 | 2.55 | 0.67 | 14.63 | 2.51 | 0.67 | 15.04 | 2.49 | 0.67 | 14.94 | 2.50 |
Averaged without ppara, ppard and pparg
Fig. 3Average run time (in seconds) versus number of ligand rotatable bonds