Literature DB >> 28106755

The Performance of Several Docking Programs at Reproducing Protein-Macrolide-Like Crystal Structures.

Alejandro Castro-Alvarez1, Anna M Costa2, Jaume Vilarrasa3.   

Abstract

The accuracy of five docking programs at reproducing crystallographic structures of complexes of 8 macrolides and 12 related macrocyclic structures, all with their corresponding receptors, was evaluated. Self-docking calculations indicated excellent performance in all cases (mean RMSD values ≤ 1.0) and confirmed the speed of AutoDock Vina. Afterwards, the lowest-energy conformer of each molecule and all the conformers lying 0-10 kcal/mol above it (as given by Macrocycle, from MacroModel 10.0) were subjected to standard docking calculations. While each docking method has its own merits, the observed speed of the programs was as follows: Glide 6.6 > AutoDock Vina 1.1.2 > DOCK 6.5 >> AutoDock 4.2.6 > AutoDock 3.0.5. For most of the complexes, the five methods predicted quite correct poses of ligands at the binding sites, but the lower RMSD values for the poses of highest affinity were in the order: Glide 6.6 ≈ AutoDock Vina ≈ DOCK 6.5 > AutoDock 4.2.6 >> AutoDock 3.0.5. By choosing the poses closest to the crystal structure the order was: AutoDock Vina > Glide 6.6 ≈ DOCK 6.5 ≥ AutoDock 4.2.6 >> AutoDock 3.0.5. Re-scoring (AutoDock 4.2.6//AutoDock Vina, Amber Score and MM-GBSA) improved the agreement between the calculated and experimental data. For all intents and purposes, these three methods are equally reliable.

Entities:  

Keywords:  AutoDock; DOCK; Glide; Vina; docking; macrolides; natural products

Mesh:

Substances:

Year:  2017        PMID: 28106755      PMCID: PMC6155922          DOI: 10.3390/molecules22010136

Source DB:  PubMed          Journal:  Molecules        ISSN: 1420-3049            Impact factor:   4.411


1. Introduction

Many important drugs and hundreds of natural products with promising bioactivities are macrolides or have macrolide-related structures. Such compounds often show favorable pharmacokinetic profiles with regard to open-chain molecules, despite not always following Lipinski’s rules [1]. Conformational search is important for identifying their bioactive conformers, which is a prerequisite for good results in a wide range of modeling methods. Indeed, adequate sampling is extremely important in docking calculations and small changes in the search parameters can have a significant effect on their performance [2]. In this context, over the past two decades our research group has worked on the total synthesis of bioactive macrolide-like natural products, such as fluvirucins [3,4]; amphidinolides X [5], Y [6], K [7], E [8] and B2 [9]; and palmerolide A [10]. We performed docking calculations and collaborated with biochemists, chiefly with the aims of elucidating the mechanisms of action of some cytotoxic macrolides [11,12,13,14,15]. Since then, updated and new docking software has been released [16,17,18,19,20,21,22,23,24,25,26,27]. This led us to wonder whether some explanations and predictions, included in PhD theses or articles in the past decade, would still stand in the present scenario. For example, an updated version of AutoDock 3.0.5 [28] (henceforward AD 3.0, a popular program that we used years ago) is AutoDock 4.2.6 [29] (henceforward AD 4.2). A faster program, AutoDock Vina 1.1.2 [30] (henceforward AD Vina) was more recently launched by the same institution. It is expected that these programs will provide improved results and allow better explanations than those facilitated by AD 3.0. Last year, once most of our AD Vina results were at hand, PSOVina, a program that includes a particle swarm optimization algorithm, was released. PSOVina reduces the calculation time of AD Vina even further; since the accuracy is the same [31], only a few tests have been performed using it. All these packages are free. Other docking methods that we have examined are DOCK 6.5 (henceforward, DOCK) [32] and Glide 6.6 (Glide) [33]. DOCK is also free. Glide, from Schrödinger, appears to be the most popular of the commercial docking programs, according to an exhaustive SciFinder search. Thus, in the present study, we restricted ourselves to these five programs. Comparative studies of different docking programs have been reported involving small molecules (not discussed here for the sake of space restrictions). However, we have found only two precedents of the interactions of macrolides and macrolactams with their receptors: the docking of dictyostatin into the taxol binding site [34] by Glide, AutoDock 4.0 and RosettaLigand [35]; and that of spinosyn precursors (22-membered macrolides) and their products by AD Vina and DOCK [36]. Our objective was to evaluate how reliably these programs reproduce a set of protein–macrolide complexes for which crystallographic structures were reported with a good resolution (there are only eight disparate representative examples to date). Since this number is low, in our set we included some macrolactams, cyclodepsipeptides, and other heteroatom-containing macrocycles of similar size to our desired macrolides for which high-resolution crystallographic structures were also published. This gave us a total of 20 examples, which we ordered according to their PDB code (Figure 1). Six of these complexes (1NTI, 1UU3, 1W96, 2IYA, 2XBK, and 3EKS) have just been examined using other methods [37]. We envisage expanding our test to other programs in the future and applying the most efficient to dock macrolide-like anti-tumor agents synthesized by our research group into cytoskeleton proteins.
Figure 1

Selected ligands, with the PDB code and resolution (in Å, within brackets) of the crystalline complexes. For each macrocyclic ligand, the ring size is indicated by a number inside a circle.

2. Results and Discussion

2.1. Self-Docking of the Twenty PDB Structures

First of all, a self-docking study (cognate docking) was undertaken in which each ligand of Figure 1 was separated from each receptor (protein) and each pair was then subjected to a rigid docking calculation. We first used the standard parameters given for each program. The results obtained with the five programs mentioned in the introduction are shown in Table 1. All the methods gave very good results. In fact, the root mean square deviation values were quite low (mean RMSD ≤ 1.0 Å); the standard errors of these mean RMSDs (SEM) are also indicated. The standard deviations (SD) are shown in the penultimate row and the median values in the last row. There are other appropriate statistical methods to evaluate the self-docking performance [38], but the mean and median RMSD values are sufficiently representative.
Table 1

RMSD values (in Å) and scoring energies a for the self-docking of each ligand in its binding site.

AD VinaAD 4.2AD 3.0DOCKGlide
RMSDScoreRMSDScoreRMSDScoreRMSDScoreRMSDScore
11ESV0.25−10.80.60−8.520.62−10.400.33−67.420.37−7.84
21FKD0.67−11.30.71−11.100.66−12.381.11−61.331.14−6.95
31NM60.26−12.80.42−12.791.22−12.240.23−88.461.07−11.32
41NT10.65−13.30.49−13.501.99−6.890.20−85.140.79−9.68
51PKF0.42−13.71.19−12.731.12−13.670.58−77.961.97−8.32
61R8Q0.31−12.60.83−10.660.62−11.690.27−65.970.28−11.73
71UU30.48−11.90.97−11.691.00−12.880.84−84.540.61−10.27
81W960.48−11.70.67−12.030.73−13.920.31−95.400.60−8.86
92C6H0.30−11.10.59−9.990.65−12.360.70−79.951.27−6.61
102E9U0.38−10.20.47−9.100.67−10.200.22−67.520.39−8.68
112IYA0.49−13.20.50−11.600.40−14.730.73−84.691.39−9.37
122VWC0.31−9.41.45−9.150.80−10.430.51−78.131.04−6.94
132XBK0.62−18.51.50−15.231.05−18.330.66−115.341.05−11.64
142XX50.52−11.30.75−10.580.83−8.921.04−73.640.70−8.84
153DV10.74−10.91.17−11.570.94−14.290.60−94.391.94−9.25
163DV51.02−11.60.50−15.360.37−17.220.49−113.992.37 b−8.20
173EKS0.91−11.50.58−11.770.73−12.060.35−75.360.35−10.00
183QTF0.51−13.30.95−10.881.13−12.570.44−78.450.82−11.53
193UYK0.97−10.71.26−10.031.39−10.760.79−55.110.63−7.92
204DRU0.68−13.11.16−12.991.16−12.920.93−71.691.03−8.78
mean0.55 ± 0.050.84 ± 0.080.90 ± 0.080.57 ± 0.060.99 ± 0.13
SD0.230.340.370.280.57
median0.500.730.820.550.93

a Energies in kcal/mol, except for DOCK (in kJ·mol−1), as given by each method; b RMSD values higher than 2.00 are written in red throughout the Tables.

The lowest mean RMSD values were those provided by AD Vina (0.55) and DOCK (0.57). The speed of AD Vina was worth noting (just a few seconds per calculation), whereas the other programs required ca. 5 min per calculation. Figure 2 shows graphically which complexes were better or more poorly reproduced in silico.
Figure 2

Graphic representation of the RMSD values for the self-docking of ligands 1–20 into their corresponding binding sites (crystal structures).

When only the macrolide subset (1, 5, 6, 8, 9, 11, 13, and 19) was taken into account, the average RMSD values turned out slightly better than the averages for all 20 compounds shown in Table 1: AD Vina, 0.48 < DOCK, 0.55 < AD 3.0, 0.82 < AD 4.2, 0.89 < Glide, 0.94

2.2. Ligand Conformational Search

The database of conformers for each ligand was obtained by means of the MD/LLMod method. The values of the RMSD between the crystal and the predicted structures are widely used to confirm whether a close-match docked pose was predicted or not by the docking simulation. An RMSD value ≤ 2 Å is fairly good. Conformers lying 0–42 kJ·mol−1 (ca. 0–10 kcal/mol) above the lowest-energy minimum were retrieved. For some structures (e.g., for 17/3EKS) a set of as few as 17 conformers was obtained, whereas for larger macrocycles >250 conformations had to be calculated (see Table 2). For the cases 15/3DV1 and 16/3DV5, with more rotatable bonds, the numbers of conformers were 1106 and 1402, respectively.
Table 2

RMSD values (in Å) a and scoring energies of the lowest-energy poses of 1–20.

No. Conf.AD VinaAD 4.2AD 3.0DOCKGlide
RMSDScoreRMSD ScoreRMSDScoreRMSDScoreRMSDScore
11ESV730.47−10.70.72−9.471.04−11.560.59−61.980.62−8.44
21FKD3151.55−9.11.09−10.292.62−9.890.86−71.401.45−7.00
31NM65317.85−10.90.89−12.175.28−10.678.44−71.020.44−10.20
41NT12461.28−13.21.33−12.533.63−11.501.66−67.530.61−11.43
51PKF4121.05−13.31.59−13.565.41−14.720.67−78.592.03−8.86
61R8Q1090.50−11.51.23−10.435.28−11.920.44−63.660.3811.66
71UU3381.00−10.91.27−11.165.77−11.401.64−83.790.78−10.58
81W962780.45−11.70.69−12.270.64−12.660.41−86.840.52−8.92
92C6H690.58−10.93.94−9.861.49−11.830.75−84.401.17−7.02
102E9U360.82−10.90.79−8.980.91−10.121.06−60.400.36−8.75
112IYA2650.62−11.80.98−16.081.15−14.820.94−99.181.39−9.99
122VWC600.70−9.80.79−9.360.79−10.870.59−76.215.36−6.15
132XBK3731.09−16.31.72−19.731.39−18.660.65−111.521.12−12.84
142XX54221.14−11.36.41−11.495.47−9.681.24−73.531.15−8.96
153DV111060.90−11.41.14−13.051.29−13.481.03−96.731.14−10.68
163DV514020.92−12.41.10−14.390.76−16.470.56−102.581.94−8.04
173EKS170.61−10.80.71−12.970.61−12.370.64−70.420.67−10.02
183QTF240.94−13.41.28−11.591.75−13.730.59−74.860.94−11.62
193UYK782.23−10.04.88−10.575.30−10.782.30−52.572.33−7.46
204DRU501.50−13.11.31−12.561.82−12.901.64−77.541.44−8.23
mean1.31 ± 0.361.69 ± 0.342.62 ± 0.451.34 ± 0.391.29 ± 0.25
SD1.601.542.011.751.11
median0.931.191.620.811.13
mean corr. b0.90 ± 0.081.09 ± 0.07 0.94 ± 0.100.95 ± 0.11

a Energies in kcal/mol, except for DOCK (in kJ·mol−1), as given by each method. The highest (poorest) values are indicated in bold red. Poor RMSD values (above 2.0 Å but below 2.5 Å) written in red; b Mean values and standard deviations ruling out the 2–4 RMSD values drawn in red (outliers).

2.3. Docking of all the Conformers of Each Ligand

The RMSD values obtained for the lowest-energy poses predicted for each ligand are shown in Table 2 and the corresponding graphic representation is shown in Figure 3. The mean RMSD values for Glide, AD Vina, and DOCK were as low as 1.29/1.31/1.34 Å, respectively. The mean RMSD value for AD 4.2 turned out to be slightly higher (1.69 Å), while for AD 3.0 was 2.62 Å.
Figure 3

Graphic representation of the RMSD values for the lowest energy pose of each ligand (1–20) in their binding sites.

From a statistics viewpoint, we have also included at the bottom of Table 2 the mean RMSDs after ruling out the 2–4 RMSD values drawn in red (outliers) in the corresponding columns. It is clear that all the methods, except AD 3.0, afford very similar results. The RMSD values for each conformer and a Table with statistical results are given as Supplementary Material. As mentioned above, the five docking programs were initially examined using their standard parameters. However, “flexible rings docking” with the program AD 4.2 (“AD 4.2 Flex”), which allows for the flexibility of the macrocycle, was also carried out. Since poorer values were obtained than those under the standard conditions, this option was not considered further. No induced fit approaches, taking into account the possible flexibility of the side chains of the receptors, were checked for any program, since we did not perform virtual screenings, but instead evaluated the capacity of the different docking programs to reproduce a set of complexes that contain poly-functional ligands. The ligands were relatively large, compared to most “small molecules” (drugs), and they contained a relatively rigid core (the macrocycle). The case of 3/1NM6 requires some comments. All the methods, with the exception of AD 4.2 and Glide, gave RMSD values that were too high. A look at the docking images indicated that there are similar hydrophobic moieties (aromatic rings) at both extremes of 3 and some docking programs indistinctly located the folded conformers in either arrangement; they could not distinguish one pose from the other. The fact is that, for each program, there were 2–4 complexes in which the ligands had too high RMSD values. If these two “worst points” were removed, the mean RMSD values decreased significantly. For instance, the mean RMSD values for AD Vina, DOCK, and Glide decreased to 0.90, 0.94, and 0.95, respectively. In other words, 80%–90% of the structures seemed to be well reproduced. As far as the calculation speed is concerned, some representative computing times with fourth-generation i7 processors are indicated. With 5/1PKF (412 conformers) the order was: Glide, 5 min < AD Vina, 1 h < DOCK, 2 h < AD 4.2, 19 h < AD 3.0, 26 h In the case of 16/3DV5 (1402 conformers), the order was: Thus, AD Vina is not only among the three most accurate methods but it is quite rapid as well. AD 4.2 appears in an intermediate position (good accuracy but slow), while AD 3.0 offered the worst results while requiring more computer time than any other (the least efficient). This is understandable since AD 3.0 is an old version of AD 4.2. We needed to evaluate the difference, since, as commented in the introduction, we had often used the program more than a decade ago. DOCK turned out to be as accurate as AD Vina, as happened in the self-docking procedure, but it was slower than AD Vina. For molecules with a very large number of conformers, the time increased exponentially. The performance of Glide, taking into account that the overall time invested was usually very low, was excellent. However, for molecules with a very large number of conformers, the time grew exponentially, as happened with DOCK. All the calculations with Glide were carried out with the standard precision (SP) method, but see the next section for the use of Glide XP. When only the macrolide subset (1, 5, 6, 8, 9, 11, 13, and 19) was analyzed, the average RMSD values were slightly better than the averages for all 20 compounds shown in Table 2: DOCK, 0.84 < AD Vina, 0.87 < Glide, 1.20 < AD 4.2, 1.98 < AD 3.0, 2.71

2.4. Comparison of AD Vina with PSOVina

With the macrolide subset, we estimated how much the new algorithm contained in PSOVina [31] reduced the calculation time. As shown in Table 3, in six out of the eight instances there were time saving, operating with the same computer, equipped with an i7-4790 processor.
Table 3

Comparison of AD Vina and PSOVina results for the lowest-energy conformers of the macrolide subset.

No. of Conform.AD VinaPSOVina
RMSDScoreTimeRMSDScoreTime
11ESV730.47−10.78 min0.47−10.715 min
51PKF4121.05−13.371 min1.04−13.2652 min
61R8Q1090.50−11.511 min0.48−11.508 min
81W962780.45−11.766 min0.45−11.7250 min
92C6H690.58−10.911 min0.58−10.8515 min
112IYA2650.62−11.8180 min0.96−11.8685 min
132XBK3731.09−16.3225 min1.10−16.24101 min
193UYK782.23−10.06 min2.25−10.0110 min
mean0.87 ± 0.210.92 ± 0.21
SD0.600.60
median0.600.77
The RMSD values were similar, except for the complex of ligand 11/2IYA, for which it was higher with PSOVina (0.96) than with AD Vina (0.62). Owing only to this difference, the average RMSD value for PSOVina turned out to be slightly higher. Notwithstanding, the predicted binding affinities were practically identical.

2.5. Poses with Lower RMSD Values

In several cases, the lowest-energy conformer predicted for each ligand was not the best pose (lowest RMSD). There was another conformer with a lower RMSD, the binding affinity of which was only around 0.0–0.5 kcal/mol above that of the lowest energy. In other words, there was often a conformer that overlapped with the crystalline structure better—we might call them the active conformer or that closest to the bioactive form, assuming that in many cases, but not always, the conformation of the ligand in the crystal structure may be the active conformation—while the score was only slightly different from that reported in Table 2. A summary of the results is shown in Table 4. At the bottom of Table 4, the average RMSDs together with the standard deviations are indicated for each method. Thus, for AD Vina the RMSD values are 0.83 ± 0.39 (they fall between 0.44 and 1.22), for AD 4.2 between 0.63 and 1.31, for DOCK between 0.45 and 1.41, and for Glide between 0.38 and 1.46. The medians are also included. It is observed that the mean and median RMSD values obtained with AD 3.0 are much higher than the rest. In short, all the methods (except AD 3.0) give nearly similar outcomes.
Table 4

RMSD values (in Å) and scoring energies for the best poses (the lowest RMSDs) of 1–20.

No. Conf.AD VinaAD 4.2AD 3.0DOCKGlide
RMSDScorePose aRMSD ScorePoseRMSDScorePoseRMSDScorePoseRMSDScorePose
11ESV730.47−10.710.61−9.2950.82−11.2440.59−61.9810.36−8.313
21FKD3151.04−9.021.09−10.2911.63−9.1250.86−71.4011.12−6.946
31NM65310.92−10.830.89−12.1712.34−10.1032.31−67.6370.44−10.201
41NT12461.28−13.210.85−11.6392.90−11.1841.66−67.5310.48−11.142
51PKF4120.58−12.891.20−13.0582.07−14.1880.67−78.5912.03−8.861
61R8Q1090.50−11.510.84−10.3130.75−11.5270.44−63.6610.38−11.661
71UU3381.00−10.911.27−11.1611.00−11.0751.14−79.1140.78−10.581
81W962780.45−11.710.69−12.2710.64−12.6610.41−86.8410.52−8.921
92C6H690.56−10.820.79−9.6950.88−11.4650.75−84.4010.82−6.8515
102E9U360.82−10.910.79−8.9810.91−10.1211.06−60.4010.36−8.751
112IYA2650.52−11.750.98−16.0810.53−14.6440.94−99.1811.39−9.991
122VWC600.70−9.810.79−9.3610.79−10.8710.59−76.2115.36 c−6.151
132XBK3730.66−16.231.72−19.7311.39−18.6610.65−111.5210.80−12.744
142XX54220.77−10.851.74−10.42 b434.81−9.5981.24−73.5310.91−8.914
153DV111060.90−11.410.92−12.8061.29−13.4810.86−95.7081.14−10.681
163DV514020.92−12.410.67−14.1480.76−16.4710.56−102.5811.94−8.041
173EKS170.61−10.810.71−12.9710.61−12.3710.45−69.8040.31−10.002
183QTF240.56−13.320.81−11.1491.75−13.7310.59−74.8610.52−10.973
193UYK782.23−10.011.56−9.77101.57−10.2451.57−49.93151.69−7.452
204DRU501.07−12.951.31−12.5611.82−12.9011.30−76.2831.44−8.231
mean0.83 ± 0.090.97 ± 0.081.46 ± 0.220.93 ± 0.110.92 ± 0.25
SD0.390.340.990.480.54
median0.740.871.150.810.81

a Pose order (1 = the lowest-energy pose). When the order number is 2, 3, or 4 it is indicated in red; when the order number is ≥5 in bold red; b This is an exception: it is 1.0 kcal/mol above that of the lowest-energy conformer (instead of 0.0–0.5 kcal/mol), but poses 1–42 showed higher RMSDs; c No conformation with lower RMSD was found with a gap lower than 1.5 kcal/mol (better RMSD = 3.15, −4.84 kcal/mol, pose order = 36).

Table 4 also shows the rank or pose order of the conformer with the optimal binding affinity; the number of red numbers in the pose-order columns is diagnostic (AD 3.0 is the worst in this sense). A comparison of Table 2 and Table 4 indicates that the performance of all the methods significantly improved, with the exception of two cases: (a) for 12/2VWC, Glide predicted an unacceptable RMSD value of 5.36; (b) for 14/2XX5, AD 3.0 gave an RMSD value of 4.81. Again, when only the macrolide subset (1, 5, 6, 8, 9, 11, 13, and 19) was considered, the average RMSD values changed slightly in relation to the general averages shown in Table 4:

2.6. Re-Scoring

Re-docking of the main poses with computational methods that provide binding affinities closer to experimental values has a long tradition. Among the successful re-scorings reported, we will mention a few that have been published very recently [39,40,41,42,43,44,45,46]. We realized that although AD Vina is a good compromise between speed and accuracy, we observed that the gaps between the predicted binding affinities for the best pose and for the next poses are minimal. In fact, many poses had very close scoring functions (often with differences of only 0.1 kal/mol). Re-scoring of the 20 molecules with X-Score software [47] did not increase the small gaps. Re-scoring the AD Vina poses with AD 4.2, which obviously has advantages regarding the data compatibility between programs, improved the gaps between the poses, as shown in Figure 4 for the case of 3/1NM6. It should be noted that many conformations lying between −10.9 and −10.5 kcal/mol, only a few of which are shown in Figure 4 for the sake of simplicity, gave rise to larger energy differences after re-scoring: from −10.84 to −8.61 kcal/mol, in the examples shown. The conformer depicted in green (that of the lowest energy, −10.9 kcal/mol, according to AD Vina, see Table 2) moved up to −9.18 kcal/mol. The conformer depicted in magenta, which was not the lowest-energy one according to AD Vina (but it was the best pose), became the most favorable after the re-scoring.
Figure 4

Re-scoring with AD 4.2 (right column) of AD Vina-predicted main poses of 3/1NM6.

We call this re-scoring AutoDock 4.2//AutoDock Vina, or just AD 4.2//Vina, as done in quantum chemistry from the very beginning for single-point calculations of the energy at a high level of theory on equilibrium geometries obtained at a lower level. Table 5 summarizes the results for the AD 4.2//Vina re-scoring of the full series.
Table 5

Re-scoring. AD 4.2//Vina results.

AD 4.2 (Table 4)AD 4.2//Vina
RMSDScorePoseRMSDScorePose
11ESV0.61−9.2950.47−8.681
21FKD1.09−10.2911.20−8.791
31NM60.89−12.1710.92−10.841
41NT10.85−11.6391.28−12.201
51PKF1.20−13.0581.23−12.661
61R8Q0.84−10.3130.98−9.901
71UU31.27−11.1611.00−11.511
81W960.69−12.2710.45−12.071
92C6H0.79−9.6950.58−10.151
102E9U0.79−8.9810.82−8.701
112IYA0.98−16.0811.23−14.991
122VWC0.79−9.3610.70−9.271
132XBK1.72−19.7311.19−17.101
142XX51.74−10.42431.59−10.524
153DV10.92−12.8060.97−12.451
163DV50.67−14.1481.05−13.921
173EKS0.71−12.9711.14−12.441
183QTF0.81−11.1490.98−11.991
193UYK1.56−9.77102.23−9.361
204DRU1.31−12.5611.07−13.201
mean0.97 ± 0.081.05 ± 0.09
SD0.340.39
median0.871.03
Comparison with preceding Tables indicates how reliable this approach is. The mean RMSD and standard deviation does not significantly change (from 0.97 ± 0.34 to 1.05 ± 0.39), whereas the pose order of the lowest-energy predicted conformer moves to one (with only one exception, see 14). In other words, with AD 4.2//Vina, all the best poses for each ligand except one also appeared to be the lowest-energy conformers, in sharp contrast with the results in Table 4 for AD 4.2 (repeated in Table 5 for the sake of a direct comparison) and AD Vina. This may seem a fortuitous fact, to be interpreted by the authors of AutoDock, but it was well known that the scoring functions of AD 4.2 are more reliable than those of AD Vina, while AD Vina runs more rapidly than AD 4.2. Although the average RMSD value increased slightly (see Table 5), it is lower than the average values of 1.31 and 1.69 in Table 2 for AD Vina and AD 4.2, respectively. Taking the 20 poses of lowest energy in each case, we also tested the re-scoring of the four poorest DOCK results in Table 4 with GBSA score, which uses the pairwise GB solvation model [48] with special parameters [49], and with Amber Score [50], which includes an implicit solvation and a short MD simulation that takes into account some flexibility of the receptor. Whereas no improvement was observed for 3/1NM6, poses were found for complexes 7/1UU3, 15/3DV1, and 19/3UYK which showed the optimal RMSD values and optimal relative energies, especially with the Amber Score (see Table 6).
Table 6

Re-scoring of DOCK and Glide.

DOCK 6.5
Re-ScoringRe-Scoring
RMSDGBSARMSDAmber
71UU31.69−77.371.10−58.20
153DV11.44−70.391.42−40.49
193UYK2.30−43.231.39−41.89
Glide
Re-ScoringRe-Scoring
RMSDXPRMSDMM-GBSA
21FKD1.35−9.151.47−108.69
92C6H1.11−7.001.49−113.55
122VWC0.95−6.800.95−91.98
132XBK1.04−14.540.83−153.96
142XX51.10−10.131.10−97.36
193UYK1.93−12.301.71−90.04
We also performed Glide XP calculations (eXtra Precision, as a matter of fact a docking procedure with more precise scoring functions) [51] and a re-scoring with the popular PRIME MM-GBSA module [52] for the six poorest results in Table 4, which were clearly improved with both programs in all cases (Table 6). Finally, we examined whether these poses, which are in principle more reliable, overlap with the experimental pose (crystalline structure) or not. The drawings and RMSD for the superimpositions (RMSD overlap) are given as Supplementary Material, but the key points are: With AD 4.2//Vina, 9 out of the 20 ligands overlap almost perfectly with their respective poses in the crystals (ligands 1, 3, 7, 8, 9, 10, 12, 17, and 18, RMSD overlap = 0.32–0.67 Å), 6 ligands can be superimposed moderately well with their crystalline structures, at least their cyclic moieties (RMSD overlap = 0.79–0.93), and there are four calculated structures that do not match sufficiently well (5, 11, 13, and 14, RMSD overlap = 1.04–1.10). The worst result is for ligand 19 (RMSD overlap = 2.14); we have no explanation for it, but the crystalline structure of the complex was poorly reproduced by most methods (see Table 2, Table 3, Table 4, Table 5 and Table 6). Three poses that improved after re-scoring of DOCK with Amber (Table 6) showed values of RMSD overlap for 7, 15, and 19 equal to 0.87, 1.12, and 0.75, respectively (two match partially and one not sufficiently well, also by visual inspection). For the six Glide poses re-scored with MM-GBSA (Table 6), the RMSD-overlap values were 1.13, 1.36, 0.69, 0.64, 0.73, and 0.67. In other words, three ligands (12, 13, and 19) were almost superimposable with their respective experimental structures, but there were two that did not match well. It is worth noting that 19/3UYK is better described by Glide (docking and re-scoring) than by AD 4.2 (docking and AD 4.2//Vina re-scoring, respectively).

3. Materials and Methods

3.1. Crystalline Complexes

The proteins selected for docking analysis satisfied the following criteria: they do not form covalent bonds with their respective ligands; the main rings have ≥11 members; and the structures were refined at resolutions <2.5 Å (Figure 1).

3.2. Preparation of the Ligand Files

Ligand coordinate files were extracted from the corresponding PDB files and used as reference structures for root mean square deviation (RMSD) calculations. The ligand input files were represented from their one-dimensional notations, SMILEs, with the aim of avoiding any bias from the bioactive conformation in conformer calculations. The 3D structures were generated with Corina [53] and were refined with LigPrep [54].

3.3. Conformer Calculations

Macrocycle from the MacroModel 10.0 package was used for the conformational searches [55]. OPLS2005 was the force field selected. The solvent effect was described by means of the GB/SA continuum solvent model. From the different methods of conformational search, we chose molecular dynamics/large-scale low-mode sampling (MD/LLMod) [56,57]. This MD/LLMod method [56] is based on a two-stage search strategy. The first stage is high temperature MD-based simulated annealing, while the second stage is a large-scale low-mode search (LLMod). The MD/LLMod parameters were obtained after 5000 simulated annealing cycles and then 5000 energy-minimization iterations, with a maximum of 42 kJ·mol−1 (ca. 10 kcal/mol) above the lowest-energy conformer.

3.4. Preparation of the Protein Files

The receptor input files were prepared as usual. First, we removed all water molecules, ligand atoms, and those ions that do not belong to the active site of the receptor from the corresponding PDB file. Next, hydrogen atoms were added, as well as protons, and amino acid side-chain partial charges (in accordance with the protonation state at the physiological pH) were assigned according to the requirements of each program. This was followed by a local minimization to relieve potential bad contacts. The minimization was performed in the presence of restraints to maintain the protein conformation very close to that observed in the experimental model. For those proteins with one cytochrome molecule close to the binding site (1PKF, 2C6H, and 2XBK), the Chimera USCF program [58] was used; the corresponding hydrogen atoms and the AM1-Bcc partial charges of each atom were added and the cytochrome molecule was then included.

3.5. Docking Algorithms

AutoDock programs use a stochastic search algorithm (Lamarckian Genetic Algorithm, LGA, for AutoDock 3.0 and 4.2.6, and Iterated Local Search, ILS, for AutoDock Vina and PSO Vina) [28,29,30,31]. DOCK and Glide use a systematic search algorithm [32,33].

3.6. Calculations of RMSD Values

Docking results were assessed with RMSD (root-mean-square deviation of the heavy atoms) of each predicted pose versus the crystal structure. The RMSD values were calculated with the rmsd.py script downloaded from the Script Center of the Schrödinger Resources website.

4. Conclusions

We evaluated the performance of some popular docking programs at reproducing X-ray crystallographic structures that contain macrolides and related poly-functionalized ligands. Thus, the conclusions that follow regarding the performance of each method refer only and exclusively to these macrocyclic substrates when bound to rigid receptors; any generalization to other types of complexes will or would require an independent study. To our initial surprise, the outcome was very satisfactory, as most of the programs arrived at optimal poses in spite of the complexity of the structures. In general, for our set of 20 complexes, Glide 6.6 (Glide), AutoDock Vina (AD Vina), and DOCK 6.5 (DOCK) were efficient docking procedures for macrolides and analogues of intermediate size (Table 2). For large structures with many conformational isomers, the computing times of the programs Glide and especially of DOCK grew exponentially. If only true macrolides are considered (eight examples available), the preceding sentences are still valid. AD 4.2 outperforms AD 3.0, as expected for an updated and improved version, but we needed to confirm that this was the case for macrolide-like ligands. Consequently, our results obtained with AD 3.0 more than 10 years ago should be revised and corrected. When the best poses were compared (Table 4), AD Vina, Glide, DOCK, and AD 4.2 gave mean RMSD values below 1.0 Å. Since the energies given by AD Vina for the different conformers are very close, re-scoring with AD 4.2 (here so-called AD 4.2//Vina) was performed. This increased the energy gaps between the different poses and the agreement between the computational results and crystalline structures significantly improved. In general, re-scorings of DOCK and Glide also improved the agreement, which could have been expected but it had to be demonstrated for macrolide-like compounds. We hope to extend this study to other docking programs and to take advantage of the present findings in connection with the design and synthesis of analogs of anti-tumor macrolides.
  47 in total

1.  Extra precision glide: docking and scoring incorporating a model of hydrophobic enclosure for protein-ligand complexes.

Authors:  Richard A Friesner; Robert B Murphy; Matthew P Repasky; Leah L Frye; Jeremy R Greenwood; Thomas A Halgren; Paul C Sanschagrin; Daniel T Mainz
Journal:  J Med Chem       Date:  2006-10-19       Impact factor: 7.446

2.  Rescoring docking hit lists for model cavity sites: predictions and experimental testing.

Authors:  Alan P Graves; Devleena M Shivakumar; Sarah E Boyce; Matthew P Jacobson; David A Case; Brian K Shoichet
Journal:  J Mol Biol       Date:  2008-01-30       Impact factor: 5.469

Review 3.  Evaluation of the performance of 3D virtual screening protocols: RMSD comparisons, enrichment assessments, and decoy selection--what can we learn from earlier mistakes?

Authors:  Johannes Kirchmair; Patrick Markt; Simona Distinto; Gerhard Wolber; Thierry Langer
Journal:  J Comput Aided Mol Des       Date:  2008-01-15       Impact factor: 3.686

4.  Consensus virtual screening approaches to predict protein ligands.

Authors:  Andreas Kukol
Journal:  Eur J Med Chem       Date:  2011-05-19       Impact factor: 6.514

Review 5.  Challenges and advances in computational docking: 2009 in review.

Authors:  Elizabeth Yuriev; Mark Agostino; Paul A Ramsland
Journal:  J Mol Recognit       Date:  2010-10-23       Impact factor: 2.137

6.  PSOVina: The hybrid particle swarm optimization algorithm for protein-ligand docking.

Authors:  Marcus C K Ng; Simon Fong; Shirley W I Siu
Journal:  J Bioinform Comput Biol       Date:  2015-02-10       Impact factor: 1.122

7.  The application of statistical methods to cognate docking: a path forward?

Authors:  Paul C D Hawkins; Brian P Kelley; Gregory L Warren
Journal:  J Chem Inf Model       Date:  2014-05-13       Impact factor: 4.956

8.  Synthesis of amphidinolide E C10-C26 fragment.

Authors:  Jorge Esteban; Anna M Costa; Jaume Vilarrasa
Journal:  Org Lett       Date:  2008-10-07       Impact factor: 6.005

9.  Comparing neural-network scoring functions and the state of the art: applications to common library screening.

Authors:  Jacob D Durrant; Aaron J Friedman; Kathleen E Rogers; J Andrew McCammon
Journal:  J Chem Inf Model       Date:  2013-07-11       Impact factor: 4.956

View more
  13 in total

1.  Macrocycle modeling in ICM: benchmarking and evaluation in D3R Grand Challenge 4.

Authors:  Polo C-H Lam; Ruben Abagyan; Maxim Totrov
Journal:  J Comput Aided Mol Des       Date:  2019-10-09       Impact factor: 3.686

2.  Hinokiflavone Attenuates the Virulence of Methicillin-Resistant Staphylococcus aureus by Targeting Caseinolytic Protease P.

Authors:  Xiangri Kong; Bingmei Wang; Xiaoyu Chen; Li Wang; Xingye Wang; Juan Hou; Lin Wei; Liyan Sui; Chi Zhang; Jiyu Guan; Yanhe Luan; Wei Wang; Wu Song; Yicheng Zhao
Journal:  Antimicrob Agents Chemother       Date:  2022-07-11       Impact factor: 5.938

3.  Platyphylloside, a potential inhibitor from epicarp of B. aegyptiaca against CYP450 protein in T. rubrum - In vitro and in silico approaches.

Authors:  Mohamed Hussain Syed Abuthakir; Munirah Abdullah Al-Dosary; Ashraf Atef Hatamleh; Hissah Abdulrahman Alodaini; P Perumal; Muthusamy Jeyam
Journal:  Saudi J Biol Sci       Date:  2022-03-12       Impact factor: 4.052

4.  Green Synthesis of Silver Nanoparticles Coated by Water Soluble Chitosan and Its Potency as Non-Alcoholic Hand Sanitizer Formulation.

Authors:  Ika O Wulandari; Baiq E Pebriatin; Vita Valiana; Saprizal Hadisaputra; Agus D Ananto; Akhmad Sabarudin
Journal:  Materials (Basel)       Date:  2022-07-01       Impact factor: 3.748

Review 5.  G-quadruplex virtual drug screening: A review.

Authors:  Robert C Monsen; John O Trent
Journal:  Biochimie       Date:  2018-06-30       Impact factor: 4.079

6.  Exploring safe and potent bioactives for the treatment of non-small cell lung cancer.

Authors:  Muthu Kumar Thirunavukkarasu; Woong-Hee Shin; Ramanathan Karuppasamy
Journal:  3 Biotech       Date:  2021-04-26       Impact factor: 2.406

7.  An Efficient ABC_DE_Based Hybrid Algorithm for Protein-Ligand Docking.

Authors:  Boxin Guan; Changsheng Zhang; Yuhai Zhao
Journal:  Int J Mol Sci       Date:  2018-04-13       Impact factor: 5.923

8.  Genetic algorithm with a crossover elitist preservation mechanism for protein-ligand docking.

Authors:  Boxin Guan; Changsheng Zhang; Jiaxu Ning
Journal:  AMB Express       Date:  2017-09-13       Impact factor: 3.298

9.  HIGA: A Running History Information Guided Genetic Algorithm for Protein-Ligand Docking.

Authors:  Boxin Guan; Changsheng Zhang; Yuhai Zhao
Journal:  Molecules       Date:  2017-12-15       Impact factor: 4.411

10.  Plant isoquinoline alkaloids as potential neurodrugs: A comparative study of the effects of benzo[c]phenanthridine and berberine-based compounds on β-amyloid aggregation.

Authors:  Daniela Marasco; Caterina Vicidomini; Pawel Krupa; Federica Cioffi; Pham Dinh Quoc Huy; Mai Suan Li; Daniele Florio; Kerensa Broersen; Maria Francesca De Pandis; Giovanni N Roviello
Journal:  Chem Biol Interact       Date:  2020-10-22       Impact factor: 5.192

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.