| Literature DB >> 24276169 |
Yoshifumi Fukunishi1, Haruki Nakamura.
Abstract
We previously developed the direct interaction approximation (DIA) method to estimate the protein-ligand binding free energy (DG). The DIA method estimates the DG value based on the direct van der Waals and electrostatic interaction energies between the protein and the ligand. In the current study, the effect of the entropy of the ligand was introduced with protein dynamic properties by molecular dynamics simulations, and the interaction between each residue of the protein and the ligand was also weighted considering the hydration of each residue. The molecular dynamics simulation of the apo target protein gave the hydration effect of each residue, under the assumption that the residues, which strongly bind the water molecules, are important in the protein-ligand binding. These two effects improved the reliability of the DIA method. In fact, the parameters used in the DIA became independent of the target protein. The averaged error of DG estimation was 1.3 kcal/mol and the correlation coefficient between the experimental DG value and the calculated DG value was 0.75.Entities:
Year: 2013 PMID: 24276169 PMCID: PMC3817721 DOI: 10.3390/ph6050604
Source DB: PubMed Journal: Pharmaceuticals (Basel) ISSN: 1424-8247
Estimated binding free energies by several models and the experimental value (ΔG).
| PDB ID | |||||||
|---|---|---|---|---|---|---|---|
| Equation (1) | Equation (3) | Equation (10) | Equation (11) | Equation (12) | Equation (9,12) | ||
| 1abf | −7.39 | −6.44 | −7.46 | −7.33 | −6.35 | −7.16 | −7.68 |
| 1apu | −10.50 | −9.70 | −10.70 | −9.00 | −9.70 | −9.30 | −10.45 |
| 1dbb | −12.27 | −11.89 | −11.25 | −12.08 | −11.67 | −13.09 | −12.80 |
| 1dbj | −10.47 | −11.28 | −10.39 | −11.35 | −11.07 | −9.24 | −9.19 |
| 1dog | −5.48 | −6.45 | −8.58 | −8.00 | −6.38 | −8.28 | −8.64 |
| 1dwb | −3.98 | −5.04 | −5.16 | −5.24 | −4.92 | −5.57 | −5.65 |
| 1epo | −10.85 | −12.49 | −12.35 | −13.13 | −12.53 | −10.79 | −10.85 |
| 1etr | −10.09 | −10.95 | −9.86 | −9.90 | −10.87 | −10.38 | −10.23 |
| 1ets | −11.62 | −10.75 | −10.21 | −10.48 | −10.62 | −10.43 | −9.51 |
| 1ett | −8.44 | −12.04 | −10.87 | −10.42 | −11.76 | −8.44 | −10.53 |
| 1hpv | −12.57 | −13.29 | −13.32 | −12.78 | −13.33 | −12.22 | −13.15 |
| 1hsl | −9.96 | −6.79 | −7.86 | −7.26 | −6.74 | −5.43 | −7.74 |
| 1htf | −11.04 | −12.10 | −10.45 | −11.48 | −12.13 | −11.19 | −11.97 |
| 1hvr | −12.97 | −15.58 | −14.97 | −15.33 | −15.63 | −14.42 | −15.18 |
| 1nsd | −7.23 | −8.76 | −9.21 | −9.19 | −8.65 | −10.07 | −9.92 |
| 1pgp | −7.77 | −9.81 | −9.10 | −8.99 | −9.56 | −6.98 | −8.00 |
| 1phg | −11.81 | −9.63 | −9.57 | −10.59 | −9.53 | −9.58 | −11.04 |
| 1ppc | −8.80 | −9.09 | −8.55 | −9.44 | −9.10 | −8.40 | −9.56 |
| 1pph | −8.49 | −7.83 | −7.46 | −8.13 | −7.81 | −7.63 | −8.51 |
| 1rbp | −9.17 | −9.10 | −9.62 | −9.74 | −9.11 | −9.04 | −9.76 |
| 1tng | −4.00 | −5.03 | −5.39 | −5.48 | −4.98 | −4.82 | −2.64 |
| 1tnh | −4.59 | −4.89 | −5.53 | −5.26 | −4.83 | −4.78 | −5.52 |
| 1ulb | −7.23 | −6.18 | −5.90 | −6.06 | −5.99 | −6.10 | −6.25 |
| 2cgr | −9.92 | −12.21 | −11.20 | −11.16 | −11.99 | −11.19 | −8.41 |
| 2gbp | −10.36 | −7.55 | −9.23 | −8.63 | −7.45 | −10.09 | −9.37 |
| 2ifb | −7.41 | −8.13 | −7.89 | −7.08 | −8.15 | −8.63 | −7.48 |
| 2phh | −6.38 | −7.04 | −7.57 | −7.31 | −6.83 | −8.47 | −7.95 |
| 2r04 | −8.48 | −10.72 | −10.58 | −10.29 | −10.71 | −12.11 | −10.48 |
| 2tsc | −11.62 | −8.63 | −9.97 | −8.90 | −8.75 | −9.76 | −8.09 |
| 2ypi | −6.58 | −5.87 | −6.53 | −6.20 | −5.76 | −7.16 | −6.64 |
| 3ptb | −6.46 | −4.17 | −4.75 | −4.75 | −4.12 | −5.59 | −5.11 |
| 4dfr | −13.23 | −8.35 | −7.96 | −8.16 | −8.36 | −9.25 | −8.14 |
| 5abp | −9.05 | −6.86 | −8.12 | −7.46 | −6.77 | −8.87 | −8.21 |
| Average Error | 1.58 | 1.36 | 1.39 | 1.48 | 1.26 | 1.31 | |
| SD a | 1.88 | 1.66 | 1.68 | 1.86 | 1.70 | 1.72 | |
| Correlation coefficient | 0.59 | 0.75 | 0.76 | 0.76 | 0.75 | 0.75 | |
| Average Error (MLR) b | 1.42 | 1.23 | 1.23 | 1.32 | 1.13 | 1.17 | |
a: standard deviation of the difference between calculated and measured binding free energy. b: We also applied the multiple linear regression (MLR) to the 34 protein-ligand complex data. “Average Error (MLR)” is the averaged error obtained by the MLR. The error of the MLR is always smaller than the error obtained by the cross−validation test.
Figure 1Cross-validation results obtained by the DIAV, DIAV_LW and DIAV_LC methods: The experimental data (ΔGexptl) and the calculated values (ΔGcalc). Open circles, filled circles and filled triangles represent the results obtained by the DIAV, DIAV_LW and DIAV_LC methods, respectively.
Summary of parameters determined by the cross-validation tests.
| Average | 0.0341719 | 0.0017533 | −0.0002198 | 0.0000000 |
| Deviation (σ) | 0.0005495 | 0.0011874 | 0.0000087 | 0.0000000 |
| Min | 0.0323511 | −0.0038807 | −0.0002438 | 0.0000000 |
| Max | 0.0357564 | 0.0049798 | −0.0002027 | 0.0000000 |
| Negative value | 0.0000000 | 0.0285714 | 1.0000000 | 0.0000000 |
|
|
|
|
|
|
| Average | 0.0370196 | 0.0029651 | −0.0000050 | 0.1749169 |
| Deviation (σ) | 0.0007599 | 0.0008450 | 0.0000002 | 0.0190521 |
| Min | 0.0350933 | −0.0000641 | −0.0000054 | 0.1132383 |
| Max | 0.0396249 | 0.0047204 | −0.0000045 | 0.2325974 |
| Negative value | 0.0000000 | 0.0285714 | 1.0000000 | 0.0000000 |
|
|
|
|
|
|
| Average | 0.0346823 | 0.0021929 | −0.0002054 | 0.0000000 |
| Deviation (σ) | 0.0005388 | 0.0011036 | 0.0000083 | 0.0000000 |
| Min | 0.0329273 | −0.0030242 | −0.0002290 | 0.0000000 |
| Max | 0.0362899 | 0.0050095 | −0.0001878 | 0.0000000 |
| Negative value | 0.0000000 | 0.0285714 | 1.0000000 | 0.0000000 |
|
|
|
|
|
|
| Average | 0.0413163 | 0.0062033 | −0. 0000067 | 0.1536118 |
| Deviation (σ) | 0.0007382 | 0.0007907 | 0. 0000002 | 0.0140040 |
| Min | 0.0392677 | 0.0034216 | −0. 0000071 | 0.1254044 |
| Max | 0.0434480 | 0.0087162 | −0. 0000063 | 0.1944447 |
| Negative value | 0.0000000 | 0.0000000 | 1.0000000 | 0.0000000 |
|
|
|
|
|
|
| Average | 0.0343046 | 0.0042958 | −0.0000070 | 0.1143295 |
| Deviation (σ) | 0.0006129 | 0.0011950 | 0.0000002 | 0.0097216 |
| Min | 0.0321378 | 0.0002835 | −0.0000076 | 0.0942835 |
| Max | 0.0363001 | 0.0090566 | −0.0000067 | 0.1414504 |
| Negative value | 0.0000000 | 0. 0000000 | 1.0000000 | 0.0000000 |
|
|
|
|
|
|
| Average | 0.0392333 | 0.0030804 | −0.0000053 | 0.0000000 |
| Deviation (σ) | 0.0005573 | 0.0010426 | 0.0000002 | 0.0000000 |
| Min | 0.0375654 | −0.0017236 | −0.0000056 | 0.0000000 |
| Max | 0.0409116 | 0.0055269 | −0.0000049 | 0.0000000 |
| Negative value | 0.0000000 | 0.0285714 | 1.0000000 | 0.0000000 |
Figure 2Time dependence of the DIAV, DIAV_LW and DIAV_LC results. Filled circles, open squares and open circles represent the results obtained by the DIAV, DIAV_LW and DIAV_LC methods, respectively. The averaged error is plotted vs. the sampling-time length.
The binding free energies estimated for the non-active compounds (kcal/mol).
| PDB ID |
|
| ||||||
|---|---|---|---|---|---|---|---|---|
| Original ligand | Alprenolol | Difference a | Fenoterol | Difference a | Cetirizine | Difference a | ||
| 1abe2 | −9.57 | −8.06 | −6.85 | −1.22 | −8.21 | 0.14 | −9.28 | 1.22 |
| 1abf1 | −7.39 | −8.40 | −6.13 | −2.27 | −6.72 | −1.68 | −7.93 | −0.47 |
| 1apu | −10.50 | −11.63 | −2.77 | −8.86 | −4.50 | −7.12 | −5.69 | −5.93 |
| 1cbx | −8.65 | −8.89 | −5.84 | −3.04 | −7.51 | −1.38 | −8.30 | −0.58 |
| 1dog | −5.48 | −9.05 | −5.18 | −3.87 | −7.75 | −1.30 | −5.08 | −3.97 |
| 1dwb | −3.98 | −5.45 | −5.44 | −0.01 | −6.56 | 1.11 | −8.24 | 2.80 |
| 1ebg | −14.76 | −6.74 | 0.00 | −6.74 | 0.00 | −6.74 | 0.00 | −6.74 |
| 1epo | −10.85 | −14.42 | −5.64 | −8.78 | −7.30 | −7.12 | −8.49 | −5.93 |
| 1rbp | −9.17 | −8.76 | N.D.b | N.D. b | N.D. b | N.D. b | −8.69 | −0.08 |
| 1stp | −18.27 | −6.59 | N.D. b | N.D. b | N.D. b | N.D. b | −5.96 | −0.63 |
| 1tnh | −4.59 | −5.59 | −4.39 | −1.20 | −5.62 | 0.03 | −6.13 | 0.54 |
| 1ulb | −7.23 | −6.19 | −5.45 | −0.74 | −6.23 | 0.04 | −8.98 | 2.79 |
| 2gbp | −10.36 | −10.14 | −7.16 | −2.98 | −8.74 | −1.40 | −10.24 | 0.11 |
| 2ifb | −7.41 | −8.60 | −5.81 | −2.79 | −7.09 | −1.51 | −9.01 | 0.41 |
| 2tsc | −11.62 | −8.23 | −5.68 | −2.55 | −6.48 | −1.75 | −8.69 | 0.47 |
| 2ypi | −6.58 | −6.92 | −4.68 | −2.24 | N.D. b | N.D. b | N.D. b | N.D. b |
| 3ptb | −6.46 | −4.96 | −4.49 | −0.48 | −5.89 | 0.93 | −5.64 | 0.68 |
| 4dfr | −13.22 | −8.42 | −5.16 | −3.26 | −5.64 | −2.79 | −6.66 | −1.76 |
| 6cpa | −15.71 | −11.68 | −6.82 | −4.86 | −7.77 | −3.91 | −9.75 | −1.93 |
| Average | −9.57 | −8.35 | −5.15 | −3.29 | −6.38 | −2.15 | −7.38 | −1.06 |
a: the energy difference between the calculated ΔG of the original ligand and the non-active ligand.b: Not Determined, because the pocket sizes of the ligand−binding sites of the proteins were too small for these ligands to be docked, and so MD simulations were not possible for them.
ΔG values (kcal/mol) of the same target proteins and Spearman’s rank correlations.
| Thrombin | Error | Error | Error | ||||
|---|---|---|---|---|---|---|---|
| 1dwb | −3.98 | −5.15 | 1.17 | −5.02 | 1.04 | −5.57 | 1.59 |
| 1ett | −8.44 | −9.9 | 1.46 | −9.74 | 1.31 | −9.81 | 1.37 |
| 1etr | −10.09 | −9.9 | 0.19 | −9.89 | 0.2 | −10.22 | 0.13 |
| 1ets | −11.62 | −10.9 | 0.72 | −10.76 | 0.86 | −10.46 | 1.16 |
| Averaged error (kcal/mol) | - | - | 0.89 | - | 0.85 | - | 1.06 |
| SDa |
|
| 1.01 |
| 0.95 |
| 1.20 |
| Correlation coefficient |
|
| 0.97 |
| 0.97 |
| 0.96 |
| Spearman’s rank correlation |
|
| 1 |
| 1 |
| 1 |
|
|
|
|
|
|
|
|
|
| 1k6p | −8.84 | −11.71 | 2.87 | −11.74 | 2.90 | −11.78 | 2.94 |
| 1ajv | −10.59 | −10.36 | 0.23 | −10.39 | 0.20 | −10.13 | 0.46 |
| 1ajx | −10.86 | −9.89 | 0.97 | −9.91 | 0.95 | −9.68 | 1.18 |
| 1hih | −10.97 | −11.67 | 0.70 | −11.67 | 0.70 | −11.73 | 0.76 |
| 1htf | −11.04 | −11.57 | 0.53 | −11.59 | 0.55 | −11.86 | 0.82 |
| 1aaq | −11.45 | −13.15 | 1.70 | −13.13 | 1.68 | −12.96 | 1.51 |
| 1hpv | −12.57 | −12.79 | 0.22 | −12.87 | 0.30 | −13.06 | 0.49 |
| 1hvr | −12.97 | −14.79 | 1.82 | −14.93 | 1.96 | −14.65 | 1.68 |
| 1hvk | −13.79 | −13.63 | 0.16 | −13.65 | 0.14 | −13.70 | 0.09 |
| 1vj | −14.62 | −12.82 | 1.80 | −12.85 | 1.77 | −12.89 | 1.73 |
| 1dif | −14.63 | −13.76 | 0.87 | −13.77 | 0.86 | −13.82 | 0.81 |
| Averaged error (kcal/mol) | - | - | 1.08 | - | 1.09 | - | 1.13 |
| SDa | - | - | 1.36 | - | 1.37 | - | 1.37 |
| Correlation coefficient | - | - | 0.68 | - | 0.67 | - | 0.68 |
| Spearman’s rank correlation | - | - | 0.78 | - | 0.75 | - | 0.81 |
|
|
|
|
|
|
|
|
|
| 1tng | −4.00 | −5.45 | 1.45 | −5.36 | 1.37 | −2.69 | 1.31 |
| 1tnh | −4.59 | −5.29 | 0.70 | −5.20 | 0.61 | −5.50 | 0.91 |
| 3ptb | −6.46 | −4.92 | 1.54 | −4.83 | 1.63 | −5.15 | 1.31 |
| 1pph | −8.48 | −8.32 | 0.16 | −8.30 | 0.18 | −8.51 | 0.02 |
| 1ppc | −8.80 | −9.32 | 0.52 | −9.31 | 0.51 | −9.53 | 0.72 |
| Averaged error (kcal/mol) | - | - | 0.88 | - | 0.86 | - | 0.86 |
| SDa | - | - | 1.03 | - | 1.02 | - | 0.98 |
| Correlation coefficient | - | - | 0.86 | - | 0.86 | - | 0.93 |
| Spearman’s rank correlation | - | - | 0.60 | - | 0.60 | - | 0.90 |
a: standard deviation of the difference between calculated and measured binding free energy.
List of the proteins and ligands used.
| PDB ID | Protein | Ligand | MW | HA | HD |
|---|---|---|---|---|---|
| 1abe | 150.1 | 5 | 4 | ||
| 1abf | 161.2 | 5 | 4 | ||
| 1apu | acid proteinase (penicillopepsin) | pepstatin | 485.7 | 6 | 4 |
| 1dbb | Fab' fragment | progesterone | 314.5 | 2 | 0 |
| 1dbj | Fab' fragment | aetiocholanolone | 290.4 | 2 | 1 |
| 1dog | glucoamylase | deoxynojirimycin | 163.2 | 4 | 5 |
| 1dwb | thrombin | benzamidine | 120.2 | 0 | 2 |
| 1epo | endothia aspartic proteinase | 131.1 | 5 | 6 | |
| 1etr | thrombin | MQPA | 509.2 | 5 | 5 |
| 1ets | thrombin | NAPAP | 522.6 | 4 | 4 |
| 1ett | thrombin | 4-tapap | 429.6 | 3 | 3 |
| 1hpv | HIV-1 protease | amprenavir | 505.6 | 6 | 3 |
| 1hsl | Histidine-binding protein | Histidine | 156.2 | 3 | 2 |
| 1htf | HIV-1 protease | GR126045 | 574.7 | 4 | 5 |
| 1hvr | HIV-1 protease | XK263 | 606.8 | 3 | 2 |
| 1nsd | neuraminidase | neuraminic acid | 290.2 | 8 | 5 |
| 1pgp | 6-phosphogluconate dehydrogenase | 6-phosphogluconic acid | 276.1 | 10 | 4 |
| 1phg | cytochrome P450 | metyrapone | 226.3 | 3 | 0 |
| 1ppc | trypsin | Napap | 533.6 | 4 | 4 |
| 1pph | trypsin | 3-Tapap | 429.6 | 3 | 3 |
| 1rbp | retinol-binding protein | retinol | 286.5 | 1 | 1 |
| 1tng | trypsin | aminomethylcyclohexane | 114.2 | 0 | 1 |
| 1tnh | trypsin | 4-fluorobenzylamine | 126.2 | 0 | 1 |
| 1ulb | purine nucleoside phosphorylase | guanine | 151.1 | 3 | 3 |
| 2cgr | Igg2b (KAPPA) Fab fragment | guanidineacetic acid | 384.4 | 3 | 3 |
| 2gbp | 180.2 | 6 | 5 | ||
| 2ifb | intestinal fatty acid binding protein | palmitic acid | 256.4 | 2 | 0 |
| 2phh | 138.1 | 3 | 1 | ||
| 2r04 | rhinovirus 14 (HRV14) | W71 | 342.4 | 5 | 0 |
| 2tsc | thymidylate synthase | 10-propargyl-5,8-dideazafolic acid | 477.5 | 7 | 3 |
| 2ypi | triose phosphate isomerase | 2-phosphoglycolate | 156.0 | 6 | 0 |
| 3ptb | trypsin | benzamidine | 120.2 | 0 | 2 |
| 4dfr | dihydrofolate reductase | methotrexate | 454.4 | 9 | 3 |
| 5abp | 180.2 | 6 | 5 |
MW: Molecular weight (Da); HA: Number of hydrogen bond acceptors; HD: Number of hydrogen bond donors.