| Literature DB >> 20150996 |
Abstract
BACKGROUND: Molecular docking methods are commonly used for predicting binding modes and energies of ligands to proteins. For accurate complex geometry and binding energy estimation, an appropriate method for calculating partial charges is essential. AutoDockTools software, the interface for preparing input files for one of the most widely used docking programs AutoDock 4, utilizes the Gasteiger partial charge calculation method for both protein and ligand charge calculation. However, it has already been shown that more accurate partial charge calculation - and as a consequence, more accurate docking- can be achieved by using quantum chemical methods. For docking calculations quantum chemical partial charge calculation as a routine was only used for ligands so far. The newly developed Mozyme function of MOPAC2009 allows fast partial charge calculation of proteins by quantum mechanical semi-empirical methods. Thus, in the current study, the effect of semi-empirical quantum-mechanical partial charge calculation on docking accuracy could be investigated.Entities:
Year: 2009 PMID: 20150996 PMCID: PMC2820493 DOI: 10.1186/1758-2946-1-15
Source DB: PubMed Journal: J Cheminform ISSN: 1758-2946 Impact factor: 5.514
Experimental data for the 54 protein-ligand complexes.
| PDB code | Protein | Res (Å) | Ligand | Structure Ref. |
|---|---|---|---|---|
| Penicillin acylase | 2.4 | m-nitrophenylacetic acid | [ | |
| Penicillin acylase | 2.3 | 2,5-dihydroxyphenylacetic acid | [ | |
| Heat shock protein 90 | 1.9 | ADP | [ | |
| Heat shock protein 90 | 2.5 | radicicol | [ | |
| Retionic acid binding protein | 2.9 | retinoic acid | [ | |
| Human thrombin | 2.9 | benzo [ | [ | |
| Human thrombin | 3.0 | benzo [ | [ | |
| a-Thrombin | 3.2 | benzamidine | [ | |
| Immunoglobulin | 1.9 | fluorescein | [ | |
| Human serum albumin | 2.4 | oleic acid | [ | |
| HIV-1 Protease | 2.0 | A78791 | [ | |
| HIV-1 Protease | 1.8 | XK263 | [ | |
| tRNA-guanine transglycosylase | 1.7 | quinazoline derivative | [ | |
| p38 MAP kinase | 2.5 | pyrazol derivative | [ | |
| Adipocyte lipid-binding protein | 1.6 | stearic acid | [ | |
| Dialkylglycine decarboxylase | 2.2 | 1-aminocyclo-pentanephosphonate | [ | |
| Dialkylglycine decarboxylase | 2.0 | S-1-amino-ethanephosphonate | [ | |
| Branched-chain alpha-ketoacid dehydrogenase kinase | 1.9 | thiamin diphosphate | [ | |
| cAMP-dependent protein kinase | 2.0 | (R)-trans-4-(1-aminoethyl)-n-(4-pyridyl) cyclohexanecarboxamide | [ | |
| Retinol-binding protein | 2.0 | retinol | [ | |
| tRNA-guanine transglycosylase | 2.0 | 2-aminoquinazolin-4(3H)-one | [ | |
| Alpha-amylase | 2.0 | 5-trihydroxy-6-hydroxymethyl-piperidin-2-one | [ | |
| Purine nucleoside phosphorylase | 2.8 | guanine | [ | |
| Beta-glycosidase | 2.0 | D-galactohydroximo-1,5-lactam | [ | |
| 3-Phosphoshikimate 1-carboxyvinyltransferase | 1.5 | phosphonate analogue | [ | |
| Alpha-amylase | 2.2 | acarbose derived hexasaccharide | [ | |
| cAMP-dependent protein kinase | 2.3 | n-[2-(4-bromocinnamylamino)ethyl]-5-isoquinoline | [ | |
| Dialkylglycine decarboxylase | 2.0 | pyridoxamine 5-phosphate | [ | |
| Acetylcholinesterase | 2.4 | edrophonium ion | [ | |
| p38alpha Map kinase | 2.3 | pyrazol derivative | [ | |
| p38alpha Map kinase | 2.2 | nicotinamid derivative | [ | |
| Beta-glycosidase | 2.1 | glucoimidazole | [ | |
| Beta-glycosidase | 2.0 | phenethyl-substituted glucoimidazole | [ | |
| Immunoglobulin | 2.2 | N-trisubstituted guanidine | [ | |
| Cytochrome P-450cam | 1.6 | camphor | [ | |
| RNA-dependent RNA polymerase | 2.0 | non-nucleoside analogue inhibitor I | [ | |
| RNA-dependent RNA polymerase | 1.8 | non-nucleoside analogue inhibitor II | [ | |
| Beta-secretase | 2.5 | amino-ethylene inhibitor | [ | |
| Beta-secretase | 1.9 | valinamide derivative | [ | |
| D-galactose/D-glucose binding protein | 1.9 | glucose | [ | |
| Fatty-acid-binding protein | 2.0 | palmitic acid | [ | |
| Heat shock protein 82 | 1.5 | synthetic macrolactone | [ | |
| Beta-glycosidase | 2.1 | deoxynojirimycin | [Gloster, to be published] | |
| Beta-glycosidase | 1.7 | gluco-hydroximolactam | [Gloster, to be published] | |
| 3-Phosphoshikimate 1-carboxyvinyltransferase | 1.6 | shikimate-3-phosphate | [ | |
| Neuraminidase | 2.0 | sialic acid | [ | |
| Neuraminidase | 2.0 | 4-amino-Neu5Ac2en | [ | |
| Rhinovirus 14 coat protein | 3.0 | W71, antiviral agent | [ | |
| Xylose isomerase | 1.6 | D-xylitol | [ | |
| Triose phosphate isomerase | 2.5 | 2-phosphoglycolic acid | [ | |
| b-Trypsin | 1.8 | benzamidine | [ | |
| Hemagglutinin | 3.0 | sialic acid | [ | |
| Arabinose-binding protein | 1.7 | fucose | [ | |
Figure 1Correlation between experimental and predicted binding energies using Gasteiger and PM6 charge calculations.
Results of docking calculations using Gasteiger charges on both the ligand and the protein.
| PDB code of the complex | RMSD of the first rank result as compared to the X-ray (Å) | Energy rank of the lowest RMSD hit | Frequency rank/frequency of the best geometry | Lowest RMSD (Å) | Estimated free energy of binding (kcal/mol) | Experimental binding energy (kcal/mol) |
|---|---|---|---|---|---|---|
| 1.21 | 0.65 | -5.11 | -5.08 | |||
| 2.56 | 2 | 1.09 | -4.40 | -3.05 | ||
| 1.68 | 1.59 | -5.62 | -6.19 | |||
| 1.03 | 0.71 | -6.27 | -11.69 | |||
| 1.00 | 0.92 | -7.94 | -10.58 | |||
| 2.68 | 2 | 2/19 | 1.23 | -10.33 | -8.88 | |
| 2.29 | 2 | 2/20 | 1.08 | -10.08 | -8.73 | |
| 9.59 | 3 | 0.53 | -6.24 | -3.98 | ||
| 0.60 | 0.53 | -10.76 | -10.98 | |||
| 1.56 | 1.10 | -7.73 | -11.01 | |||
| 3.12 | - | -10.03 | -14.27 | |||
| 0.65 | 0.54 | -14.74 | -12.97 | |||
| 2.82 | 4 | 1.66 | -6.98 | -7.99 | ||
| 0.64 | 0.56 | -8.91 | -8.10 | |||
| 2.33 | 4 | 1.08 | -5.47 | -9.65 | ||
| 3.91 | -8.22 | -3.03 | ||||
| 2.24 | 5 | 33/1 | 1.46 | -5.99 | -5.30 | |
| 1.34 | 4/10 | 0.74 | -6.96 | -6.01 | ||
| 1.88 | 1.41 | -8.29 | -6.49 | |||
| 0.84 | 0.56 | -9.15 | -9.17 | |||
| 4.29 | 2 | 2/11 | 0.32 | -5.29 | -10.50 | |
| 2.49 | 2/14 | 1.73 | -8.14 | -6.28 | ||
| 1.88 | 1.83 | -4.66 | -7.20 | |||
| 2.36 | 0.87 | -6.77 | -8.14 | |||
| 1.24 | 0.87 | -4.17 | -8.36 | |||
| 2.92 | -11.12 | -10.80 | ||||
| 2.2 | 2 | 0.74 | -10.14 | -9.98 | ||
| 2.71 | - | -4.00 | -4.39 | |||
| 3.96 | 8 | 8/1 | 1.07 | -4.72 | -8.97 | |
| 0.68 | 0.42 | -12.40 | -11.46 | |||
| 1.28 | 16/2 | 1.28 | -13.78 | -10.14 | ||
| 7.14 | -4.66 | -9.92 | ||||
| 4.86 | -7.06 | -10.94 | ||||
| 0.87 | 0.50 | -10.68 | -9.89 | |||
| 1.56 | 1.21 | -6.35 | -8.28 | |||
| 0.87 | 0.83 | -9.43 | -9.44 | |||
| 4.68 | 3 | 2/20 | 0.69 | -8.89 | -9.06 | |
| 2.35 | 5 | 73/1 | 1.68 | -13.67 | -10.35 | |
| 3.13 | -7.57 | -12.99 | ||||
| 0.99 | 0.92 | -4.18 | -10.37 | |||
| 2.03 | 2 | 1.10 | -6.03 | -7.41 | ||
| 0.88 | 0.55 | -6.66 | -9.11 | |||
| 1.9 | 1.85 | -7.52 | -6.67 | |||
| 0.83 | 2/11 | 0.83 | -6.41 | -8.75 | ||
| 2.75 | 3 | 3/1 | 1.02 | -6.29 | -5.70 | |
| 4.38 | 13 | 6/5 | 1.17 | -5.40 | -3.74 | |
| 1.36 | 0.97 | -6.80 | -6.62 | |||
| 2.06 | 2 | 3/21 | 1.02 | -8.97 | -8.87 | |
| 1.20 | 1.15 | -5.43 | -7.94 | |||
| 4.73 | - | -3.09 | -6.57 | |||
| 1.35 | 0.51 | -5.66 | -6.47 | |||
| 1.82 | 1.02 | -4.78 | -3.48 | |||
| 2.15 | 2 | 2/13 | 1.64 | -4.58 | -8.78 | |
The average value of the RMSD between the lowest energy result and experimental structure and the total number of successful first rank predictions (based on lowest energy and highest cluster population, respectively) are indicated at the bottom of the table.
Results of docking calculations using semi-empirical charges on both the ligand and the protein.
| PDB code of the complex | RMSD of the first rank result as compared to the X-ray (Å) | Energy rank of the lowest RMSD hit | Frequency rank/frequency of the best geometry | Lowest RMSD (Å) | Estimated free energy of binding (kcal/mol) | Experimental binding energy (kcal/mol) |
|---|---|---|---|---|---|---|
| 1.22 | 0.65 | -5.06 | -5.08 | |||
| 2.73 | 2 | 1.07 | -4.82 | -3.05 | ||
| 2.13 | 0.87 | -5.67 | -6.19 | |||
| 1.03 | 0.70 | -6.50 | -11.69 | |||
| 1.01 | 0.77 | -7.42 | -10.58 | |||
| 2.77 | 2 | 2/40 | 0.85 | -11.47 | -8.88 | |
| 2.09 | 3 | 0.99 | -10.37 | -8.73 | ||
| 0.72 | 0.54 | -4.87 | -3.98 | |||
| 0.39 | 0.39 | -8.54 | -10.98 | |||
| 3.01 | 2 | 1.01 | -6.80 | -11.01 | ||
| 1.61 | 1.61 | -10.45 | -14.27 | |||
| 1.21 | 1.00 | -11.67 | -12.97 | |||
| 2.83 | 4 | 1.42 | -7.78 | -7.99 | ||
| 0.6 | 0.58 | -9.33 | -8.10 | |||
| 3.1 | 4 | 3/17 | 0.93 | -4.51 | -9.65 | |
| 1.18 | 1.02 | -7.65 | -3.03 | |||
| 1.03 | 0.95 | -7.15 | -5.30 | |||
| 0.89 | 0.61 | -9.14 | -6.01 | |||
| 1.9 | 1.41 | -7.58 | -6.49 | |||
| 0.85 | 0.56 | -8.46 | -9.17 | |||
| 4.29 | 2 | 0.33 | -4.65 | -10.50 | ||
| 0.86 | 0.80 | -9.82 | -6.28 | |||
| 0.7 | 0.64 | -5.09 | -7.20 | |||
| 2.35 | 0.71 | -6.47 | -8.14 | |||
| 1.26 | 0.96 | -3.10 | -8.36 | |||
| 1.81 | 2/10 | 1.30 | -15.97 | -10.80 | ||
| 0.79 | 0.45 | -8.05 | -9.98 | |||
| 2.7 | 14 | 2/26 | 0.96 | -6.76 | -4.39 | |
| 3.98 | 4 | 2/22 | 0.94 | -5.25 | -8.97 | |
| 0.66 | 0.55 | -12.50 | -11.46 | |||
| 1.21 | 0.64 | -14.20 | -10.14 | |||
| 0.85 | 0.83 | -6.23 | -9.92 | |||
| 2.61 | 2/15 | 0.96 | -7.76 | -10.94 | ||
| 0.9 | 0.68 | -10.05 | -9.89 | |||
| 1.21 | 1.21 | -6.26 | -8.28 | |||
| 0.68 | 0.54 | -7.11 | -9.44 | |||
| 3.86 | 3 | 0.63 | -7.60 | -9.06 | ||
| 0.75 | 0.68 | -14.80 | -10.35 | |||
| 1.75 | 3/5 | 1.45 | -8.64 | -12.99 | ||
| 0.89 | 0.87 | -5.60 | -10.37 | |||
| 2.05 | 2 | 1.03 | -4.99 | -7.41 | ||
| 0.87 | 0.55 | -6.76 | -9.11 | |||
| 2.01 | 1.60 | -7.62 | -6.67 | |||
| 0.83 | 2/27 | 0.73 | -6.13 | -8.75 | ||
| 2.74 | 4 | 2/37 | 0.94 | -6.29 | -5.70 | |
| 4.35 | 12 | 2/15 | 0.90 | -6.64 | -3.74 | |
| 2.57 | 2 | 0.91 | -7.37 | -6.62 | ||
| 1.99 | 2 | 2/8 | 0.92 | -8.84 | -8.87 | |
| 1.17 | 1.14 | -6.93 | -7.94 | |||
| 1.19 | 0.78 | -2.62 | -6.57 | |||
| 1.31 | 0.74 | -5.06 | -6.47 | |||
| 1.04 | 0.85 | -6.92 | -3.48 | |||
| 2.09 | 1.16 | -5.58 | -8.78 | |||
QASP parameter was modified from 0.01097 to 0.00679 in AutoGrid and AutoDock. The average value of the RMSD between the lowest energy result and experimental structure and the total number of successful first rank predictions (based on lowest energy and highest cluster population, respectively) are indicated at the bottom of the table.
Figure 2Best cluster rank docking results of redocking of the PDB entry 2FDP. Protein surfaces are colored by partial charges (a, PM6 charges, RMSD from coordinates in PDB: 0.75 b, Gasteiger charges, RMSD from coordinates in PDB: 2.35). The darker color of the protein surface colored by PM6 partial charges as compared to the colors of Figure 2b reflects the higher calculated absolute value of semi-empirical partial charges. This "sharper" surface defines the possible binding geometry of the ligand more, than in the case of Gasteiger charges.
Figure 3Performance of the PM6 charge calculation in docking experiments, compared to Gasteiger method. The graph shows the number of complexes within a given RMSD of the crystallographic structure. In each case, the conformation of the most favorable estimated energy is used as the predicted conformation.