| Literature DB >> 20398404 |
Tiejun Cheng1, Zhihai Liu, Renxiao Wang.
Abstract
BACKGROUND: Current scoring functions are not very successful in protein-ligand binding affinity prediction albeit their popularity in structure-based drug designs. Here, we propose a general knowledge-guided scoring (KGS) strategy to tackle this problem. Our KGS strategy computes the binding constant of a given protein-ligand complex based on the known binding constant of an appropriate reference complex. A good training set that includes a sufficient number of protein-ligand complexes with known binding data needs to be supplied for finding the reference complex. The reference complex is required to share a similar pattern of key protein-ligand interactions to that of the complex of interest. Thus, some uncertain factors in protein-ligand binding may cancel out, resulting in a more accurate prediction of absolute binding constants.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20398404 PMCID: PMC2868011 DOI: 10.1186/1471-2105-11-193
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Illustration of our algorithm for pharmacophore elucidation.
Figure 2The pharmacophore mode of the CDK2-staurosporine complex (PDB entry 1AQ1) elucidated by our algorithm. (A) Binding mode of staurosporine to CDK2. (B) The grids with significant contributions to binding. (C) Grids after refinement. (D) The final pharmacophore model. In each figure, features of hydrogen bond donor, hydrogen bond acceptor, and hydrophobic center are represented by dots/balls in green, red, and cyan, respectively.
Figure 3Illustration of the algorithm for finding the common features between two pharmacophore models .
PDB codes of the protein-ligand complexes in the three test sets
Statistical results produced by two scoring functions alone on the three test sets
| X-Score | PLP | |||||
|---|---|---|---|---|---|---|
| 0.329 | 1.55 | 0.664 | 0.190 | 1.61 | -0.0099 | |
| 0.648 | 1.06 | 2.045 | 0.690 | 1.01 | -0.0833 | |
| 0.815 | 0.98 | 1.988 | 0.762 | 1.09 | -0.0518 | |
Pearson correlation coefficient between the experimental binding constants of these complexes and the binding scores produced by the given scoring function.
Standard deviation in regression (in logKunits)
Slope of the regression line, which is needed in Equation 4.
Figure 4Standard deviations (in log . (A) Results obtained based on crystal structures (Set I). (B) Results obtained based on docking poses (Set II). The X axis indicates the similarity cutoffs used in defining reference complexes. The numbers indicated on this figure are the total numbers of the complexes considered at each similarity cutoff.
Information on the five pairs of HIV protease complexes with the highest similarities
| The query complex | The reference complex | |||||
|---|---|---|---|---|---|---|
| 8.40 | 8.39 | 8.83 | 8.40 | 0.67 | ||
| 8.40 | 8.41 | 8.84 | 8.40 | 0.67 | ||
| 11.40 | 10.68 | 8.86 | 10.96 | 0.67 | ||
| 10.96 | 11.68 | 9.28 | 11.40 | 0.67 | ||
| 9.95 | 11.43 | 8.90 | 11.40 | 0.67 | ||
Figure 5Chemical structures of the five ligand molecules in the HIV protease complexes listed in Table 3.
Figure 6Standard deviations (in log. (A) Results obtained based on crystal structures (Set I). (B) Results obtained based on docking poses (Set II). The X axis indicates the similarity cutoffs used in defining reference complexes. The numbers indicated on this figure are the total numbers of the complexes considered at each similarity cutoff.
Figure 7Standard deviations (in log. (A) Results obtained based on crystal structures (Set I). (B) Results obtained based on docking poses (Set II). The X axis indicates the similarity cutoffs used in defining reference complexes. The numbers indicated on this figure are the total numbers of the complexes considered at each similarity cutoff.
Information on the six pairs of trypsin complexes with the highest similarities
| The query complex | The reference complex | |||||
|---|---|---|---|---|---|---|
| 5.80 | 5.54 | 5.60 | 5.38 | 0.67 | ||
| 5.38 | 5.64 | 5.52 | 5.80 | 0.67 | ||
| 6.92 | 6.92 | 6.08 | 6.92 | 0.58 | ||
| 6.92 | 6.92 | 6.08 | 6.92 | 0.58 | ||
| 6.36 | 7.30 | 6.27 | 6.92 | 0.58 | ||
| 6.82 | 6.48 | 6.33 | 6.36 | 0.58 | ||
Figure 8Chemical structures of the six ligand molecules in the trypsin complexes listed in Table 4.