Literature DB >> 28149052

Prediction of kinase-inhibitor binding affinity using energetic parameters.

Singaravelu Usha1, Samuel Selvaraj1.   

Abstract

The combination of physicochemical properties and energetic parameters derived from protein-ligand complexes play a vital role in determining the biological activity of a molecule. In the present work, protein-ligand interaction energy along with logP values was used to predict the experimental log (IC50) values of 25 different kinase-inhibitors using multiple regressions which gave a correlation coefficient of 0.93. The regression equation obtained was tested on 93 kinase-inhibitor complexes and an average deviation of 0.92 from the experimental log IC50 values was shown. The same set of descriptors was used to predict binding affinities for a test set of five individual kinase families, with correlation values > 0.9. We show that the protein-ligand interaction energies and partition coefficient values form the major deterministic factors for binding affinity of the ligand for its receptor.

Entities:  

Keywords:  Inhibition constant prediction; Kinase inhibitors; Protein-ligand interaction; energetic and solvent descriptors

Year:  2016        PMID: 28149052      PMCID: PMC5267961          DOI: 10.6026/97320630012172

Source DB:  PubMed          Journal:  Bioinformation        ISSN: 0973-2063


Background

Protein kinases are a large family of homologous proteins with more than 500 members in the human proteome [1]. Kinasemediated protein phosphorylation is a crucial component of the signal transduction pathways which plays a central role in diverse biological processes such as cell growth, metabolism, differentiation, and apoptosis [2]. A number of diseases, including cancer, diabetes, inflammation, immune and neurodegenerative disorders are linked to perturbation of protein kinase–mediated cell signaling pathways [3]. Since all members of the kinase families utilize ATP, kinase inhibitors are designed to bind with the ATP to prevent substrate phosphorylation [1]. Over 20 small-molecule protein kinase inhibitors have been currently approved and more than 150 kinase inhibitors are undergoing clinical trials [4]. Yet, issues such as target specificity, [5,6, 7,8] resistance development [9,10] hinge region binding and activation state dependence of kinase inhibitors need to be addressed [2,3,4]. The three-dimensional structures of proteins with bound ligand are available in the Protein Data Bank [11] along with their experimental binding affinity information. Binding affinity data such as Ki, Kd, IC50 etc. obtained from experimental studies are also available in databases such as BindingDB [12], Binding MOAD [13], PDBbind [14] etc. Availability of valuable resources regarding kinase inhibitors made computational biologists to develop statistical models to accurately predict the binding affinity of complexes. Structure-based virtual screening methods use docking programs to explore the possible binding modes of a ligand within the target binding site, and scoring functions to estimate the affinity of the ligand for the binding site [15,16]. While docking methods at present are in general successful in predicting the correct binding conformations of ligand molecules, they do not perform well in correctly predicting the binding affinity for the predicted ligand conformations [2]. Hence, it is essential to predict the binding affinity of a given ligand to its target known as the ‘scoring problem’ [17]. As a pioneering work, Bohm [18] (1994) developed a simple empirical function (LUDI) to estimate the binding constant for a protein-ligand complex of known structure. This empirical scoring function takes into account hydrogen bonds, ionic interactions, the lipophilic protein-ligand contact surface and the number of rotatable bonds in the ligand. Head et al. (1996) in their VALIDATE approach used electrostatic and steric interaction energies, octanol-water partition coefficient, polar and nonpolar contact surfaces, and a term to describe intramolecular flexibility [19]. Following the approach of Bohm, [18] Eldridge et al. [20] (1997) included intramolecular flexibility in ChemScore and Wang et al. [21] (1998) classified hydrogen bonds and included the occurrence of interstitial water molecules in SCORE. Based on the statistical analysis of experimentally observed distributions and frequencies of distance-dependent protein-ligand atom pair interactions, the potential of mean force (PMF) was obtained which has been used for affinity predictions of large datasets [22]. Based on a larger set of 200 protein–ligand complexes, Wang et al. (2002) developed X-Score [17], consisting of four energy terms including van der Waals interactions, hydrogen bonds, hydrophobic effects and effective rotatable bonds. Docking programs such as FlexX [23] and Surflex [24] use empirical scoring functions by including different empirical energy terms. A large-scale validation of a quantum mechanics based scoring function to predict the binding affinity and binding mode of a diverse set of protein-ligand complexes containing different protein families including aspartic proteases, serine proteases, sugar binding proteins, amino acid binding proteins, and protein kinases was done by Raha and Merz (2004) [25]. LigScore functions [26] have made use of three distinct terms, the van der Waals interaction, the polar attraction between the ligand and protein, and the desolvation penalty attributed to the binding of the polar ligand atoms to the protein to predict the experimental pKi values of a diverse set of 118 protein–ligand complexes that span more than seven protein families. CLiBE, a database of computed ligand binding energy (based on molecular mechanics force field) for ligand-receptor three dimensional structures have been developed and a linear correlation between the computed ligand-receptor interaction energy and experimental binding affinity (Kcal/mol) has been observed [27]. A Program for Energetic Analysis of Receptor-Ligand System (PEARLS) has been developed to compute free energy of protein-ligand complexes [28]. While the above methods use the known three dimensional structures to predict binding affinities, the Quantitative Structure-Activity Relationship (QSAR) methods serve as an alternative way of binding affinity predictions in the absence of 3D structure of target proteins or their complexes with ligands. These methods make use of physicochemical and structural properties (descriptors) of ligands to relate their biological activity using regression methods. Combined QSAR approaches in binding affinity predictions have been recently reported [29,30]. In the present work, we have correlated the experimental IC50 values (in their logarithmic form) of 25 different kinase-inhibitor complexes with their protein- interaction energy and partition coefficient (logP) values for multiple regression analysis, which shows a good correlation with the experimental IC50 values. This shows that the protein-ligand interaction energies and logP values form the major factors that determine the ligand binding affinity of proteins. By incorporating these energetic as well as solvent terms, docking methodologies can be highly successful in predicting the binding affinity for the generated poses of their correct ligand binding modes.

Methodology

Information resources

Twenty five different protein kinase-inhibitor complexes solved by x-ray crystallography method were obtained from Protein Data Bank [11]. The complexes in the dataset have resolution less than 2.5 Å with known IC50 values were treated as training set. The number of non-hydrogen atoms of the ligands and energetic profile comprising of i) total ligand-receptor interaction energy, ii) van der Waals energy, iii) electrostatic energy, iv) hydrogen bond energy, v) solvation free energy, vi) conformational entropy and vii) ligand-water-receptor binding energy were obtained from the PEARLS server for each of the kinase-inhibitor complexes. The PEARLS server uses the AMBER force field [31] for computing the above energetic contributions [28]. LogP (octanol/water partition coefficient) values of the ligands were calculated from the Molinspiration server [32] by providing SMILES code of the ligand as input.

Training set construction and validation

Multiple regression analysis was carried out to establish a relationship between the above-mentioned descriptors and experimental log (IC50). A back-check test was carried out for predicting the binding affinity by re-substituting the values in the regression equation obtained. For the jack-knife test, coefficients of multiple regressions were determined using (n-1) data (omitting one protein-ligand complex at a time) and then predicting binding affinity of the omitted protein-ligand complex.

Test set information

The regression equation obtained from the training set was tested on i) a set of 93 kinase-inhibitor complexes with IC50 values, and ii) a set of 9 approved kinase inhibitors [2]. To further assess the predominant role of the chosen descriptors in binding affinity predictions, the experimental log (IC50) values were regressed with the same set of nine descriptors in five independent protein-kinase families comprising 17 cyclic AMPdependent kinase-inhibitors, 12 casein kinase-inhibitors, 15 hepatocyte growth factor receptor kinase-inhibitors, 12 cyclindependent kinase-inhibitors and 16 mitogen-activated kinaseinhibitors. For each of the five kinase families, five different regression equations were obtained which were then validated by back-check analysis. The dataset information of all the kinaseinhibitor complexes used in the present study, including PDB ID, protein name, ligand ID, x-ray resolution (Å), experimental IC50 values (nM) with their logarithmic form, and descriptor values are provided in the Appendix.

Discussion

The following multiple regression equation (1) between log (IC50) values and nine energetic descriptors and log P with a correlation coefficient, r = 0.93 was obtained for the training set of 25 kinaseinhibitor complexes. log (IC50) = – 0.07 NHA + 44.71 IE – 44.61 vdW – 44.51 Elect – 44.47 H-bonds – 45.45 Solv – 43.19 entropy – 42.82 H-bonds (water-mediated) + 0.18 LogP + 3.58 (1) where NHA denotes the number of non-hydrogen atoms of the ligand, IE, the total Ligand-receptor interaction energy (Kcal/mol), vdW, the van der Waals energy (Kcal/mol), Elect, the electrostatic Energy (Kcal/mol), H-bonds, the hydrogen bond energy (Kcal/mol), Solv, the solvation free energy (Kcal/mol), entropy, the conformational entropy (Kcal/mol), H-bonds (water-mediated), the ligand-water-receptor binding energy (Kcal/mol) and logP, (the octanol-water) Partition coefficient. The experimentally observed and predicted IC50 values for 25 kinase-inhibitor complexes in back-check and jack-knife predictions are provided in Table 1. The average deviation of the predicted log (IC50) values from the experimental log (IC50) values was 0.41 for back-check and 0.63 for jack-knife predictions respectively. The relationship between the experimental log (IC50) values with back-check and jack-knife predictions are provided as scatter plots in Figure 1a and 1b
Table 1

Experimentally observed and predicted IC50 values for 25 kinase-inhibitor complexes.

PDB_IDLigand_IDExperimental IC50(nM)Experimental Log (IC50(nM))Predicted log(IC50 (nM))
Back-checkJack-knife test
2I6A5I522.81.361.33 (0.03)1.30(0.06)
2OO8RAJ100.26 (-0.26)0.48 (-0.48)
4AT3LTI40.60.28 (0.32)0.13 (0.47)
3SXFBK550.70.69 (0.01)0.69 (0.01)
2C1AI5S1702.231.43 (0.80)1.96 (0.27)
3MB601I10021.71 (0.29)1.54 (0.46)
1Y6BAAX381.580.85 (0.73)1.90 (-0.32)
2A4LRRC4002.62.82 (-0.22)2.90 (-0.30)
2YAKOSV20.30.24 (0.07)0.17 (0.13)
4GK2L66401.61.55 (0.05)1.53 (0.07)
3POZ03P231.361.65 (-0.28)1.91 (-0.55)
4F640S8631.82.00 (-0.20)2.15 (-0.35)
3BZ3YAM1.50.180.27 (-0.09)0.33 (-0.15)
1Q3DSTU151.181.28 (-0.11)1.34 (-0.16)
3C1XCKK451.651.01 (0.64)0.68 (0.97)
3D94D94191.280.96 (0.32)0.77 (0.51)
4BKZ1WS271.431.87 (-0.43)2.20 (-0.77)
3HRBI39211.321.80 (-0.48)1.93 (-0.61)
4BFVZVV1402.151.45 (0.70)0.54 (1.61)
3LJ3WYE431.632.12 (-0.49)2.74 (-1.10)
2VGOAD55002.71.93 (0.77)1.39 (1.31)
4HDC13Y1.20.080.37 (-0.29)0.77 (-0.69)
1RW858013203.122.96 (0.16)2.84 (0.28)
3KRRDQX0.48-0.32-0.26 (-0.06)-0.16 (-0.16)
3MVHWFE0.5-0.30.05 (-0.35)0.19 (-0.49)
Average deviation0.410.63
Note: The deviation between the predicted and experimental IC50 values is given in parenthesis.
Figure 1

(a) Scatter plot of experimental and back-check predicted IC50 values in 25 different kinase-inhibitor complexes, (b) Scatter plot of experimental and jack-knife predicted IC50 values in 25 different kinase-inhibitor complexes.

a) Test set of diverse protein kinases (Test set I)

The regression equation (1) obtained was tested on 93 kinaseinhibitor complexes (results are provided as Table 7 in the Supporting Information file) and the relationship between the experimental and predicted log (IC50) values is presented as a scatter plot (Figure 2). An average deviation of 0.92 from the original log (IC50) values was observed for the 93 kinase-inhibitor complexes. The difference between the experimental and calculated log (IC50) values was found to be less than ±1 log unit for 64 out of 93 kinase-inhibitor complexes.
Table 7

Experimentally observed and predicted IC50 values for 16 mitogen-activated protein kinase-inhibitor complexes

PDB_ID LIG_ID Experimental IC50 (nM) Experimental log ( IC50 (nM)) Back-check Prediction
1W82 L10 1962.292.40 (-0.11)
1WBN L09 3502.542.20 ( 0.34)
1ZYJ BI5 15003.182.84 ( 0.33)
3FLZ FLZ 1062.032.25 (-0.22)
3FMH 533111.041.25 (-0.21)
3HL7 I47 1102.041.56 ( 0.48)
3HP2 P36 6802.833.07 (-0.24)
3HRB I39 211.321.99 (-0.67)
3IPH G11 316.232.52.40 ( 0.10)
3L8X N4D 1010.96 ( 0.04)
3MVM 39P 3.90.590.63 (-0.03)
3NWW 3NW 70.850.96 (-0.12)
3S4Q NK0 40.60.44 ( 0.16)
3UVP 48351.541.26 ( 0.28)
3ZSG T75 7.10.851.15 (-0.30)
4EWQ GG5 6002.782.62 ( 0.16)
Average deviation 0.29
Note: The deviation between the predicted and experimental IC50 values is given in parenthesis.
Figure 2

Scatter plot of experimental and predicted IC50 values in the test set of 93 kinase-inhibitor complexes.

b) Approved kinase inhibitors as test set (Test set II)

To further test the predictability of our regression equation (1), we have tested it for nine approved kinase inhibitors2. The experimental values of those inhibitors were found to have minimum and maximum range of values. Hence, the mean value between the logarithm of minimum and maximum values were calculated and compared with the predicted values. The predicted values were almost closer (the deviation was less than 1) to the experimental log (IC50) values in 7 out of 9 kinaseinhibitor complexes (Table 2).
Table 2

Prediction of experimental log (IC50) values in approved kinase inhibitors.

S.NO.PDB ID Protein nameLigand IDLigand nameExperimental log (IC50 (nM))Predicted log (IC50(nM))
11M17 Epidermal growth factor receptor AQ4 Erlotinib - 0.70 – 3.16 (1.23) 0.87
21IEP Proto-oncogene tyrosine-protein kinase abl STI Imatinib0.04 – 3.93 (1.99) 1.79
32ITY Epidermal growth factor receptor IRE Gefitinib 0.00 – 3.44 (1.72)1.46
42J2I Proto-oncogene serine/threonine-protein kinase pim-1 LY4 Ruboxistaurin 2.31.61
52GQG Proto-oncogene tyrosine-protein kinase abl1 1N1 Dasatinib - 0.70 – 2.85 (1.07) -0.97
62JAV Serine/threonine-protein kinase nek2 5Z5 Sunitinib 3.91.02
71UWH B-raf proto-oncogene serine/threonine-protein kinase BAX Sorafenib 1.04 – 3.86 (2.45) 1.12
81XKK Epidermal growth factor receptor FMM Lapatinib 0.46 – 2.64 (1.55) 1.88
92F2U Rho-associated protein kinase 2 M77 Fasudil 2.26 – 4.07 (3.16) 2.45
Note: The mean value between the logarithm of minimum and maximum experimental IC50 values are given in parenthesis.

c) Kinase classes

In order to further validate the use of the same set of descriptors in predicting binding affinity, the experimental IC50 values of five kinase families were regressed. The regression equation obtained for individual kinase families and the result of back-check predictions are discussed as follows:

i) Cyclic AMP-dependent protein kinase-inhibitor complexes

For a data-set of 17 cyclic AMP-dependent protein kinaseinhibitor complexes, a correlation value of 0.95 was obtained using the regression equation (2) log (IC50) = -0.06 NHA – 76.09 IE + 76.08 vdW + 75.98 Elect + 76.26 H-bonds + 76.62 Solv + 75.21 entropy + 77.07 H-bonds (water-mediated) – 0.13 LogP + 5.21 (2) Using the regression equation (Eq. 2), log (IC50) values for 17 cyclic AMP-dependent protein kinase-inhibitor complexes were predicted. The experimental as well as predicted log (IC50) values are presented (Table 3) and plotted (Figure 3a). The average deviation for the back-check test was 0.28 from the experimental values.
Table 3

Experimentally observed and predicted IC50 values for 17 cyclic AMP-dependent protein kinase-inhibitor complexes.

PDB_ID LIG_ID Experimental IC50 (nM) Experimental log ( IC50 (nM)) Back-check Prediction
1STC STU 511.711.45 (0.26)
1SVE I01 50.70.99 (-0.29)
1XH4 R69 301.481.58 (-0.10)
1YDS IQS 53003.723.13 (0.59)
2C1A I5S 1702.232.23 (0.00)
2F7X 4EA 381.581.64 (-0.06)
2GNI M77 76053.883.59 (0.29)
2JDS L20 271.431.45 (-0.01)
2OH0 2PY 181.261.21 (0.04)
2OJF 4PY 1102.042.15 (-0.11)
2UW6 GVO 2802.453.07 (-0.62)
2UZT SS3 141.151.11 (0.04)
3L9L L9L 1672.221.78 (0.45)
3MVJ XFE 32003.513.77 (-0.27)
3OW3 SMY 7422.872.88 (-0.01)
3ZO2 15I 801.92.10 (-0.19)
4C35 NU3 5602.752.76 (-0.01)
Average deviation0.28
Note: The deviation between the predicted and experimental IC50 values is given in parenthesis.
Figure 3

Scatter plot of experimental and back-check predicted IC50 values in (a) 17 cyclic AMP-dependent kinase-inhibitor complexes; (b) 12 caesin kinase-inhibitor complexes; (c) 15 Hepatocyte growth factor receptor kinase-inhibitor complexes; (d)12 cyclin-dependent kinase-inhibitor complexes; (e) 16 mitogen-activated protein kinase-inhibitor complexes.

ii) Casein kinase-inhibitor complexes

12 casein kinase-inhibitor complexes were taken for the multiple regression analysis which has shown a good correlation of r = 0.97 for the regression equation (3) log (IC50) = 0.01 NHA – 1.40 IE + 1.45 vdW + 1.39 Elect + 1.36 Hbonds + 1.43 Solv + 1.35 entropy + 1.69 H-bonds (watermediated) + 0.01 LogP + 0.47 (3) The set of 12 casein kinase-inhibitor complexes with their experimental and predicted values has been provided (Table 4). The scatter plot shows the relationship between the experimental and predicted IC50 values (Figure 3b), the average deviation being 0.02 for back-check predictions.
Table 4

Experimentally observed and predicted IC50 values for 12 casein kinase-inhibitor complexes.

PDB_ID LIG_ID Experimental IC50 (nM) Experimental log ( IC50 (nM)) Back-check Prediction
2QC6 G12 1000.270.24 (0.03)
2ZJW REF 400.380.37 (0.01)
3AMY AGI 8000.360.38 (-0.02)
3BQC EMO 20000.180.20 (-0.03)
3MB6 01I 1000.240.25 (0.00)
3PE1 3NG 10.20.21 (-0.01)
3PWD CZ0 2200.340.31 (0.03)
3R0T FU9 0.910.240.24 (0.01)
3RPS 4B0 3200.360.37 (-0.01)
3U4U LNH 31000.340.34 (0.01)
4ANM WUL 280.230.22 (0.01)
4DGM AGI 12000.220.23 (-0.02)
Average deviation 0.02
Note: The deviation between the predicted and experimental IC50 values is given in parenthesis.

iii) Hepatocyte growth factor receptor kinase-inhibitor complexes

A set of 15 hepatocyte growth factor receptor kinase-inhibitors has shown a correlation coefficient value of 0.90 when subjected to regression with multiple descriptors, the equation (4) being log (IC50) = 0.05 NHA – 232.16 IE + 231.87 vdW + 231.02 Elect + 232.03 H-bonds + 232.68 Solv + 224.75 entropy +236.13 H-bonds (water-mediated) – 0.27 LogP + 0.24 (4) The experimental and predicted log (IC50) values are provided in Table 5. The correlation between experimental and calculated values for the 15 hepatocyte growth factor receptor kinaseinhibitors is shown in Figure 3c. An average deviation of 0.31 was observed.
Table 5

Experimentally observed and predicted IC50 values for 15 hepatocyte growth factor receptor kinase-inhibitor complexes.

PDB_ID LIG_ID Experimental IC50 (nM) Experimental log ( IC50 (nM)) Back-check Prediction
2WD1 ZZY 821.911.56 (0.36)
2WKM PFY 90.951.26 (-0.30)
3C1X CKK 451.651.56 (0.09)
3CCN LKG 1202.081.69 (0.39)
3CD8 L5G 90.951.11 (-0.16)
3CTH 319351.541.84 (-0.29)
3I5N B2D 171.231.69 (-0.46)
3F66 IHX 9002.952.88 (0.07)
3L8V L8V 80.90.72 (0.18)
3QTI 3QT 141.151.18 (-0.04)
3RHK M97 5202.722.60 (0.11)
4DEG 0JJ 60.781.43 (-0.65)
4DEH 0JK 6122.792.71 (0.08)
4EEV L1X 421.621.09 (0.53)
4GG7 0J8 6.50.810.72 (0.09)
Average deviation 0.31

iv) Cyclin-dependent kinase-inhibitor complexes

A very good correlation of r = 0.94 was obtained for 12 cyclindependent kinase-inhibitor dataset using the regression equation (5) log (IC50) = – 0.35 NHA + 94.15 IE – 94.18 vdW – 96.24 Elect – 92.97 H-bonds – 92.06 Solv –96.77 entropy – 95.47 H-bonds (water-mediated) + 1.94 LogP + 6.27 (5) The predicted results of 12 cyclin-dependent kinase-inhibitor complexes are tabulated (Table 6). The average deviation value from the experimental value was found to be 0.49. The results are plotted (Figure 3d).
Table 6

Experimentally observed and predicted IC50 values for 12 cyclin-dependent kinase-inhibitor complexes.

PDB_ID LIG_ID Experimental IC50 (nM) Experimental log ( IC50 (nM)) Back-check Prediction
1AQ1 STU 70.850.71 (0.13)
1DI8 DTQ 100033.07 (-0.07)
1E1X NW1 22003.343.89 (-0.54)
1H01 FAL 220004.344.74 (-0.40)
1W0X OLO 70.851.59 (-0.74)
3S2P PMU 681.831.36 (0.48)
3TIY TIY 170004.234.36 (-0.13)
3TNW F18 200004.33.52 (0.78)
3ULI 1N3 701.851.48 (0.37)
3UNJ 0BX 110004.043.41 (0.63)
3WBL PDY 230004.364.22 (0.14)
4BGH 3I6 40.61.24 (-0.63)
Average deviation 0.49
Note: The deviation between the predicted and experimental IC50 values is given in parenthesis.

v) Mitogen-activated protein kinase-inhibitor complexes

The multiple regression analysis of 16 mitogen-activated protein kinase-inhibitors gave a correlation of r = 0.94 using the regression equation (6) log (IC50) = – 0.13 NHA + 16.62 IE – 16.05 vdW – 17.09 Elect – 16.66 H-bonds – 16.62 Solv – 13.78 entropy – 18.19 H-bonds (water-mediated) + 0.40 LogP + 8.27 (6) The observed and computed values for a dataset of 16 mitogenactivated protein kinase-inhibitors are presented (Table 7) showing an average deviation value of 0.29. The predicted IC50 values were plotted against the experimental values (Figure 3e). In QSAR studies, usually a large number of physicochemical and structural properties (descriptors) of chemical compounds are calculated and the best combinations of descriptors that correlate maximally with the biological activity are chosen. QSAR models have been developed for predicting kinase selectivity profiles to provide understanding of structure selectivity relationships for kinase inhibitor design [33,34, 35]. The development of widely accepted ‘universal’ set of descriptors applicable for diverse datasets has also been a focus of QSAR-based analysis [36,37, 38]. In the present work, a uniform set of descriptors have been used across different kinase-inhibitor complexes for binding affinity prediction. Han et al. (2006) suggested that the calculated interaction energies highly depend on van der Waals contacts, electrostatic interactions, hydrogen bonds, metal-receptor binding, and solvation [28], which agrees with the observations of our present analysis. These observations are reflected in the interaction energy contributions of our present analysis. The protein-ligand interaction energies obtained using PEARLS server has been used in other studies on inhibitor discovery such as HIV-1 protease [39] and ribonuclease A inhibitors [40] to predict the binding affinity values using regression analysis. Log P, remains the main deterministic factor for the ligand’s affinity for the protein active site with reference to the surrounding solvent environment [41]. In the present study, we have used the various energetic components as independent variables along with logP values, to predict the experimental binding affinity. This set of descriptors developed from a small set of 25 kinase-inhibitor complexes were able to predict IC50 values for 93 test set complexes spanning 4 orders of magnitude of IC50 values. The same set of descriptors was also found to be suitable for family specific regression models as well. As docking methods improve to reproduce conformations observed through x-ray crystallographic and NMR determined structures, it will be possible to use our present approach to predict the IC50 values for various protein targets, more significantly for specific protein families. Alternatively, if IC50 values for kinase-inhibitor complexes are known, the method can also be used to predict the pose of a given ligand as well.

Conclusion:

Despite intensive research over more than two decades, accurate prediction of the binding affinities of large set of diverse protein ligand complexes remains one of the most important open problems in computational molecular biology [42]. The issues currently being addressed are the scoring of modelled protein conformations, and including the binding free energy due to presence of water molecules [43]. In the present work, we have addressed these issues by using energetic and solvent descriptors to predict the binding affinity of kinase-inhibitor complexes using multiple regression analysis. A high correlation value of 0.9 between the predicted and experimental binding affinity was obtained for a test set of kinase-inhibitor complexes. The method was validated by predicting a test of 93 kinase-inhibitor complexes covering five kinase families which has shown a good predictive ability. Our methodology can provide valuable insights for the prediction accuracy of molecular docking strategies. Further studies will be required to validate the general applicability of these set of descriptors to predict the binding affinity for a diverse set of enzyme-inhibitor complexes.
  37 in total

1.  The Protein Data Bank.

Authors:  H M Berman; J Westbrook; Z Feng; G Gilliland; T N Bhat; H Weissig; I N Shindyalov; P E Bourne
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  A widely applicable set of descriptors.

Authors:  P Labute
Journal:  J Mol Graph Model       Date:  2000 Aug-Oct       Impact factor: 2.518

3.  Further development and validation of empirical scoring functions for structure-based binding affinity prediction.

Authors:  Renxiao Wang; Luhua Lai; Shaomeng Wang
Journal:  J Comput Aided Mol Des       Date:  2002-01       Impact factor: 3.686

4.  Native atom types for knowledge-based potentials: application to binding energy prediction.

Authors:  Brian N Dominy; Eugene I Shakhnovich
Journal:  J Med Chem       Date:  2004-08-26       Impact factor: 7.446

5.  Functional interrogation of the kinome using nucleotide acyl phosphates.

Authors:  Matthew P Patricelli; A Katrin Szardenings; Marek Liyanage; Tyzoon K Nomanbhoy; Min Wu; Helge Weissig; Arwin Aban; Doris Chun; Stephen Tanner; John W Kozarich
Journal:  Biochemistry       Date:  2007-01-16       Impact factor: 3.162

6.  Toward novel universal descriptors: charge fingerprints.

Authors:  Frank R Burden; Mitchell J Polley; David A Winkler
Journal:  J Chem Inf Model       Date:  2009-03       Impact factor: 4.956

7.  QSAR models for predicting the similarity in binding profiles for pairs of protein kinases and the variation of models between experimental data sets.

Authors:  Robert P Sheridan; Kiyean Nam; Vladimir N Maiorov; Daniel R McMasters; Wendy D Cornell
Journal:  J Chem Inf Model       Date:  2009-08       Impact factor: 4.956

8.  Empirical scoring functions: I. The development of a fast empirical scoring function to estimate the binding affinity of ligands in receptor complexes.

Authors:  M D Eldridge; C W Murray; T R Auton; G V Paolini; R P Mee
Journal:  J Comput Aided Mol Des       Date:  1997-09       Impact factor: 3.686

Review 9.  Resistance to tyrosine kinase inhibitors: calling on extra forces.

Authors:  Jan Cools; Chantal Maertens; Peter Marynen
Journal:  Drug Resist Updat       Date:  2005-06       Impact factor: 18.500

10.  Does a more precise chemical description of protein-ligand complexes lead to more accurate prediction of binding affinity?

Authors:  Pedro J Ballester; Adrian Schreyer; Tom L Blundell
Journal:  J Chem Inf Model       Date:  2014-02-20       Impact factor: 4.956

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.