Literature DB >> 24521147

HADDOCK(2P2I): a biophysical model for predicting the binding affinity of protein-protein interaction inhibitors.

Panagiotis L Kastritis¹, João P G L M Rodrigues, Alexandre M J J Bonvin.

Abstract

The HADDOCK score, a scoring function for both protein-protein and protein-nucleic acid modeling, has been successful in selecting near-native docking poses in a variety of cases, including those of the CAPRI blind prediction experiment. However, it has yet to be optimized for small molecules, and in particular inhibitors of protein-protein interactions, that constitute an "unmined gold reserve" for drug design ventures. We describe here HADDOCK(2P2I), a biophysical model capable of predicting the binding affinity of protein-protein complex inhibitors close to experimental error (~2-fold larger). The algorithm was trained and 4-fold cross-validated against experimental data for 27 inhibitors targeting 7 protein-protein complexes of various functions and tested on an independent set of 24 different inhibitors for which K(d)/IC50 data are available. In addition, two popular ligand topology generation and parametrization methods (ACPYPE and PRODRG) were assessed. The resulting HADDOCK(2P2I) model, derived from the original HADDOCK score, provides insights into inhibition determinants: while the role of electrostatics and desolvation energies is case-dependent, the interface area plays a more critical role compared to protein-protein interactions.

Entities: Chemical Disease Gene Species

Mesh：

Substances：

Year: 2014 PMID： 24521147 PMCID： PMC3966529 DOI： 10.1021/ci4005332

Source DB: PubMed Journal: J Chem Inf Model ISSN： 1549-9596 Impact factor: 4.956

Introduction

Protein–protein interactions (PPIs) define most of the cellular processes in the cell,[1] such as differentiation, proliferation, signal transduction, and apoptosis. Being able to design PPI inhibitors will drastically catalyze the development of novel therapeutics for diseases, such as cancer.[2] Such inhibitors are currently considered “an unmined gold reserve”[3] for drug design in general. Consequently, novel software tools are being developed and made publicly available that target the design of inhibitor of PPIs, such as PocketQuery.[4−6] The chemical space of PPI inhibitors is, however, rather unique.[7−9] Protein–protein interfaces have been manually curated and collected in several dedicated databases, such as iPPi-DB,[10] a database that includes associated pharmacological data, and the 2P2I database.[11,12] Recent studies on a structure-based benchmark[11] of PPI inhibitors (collected from the 2P2I database[12]) showed that the size of the ligands targeting the interface of protein–protein complexes is substantially larger than that of normal inhibitors that target the active site of single molecules like enzymes[9] and suggest that PPI inhibitors are mainly large, lipophilic, and aromatic compounds. Because of the different nature of the target interface and ligands, dedicated biophysical models are needed to predict the binding affinity of PPI inhibitors. Although current biophysical models have proven to reasonably approximate the affinity of protein–ligand complexes,[9,13] these have yet to be optimized to predict the affinity of PPI inhibitors. A drop by more than 10% in docking success rates is reported when inhibitors of protein–protein interactions are put to test in comparison to normal inhibitors.[14] To stimulate the development of new models, data sets of binding affinities are required. Such a data set has recently been compiled for protein–protein complexes,[15,16] but no such data set is available for PPI inhibitors. Although docking and binding affinity prediction have been performed with success for specific systems,[17−19] a generic binding affinity prediction model for PPIs would be a welcome addition. In this work, we report the optimization and performance of the HADDOCK score[20,21] on the prediction of the binding affinity of PPI inhibitors. This was performed on a binding affinity benchmark consisting of Kd and Ki values for 27 complexes from the 2P2I database.[11,12] In addition, we also compiled an independent set of PPI inhibitors for further validation, consisting of 19 protein-inhibitor complexes with available IC50 data. Finally, we predicted the affinity of five different inhibitors that target the interaction between bromodomains and acetylated histones. We additionally investigate experimental ambiguity in affinity measurement and propose an acceptable prediction error on this basis. The original HADDOCK score, which is a linear combination of van der Waals and electrostatics energy terms calculated using the OPLS force field,[22] together with an additional empirical desolvation term,[23] was originally optimized for discriminating near-native poses in protein–protein docking.[21] We present here the PPI-optimized score, HADDOCK2P2I, which is shown to predict, close to experimental error (∼2-fold larger), the binding affinities of PPI inhibitors. Its components provide valuable new insights into the determinants of the inhibition of PPIs.

Materials and Methods

Benchmark Compilation

Information about available structural data for protein–protein interaction inhibitors was retrieved from the 2P2I database[11] (see Table 1) and the corresponding coordinates downloaded from the Protein Data Bank (www.pdb.org).[24] Binding affinity data (dissociation constants, Kd values, and inhibition constants, Ki values) were manually procured from literature (see Table 1 and associated references in Supporting Information Table S1). In total, data for 7 different protein–protein complexes were identified, for which 27 structures and binding affinity data of complexes with various PPI inhibitors were available.

Table 1

Ki and Kd Binding Affinity Data Set of Protein–Protein Interaction Inhibitorsa

	interaction	biological role		PDB (complex)	K_d (complex) (M)
	Bcl-x_L/Bak	programmed cell death		1bxl	3.4 × 10^–7

Original references for the affinity data are provided in the Supporting Information (Table S1).

Structure Refinement

The crystal structures of the inhibitors of PPIs in complex with their respective proteins were refined in HADDOCK[20,21,25] and subsequently scored. This ensured that all potentially missing side-chains were properly built and the interface of the complex was optimized using the OPLS force field.[22] For this, the standard water refinement setup in HADDOCK was used, which starts by solvating the complexes in an 8 Å shell of water molecules (TIP3P) and consists of the following steps: (1) Energy minimization (EM) of the water, 40 steps with the protein fixed (Powell minimizer), followed by 2 × 40 EM steps with harmonic position restraints on the protein heavy atoms (krest = 5 kcal mol–1 Å–2). (2) Gentle simulated annealing protocol using molecular dynamics in Cartesian space consisting of (a) a heating period with 500 MD steps at 100, 200, and 300 K with position restraints (krest = 5 kcal mol–1 Å–2) on the protein heavy atoms except for the side-chains at the interface, (b) a sampling stage with 1250 MD steps with weak (krest = 1 kcal mol–1 Å–2) position restraints on the protein heavy atoms except for the backbone and side chains at the interface, and (c) a cooling stage with 500 MD steps at 300, 200, and 100 K with weak (krest = 1 kcal mol–1 Å–2) position restraints on the protein backbone heavy atoms not at the interface. A time step of 2 fs was used for the integration of the equation of motions and the temperature was maintained constant by weak coupling to a reference temperature bath using the Berendsen thermostat.[26] (3) Final 200 EM steps without any position restraints. Nonbonded interactions were calculated with the OPLS force field[22] using a cutoff of 8.5 Å. The electrostatic energy (Eelec) was calculated using a shift function while a switching function (between 6.5 and 8.5 Å) was used for the van der Waals energy (Evdw). This procedure generated 20 models for each complex, starting from different random velocities. As is default in the HADDOCK protocol, the average score of the top 4 models was considered. All calculations were performed with HADDOCK, version 2.1/CNS,[27] version 1.2, through the refinement interface of the HADDOCK web server[25] (http://haddock.science.uu.nl/services/HADDOCK/haddockserver-refinement.html). The same protocol can be run manually on a local installation of HADDOCK by turning off randomization of starting orientations, rigid body minimization and random removal of restraints and setting all steps for the semiflexible refinement stage (it1) to 0. This was used for the refinement runs using the ACPYPE parametrization of the ligands (see Results).

HADDOCK2P2I Model Development and Evaluation

Multiple linear regression analysis was used to create the optimized HADDOCK2P2I score based on the following equation:where Evdw and EElec denote the intermolecular van der Waals and electrostatics energies, Edesolv an empirical desolvation energy,[23] and BSA the buried surface area. log Kpred denotes the logarithm of the predicted binding affinity of the inhibitors. The β and c coefficients were optimized using 4-fold cross-validation by minimizing the χ2 function (eq 2) for the 27 complexes shown in Table 1.where Kexp corresponds to the experimentally measured Kd and Ki constants. During optimization, the various HADDOCK score components were switched on and off to assess their usefulness in the final development of the model. Similar parametrization methods have already been employed successfully for various protein–ligand systems to date.[28−30] The final coefficients were taken as the average of the 4-fold cross-validation optimization runs.

Independent Sets for Assessing the Prediction Performance of the HADDOCK Score and HADDOCK2P2I

For compiling the training set and performing the 4-fold cross-validation, we carefully selected only competitive inhibitors, avoiding any noncompetitive modulators of the interactions. Data from interaction inhibition are often reported not as Ki (or, equivalently, Kd), but instead as IC50, the latter denoting the concentration of ligand that reduces interaction activity by 50%. IC50 measures inhibitor binding in competition with another binding partner; consequently, it depends on the concentration of the competitive molecule and its affinity for the target protein. IC50s are usually larger than Ki values, but when the concentration of the substrate is very low, they should become essentially equal to Ki.[31] Since data about protein concentration in the assays were rather scarce we chose to omit complexes with only IC50 values from the training/cross-validation set. These were, however, included as an independent set for prediction, because of their different physicochemical but related nature. A list of 19 inhibitors of protein–protein interactions with known molecular structures and IC50 values was manually procured from the literature using 2P2I database and iPPI-DB as starting points (Table 2).

Table 2

IC50 Binding Affinity Data Set of Protein–Protein Interaction Inhibitorsa

	interaction	biological role		PDB (complex)	K_d (complex) (M)
	Bcl-x_L/Bak	programmed cell death		1bxl	3.4 × 10^–7

Original references for the IC50 data were retrieved from the relevant PDB entries in the Protein Data Bank (www.pdb.org).

Kd data, retrieved from the Protein Data Bank (www.pdb.org).

Original references for the IC50 data were retrieved from the relevant PDB entries in the Protein Data Bank (www.pdb.org). Kd data, retrieved from the Protein Data Bank (www.pdb.org). Additionally, affinity predictions of inhibitors that target a protein–protein complex, not included in the training/cross-validation set were performed with HADDOCK2P2I. These inhibitors were designed to disrupt the interaction between bromodomains that specifically recognize ε-N-acetylation of lysine residues, a post-translational modification common on histone tails. All interactions include measured Kd data and have available molecular structures in the Protein Data Bank (www.pdb.org).

Estimation of Experimental Uncertainty and Qualitative Comparison to Prediction Error

To assess the physicochemical relevance of the prediction error in the cross-validated set and the test sets used, we collected from the iPPI-DB[10] all interactions that fulfill the following criteria: (1) The same small molecule must inhibit the same interaction. (2) Two or more binding affinity measurements using different experimental methods should be available, but affinity constants must be of the same nature (e.g., Kd, Ki, or IC50). We, therefore, compiled a data set of 72 interactions that match the above-mentioned criteria (Supporting Information Table S3). Estimation of experimental ambiguity of the derived data is performed using descriptive statistics and distribution analysis, and is empirically compared to the prediction accuracy.

Assessment of the Structural Variability between Ligands and Proteins in the Data Set

To verify that the protein–inhibitor complexes selected for both training and prediction were not structurally redundant, which meant the predictive ability of our biophysical model would be limited to a few classes of complexes, we carried out an analysis of the similarity between the ligands and the protein binders of each data sets. The protein similarity was assessed using sequence similarity following a pairwise global alignment of the sequences (as taken from the RCSB PDB Web site) calculated with the Needleman–Wunsch algorithm available at the EBI Web site (www.ebi.ac.uk/Tools/psa/emboss_needle/).[32,33] Features such as histidine tags from the purification protocol were not included in these calculations. Ligands were compared using a substructure key-based 2D Tanimoto similarity,[34] which was calculated via the PubChem score matrix web service (http://pubchem.ncbi.nlm.nih.gov//score_matrix/score_matrix.cgi).[35]

Results

We investigated the protein–protein complexes from the 2P2I database and collected from the literature affinity data for 27 inhibitors targeting 7 different protein–protein complexes. Ki and Kd data were combined into a single data set, excluding any IC50 measurements, which was manually curated and assembled from two different databases. These IC50 values and their respective complexes were used as a test set, to assess the performance of the developed functions in unknown cases, with fuzzier data.

Compilation of a Structure-Based Data Set of PPI Inhibitors with Known Ki and Kd Data

The resulting binding constants include Kd and K values describing the modulation/inhibition of the 7 protein–protein complexes (see Table 1). They span the range from very low (mM) to very high affinity (nM), covering thus a broad spectrum of potencies. For two of the complexes (Bcl-xL/Bak and SMAC-DIABLO/XIAP-BIR3), inhibitors with a large range of affinities have been cocrystallized. For example, the survival protein Bcl-xL in complex with the death-promoting region of the Bcl-2-related protein Bak[36] is a medium affinity complex (Kd = 340 nM), whereas designed PPI inhibitors against it exhibit binding affinities from 4.3 mM to 0.5 nM. Another complex reported in this study is Smac/DIABLO bound to the third BIR domain (BIR3) of XIAP, a complex critical for cellular apoptosis.[37] Its reported affinity is 420 nM, whereas inhibitors designed to disrupt this interaction cover a broad range of Ki, from 300 mM to 67 nM. For the other complexes reported in Table 1, fewer inhibitors with known Ki or Kd data have been reported; they exhibit various binding potencies.

Evaluation of Force Field Topologies and Parameters for PPI Inhibitors

The crystal structures of the PPI inhibitors in complex with their respective proteins were subjected to short refinement and scored using both the HADDOCK refinement web interface[25] and a local installation (see Methods). Force field topologies and parameters of all PPI inhibitors were generated using both PRODRG[38] and ACPYPE.[39] The main difference between the two tools is that, while PRODRG is based on a database search method to parametrize the molecules using OPLS-like parameters,[22] ACPYPE uses ANTECHAMBER[40] with a semiempirical quantum calculation method for the partial charges. A comparison of the original HADDOCK scores (HSorig = Evdw + 0.2Eelec + Edesol) of the complexes using these two parameter sets reveals differences (Figure 1A) (r2 = 0.73, N = 27, p-value < 0.0001), showing that ligand parameters play a substantial role in defining the interaction energy of each complex. While the desolvation (Edesolv) and van der Waals energies (EvdW) are essentially identical (r2 = 0.89, N = 27, p-value <0.0001 and r2 = 0.88, N = 27, p-value <0.0001, respectively) (Figure 1B and D), the electrostatic component (Eelec) changes dramatically (r2 = 0.33, N = 27, p-value = 0.0017) (Figure 1C) because of differences in partial charges. The buried surface area (BSA), which depends on the van der Waals radii, is however almost identical (r2 = 0.98, N = 27, p-value <0.0001, data not shown).

Figure 1

Correlation plots between energetic components calculated using ACPYPE and PRODRG-derived parameters. (A) Original HADDOCK score (Evdw + 0.2Eelec + Edesol), (B) van der Waals energy, (C) electrostatic energy, and (D) desolvation energy.

Performance of HADDOCK Score in PPI Inhibitor Binding Affinity Prediction. Training and Cross-Validation of HADDOCK2P2I

The individual components of HSorig show different contributions to the binding affinity with the van der Waals energy being the most significant contributor (r2 = 0.53, N = 27, p-value <0.0001 for EvdWACPYPE and r2 = 0.38, N = 27, p-value = 0.0006 for EvdWPRODRG) (Figure 2 A and B). The BSA also shows a strong positive correlation with binding affinity independently of the parametrization tool used (r2 = 0.51, N = 27, p-value <0.0001) (Figure 2C). The electrostatic energy, in contrast, does not correlate to binding affinity (r2 = 0.07, N = 27, p-value = 0.1823 for EelecACPYPE and r2 = 0.04, N = 27, p-value =0 .3172 for EelecPRODRG) (Figure 2D).

Figure 2

Correlation plots of experimentally determined binding affinities of PPI modulators with (A, B) the van der Waals energy calculated with the two ligand parametrization schemes, (C) the buried surface area (calculated using PRODRG parameters for the ligands), and (D) the electrostatic energy. In terms of binding affinity prediction using HSorig, both parametrization schemes exhibit comparable results (r2 = 0.40, N = 27, p-value = 0.0004) (Figure 3A and B).

Figure 3

Correlation plots of experimentally determined binding affinities with the original HADDOCK score (Evdw + 0.2Eelec + Edesol) calculated with (A and B) the two parametrization schemes and (C) the optimized HADDOCK2P2I score. (D and E) Cartoon representation of (D) the near-rigid Bcl-xL/Bak and (E) the flexible Xiap-BIR3/caspase-9 protein–protein complex. The components of the original HADDOCK score, together with the BSA, which is used in HADDOCK in the scoring of the initial models at the rigid body stage, were optimized (see Methods) leading to the optimized HADDOCK score, termed HADDOCK2P2I. It is described by the following equation:Note that the parameters are averages from the 4-fold cross-validation. As can been seen from the standard deviations given as subscripts, there can be quite some variation in some parameters, in particular the weight of the desolvation term. The values of these coefficients may very well vary depending on the data set used for cross-validation; only a much larger data set (unfortunately not available) would allow a better convergence. Still, the model selected (eq 3) outperforms all other models tested (see Supporting Information Table S2), reaching an r2 = 0.57, N = 27, p-value < 0.0001 (r2 = 0.53, N = 27, p-value < 0.0001 after cross-validation) for complexes parametrized with PRODRG (Figure 3C) with a mean absolute error (MAE) of 0.8 ± 0.6 kcal mol–1. It includes two of the original components of the HADDOCK score with an additional BSA term which substitutes the original van der Waals energy term. For complexes parametrized with ACPYPE, the model still retains its predictive capacity (r2 = 0.45, N = 27, p-value = 0.0001 after cross-validation), albeit to a lesser extent (see Supporting Information Table S2). Note that the HADDOCK2P2I scores from both parametrization schemes are very similar (r2 = 0.88, N = 27, p-value < 0.0001). The most notable difference compared to the original HADDOCK score, next to the difference in coefficients, is that the van der Waals energy term has been replaced by the Buried Surface Area term. A protein–protein complex of central medical relevance is the one formed between the survival protein Bcl-xL and the death-promoting region of the Bcl-2-related protein Bak (Figure 3D). Proteins in the Bcl family are critical regulators of apoptosis and contain members that inhibit programmed cell death, such as Bcl-xL and Bcl-2. These proteins are overexpressed in many cancers and contribute to tumor initiation, progression, and resistance to therapeutics. Successful inhibition of this protein–protein interaction will have major consequences in cancer therapeutics.[41] Both the original (r2 = 0.86, N = 6, p-value < 0.001) and optimized (r2 = 0.94, N = 6, p-value < 0.001) HADDOCK scores are able to relate the scores of all six designed inhibitors for this particular complex to their potency, with the optimized HADDOCK2P2I performing slightly better (Figure 3 solid circles, compare A–C). Although the model can, in general, efficiently predict affinity data for different inhibitors of protein–protein complexes, the prediction performance is directly influenced by conformational changes taking place upon binding: indeed, while for the rigid interface of the BcL-XL/Bak complex discussed above (Figure 3D) predicted affinities are lying close to the regression line (solid circles in Figure 3C), in contrast, for the more flexible interface region of the Xiap-BIR3, that is also flexible in the interaction with Caspase 9 (Figure 3E), predicted affinities show a poor, or even lack of, correlation to experimental values (solid triangles in Figure 3C). Last but not least, when we calculate the simple correlation between molecular weight of the inhibitor and its corresponding affinity, significant correlations emerge (r2 = 0.43, N = 27, p-value = 0.0002), albeit weaker compared to the derived HADDOCK2P2I score (r2 = 0.51, N = 27, p-value < 0.0001), indicating that 3D information is needed to more efficiently predict ligand affinities.

Compilation of a Structure-Based Data Set for PPI Inhibitors with IC50 Values. Prediction of IC50 Data with the HADDOCK Score and HADDOCK2P2I

To assess the performance of both the original HADDOCK score and HADDOCK2P2I, we compiled a data set of related affinity data that include a total of 19 protein-inhibitor complexes for 6 different protein–protein interactions (Table 2). Four complexes are new and have not been incorporated in the training/cross-validation set, including inhibitors of the interaction between human transcriptional coactivator p75 (LEDGF) and viral integrase. The derived inhibitors have corresponding IC50s ranging from 12 μM to 19 nM. Tighter binding is exhibited by inhibitors of another host–virus interaction, that between a viral integrin and CD54, also known as Intercellular Adhesion Molecule 1 (ICAM 1), where IC50s span from 69 nM to 400 pM. Overall, the derived test set comprises different complexes with different binding constants, making this data set significantly distinct compared to the test/cross-validation set. The best performance in predicting IC50 data is obtained for HADDOCK2P2I (r2 = 0.61, N = 19, p-value < 0.0001) but predictions with the original HADDOCK score are also reliable (r2 = 0.52, N = 19, p-value = 0.0005) (Figure 4A and B). Note, however, that these results should be treated with caution because the method was developed to reproduce Kd measurements in a quantitative manner and not IC50 values. Unfortunately, we could not compile an independent test set of K or K data due to the absence of combined structural and affinity data for protein–protein interaction inhibitors. There is however a clear trend for HADDOCK2P2I to relate to IC50 values for different complexes. The molecular weight of these ligands also correlates with affinity (r2 = 0.37, N = 19, p-value = 0.006), but the correlations with HADDOCK2P2I are much better (r2 = 0.61, N = 19, p-value < 0.0001), and those by the original HADDOCK score are also higher (r2 = 0.52, N = 19, p-value = 0.0005). This confirms the need of structural information to gain additional insights into ligand affinity.

Figure 4

Correlation plots of experimentally determined binding affinities with (A) the original HADDOCK score (Evdw + 0.2Eelec + Edesol) and (B) the optimized HADDOCK2P2I score. (C) Prediction of affinities for inhibitors of bromodomains that recognize acetylated histone tails for complexes 1–5 corresponding to PDB IDs 3ONI, 4FLP, 3MXF, 3P5O, and 3ZYU, respectively. HADDOCK2P2I was also used to predict Kd values for 5 known complexes of the same family and initially found in iPPI-DB. These data concern different bromodomains (BRD4, BRD2, BRDT), for which small-molecule inhibitors have been developed to disrupt their interaction with the ε-N-acetylated histone tails. Overall, the prediction of Kd for this set is reasonable, and the absolute mean error of the prediction is ∼1.6 kcal mol–1 (Figure 4C). Note that the highest prediction error (∼1.9 kcal mol–1) corresponds to the complex between BRD4 and I-BET151 (GSK1210151A) inhibitor. The complex buries an unusually large surface area (1351 Å2) that directly leads to higher predicted affinity. Note that large buried surface area in macromolecular complexation could be the outcome of extended conformational changes upon binding and, therefore, hamper accurate affinity estimation.[16]

Data Obtained from Different Experimental Methods Profoundly Limits the Prediction Accuracy

The iPPI-DB[10] has the largest collection of affinity data for modulators/inhibitors of protein–protein interactions to date. It also includes affinity data for compounds targeting the same protein–protein interaction that were measured using different experimental methods. Multiple Kd data are rather scarce. Inhibition of the bromodomain 3 activity (BRD3) by compound entry 1603 in iPPI-DB was measured both by isothermal titration calorimetry (ITC) and surface plasmon resonance (SPR). A 4-fold difference between affinities was observed (KdITC = 50 nM, KdSPR = 200 nM). For the same compound targeting the bromodomain 4 (BRD4) activity, a 6-fold difference in affinities was observed (KdITC = 55 nM, KdSPR = 324 nM). Even measurements with the same biophysical methods can vary – as an example, compound entry 713 in iPPI-DB inhibits the XIAP-Smac interaction in the μM range, but measured data with fluorescence polarization (FP) reach a 1.3-fold difference in Kd (KdFP1 = 0.275 μM, KdFP2 = 0.209 μM). IC50 values obtained from various cellular assays are more abundant in the database. We have compiled a list that contains multiple IC50 measurements for 72 interactions (Supporting Information Table S3). In particular, FP and fluorescence resonance energy transfer (TR-FRET) were used to measure inhibition of different bromodomains for a variety of ligands (Figure 5A). Correlation between methods is strong for individual bromodomains (BRD1 r2 = 0.76, N = 9, p-value = 0.002, BRD2 r2 = 0.80, N = 9, p-value = 0.001; BRD3 r2 = 0.88, N = 8, p-value = 0.001), but when considered together, the relation is less profound, explaining 72% of the data (r2 = 0.72, N = 26, p-value < 0.0001). What is also clear is that IC50s obtained with TR-FRET are always of better affinity, reaching a 5-fold decrease in IC50, corresponding to 0.7 in −log scale. Therefore, a correction of the relationship with an additional β parameter is needed (y = 0.7x + 1.9, β = 1.9) (Figure 5A); When modeling affinity, however, the biophysical data are not corrected by this additional parameter because we assume a 1:1 relation between affinities measured from different experimental methods. In that case, the derived relationships lead to much lower correlations, albeit still significant (r2 = 0.54, N = 26, p-value < 0.0001) (Figure 5A). Similar conclusions have been made by experimentalists themselves, showing that 36 sulfonamides aiming to inhibit BRD4 activity exhibit similar IC50 values, but not identical between cell-based and biochemical assays (r2 = 0.57, N = 36, p-value < 0.0001).[42]

Figure 5

Assessment of experimental uncertainty in the determination of IC50 for PPI inhibitors using different assays. (A) Inhibitors of various bromodomains, measured by both fluorescence polarization and fluorescence resonance energy transfer (TR-FRET). Two regression lines are shown (y = αx + β and y = αx), highlighting that absence of the β coefficient substantially lowers r2. (B) Inhibitors of the MDM2/p53 interaction, measured by both ELISA and cell proliferation assays. (C) Distribution of fold differences in IC50 for 72 inhibitors targeting various protein–protein interactions that have been measured by two (or more) different experimental methods. Another example is the inhibition of the MDM2/p53 interaction using different ligands and measured by both ELISA and cell proliferation assays (CPA) (Figure 5B); we observe that IC50 values do indeed correlate without adding a correction coefficient β to their linear relation (r2 = 0.68, N = 21, p-value < 0.0001), but measurements may differ by more than 2 orders of magnitude (e.g., for entry 1143 of iPPI-DB, IC50ELISA = 0.1 μM, and IC50CPA = 158 μM). The distribution of the overall ambiguity in affinity measurements is shown in Figure 5C for all 72 entries with multiple experimental measurements collected from iPPI-DB (Supporting Information Table S3). Nearly half of the interactions measured have an ambiguity of <0.4 in −log IC50 units, which corresponds to less than 2-fold changes in IC50. The other half of the data exhibit deviations in IC50s by more than 2-fold, reaching a maximum of 1650-fold change, the latter representing changes in −log IC50 by 3.2 units. Overall, the average experimental uncertainty for these data is rather low, 0.7 ± 0.7 −log IC50 units, with a median of 0.5 −log IC50 units and a maximum of 3.2 −log IC50 units. Considering such deviations in −log IC50 units for a single system using two independent measurements, the prediction error of HADDOCK2P2I on the validation set (0.8 ± 0.6 kcal mol–1), on the “fuzzy” IC50 test set (1.6 ± 0.9 kcal mol–1) and the Kd set concerning inhibition of different bromodomains (1.3 ± 0.5 kcal mol–1), is in the same range of accuracy as experimental uncertainty.

Discussion

HADDOCK2P2I is a biophysical model optimized to predict the binding affinities of inhibitors of PPIs. By optimizing the weights of the original HADDOCK score and including a BSA term, the resulting model is able to predict binding affinities of PPI inhibitors close to experimental error (∼2-fold larger). To test this, we have compiled from iPPI-DB and analyzed a set of binding affinities obtained with various experiments for the same system. We have estimated an experimental uncertainty for IC50 values of 0.7 ± 0.7 in −log IC50 units (based on 72 data points), with a maximum of 3.2 −log IC50 units, the latter seemingly rare. Experimental conditions can also influence binding affinity determination, especially changes in pH. However, modeling the effect of pH was outside the scope of this study; this might well, if properly accounted for, lower the prediction error. Although the effect of pH on the binding affinity of PPI inhibitors is unclear at this time, it is known that for protein–protein complexes, changing the pH by three units, changes the Kd by a factor of 10–50, and ΔG by 1.4–2.3 kcal mol–1.[16] Despite those limitations, our algorithm reasonably reproduces a variety of experimental affinities of different nature (IC50, Ki, Kd) for distinct protein–protein interaction inhibitors. Since new PPI inhibitors are regularly published and crystallized with associated biophysical measurements for their interaction, this leaves room for further optimization of our HADDOCK2P2I binding affinity predictor. One could possibly argue that, because of the limited size of the data set, the prediction capacity of HADDOCK2P2I is not generalizable. Previous studies on scoring functions for “classical” protein–ligand complexes have shown that such limited amount of training data leads to a bias, which could only be surpassed when more than 100 cases are available in a data set.[43] The diversity of the data as well as number of predictor variables used may also influence the results. To exclude a potential lack of diversity, we performed a similarity analysis of the proteins and ligands included in both training/cross-validation and test sets; this highlights the diversity of the studied systems and reflects their nonredundancy (Figure 6). Even for systems that have highly homologous protein structures, single mutations in the sequence, being directly at the interface or not, are often observed that could have implications in the binding energies of the ligands.

Figure 6

All-versus-all similarity analysis for the proteins[33] and their bound inhibitors[34] of the used data set, highlighting the diversity in the systems under study (shown at the left of the matrix). The upper-right and the lower-left halves of the matrix represent the all-versus-all similarity for the proteins and their bound inhibitors, respectively. Rows and columns 1–27 and 28–51 correspond to the training/cross-validation set and the independent test set, respectively (following numbering introduced in Tables 1 and 2). In this work, two different ligand parametrization tools were compared: a semiquantum mechanical approach (ACPYPE), and the faster, database-driven PRODRG. Both parametrization schemes yield similar performance in terms of binding affinity prediction using the optimized HADDOCK score, with PRODRG slightly outperforming ACPYPE. The main difference between the two sets of parameters resides in the electrostatics partial charges. While this might not affect much affinity prediction based on refined crystal structures, it might well have a much more profound impact on docking results, something that should be evaluated in the future. The prediction performance for PPI inhibitors seems somewhat better than that of the most recent protein–ligand/drug design program. The latter, when tested against new blind data sets, showed a predictive capacity ranging from r2 = 0.30 to 0.40.[44,45] For a fair comparison, HADDOCK2P2I and other small ligand binding affinity models should be tested against similar data sets. One test set used in this study contains IC50 data; these cannot be related to actual Kd or Ki measurements from biophysical methods, since substrate Michaelis constant (Km) and related concentration must be reported (S), assuming the inhibitor is competitive. If S ≪ Km, then Ki ≈ IC50, but again, verification from classical biophysical methods is advisable. PPI inhibitors differ in nature from small molecule inhibitors that target enzymes and it remains to be seen how well the optimized function presented here will predict small molecule binding affinities (which was outside the scope of this work). Next to yielding an optimized function for binding affinity prediction, this study also provides new insights into the determinants of the binding affinity of PPI inhibitors. In particular, affinity prediction deteriorates as a function of conformational freedom of the system under study, directly pointing to missing entropic contributions (Figure 3C–E). These results are similar in nature with the ones we derived for protein–protein complexes and with the impact of conformational change on binding affinity prediction.[15,16] Nevertheless, the buried surface area (BSA) and van der Waals interactions show moderate-to-high correlations with binding affinity data for all complexes tested. The two are of course directly correlated. The desolvation and electrostatic energies show more complicated profiles than just plain correlation. This does not mean that they do not contribute per se: both the original and optimized models for the HADDOCK score clearly show a direct contribution of these two components, albeit to a smaller extent than the BSA: The BSA contributes on average, 3.6 ± 1.3 kcal mol–1 to the overall HADDOCK2P2I score, whereas the contributions of electrostatics and desolvation only reach 0.5 ± 0.4 and 0.1 ± 0.2 kcal mol–1, respectively, when calculated over the entire validation data set. The BSA and van der Waals interactions are features that were already known to be critical factors for the affinity of protein–ligand[46] and protein–protein complexes.[47,48] The correlations calculated in this study for the BSA of inhibitors of protein–protein interactions are, however, substantially higher than the ones calculated for protein–protein complexes,[16] but smaller than for standard protein–ligand complexes.[46] This indicates that, in designing highly affine inhibitors of PPIs, one should aim at optimizing the complementarity of the inhibitors with the protein interface and always consider conformational changes as these are a limiting factor for accurate prediction.[49] In conclusion, the newly designed score, HADDOCK2P2I, should facilitate and guide the design of PPI inhibitors, especially for the less flexible interfaces.

44 in total

1. PRODRG: a tool for high-throughput crystallography of protein-ligand complexes.

Authors: Alexander W Schüttelkopf; Daan M F van Aalten
Journal: Acta Crystallogr D Biol Crystallogr Date: 2004-07-21

2. Molecular interaction fields and 3D-QSAR studies of p53-MDM2 inhibitors suggest additional features of ligand-target interaction.

Authors: Cristina Dezi; Andrea Carotti; Matteo Magnani; Massimo Baroni; Alessandro Padova; Gabriele Cruciani; Antonio Macchiarulo; Roberto Pellicciari
Journal: J Chem Inf Model Date: 2010-08-23 Impact factor: 4.956

Review 3. Computational identification of inhibitors of protein-protein interactions.

Authors: Shijun Zhong; Alba T Macias; Alexander D MacKerell
Journal: Curr Top Med Chem Date: 2007 Impact factor: 3.295

Review 4. BCL-2 family antagonists for cancer therapy.

Authors: Guillaume Lessene; Peter E Czabotar; Peter M Colman
Journal: Nat Rev Drug Discov Date: 2008-12 Impact factor: 84.694

5. Comparative assessment of scoring functions on a diverse test set.

Authors: Tiejun Cheng; Xun Li; Yan Li; Zhihai Liu; Renxiao Wang
Journal: J Chem Inf Model Date: 2009-04 Impact factor: 4.956

6. Are scoring functions in protein-protein docking ready to predict interactomes? Clues from a novel binding affinity benchmark.

Authors: Panagiotis L Kastritis; Alexandre M J J Bonvin
Journal: J Proteome Res Date: 2010-05-07 Impact factor: 4.466

7. Fragment-based discovery of bromodomain inhibitors part 2: optimization of phenylisoxazole sulfonamides.

Authors: Paul Bamborough; Hawa Diallo; Jonathan D Goodacre; Laurie Gordon; Antonia Lewis; Jonathan T Seal; David M Wilson; Michael D Woodrow; Chun-Wa Chung
Journal: J Med Chem Date: 2012-01-11 Impact factor: 7.446

8. Hot spots and transient pockets: predicting the determinants of small-molecule binding to a protein-protein interface.

Authors: Alexander Metz; Christopher Pfleger; Hannes Kopitz; Stefania Pfeiffer-Marek; Karl-Heinz Baringhaus; Holger Gohlke
Journal: J Chem Inf Model Date: 2011-12-27 Impact factor: 4.956

9. 2P2Idb: a structural database dedicated to orthosteric modulation of protein-protein interactions.

Authors: Marie Jeanne Basse; Stéphane Betzi; Raphaël Bourgeas; Sofia Bouzidi; Bernard Chetrit; Véronique Hamon; Xavier Morelli; Philippe Roche
Journal: Nucleic Acids Res Date: 2012-11-30 Impact factor: 16.971

10. PocketQuery: protein-protein interaction inhibitor starting points from protein-protein interaction structure.

Authors: David Ryan Koes; Carlos J Camacho
Journal: Nucleic Acids Res Date: 2012-04-20 Impact factor: 16.971

14 in total

Review 1. Hybrid methods for combined experimental and computational determination of protein structure.

Authors: Justin T Seffernick; Steffen Lindert
Journal: J Chem Phys Date: 2020-12-28 Impact factor: 3.488

Review 2. Computational Structure Prediction for Antibody-Antigen Complexes From Hydrogen-Deuterium Exchange Mass Spectrometry: Challenges and Outlook.

Authors: Minh H Tran; Clara T Schoeder; Kevin L Schey; Jens Meiler
Journal: Front Immunol Date: 2022-05-26 Impact factor: 8.786

3. Based on Network Pharmacology and Molecular Dynamics Simulations, Baicalein, an Active Ingredient of Yiqi Qingre Ziyin Method, Potentially Protects Patients With Atrophic Rhinitis From Cognitive Impairment.

Authors: Xueran Kang; Yuxing Sun; Bin Yi; Chenyan Jiang; Xiaojun Yan; Bin Chen; Lixing Lu; Fangze Shi; Yuanbo Luo; Yisheng Chen; Qian Wang; Runjie Shi
Journal: Front Aging Neurosci Date: 2022-06-10 Impact factor: 5.702

Review 4. Bioinformatics-Aided Venomics.

Authors: Quentin Kaas; David J Craik
Journal: Toxins (Basel) Date: 2015-06-11 Impact factor: 4.546

Review 5. On the energy components governing molecular recognition in the framework of continuum approaches.

Authors: Lin Li; Lin Wang; Emil Alexov
Journal: Front Mol Biosci Date: 2015-03-06

6. dMM-PBSA: A New HADDOCK Scoring Function for Protein-Peptide Docking.

Authors: Dimitrios Spiliotopoulos; Panagiotis L Kastritis; Adrien S J Melquiond; Alexandre M J J Bonvin; Giovanna Musco; Walter Rocchia; Andrea Spitaleri
Journal: Front Mol Biosci Date: 2016-08-31

7. Performance of HADDOCK and a simple contact-based protein-ligand binding affinity predictor in the D3R Grand Challenge 2.

Authors: Zeynep Kurkcuoglu; Panagiotis I Koukos; Nevia Citro; Mikael E Trellet; J P G L M Rodrigues; Irina S Moreira; Jorge Roel-Touris; Adrien S J Melquiond; Cunliang Geng; Jörg Schaarschmidt; Li C Xue; Anna Vangone; A M J J Bonvin
Journal: J Comput Aided Mol Des Date: 2017-08-22 Impact factor: 3.686