| Literature DB >> 18939967 |
Solène Grosdidier1, Juan Fernández-Recio.
Abstract
BACKGROUND: The study of protein-protein interactions is becoming increasingly important for biotechnological and therapeutic reasons. We can define two major areas therein: the structural prediction of protein-protein binding mode, and the identification of the relevant residues for the interaction (so called 'hot-spots'). These hot-spot residues have high interest since they are considered one of the possible ways of disrupting a protein-protein interaction. Unfortunately, large-scale experimental measurement of residue contribution to the binding energy, based on alanine-scanning experiments, is costly and thus data is fairly limited. Recent computational approaches for hot-spot prediction have been reported, but they usually require the structure of the complex.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18939967 PMCID: PMC2579439 DOI: 10.1186/1471-2105-9-447
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Initial dataset of complexes used in this work
| Complexa | Resb | Receptor | Ligand | Unbound receptor | Resb | Unbound ligand | Resb | Complex typec |
| 2.60 | Growth hormone receptor | Growth hormone | - | - | 1HGU | 2.50 | B/U | |
| 2.00 | Ribonuclease inhibitor | Angiogenin | - | - | 1UN3 | 1.70 | B/U | |
| 3.00 | Fab 5G9 | Tissue Factor | 1K6Q | 2.40 | 2HFT | 1.69 | U/U | |
| 1.50 | p53 | p53 | - | - | - | - | B/B | |
| 2.00 | Barnase | Barstar | 1A2P | 1.50 | - | - | U/B | |
| 2.05 | Colicin E9 | Immunity protein Im9 | 1FSJ | 1.80 | - | - | U/B | |
| 2.00 | Tissue Factor | Factor VII | 2HFT | 1.69 | - | - | U/B | |
| 2.30 | Ribonuclease A | Ribonuclease inhibitor | 1FS3 | 1.40 | 2BNH | 2.30 | U/U | |
| 2.70 | IgG1 Fc fragment | DCAWHLGELV WCT-NH2 | 1H3V | 3.10 | - | - | U/B | |
| 3.00 | HYHEL-10 | HEL | 1GPO | 1.95 | 3LZT | 0.92 | U/U | |
| 2.50 | CD4 | gp120 | 1CDJ | 2.50 | - | - | U/B | |
| 1.95 | Zipa | FTSZ fragment | 1F7W | NMR | - | - | U/B | |
| 2.80 | Fc fragment | Protein A | - | - | - | - | B/B | |
| 3.50 | Fc fragment | Protein G | 1H3V | 3.10 | - | - | U/B | |
| 2.60 | IL-4 receptor | IL-4 | - | - | 1HIK | 2.60 | B/U | |
| 3.50 | T-cell antigen receptor | SEC3 | 1BEC | 1.70 | 1CK1 | 2.60 | U/U | |
| 2.80 | Antibody A6 | Interferon-γ receptor | - | - | - | - | B/B | |
| 2.30 | TEM-1 β-lactamase | BLIP | 1ZG4 | 1.55 | - | - | U/B | |
| 2.50 | NC10 | Neuraminidase N9 | - | - | 7NN9 | 2.00 | B/U | |
| 1.90 | Trypsin | BPTI | 1S0Q | 1.02 | 1G6X | 0.86 | U/U | |
| 1.80 | Antibody D1.3 | HEL | - | - | 3LZT | 0.92 | B/U |
aPDB Code, bResolution in Å; B, Bound; U, Unbound
Figure 1Experimental binding energy vs. computer predictions. Distribution of ΔΔG data vs. NIP values from rigid-body docking (FTDock+ZDOCK).
Comparison between NIP predictions (cut-off 0.4) from rigid-body docking and the experimentally known hot-spot residues
| Total | |||
| ΔΔG ≥ 1a | 40 | 128 | 168 |
| ΔΔG < 1a | 11 | 407 | 418 |
| Total | 51 | 535 | 586 |
a in kcal.mol-1
Benchmarking NIP hot-spot predictions on different datasets
| Initial dataset (21 cases) | ||
| NIP ≥ 0.2 | 68% | 43% |
| NIP ≥ 0.4 | 78% | 24% |
| FOLDEF | 73% | 46% |
| ROBETTA | 71% | 69% |
| Li's dataset (15 cases) | ||
| NIP ≥ 0.2 | 75% | 42% |
| NIP ≥ 0.4 | 91% | 26% |
| FOLDEF | 70% | 45% |
| ROBETTA | 64% | 60% |
| Additional dataset (all 22 cases) | ||
| NIP ≥ 0.2 | 59% | 34% |
| NIP ≥ 0.4 | 78% | 15% |
| Additional dataset (X-ray subunits) | ||
| NIP ≥ 0.2 | 73% | 44% |
| NIP ≥ 0.4 | 80% | 19% |
| Additional dataset (NMR subunits) | ||
| NIP ≥ 0.2 | 0% | 0% |
| NIP ≥ 0.4 | 0% | 0% |
| Additional dataset (modeled subunits) | ||
| NIP ≥ 0.2 | 59% | 33% |
| NIP ≥ 0.4 | 75% | 15% |
Comparison to ROBETTA and FOLDEF
a Our initial dataset (Table 1); b Values from Kortemme and Baker,14 on a sub-set of 19 complexes from our initial dataset; c Dataset compiled by Li et al.;28 d Values from Li et al.;28 e Our additional dataset (Table 6).
Detailed results for hot-spot predicted residues (NIP ≥ 0.4) with available experimental data.
| Complex | Number of predicted residues( | hot-spot prediction success(PPV) | Number of near-native posesa |
| 5 | 100% | 25 | |
| 6 | 67% | 6 | |
| 3 | 100% | 8 | |
| 1 | 100% | 5 | |
| 3 | 67% | 1 | |
| 7 | 100% | 0b | |
| 2 | 50% | 10 | |
| 1 | 100% | 0c | |
| 9 | 67% | 1 | |
| 3 | 100% | 0d | |
| 7 | 71% | 1 | |
| 4 | 50% | 3 |
Number of near-native solutions (RMSD ≤ 10 Å) within the ensemble of 100 lowest-energy docking orientations used to calculate the NIP values; best RMSD within the ensemble is 22.0 Å; best RMSD is 20.9 Å; best RMSD is 12.2 Å.
Figure 2Global performance of hot-spot prediction. Evaluation of prediction results according to the NIP cut-off value: global PPV (open squares), global sensitivity (diamonds), and percentage of cases with prediction (triangles).
Comparison between NIP predictions (cut-off 0.2) from rigid-body docking and the experimentally known hot-spot residues
| NIP ≥ 0.2 | NIP < 0.2 | Total | |
| ΔΔG ≥ 1 | 73 | 95 | 168 |
| ΔΔG < 1 | 34 | 384 | 418 |
| Total | 107 | 479 | 586 |
a in kcal.mol-1
Figure 3Examples of hot-spot predictions. Selected complex examples: SEC3 super antigen/T-cell β-chain (A to C) and HEL/D1.3 IgG1 (D to F). (A, D) Experimental data: hot-spots in red (ΔΔG ≥ 1 kcal.mol-1); non hot-spots in blue (ΔΔG < 1 kcal.mol-1). Only residues with available experimental data have been shown with labels. L- and H- in IgG1 labels indicate residues belonging to the light and heavy antibody chain respectively. (B, E) Computer predictions. Residues predicted to be hot-spots (NIP ≥ 0.4) are shown in red; the remaining residues are shown in a scale from red to blue. Only residues with NIP ≥ 0.4 have been labelled. (C, F). Complex X-ray structures (PDB codes 1JCK and 1VFB, respectively). Receptor residues are coloured according to the NIP values. Ligand is represented as a grey ribbon.
Additional dataset of complexes used in this work
| Complex | Res | Receptor | Ligand | Receptor PDB | Res (Id) | Ligand PDB | Res(Id) | Complex type |
| X-ray subunits | ||||||||
| 2.60 | Chymotrypsin | BPTI | 1.68 | 1.50 | U/U | |||
| 1.70 | IL-6R | IL-6 | 2.40 | 1.90 | U/U | |||
| N/A | N/A | KDR | VEGF | 1.95 | 2.50 | U/U | ||
| N/A | N/A | trkC | Neurotrophin-3 | 1.90 | 2.40 | U/U | ||
| 2.55 | Rabbit actin | Bovine profilin I | 1.54 | 2.00 | U/U | |||
| 1.90 | E5.2 | D1.3 | - | - | 1.80 | B/U | ||
| N/A | N/A | sHIR | Insulin | 3.80 | 1.95 | U/U | ||
| NMR subunits | ||||||||
| N/A | N/A | GPIIbIIIa | Kistrin | 2.90 | NMR | U/U | ||
| N/A | N/A | bFGF | FGFR1b | 1.60 | NMR | U/U | ||
| N/A | N/A | IGF-1R | IGF-1 | 2.60 | NMR | U/U | ||
| N/A | N/A | IGF-1bp | IGF-1 | 1.80 | NMR | U/U | ||
| Homology-based modeled subunits | ||||||||
| N/A | N/A | E9 DNase | Im2 | 1.80 | (66%) | U/M | ||
| N/A | N/A | AChR | Erabutoxin | 4.00 | 2.00 | cryo- EM/U | ||
| N/A | N/A | AChR | NmmI | 4.00 | 0.87 | cryo- EM/U | ||
| 2.80 | IL-2 receptor | IL-2 (human) | (100%) | 1.99 | M/U | |||
| N/A | N/A | IL-2 beta receptor(human) | IL-2 (murine) | (99%) | (64%) | M/M | ||
| N/A | N/A | IL-2 alpha receptor(murine) | IL-2 (murine) | (22%) | (64%) | M/M | ||
| N/A | N/A | IL-4/IL-4bp | GammaC | 2.30 | (99%) | U/M | ||
| N/A | N/A | gp75 | Neurotrophin-3 | (18%) | 2.40 | M/U | ||
| N/A | N/A | CD48 | CD2 | (39%) | 2.00 | M/U | ||
| N/A | N/A | Calcineurin | CaM | (59%) | NMR | M/U | ||
| 2.80 | hG-CSFbp | hG-CSF | (99%) | 2.20 | M/U | |||
a PDB Code; N/A, Not Available; b Resolution in Å or (Id): sequence identity with template in case of model; c B, Bound; U, Unbound; M, Model; cryo-EM, cryoelectron microscopy structures classified as modeled structures because of the low resolution compared to crystallographic or NMR structures, d PDB code of the template structure used for modelling