| Literature DB >> 20231898 |
Raphaël Bourgeas1, Marie-Jeanne Basse, Xavier Morelli, Philippe Roche.
Abstract
BACKGROUND: In the last decade, the inhibition of protein-protein interactions (PPIs) has emerged from both academic and private research as a new way to modulate the activity of proteins. Inhibitors of these original interactions are certainly the next generation of highly innovative drugs that will reach the market in the next decade. However, in silico design of such compounds still remains challenging. METHODOLOGY/PRINCIPALEntities:
Mesh:
Substances:
Year: 2010 PMID: 20231898 PMCID: PMC2834754 DOI: 10.1371/journal.pone.0009598
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Flow chart indicating how the 2P2IDB dataset was build from data mining.
Two separate approaches were used to retrieve protein/protein complexes with known inhibitors for which structural information was available. The protein databank was search through the Dockground server [23] which led to 202 complexes that were filtered using an advanced query and manual inspection of the interface to give 9 protein/protein complexes and 25 protein/inhibitor complexes. Exhaustive search of the literature led to the discovery of 8 protein complexes corresponding to 31 protein/ligand complexes.
Complex families in 2P2IDB.
| Class | # | Family | Complex | Number of Inhibitors | Source | Affinity | Ref |
| I | 1 | BclXL/Bak | 1bxl | 8 | PubMed | 340 |
|
| I | 2 | MDM2/p53 | 1ycr 1ycq | 3 | PubMed | 600 |
|
| I | 3 | XIAP BIR3/CASPASE 9 | 1nw9 | 2 | PubMed | 20 |
|
| I | 4 | XIAP BIR3/SMAC | 1g73 | 5 | PubMed | 420 |
|
| I | 5 | ZipA/FtsZ | 1f47 | 4 | PubMed | 20,000 |
|
| II | 6 | Chagasin/Papain | 3e1z | 1 | PDB | 0.036 |
|
| II | 7 | E2/E1 | 1tue | 1 | PubMed | na |
|
| II | 8 | FKBP12/TGFR | 1b6c | 17 | PDB | na |
|
| II | 9 | IL-2/IL-2R | 1z92 | 8 | PubMed | 10 |
|
| II | 10 | MMP1/TIMP1 | 2j0t | 1 | PDB | 0.40 |
|
| II | 11 | MMP3/TIMP1 | 1oo9 | 1 | PDB | 0.22 |
|
| II | 12 | Subtilisin/Eglin C | 1cse 1r0r 1to2 | 1 | PDB | 0.029 |
|
| II | 13 | Thrombin/Protein C inhibitor | 3b9f | 1 | PDB | na |
|
| II | 14 | Trypsin/Trypsin inhibitor | 2uuy | 3 | PDB | 0.02 |
|
PPIs were subdivided into class I that correspond to protein/peptide interactions with less than six segments at the interface (families 1–5) and class II that represent more globular interacting domains with more segments (families 6–14).
PDB code of protein/protein complexes.
Number of inhibitors present in the database for a given protein/protein complex.
Structures were retrieved through exhaustive search of the protein databank (PDB) or literature data mining (PubMed).
Dissociation constant (KD) of the protein/protein complexes are indicated in nanomolar when available.
indicates Ki values.
Figure 2List of representative small molecule inhibitors for each protein/protein complex in 2P2IDB.
Only the inhibitor used to define the subset of the interface at 4.5 Å around the ligand is shown. For each inhibitor, the name of the protein family, the PDB code of the complex and the molecular weight of the ligand are indicated.
General interface parameters of PPIs in the 2P2IDB dataset.
| PDB | Plan (Å) | Ecc | SecS | GV (Å3) | GV_I (Å) | Hb | SB | ASA (Å2) | H_ASA (%) | RMSD (Å) | Pockets (Å3) | Seg | |
|
| 1bxl | 3,60 | 0,75 | H/H | 2892 | 1,75 | 0,05 | 0 | 825 | 45,6 | 2,3 | 97 | 5 |
|
| 1ycr | 2,99 | 0,86 | H/H | 786 | 0,60 | 0,34 | 1 | 660 | 54,2 | 1,9 | 351 | 3 |
|
| 1ycq | 2,14 | 0,62 | H/H | 1286 | 1,38 | 0,39 | 1 | 455 | 65,7 | 1,8 | 215 | 2 |
|
| 1nw9 | 2,32 | 0,78 | S/S | 3567 | 1,79 | 0,78 | 0 | 241 | 100,0 | 2,4 | 244 | 2 |
|
| 1g73 | 2,18 | 0,73 | S/S | 3500 | 5,35 | 0,88 | 0 | 472 | 88,9 | 2,3 | 140 | 3 |
|
| 1f47 | 2,76 | 0,75 | S/H | 3503 | 3,24 | 0,09 | 0 | 541 | 39,2 | 0,9 | 0 | 5 |
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
|
| 3e1z | 3,00 | 0,89 | C/C | 4286 | 2,26 | 0,53 | 0 | 947 | 55,4 | 0,4 | 279 | 9 |
|
| 1tue | 2,59 | 0,71 | H/H | 5042 | 2,86 | 0,55 | 3 | 946 | 32,6 | 1,4 | 202 | 7 |
|
| 1b6c | 2,82 | 0,42 | S/H | 5457 | 3,14 | 0,17 | 0 | 869 | 57,2 | 0,5 | 387 | 8 |
|
| 1z92 | 2,40 | 0,86 | H/C | 4431 | 2,47 | 0,7 | 5 | 898 | 39,8 | 1,3 | 146 | 8 |
|
| 2j0t | 2,76 | 0,55 | C/C | 5380 | 4,08 | 0,83 | 0 | 660 | 48,8 | 0,5 | 323 | 7 |
|
| 1oo9 | 3,02 | 0,78 | C/C | 5157 | 2,76 | 0,24 | 0 | 936 | 39,7 | 1,2 | 227 | 9 |
|
| 1cse | 2,67 | 0,65 | C/S | 3858 | 3,01 | 0,74 | 0 | 640 | 62,6 | 0,3 | 282 | 8 |
|
| 1r0r | 2,54 | 0,75 | C/C | 3763 | 2,99 | 0,78 | 0 | 630 | 66,3 | 0,3 | 230 | 9 |
|
| 1to2 | 3,11 | 0,61 | C/S | 3277 | 2,25 | 0,74 | 1 | 728 | 66,4 | 0,3 | 275 | 8 |
|
| 3b9f | 3,41 | 0,81 | C/C | 5538 | 4,33 | 0,72 | 2 | 639 | 38,0 | 0,6 | 350 | 9 |
|
| 2uuy | 2,57 | 0,73 | C/C | 3500 | 3,12 | 1,09 | 1 | 562 | 70,1 | 0,7 | 154 | 10 |
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Planarity;
eccentricity;
secondary structure elements at the interface for the target and related partner;
Gap volume;
Gap volume index;
Hydrogen bonds per 100 Å2 of interface;
Salt bridges;
Accessible surface area buried at the interface of the protein/protein complex;
Accessible surface area hidden by the inhibitor;
Root mean square deviation (CA atoms) between unbound protein and complex;
when unbound protein was not available, rmsd between protein/protein and protein/ligand complexes was computed;
Total pocket volume at the interface;
Number of interface residue segments. For each parameter the mean and standard deviation are presented.
Figure 3Student t-test allowing the selection of three main discriminating parameters for the PCA analysis.
The t-test was calculated for the whole dataset and for each class (I and II) separately. Dotted lines indicate a threshold of confidence higher than 90% to differentiate 2P2IDB complexes from transient heterodimers. On the basis of this analysis, ASA, Gap volume, Number of interfacial segments, hydrogen bonds and the percentage of charged residues were selected for the PCA analysis.
Figure 4PCA analysis.
Six key parameters were selected to perform the PCA analysis to separate the complexes into different groups: The five parameters based on the t-test and defined in figure 3 as well as total pocket volume at the interface. A: Analysis on the whole interface. Three different PPI clusters were defined. Cluster 1 (green) regrouped all complexes from class I corresponding to targets interacting with a peptide. Subtilisin and trypsin complexes defined cluster 2 (pink). Cluster 3 (purple) regrouped all other protein/protein complexes. B: Same analysis done on the part of the interface that is 4.5 Å around the ligand. The protein/protein complexes were in three slightly different clusters. Four out of six class I complexes were grouped together. Subtilisin and trypsin complexes remained very closely associated.
Figure 52P2IDB Server (http://2p2idb.cnrs-mrs.fr).
A: Query procedures to retrieve data stored in the database for each PPI. The database can be searched by family, by PDB codes of the protein/protein complexes, protein/ligand complexes, or unbound proteins, ligand properties or cluster number. A list of PDB codes or Uniprot numbers is returned as well as links to relevant databases. A detailed report of properties of each PPI is also provided. B: Interactive menu to compare user defined PPI properties with the 2P2IDB dataset. The user can enter pre-calculated values for the six key descriptors that were used in the PCA analyses. In return, these parameters are compared to the 2P2IDB dataset and an estimation of the druggability of the target is proposed.