| Literature DB >> 21326236 |
Mark Nicholas Wass1, Gloria Fuentes, Carles Pons, Florencio Pazos, Alfonso Valencia.
Abstract
Deciphering the whole network of protein interactions for a given proteome ('interactome') is the goal of many experimental and computational efforts in Systems Biology. Separately the prediction of the structure of protein complexes by docking methods is a well-established scientific area. To date, docking programs have not been used to predict interaction partners. We provide a proof of principle for such an approach. Using a set of protein complexes representing known interactors in their unbound form, we show that a standard docking program can distinguish the true interactors from a background of 922 non-redundant potential interactors. We additionally show that true interactions can be distinguished from non-likely interacting proteins within the same structural family. Our approach may be put in the context of the proposed 'funnel-energy model'; the docking algorithm may not find the native complex, but it distinguishes binding partners because of the higher probability of favourable models compared with a collection of non-binders. The potential exists to develop this proof of principle into new approaches for predicting interaction partners and reconstructing biological networks.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21326236 PMCID: PMC3063693 DOI: 10.1038/msb.2011.3
Source DB: PubMed Journal: Mol Syst Biol ISSN: 1744-4292 Impact factor: 11.429
Figure 1Docking score distributions for benchmark complexes. The docking score distribution for a benchmark complex (red and white line) and one of the components docked with the background set (black lines) are shown for (A) Gt-α docked with RGS9 and (B) acetylcholinesterase complexed with fasciculin2. The native benchmark complexes are shown (red/blue structure) and selected models for the benchmark component docked with the background set (red/cyan).
Statistical significance of benchmark complex docking score distributions
| % Background better than | All (%) | Enzyme/ inhibitors (%) | Others (%) |
|---|---|---|---|
| The percentage of the background docking score distributions that the benchmark complex score distribution is significantly less than is shown. Cumulative counts and percentages are displayed. Statistical significance was tested using Wilcoxon rank-sum test at a 1% significance level. The results are also split for the two complex categories in the docking benchmark (enzyme/inhibitor and others). | |||
| 100 | 3 (5) | 1 (3) | 2 (7) |
| >99 | 14 (25) | 5 (17) | 10 (37) |
| >95 | 19 (34) | 6 (21) | 13 (48) |
| >90 | 25 (45) | 7 (24) | 18 (67) |
| >85 | 31 (55) | 10 (34) | 21 (78) |
| >80 | 36 (64) | 14 (48) | 22 (81) |
| >70 | 40 (71) | 17 (59) | 23 (85) |
| >60 | 46 (82) | 22 (76) | 24 (89) |
| >50 | 48 (85) | 23 (79) | 25 (93) |
| <50 | 56 (100) | 29 (100) | 27 (100) |
Figure 2Heat map of the docking model putative binding sites. The Heat map shows how often each residue is present in the binding site modelled by HEX for (A) Acetylcholinesterase/fasciculin2 and (B) transthyretin/retinol binding protein. The unbound structures are shown and they have been aligned with their equivalent component in the native complex. The colour scheme as shown in the key indicates the percentage of HEX poses that a residue formed part of the putative interface.
Figure 3Docking for a single superfamily. (A) Docking score distributions for the acetylcholinesterase and fasciculin2 complex (red) and for fasciculin2 docked to members of the α/β-hydrolases superfamily. The docking scores for the three most structurally similar proteins to acetylcholinesterase are shown in different colours with their structures. The score distributions for the remaining structures are shown in black. (B) Docking score distributions for Ras GTPase docked with PIP3 kinase (red line), and PIP3 kinase docked with other structures from the same SCOP superfamily as Ras GTPase. The native form of the GTPase/PIP3 kinase complex is shown.