| Literature DB >> 31697436 |
Mieczyslaw Torchala1,2, Tereza Gerguri1, Raphael A G Chaleil1, Patrick Gordon2, Francis Russell2, Miriam Keshani2, Paul A Bates1.
Abstract
The formation of specific protein-protein interactions is often a key to a protein's function. During complex formation, each protein component will undergo a change in the conformational state, for some these changes are relatively small and reside primarily at the sidechain level; however, others may display notable backbone adjustments. One of the classic problems in the protein-docking field is to be able to a priori predict the extent of such conformational changes. In this work, we investigated three protocols to find the most suitable input structure conformations for cross-docking, including a robust sampling approach in normal mode space. Counterintuitively, knowledge of the theoretically best combination of normal modes for unbound-bound transitions does not always lead to the best results. We used a novel spatial partitioning library, Aether Engine (see Supplementary Materials), to efficiently search the conformational states of 56 receptor/ligand pairs, including a recent CAPRI target, in a systematic manner and selected diverse conformations as input to our automated docking server, SwarmDock, a server that allows moderate conformational adjustments during the docking process. In essence, here we present a dynamic cross-docking protocol, which when benchmarked against the simpler approach of just docking the unbound components shows a 10% uplift in the quality of the top docking pose.Entities:
Keywords: Aether Engine; CAPRI; DFIRE2; SwarmDock; conformational selection; conformational states space sampling; cross-docking; induced fit; normal modes; protein-protein docking; protein-protein interactions
Mesh:
Substances:
Year: 2019 PMID: 31697436 PMCID: PMC7496321 DOI: 10.1002/prot.25851
Source DB: PubMed Journal: Proteins ISSN: 0887-3585
The 55 targets from Benchmark 59 and the CAPRI target T131, PDB code 6GBG32 (in bold font), used in this analysis
| PDB complex | Type | IRMSD | Difficulty | Receptor | Ligand | Protocols | |||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| M1 | M2 | M3 | M1 | M2 | M3 | P0 | P1 | P2 | P3 | ||||
| 3EOA | A | 0.39 | RB | 252.64 | 26.56 | 57.65 | 7.52 | −2.17 | 10.17 | A | A | A | A |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 3MXW | A | 0.48 | RB | 50.73 | −18.51 | 1.23 | 3.48 | −1.58 | −2.00 | M | M | M | M |
| 4G6M | A | 0.49 | RB | −13.69 | 9.58 | −2.87 | −1.53 | −0.65 | 6.12 | M | M | M | M |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 3PC8 | ER | 0.50 | RB | −0.14 | 2.99 | −1.36 | −13.63 | −9.09 | −0.55 | H | H | H | H |
| 3VLB | EI | 0.51 | RB | −8.10 | 13.12 | −22.29 | 4.21 | −10.68 | −2.61 | M | M | M | M |
| 3P57 | OX | 0.53 | RB | −2.48 | −0.09 | −0.67 | 19.82 | 16.01 | 21.28 | A | A | A | A |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 2YVJ | ER | 0.60 | RB | 4.32 | 10.89 | −10.91 | −0.42 | 3.78 | 1.52 | M | M | M | M |
| 4G6J | A | 0.61 | RB | −166.10 | 12.60 | −21.70 | −3.83 | 0,76 | −0.60 | M | M | M | M |
| 1EXB | OX | 0.62 | RB | −4.45 | 12.67 | −13.45 | 23.87 | 4.53 | 10.04 | H | H | H | H |
| 3 K75 | ER | 0.64 | RB | −28.65 | −0.84 | −13.82 | 0.16 | −1.49 | −2.98 | M | M | M | M |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 2GAF | ER | 0.69 | RB | −2.24 | −4.38 | 22.32 | 3.03 | 12.19 | −5.00 | M | M | M | M |
| 3A4S | EI | 0.72 | RB | 24.13 | 6.77 | 1.90 | −7.93 | 15.26 | −6.36 | M | M | M | M |
| 3HMX | A | 0.73 | RB | −176.10 | 47.67 | 6.96 | 30.28 | 36.90 | −4.80 | A | A | A | A |
| BP57 | OX | 0.74 | RB | 1.63 | 1.21 | 0.47 | 20.26 | 15.67 | 20.74 | I | I | I | I |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 4DN4 | A | 0.81 | RB | −177.76 | −44.97 | −0.31 | 3.66 | 3.74 | 1.38 | M | M | M | M |
| 3LVK | ER | 0.81 | RB | −23.09 | −3.05 | 9.71 | −7.67 | 2.69 | −3.22 | M | M | M | M |
| 4HX3 | EI | 0.90 | RB | 21.85 | 6.44 | −15.03 | 3.11 | 4.15 | −1.40 | M | M | M | M |
| CP57 | OX | 0.91 | RB | −4.17 | 5.42 | −1.18 | −0.17 | 0.01 | 0.05 | M | M | M | M |
| 4FQI | A | 1.08 | RB | 142.03 | −39.70 | 26.35 | −118.89 | −15.29 | 9.26 | A | A | A | A |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 2X9A | OR | 1.33 | RB | 1.94 | −0.75 | −10.04 | −0,71 | −1.19 | −2.42 | A | A | A | A |
| 2A1A | ES | 1.35 | RB | 0.05 | 0.08 | 0.96 | −15.94 | −11.30 | 5.92 | A | A | A | A |
| 3EO1 | A | 1.37 | M | 80.74 | −25.14 | −11.74 | 53.36 | 9.39 | 8.74 | A | A | A | A |
| 3DAW | OX | 1.49 | M | 50.29 | −1.68 | 23.33 | 20.86 | −4.81 | 5.40 | A | A | A | A |
| 4IZ7 | EI | 1.56 | M | 12.24 | 27.11 | 20.68 | 11.60 | 13.81 | −8.48 | I | I | I | I |
| 4LW4 | ES | 1.60 | M | −19.77 | 3.31 | −9.73 | −39.72 | −8.93 | 39.82 | A | A | A | A |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 3BX7 | OX | 1.63 | M | 1.55 | 22.06 | 7.36 | −4.11 | −12.48 | −4.21 | A | A | A | A |
| 3HI6 | A | 1.65 | M | −140.53 | −41.11 | −7.63 | 16.36 | −15.11 | −18.27 | I | I | I | I |
| 3S9D | OR | 1.69 | M | 1.17 | −0.40 | 48.63 | 16.33 | 1.81 | −4.70 | M | M | M | M |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 3V6Z | A | 1.83 | M | −21.01 | 16.58 | 6.74 | −56.12 | 0.19 | 50.15 | I | I | I | I |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 3R9A | OR | 1.91 | M | −0.45 | −2.97 | −7.86 | 70.43 | −41.64 | 23.27 | A | A | A | A |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 3F1P | OX | 2.52 | D | −0.97 | −5.62 | −3.15 | 3.48 | 4.08 | 4.59 | M | M | M | M |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 3H11 | ER | 3.79 | D | 32.62 | −12.08 | −23.19 | −11.01 | 6.84 | −5.52 | I | I | I | I |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 4GAM | ER | 5.79 | D | 15.65 | 25.17 | −3.58 | 8.57 | −67.77 | 43.11 | I | I | I | I |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Notes: Type: protein function categories as described in Vreven et al.9: A, antibody‐antigen; EI, enzyme‐inhibitor; ES, enzyme‐substrate; ER, enzyme complex with a regulatory or accessory chain; OG, others, G‐protein containing; OR, others, receptor containing; OX, others, miscellaneous. IRMSD: root‐mean‐squared deviation of Cα atoms of residues at the receptor/ligand interface, calculated after finding the best superposition of bound and unbound. Difficulty: three categories for expected docking difficulty as defined by IRMSD; rigid‐body (RB), IRMSD ≤ 1.35 Å; medium (M), IRMSD > 1.35 Å and IRMSD ≤ 2.5 Å; and difficult (D), IRMSD >2.5 Å. Next, amplitudes of the three lowest mode for the receptor, followed by the amplitudes of the first three lowest modes of the ligand, as calculated by Equation (1); values highlighted in bold if the first amplitude for a receptor or ligand is higher than the following two. Finally, columns P0‐P3 show the best obtained docked pose quality (incorrect, acceptable, medium, or high) for basic docking runs (from unbound; (Ru):(Lu) run from P1), naïve approach (P1), theoretical combination of normal modes input approach (P2), and sophisticated sampling approach (P3). When comparing with protocol P0, transitions with improving quality are highlighted in blue, and the ones worsening the quality are highlighted in red.
Figure 3Energy diagrams for linear combinations of the first three lowest frequency modes. Receptor, all samples (A) and DFIRE2 ≤ 5.0 (B), and ligand, all samples (C) and DFIRE2 ≤ 5.0 (D) for a CAPRI Target 131, 6GBG. DFIRE2 values were normalized to the range from 0.0 (blue) to 10.0 (red) as presented in (E). Yellow marks the origin (run P0) and green marks various starting combinations as labeled in the figure for protocols P1, P2, and P3 (see the Methods section and Table 1 for details). Arrows show the normal modes axes—red, green, blue for the first, second, and third nontrivial normal mode, respectively. The pictures were generated using VMD26
Figure 1Plot of the best percentage of native contacts vs unbound‐bound IRMSD for the 56 Target when various protocols, from P0 to P3, are applied
Figure 2Naïve approach example—9‐fold cross‐docking conformations. A, CAPRI Target T131 (6GBG), showing the variability in conformations that are presented to the SwarmDock server. Three receptor conformations of the left, (Ru) as green, (R‐) as blue, and (R+) as red, three ligand conformations on the right, (Lu) as green, (L‐) as blue, and (L+) as red. B, The best docked pose superimposed on bound receptor structure, red and blue are for bound receptor and ligand, gray and green are for the highest quality docked receptor and ligand pair from run P1 (9‐fold docking), and purple and pink are for the highest FNAT docked receptor and ligand pair from run P0 (basic single run from unbound). The pictures were generated using VMD26