| Literature DB >> 27535582 |
Lim Heo1, Hasup Lee1, Chaok Seok1.
Abstract
Protein-protein docking methods have been widely used to gain an atomic-level understanding of protein interactions. However, docking methods that employ low-resolution energy functions are popular because of computational efficiency. Low-resolution docking tends to generate protein complex structures that are not fully optimized. GalaxyRefineComplex takes such low-resolution docking structures and refines them to improve model accuracy in terms of both interface contact and inter-protein orientation. This refinement method allows flexibility at the protein interface and in the overall docking structure to capture conformational changes that occur upon binding. Symmetric refinement is also provided for symmetric homo-complexes. This method was validated by refining models produced by available docking programs, including ZDOCK and M-ZDOCK, and was successfully applied to CAPRI targets in a blind fashion. An example of using the refinement method with an existing docking method for ligand binding mode prediction of a drug target is also presented. A web server that implements the method is freely available at http://galaxy.seoklab.org/refinecomplex.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27535582 PMCID: PMC4989233 DOI: 10.1038/srep32153
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Flowchart of the GalaxyRefineComplex method.
Performance comparison of refinement methods in terms of the CAPRI accuracy criterion.
| Hetero-complex refinement | ||
|---|---|---|
| Z | ||
| Initial models | 414/18***/148** | 19/1***/6** |
| GalaxyRefineComplex | 528/25***/175** (475/21***/148**) | 27/1***/8** (23/1***/7**) |
| RosettaDock | 482/36***/171** (296/23***/86**) | 26/2***/7** (19/2***/4**) |
| FiberDock | –/–/– (390/23***/127**) | –/–/– (22/1***/4**) |
| Homo-complex refinement | ||
| Initial models | 314/45***/134** | 58/30** |
| GalaxyRefineComplex | 389/90***/146** (364/69***/147**) | 57/35** (57/35**) |
| RosettaDock | 391/285***/52** (347/252***/39**) | 52/29** (48/23**) |
| SymmRef | –/–/– (361/283***/14**) | –/–/– (57/26**) |
The numbers of targets for which the best of 10 refined models were of acceptable or higher accuracy/high accuracy (***)/medium accuracy (**) are presented, and those for model 1’s are shown in parentheses.
1Data for 10 models are not provided for FiberDock and SymmRef because they generate only single models.
Performance comparison of different refinement methods in terms of mean improvement/percentage of improved cases in ligand RMSD (L-RMSD), interface RMSD (I-RMSD), fraction of native contact (Fnat), and MolProbity score (MolP).
| −ΔL-RMSD (Å) | −ΔI-RMSD (Å) | ΔFnat (%) | −ΔMolP | ||
| GalaxyRefineComplex | 0.68/82% (0.06/54%) <0.11/56%> | 0.37/85% (0.15/59%) <0.12/64%> | 10.3/91% (7.7/81%) <6.9/87%> | 1.41/100% (1.33/100%) <1.32/100%> | |
| RosettaDock | 0.40/56% (−3.73/29%) <−3.11/25%> | 0.30/61% (−1.61/32%) <−1.33/26%> | 2.2/58% (−8.4/30%) <−7.9/33%> | 0.35/95% (0.33/95%) <0.32/95%> | |
| FiberDock | –/– (−0.59/41%) | –/– (−0.19/44%) | –/– (−2.2/34%) | –/– (−0.02/44%) | |
| GalaxyRefineComplex | 0.48/74% (0.20/65%) <0.15/60%> | 0.47/90% (0.02/68%) <0.12/60%> | 11.0/82% (7.6/61%) <7.8/82%> | 0.95/100% (0.85/98%) <0.84/100%> | |
| RosettaDock | −2.38/48% (−3.95/30%) <−3.80/30%> | −3.38/45% (−3.99/22%) <−4.29/20%> | 7.2/63% (2.6/46%) <2.2/52%> | 0.16/92% (0.13/81%) <0.13/84%> | |
| FiberDock | –/– (−0.19/41%) | –/– (0.06/40%) | –/– (4.8/70%) | –/– (0.00/0%) | |
| GalaxyRefineComplex | 1.30/89% (0.78/73%) <0.63/76%> | 0.63/86% (0.37/72%) <0.32/73%> | 8.5/73% (6.1/64%) <4.5/64%> | 1.36/100% (1.25/100%) <1.25/100%> | |
| RosettaDock | 3.49/84% (0.82/73%) <0.38/69%> | 1.74/86% (0.17/73%) <−0.08/67%> | 17.1/74% (6.6/61%) <1.9/55%> | 0.66/99% (0.57/99%) <0.56/99%> | |
| SymmRef | –/– (2.90/80%) | –/– (1.14/79%) | –/– (14.6/64%) | –/– (−2.55/0%) | |
| GalaxyRefineComplex | 0.08/65% (−0.05/60%) <−0.04/60%> | 0.07/74% (0.02/71%) <0.01/65%> | 2.6/70% (0.8/57%) <0.9/57%> | 0.78/89% (0.73/83%) <0.72/83%> | |
| RosettaDock | −1.22/38% (−3.31/22%) <−3.97/20%> | −0.86/52% (−1.71/30%) <−2.19/25%> | −5.9/28% (−9.2/18%) <−11.6/15%> | 0.06/74% (0.03/70%) <0.03/73%> | |
| SymmRef | –/– (−0.43/38%) | –/– (−0.20/37%) | –/– (−4.3/39%) | –/– (0.00/0%) | |
Results for the best of 10 refined models are presented, and those for model 1’s and the mean of the 10 models are shown in parentheses and pointy brackets, respectively.
1Data for the 10 models and the mean of the 10 models are not provided for FiberDock and SymmRef because they generate only single models.
Figure 2Quality comparison of the best out of 10 refined models generated by GalaxyRefineComplex (red) and RosettaDock (green) and the single refined models generated by FiberDock (blue in (a,b)) and SymmRef (blue in (c,d)) when the initial models were (a) ZDOCK models and (b) CAPRI models for hetero-complexes and (c) M-ZDOCK models and (d) CAPRI models for homo-complexes. The refinement results for different target complexes are depicted in boxplots, which present first and third quartiles as boxes and the median as the band inside the boxes. The minimum and maximum data within 1.5 interquartile range of the lower and upper quartile are represented as the bottom and top ends of whiskers, respectively, and data points outside of this range are shown with black dots as outliers. Note that only single models, not top ten models, were evaluated for FiberDock and SymmRef because these programs generate only single models.
Figure 3Successful refinement examples for hetero-complexes, i.e., (a) a model for TA12 of the ZDOCK benchmark 4.0 and (b) the model for CAPRI round 26 T54 submitted as P38_M07, and for homo-complexes, i.e., (c) a model for 1MOQ of the PISA benchmark set and (d) a model for CAPRI round 30 T87 submitted as TS417_2. In each panel, experimentally resolved structures are shown on the left, and model structures before and after refinement are shown in the middle and on the right, respectively. Receptor protein structures are depicted in green, ligand protein structures in experimental structures and in models before and after refinement are shown in yellow, pink, and violet, respectively. Red arrows indicate directions of changes in relative orientation made by refinement (from the pink to violet structures).
Blind refinement results of GalaxyRefineComplex on the 13 initial models generated by GALAXY methods in CAPRI round 30 for the best of 10 submitted models.
| CAPRI criterion | |||
|---|---|---|---|
| Initial models | 13/7** | ||
| GalaxyRefineComplex | 13/7** | ||
| −ΔL-RMSD (Å) | −ΔI-RMSD (Å) | ΔFnat (%) | −ΔMolP |
| 0.12/62% | 0.09/85% | 0.075/85% | 0.83/100% |
Figure 4Ligand docking results for the dimer models of HIV-1 integrase generated with and without refinement by GalaxyRefineComplex.
The full structure of the HIV-1 integrase dimer (PDB ID: 4NYF) and the ligand binding site at the dimer interface are shown in (a). Ligand docking results obtained with the dimer models before and after refinement are presented in (b,c), respectively. In (b,c), the crystal structure is colored in yellow. The predicted complex structures are colored in pink and blue, respectively. It can be seen that the predicted hydrogen bonds (cyan lines, generated using UCSF Chimera43) match perfectly with those of crystal structure in (c) but not in (b).