| Literature DB >> 34324224 |
Minkyung Baek1,2, Ivan Anishchenko1,2, Hahnbeom Park1,2, Ian R Humphreys1,2, David Baker1,2,3.
Abstract
For CASP14, we developed deep learning-based methods for predicting homo-oligomeric and hetero-oligomeric contacts and used them for oligomer modeling. To build structure models, we developed an oligomer structure generation method that utilizes predicted interchain contacts to guide iterative restrained minimization from random backbone structures. We supplemented this gradient-based fold-and-dock method with template-based and ab initio docking approaches using deep learning-based subunit predictions on 29 assembly targets. These methods produced oligomer models with summed Z-scores 5.5 units higher than the next best group, with the fold-and-dock method having the best relative performance. Over the eight targets for which this method was used, the best of the five submitted models had average oligomer TM-score of 0.71 (average oligomer TM-score of the next best group: 0.64), and explicit modeling of inter-subunit interactions improved modeling of six out of 40 individual domains (ΔGDT-TS > 2.0).Entities:
Keywords: deep learning; interchain contact prediction; protein complex structure prediction; protein-protien docking
Mesh:
Substances:
Year: 2021 PMID: 34324224 PMCID: PMC8616806 DOI: 10.1002/prot.26197
Source DB: PubMed Journal: Proteins ISSN: 0887-3585
FIGURE 1The oligomer structure modeling procedure used by the BAKER‐experimental group
Summary of modeling strategies and performances
| Target | Difficulty | Interchain contact | Modeling method | Model 1 | Best out of 5 (based on Z‐score) | ||||
|---|---|---|---|---|---|---|---|---|---|
| Z‐score | ICS | TM‐score (oligo) | Z‐score | ICS | TM‐score (oligo) | ||||
| H1036 | Medium | No | Template | 0.84 | 0.68 | 0.70 | 1.17 | 0.72 | 0.71 |
| H1036v0 | Medium | No | Template | 0.93 | 0.27 | 0.69 | 0.92 | 0.27 | 0.69 |
| H1045 | Medium | No | Template | 1.05 | 0.71 | 0.87 | 1.35 | 0.77 | 0.87 |
| H1047 | Hard | Yes (G) |
| 1.35 | 0.04 | 0.39 | 1.54 | 0.04 | 0.38 |
| H1060v1 | Medium | No |
| 0.94 | 0.06 | 0.31 | 1.09 | 0.08 | 0.31 |
| H1060v2 | Medium | No | Template | 0.38 | 0.09 | 0.86 | 0.34 | 0.10 | 0.88 |
| H1060v3 | Medium | No | Template | 0.43 | 0.01 | 0.75 | 0.85 | 0.12 | 0.84 |
| H1060v4 | Medium | No | Template | 0.74 | 0.22 | 0.75 | 0.93 | 0.21 | 0.73 |
| H1060v5 | Medium | Yes (G) | Template | 1.52 | 0.48 | 0.95 | 1.67 | 0.50 | 0.95 |
| H1065 | Hard | Yes (DL) | Fold‐and‐dock | 1.74 | 0.40 | 0.79 | 1.82 | 0.40 | 0.79 |
| H1072 | Medium | Yes (DL) | Fold‐and‐dock | 0.10 | 0.04 | 0.34 | 0.28 | 0.03 | 0.37 |
| H1081v0 | Medium | No |
| 1.46 | 0.35 | 0.97 | 1.59 | 0.35 | 0.97 |
| H1097 | Medium | Yes (T) | Fold‐and‐dock | 2.13 | 0.44 | 0.73 | 1.99 | 0.44 | 0.74 |
| T1032 | Easy | No | Template | 1.08 | 0.38 | 0.69 | 1.10 | 0.40 | 0.68 |
| T1034 | Medium | No | Template | −0.81 | 0.00 | 0.17 | −0.38 | 0.00 | 0.23 |
| T1038 | Hard | No |
| −0.58 | 0.00 | 0.17 | 0.36 | 0.01 | 0.20 |
| T1048 | Medium | Yes (DL) | Fold‐and‐dock | 3.09 | 0.50 | 0.59 | 4.29 | 0.58 | 0.83 |
| T1052 | Easy | No | Template | 0.63 | 0.51 | 0.69 | 0.72 | 0.51 | 0.69 |
| T1054 | Hard | No |
| 0.13 | 0.00 | 0.44 | 0.81 | 0.00 | 0.52 |
| T1061 | Hard | No | Template | 1.85 | 0.15 | 0.64 | 1.97 | 0.17 | 0.69 |
| T1070 | Hard | No | Template | 0.94 | 0.06 | 0.31 | 2.10 | 0.10 | 0.37 |
| T1078 | Medium | No |
| 0.19 | 0.00 | 0.54 | 2.50 | 0.25 | 0.67 |
| T1080 | Hard | Yes (DL) | Fold‐and‐dock | 1.92 | 0.12 | 0.55 | 2.60 | 0.13 | 0.61 |
| T1083 | Medium | Yes (DL) | Fold‐and‐dock | 1.48 | 0.23 | 0.63 | 1.60 | 0.23 | 0.63 |
| T1084 | Medium | Yes (DL) | Fold‐and‐dock | 2.17 | 0.81 | 0.92 | 2.20 | 0.84 | 0.91 |
| T1087 | Medium | Yes (DL) | Fold‐and‐dock | 2.27 | 0.36 | 0.79 | 2.86 | 0.36 | 0.79 |
| T1099v0 | Medium | No | Template | −0.23 | 0.03 | 0.24 | 0.22 | 0.02 | 0.45 |
| T1099v1 | Medium | No | Template | 0.15 | 0.00 | 0.55 | 0.27 | 0.00 | 0.55 |
| T1099v2 | Medium | No | Template | 0.75 | 0.13 | 0.60 | 0.89 | 0.16 | 0.60 |
Abbreviations: DL, Deep learning‐based methods; G, GREMLIN; T, Partial templates.
Calculated on model 1 submissions.
Calculated on all model submissions.
Having a completely wrong prediction for the antigen‐antibody interface.
FIGURE 2Deep learning‐based residue pairwise interaction prediction for (A) homo‐oligomers (trRosetta‐homo) and (B) hetero‐oligomers (trRosetta‐discont)
FIGURE 3Performance of the gradient‐based fold‐and‐dock method. (A) Schematic outline of the fold‐and‐dock procedure consisting of two stages: repetitive folding and docking in centroid representation followed by full‐atom docking and relaxation. (B) Correlation between the quality of predicted interchain contacts and that of modeled interfaces. (C,D) Examples of successful predictions using gradient‐based fold‐and‐dock methods with predicted interchain contacts. Predicted intrachain distances and interchain contacts are shown in the upper diagonal (colored in red) of 2D maps while those from native structures are shown in the lower diagonal (colored in blue). The correctly predicted interchain contacts are shown as blue lines in the model structures. Both native and model structures are colored by chains. (E) Native and the best prediction submitted as model 4 for H1097
FIGURE 4Oligomer modeling performance of BAKER‐experimental group. (A) The relative performance in terms of average Z‐score for the best out of five submissions for each target difficulty and modeling strategy we used. (B) A successful example (T1061) of template‐based approach by detecting a distant oligomer template based on structural similarity. Left; The subunit structure (colored in rainbow) used to search oligomer templates and the detected template (colored in gray, PDB ID: 3CDD) are shown. Right; The predicted structure (submitted as model 2) is shown with the native structure colored in gray. (C) A successful example (H1081) of ab initio docking with a constraint to match symmetry axes of two subunits. The native structure is colored in gray. (D) A failed example (T1054) to generate a correct binding pose by ab initio docking with the subunit structure (colored in rainbow colors from the N‐terminus in blue to the C‐terminus in red) having high GDT‐TS. The problematic N‐terminal helix is highlighted by an orange arrow. The correct binding pose is colored in pink while the predicted one is colored in dark gray
FIGURE 5Comparison of subunit structures modeled as complexes to those modeled as monomers. (A) Head‐to‐head comparison of the subunit qualities in terms of the evaluation unit‐wise GDT‐TS score. Dots are colored by the ICS score of predicted complex structures. (B and C) Two successful examples (T1065s1‐D1 and T1095‐D1) where modeling in oligomer contexts generated better subunit structures. The native structure of the target subunit and its binding partners are shown in green and gray, respectively. The subunit structures predicted as a monomer are shown in cyan (left), while those predicted in oligomer contexts are colored in magenta (right)