| Literature DB >> 28002624 |
Orly Marcu1, Emma-Joy Dodson1, Nawsad Alam1, Michal Sperber1, Dima Kozakov2, Marc F Lensink3, Ora Schueler-Furman1.
Abstract
CAPRI rounds 28 and 29 included, for the first time, peptide-receptor targets of three different systems, reflecting increased appreciation of the importance of peptide-protein interactions. The CAPRI rounds allowed us to objectively assess the performance of Rosetta FlexPepDock, one of the first protocols to explicitly include peptide flexibility in docking, accounting for peptide conformational changes upon binding. We discuss here successes and challenges in modeling these targets: we obtain top-performing, high-resolution models of the peptide motif for cases with known binding sites but there is a need for better modeling of flanking regions, as well as better selection criteria, in particular for unknown binding sites. These rounds have also provided us the opportunity to reassess the success criteria, to better reflect the quality of a peptide-protein complex model. Using all models submitted to CAPRI, we analyze the correlation between current classification criteria and the ability to retrieve critical interface features, such as hydrogen bonds and hotspots. We find that loosening the backbone (and ligand) RMSD threshold, together with a restriction on the side chain RMSD measure, allows us to improve the selection of high-accuracy models. We also suggest a new measure to assess interface hydrogen bond recovery, which is not assessed by the current CAPRI criteria. Finally, we find that surprisingly much can be learned from rather inaccurate models about binding hotspots, suggesting that the current status of peptide-protein docking methods, as reflected by the submitted CAPRI models, can already have a significant impact on our understanding of protein interactions. Proteins 2017; 85:445-462.Entities:
Keywords: CAPRI; FlexPepDock; Peptide docking; binding hotspots; impact of low-accuracy models; model assessment
Mesh:
Substances:
Year: 2017 PMID: 28002624 PMCID: PMC6618814 DOI: 10.1002/prot.25230
Source DB: PubMed Journal: Proteins ISSN: 0887-3585
Current CAPRI Success Criteria for Peptide–Protein Docking (Modified From27, 28)
| (A) Measures | |
| Interface residues | <8.0 Å between any two CB atoms (CA for Gly) across interface |
| Native contacts | <4.0 Å between any two atoms across the interface (residue‐based) |
| Clashes | <3.0 Å between any two atoms across the interface (atom‐based) |
| (B) Classification | |
| High quality (H) | fnat [0.8 . 1.0] & (L‐RMSD ≤ 1.0 Å ǁ I‐RMSD ≤ 0.5 Å) |
| Medium (M) | fnat [0.5 . 0.8] & (L‐RMSD ≤ 2.0 Å ǁ I‐RMSD ≤ 1.0 Å) ǁfnat [0.8 . 1.0] & (L‐RMSD > 1.0 Å & I‐RMSD > 0.5 Å) |
| Acceptable (A) | fnat [0.2 . 0.5] & (L‐RMSD ≤ 4.0 Å ǁ I‐RMSD ≤ 2.0 Å) ǁfnat [0.5 . 1.0] & (L‐RMSD > 2.0 Å & I‐RMSD > 1.0 Å) |
| Incorrect (I) | The rest |
Residues Involved in the Peptide–Receptor Interactions for CAPRI Targets T60‐64: Importin α − NLS of RNA Helicase Guα + Peptides Derived from mRNA Display (PDB ids 3zin, 3zio, 3zip, 3ziq, 3zir)39
| Peptide residue | Contacting receptor residues | Additional information |
|---|---|---|
| Ser P−3’(W,R) | S406 (HB:SM) | |
| Arg P−2' (I,A,S,V) | D325 (SB), A364 (HB:SM), G365, R366, S406, G407 | |
| Gly P−1' (H/Q) | A364, N403 (HB:MS), S406, G407 | |
| Gln P0' (R) | A364, W399, N403 (HB:MS) | |
|
|
| [K‐>R: <5% NLS‐GFP import] |
|
|
| [R‐>A: reduction in NLS‐GFP import] |
| [E396R: 100x less binding (E402)] | ||
| Ser P3' (G,T,K) | W357 | |
|
|
| [W‐>V: <5% NLS‐GFP import in mouse importin] |
| [R315A & Y277A: 10x less binding of bipartite NLS‐GFP (R321, Y283)] | ||
| Ser P5’ | – | |
| Lys (+) P6’ | – | |
|
|
| |
|
|
| [F‐>A: <5% NLS‐GFP cargo import] |
| Gly P9’ | – |
In parentheses: amino acids in other peptides (T61–T64), if not identical. +: basic residue; aro: aromatic residue.
Native contact between peptide and receptor residues are defined as by CAPRI: at least one atom pair across the interface within 4.0 Å distance.
Specific interactions with the peptide are indicated in parentheses next to the receptor residue: SB: salt bridge; HB: Hydrogen bond between peptide and receptor (involving side‐chain, S or main chain, M); +π: cation −π . Hydrogen bonds and Salt bridges are identified according to HBplus, see Methods.
Residues with conserved interactions are in italics.
Residue numbering in respective studies is indicated in parentheses.
Note that effect is observed only at 37°C, not at 25°C.
Highlighted in bold are the peptide residues that are part of the binding motif (first column), and the receptor residues for which experimental information on their contribution to binding is available (detailed in the last column).
Residues Involved in the Peptide–Receptor Interactions for CAPRI Target T67: Nedd4 Third WW Domain–ARRDC3 PY1 (PDB id 4n7h)41 (Legend as in Table II
| Peptide residue | Contacting receptor residues | Additional information |
|---|---|---|
| Glu P−2’ |
| |
| Ala P−1’ |
| |
|
| F438, T447, T448, |
[WWOX WW1‐ErbB4 PY3] P‐>A: NBD |
|
| A432, P433, F438, |
[WWOX WW1‐ErbB4 PY3] P‐>A: NBD |
| Ser P2’ | T447 | |
|
| I440, D441, |
[WWOX WW1‐ErbB4 PY3] P‐>A: NBD |
| Ala P+1, Glu P+2 | – | |
|
| E428, |
V‐>I: ∼2x less binding |
gNBD—no binding detected.
Highlighted in bold are the peptide residues that are part of the binding motif (first column), and the receptor residues for which experimental information on their contribution to binding is available (detailed in the last column).
Residues Involved in the Peptide–Receptor Interactions for CAPRI Targets T65/T66: Bacterial RNase/PriA Helicase—Single Strand Binding Protein (SSB) c‐Terminal Peptide (PDB id 4z0u45/4nl846)
| T65 | T66 | ||
|---|---|---|---|
| Peptide residue | Contacting receptor residues | Additional information45 | Contacting receptor residues |
| ASP P−3’ |
| K3A: ND; R29A: 8x less binding | R699 (HB:SS) |
| ILE P−2’ | Y28, | R31A: 10x less binding | S696, R697, V698 |
|
| V5, Y28, A58, L59, | K60A: 10x less binding | – |
|
| L26, Y28, | K33A: 3x less binding | V341, R697 (oxt‐SB/HB:MS |
Chains B–E were used for further evaluation since the A–D complex contains far less interactions. Underlined are interactions that appear only in chains B–E.
Chains B–D were used.
HB:FC’‐R697 taken from chains E–F.
Highlighted in bold are the peptide residues that are part of the binding motif (first column), and the receptor residues for which experimental information on their contribution to binding is available (detailed in the third column).
Performance of the Furman Group Blind Predictions in the CAPRI Peptide–Protein Docking Rounds (Round 28: T60–64; Round 29: T65–67), Using Rosetta FlexPepDock. CAPRI Format Report: # of Acceptable/High***/Medium** Accuracy Models
| Target | T67 | T60‐64a | T60 | T61 | T62 | T63 | T64 | T65/66 |
|---|---|---|---|---|---|---|---|---|
| Motif (PPSY; hexamer) |
10/ |
5/ | 10/8**medium | 10/5**medium | 10**medium | 10acceptable | 10/8**medium | – |
| Full peptide |
10/ |
| 0 | 0 | 1acceptable | 3acceptable | 5acceptable | 0 |
Number of targets for which at least one high***/medium**/acceptable model was submitted.
Top‐ranking results are shown in bold: Rosetta FlexPepDock is the top‐ranking approach for T67, and top‐ranking together with the groups of Guerois and Seok for T60‐64 (see other contributions to this issue).
Figure 1CAPRI performance of the Furman Group: (A) T6741: ARRDC3 rpEA AEVvt peptide bound to Nedd4 WW3 domain: High accuracy for peptide motif (shown in cyan; fnat = 0.83, L_rmsd = 039 Å, I_rmsd = 0.35 Å), medium accuracy for full peptide, including Valine at P +3’. Coloring in this and next figures, unless specifically noted: Blue/Cyan—model; Green—crystal structure. (B–D) T60‐6439: NLS peptides bound to importin minor binding site. (B) Best model of peptide motif (in sticks, fnat = 0.83, L_RMSD = 2.42 Å, I_RMSD = 0.79 Å), compared to solved structure of T64. (C,D) Arrangement of aromatic side chains at the interface in the crystal structure (C), and a model (D): In the model, a rotamer flip of the central tryptophan residue (highlighted by a black circle) allows to form a ladder of interface stacking interactions, involving both the receptor and the peptide. This results in incorrect positioning of F P4’ (highlighted by a black rectangle). Coloring in (D): yellow–receptor; red–peptide). (E–G) RNAse bound to SSB‐terminal peptide (T65).45 (E) Superposition of the bound conformation and a representative free conformation shows significant movement of an adjacent loop (bound conformation: white and magenta, 4z0u45; free conformation: green, 2rn2). (F) Peptimap misses the binding site on the free conformation that we used as template (2rn2), but mapping of SSB‐peptide fragments identifies the site (submission 10; peptide in spheres). (G) Mapping would identify the site and rank it 5th on the bound conformation (yellow mesh; additional predicted sites are on the back, not shown). (H) PriA helicase bound to SSB peptide (T66).46 Mapping of the full receptor structure using peptimap identifies the binding site and ranks it 5th. However, refinement of the crystal structure does not retain the peptide in the binding site (see Supporting Information Fig. S2), indicating poor resolution of the binding site.
Figure 2The effect of alanine mutations, as predicted by computational protocols applied to crystal structures. Shown are results for (A) T60, (B) T65, and (C) T67. Upper panels: results for all interface residues, colored from blue to orange for predicted values in the range of [0.00. 2.00] (values beyond are capped to this scale). Lower panels: results for hotspot residues only, colored orange, yellow and cyan for >1.45;>0.95 and the rest. X: value not calculated. Fx: FoldX (versions 2.5 and 3), Rob and RS: Robetta and Rosetta scripts. Residue coloring: Experimentally tested receptor residues that affect binding are colored in red (weaker effect in orange). Previously known peptide motif residues are highlighted in red, and those identified in the studies that report the target structures are in orange. [Color figure can be viewed at wileyonlinelibrary.com]
Figure 3Correlation between different measures of model accuracy. Interface side‐chain RMSD (S‐RMSD) versus interface backbone RMSD (I‐RMSD) for models submitted to CAPRI (regression line in red) (A), and models generated by FlexPepDock refinement starting from the native structure (C). Models are colored according to CAPRI classification (medium accuracy—green; acceptable model—blue; incorrect models—magenta), and Target: 0 ‐ T60, X‐ T65, and □‐T67. (B,D) Models suggested for reclassification, based on their accurate side chains (see text for more details; coloring as in Fig. 1). [Color figure can be viewed at wileyonlinelibrary.com]
Figure 4Correlation between different measures of model accuracy. Hydrogen bond recovery (fnathb) versus native contact recovery (fnat) for models submitted to CAPRI (A) and models obtained from local refinement runs (C). (B) Example of model that recovers all hydrogen bonds, but is ranked as medium due to low native contact recovery (coloring as in Fig. 3). [Color figure can be viewed at wileyonlinelibrary.com]
Figure 5Even incorrect models can identify critical interface receptor and peptide hotspots. (A) Distribution of number of hotspots identified by incorrect models. Number of models for which Robetta (blue), FoldX2.5.2 (cyan) or mCSM (yellow) were able to identify a given number of receptor (upper) or peptide (lower row) binding residues for T60‐64 (left), T65 (center) and T67 (right). Stars indicate performance of the corresponding crystal structures. Performance based on crystal structures is highlighted as stars of the same respective color of the protocol. (B) Distribution for acceptable and medium accuracy models. (C) Example model with high hotspot recovery but poor modeling quality (T62_P36.M04; taken from the boxes outlined in red): While previously known hotspots KP1’RP2’ are correctly positioned (green and blue overlay), the rest of the peptide extends to form non‐native interactions using FP4’, which nevertheless still results in high hotspot recovery rate. [Color figure can be viewed at wileyonlinelibrary.com]
Figure 6Hitmaps reveal that incorrect models can identify important features of an interaction. Hitmaps show how often each residue on the receptor (and the peptide) is defined as hotspot in the ensemble of incorrect models submitted to CAPRI. (A) T65—blind prediction of a binding site: The hitmap of the receptor surface shows a few suggested binding sites, including the SSB binding site (circled), as well as a known dimerization site (square). (B,C) T60‐64—prediction of secondary motif for minor‐site specific binding: The hitmap recovers previously known, as well as new peptide motif residues (aroP4’ and FP8’) (B), but for the wrong reasons: A wrong binding site is mapped (square), while the pocket binding the FP8’ in the correct orientation is missed (circle) (C). The structures are colored from blue (0) to red (maximum) by the number of times a receptor residue was detected to be important for binding by FoldX2.5.2. Peptide is colored in salmon and represented as sticks. [Color figure can be viewed at wileyonlinelibrary.com]