Literature DB >> 28430871

ClusPro PeptiDock: efficient global docking of peptide recognition motifs using FFT.

Kathryn A Porter1, Bing Xia1, Dmitri Beglov1, Tanggis Bohnuud1, Nawsad Alam2, Ora Schueler-Furman2, Dima Kozakov3,4.   

Abstract

SUMMARY: We present an approach for the efficient docking of peptide motifs to their free receptor structures. Using a motif based search, we can retrieve structural fragments from the Protein Data Bank (PDB) that are very similar to the peptide's final, bound conformation. We use a Fast Fourier Transform (FFT) based docking method to quickly perform global rigid body docking of these fragments to the receptor. According to CAPRI peptide docking criteria, an acceptable conformation can often be found among the top-ranking predictions.
AVAILABILITY AND IMPLEMENTATION: The method is available as part of the protein-protein docking server ClusPro at https://peptidock.cluspro.org/nousername.php. CONTACT: midas@laufercenter.org or oraf@ekmd.huji.ac.il. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author (2017). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com

Entities:  

Mesh:

Substances:

Year:  2017        PMID: 28430871      PMCID: PMC5860028          DOI: 10.1093/bioinformatics/btx216

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


1 Introduction

Fast Fourier Transform (FFT) based sampling approaches have been shown to be very effective for modeling protein–protein interactions. If we assume that proteins undergo minimal conformational changes upon binding, we can exhaustively sample mutual protein orientations using a molecular mechanics like energy function in an efficient manner (Chen ; Kozakov ; Tovchigrechko and Vakser, 2006; Viswanath ). This assumption has been adequate in the Critical Assessment of PRediction of Interactions (CAPRI), a community wide docking experiment (Lensink ) where FFT based approaches have consistently been among the top performers (Lensink and Wodak, 2013). The low cost of such techniques that do not require prior knowledge of the binding site makes them well suited for implementation as an automated server. In particular, our ClusPro server (Kozakov ) is consistently the most accurate server according to CAPRI evaluations, and performs comparably to the best human groups (Lensink ). ClusPro is heavily used and has more than 17 000 users, who have run more than 1 50 000 jobs in the last few years. FFT based approaches have been successfully applied to rigid protein systems, however, they are not directly applicable to flexible systems, such as peptide-protein interactions. These transient protein complexes are important as they make up nearly half of protein interactions in the cell. To account for increased flexibility of peptides, existing global peptide docking methods instead use Monte Carlo and Molecular Dynamics-based approaches (Ben-Shimon and Niv, 2015; Dagliyan ; Kurcinski ; Raveh ; Schindler ; Trellet ; Yan ). While useful, most of these approaches require inclusion of computationally expensive refinement to obtain CAPRI quality models, and thus are too resource demanding for efficient server implementation. Template based approaches on the other hand are much faster (Lee ), but are limited to cases with available templates. It turns out that peptides in transient complexes have particular properties that nevertheless renders the peptide-protein problem amenable to an FFT based approach. In the majority of peptide–protein interactions, a protein domain interacts with a short linear peptide motif. For example, cyclins recognize the RXL motif, where R (arginine) and L (leucine) are fixed as amino acids, and X represents any amino acid. It has been shown experimentally that motif regions are limited to a small ensemble of potential conformations within a protein environment (Wang ). This observation enables the use of FFT for our docking method, as we can dock each conformation in this small ensemble. The key idea of our approach is to generate this ensemble by searching for proteins containing the motif of interest in the Protein Data Bank (PDB) (Berman ) and extract the regions that match the motif. Conceptually similar approaches of mining fragments from the PDB have been previously used for protein structure prediction (Simons ). While these approaches were based on sequence and secondary structure similarity, we use motif based fragment extraction to focus on a relevant subset of conformations, taking advantage of the specific features of the motif docking challenge. By combining this resulting fragment library with systematic Fast Fourier Transform (FFT) grid based sampling using accurate molecular mechanics potentials (Kozakov , 2014), we efficiently sample and discriminate near native docked peptide models with success rates similar to domain–domain protein docking, despite significant peptide flexibility. We demonstrate this on a diverse set of domain–motif interactions, and make the method freely available as part of the protein–protein docking server ClusPro.

2 Materials and methods

(1) Preparing the input structures: The structure of the free receptor is represented as an independent binding unit that is defined as either a single domain, or repeated, non-decomposable domains (Lavi ). Unstructured terminal tails are removed. On the peptide side we start with a peptide sequence that covers the initial motif, and expand the sequence if the original motif is too short (less than 5 residues). We further generalize it by introducing wildcards based on motif information, or restrict it by further expansion, as detailed in SI methods. The generalization protocol is iterative, based on available PDB information, to ensure reasonable structural coverage (i.e. a library of 100–1000 conformations). (2) Clustering of fragments: In the next step the fragments are clustered using a greedy algorithm and a stringent clustering radius of 0.5 Å. The cluster center of each of the top 25 clusters is retained for docking in the next step [Similar to the 25 top-scoring fragments used in Rosetta ab initio modeling (Simons )]. (3) Docking of peptide fragments: Each of the (up to 25) fragments is docked to the receptor structure using an FFT based sampling protocol (see SI). The top 250 results of each run are combined into one ensemble. (4) Selection of models: The ensemble of models is clustered using a cluster radius of 3.5 Å, which was chosen to represent the resolution of basins of attraction, and the resulting clusters are ranked according to cluster size (see SI). Representatives of the top-ranking selected clusters are further locally minimized using CHARMM (Brooks ), and provided as predictions. Solutions that overlap with domain-domain interfaces are removed. Overview of the protocol is presented in Supplementary Figure S1.

3 Docking performance

To demonstrate the efficiency and accuracy of this approach, we model diverse protein domain-motif interactions from the PeptiDB v2 dataset for which the interacting motif has been reported (Lavi ; London ). Additionally, we validate the protocol on an independent set of domain-motif complexes recently published in the PDB distinct from PeptiDB v2. We note that to prevent inherent bias in modeling, structures of proteins with more than 30% sequence identity to the target are excluded from the search for motif backbones, and only information about sequence motifs available before publication of the solved complex structure is included. Using a CAPRI-inspired threshold for success, namely defining a near-native conformation if the peptide lies within 4.0 Å backbone RMSD of the native peptide bound to the receptor (i.e. the CAPRI criterion for an acceptable peptide-protein docking prediction), a near-native peptide conformation is found among the top 10 PeptiDock predictions for 11 of the 16 complexes, and all apart from two cases are identified among the top 20 clusters. Similar performance is obtained for the additional validation set: for 4 out of 5 complexes, a conformation similar to bound is extracted using the motif based search, and for 3 out of the 5 cases a near-native structure is ranked first. The overall detailed assessment of PeptiDock performance is provided in Supplementary Table S1, and comparison of docked poses to crystal conformations are shown in Supplementary Figure S2. Comparison with another global peptide docking protocol, CABS-dock (Kurcinski ) which uses ab-initio folded peptide conformations rather than PDB fragments, indicates that fragment based backbone sampling provides docking models of higher accuracy (Supplementary Table S2). Figure 1 shows an example of a successful case and a challenging case. For the latter, the native complex forms hydrogen bonds between the peptide backbone and protein side chains, but lacks strong hydrophobic interactions with the aromatic side chains. The hydrophobic valine points into the solvent (forming crystal contacts with a symmetry mate in the solved structure). Interestingly, in this and in the one additional case for which no near-native structure was sampled (2YNNA), considerable improvement was obtained by using an electrostatic-driven potential (Supplementary Table S3), indicating that scoring rather than sampling limits performance in these interactions that are dominated by electrostatic attraction (and crystal contacts).
Fig. 1

Examples for models generated by PeptiDock rigid body docking of peptides to a receptor. Receptor structures are shown in light grey. Silver structures represent the native peptide pose. (A) A peptide derived from CDC6 with the sequence motif KGRRL is successfully docked to cyclin. The third ranked prediction (dark grey) produces an acceptable accuracy result (1.9Å backbone RMSD; apo/holo PDB IDs: 1H1R/2CCH). (B) No near native structure is sampled using the standard energy function weight set when a peptide derived from synaptojanin is docked to the ap2 adaptor (apo/holo PDB IDs: 1B9K/2VJ0). Nevertheless, a 4.0Å RMSD model (gray) is produced when an electrostatics-favored coefficient set is used in place of the standard weight set. This can be explained by the fact that this interaction is dominated by several hydrogen bonds of the peptide backbone (dotted line) in the native complex, but lacks strong hydrophobic interactions with the aromatic side chains, as well as by crystal contacts in the bound conformation (shown in opaque gray sticks) that stabilize the solved peptide conformation. The hydrophobic V6 points into the “solvent”, but actually contacts the symmetry mate (interaction is marked with *; see Figure S3 for more details) (Color version of this figure is available at Bioinformatics online.)

Examples for models generated by PeptiDock rigid body docking of peptides to a receptor. Receptor structures are shown in light grey. Silver structures represent the native peptide pose. (A) A peptide derived from CDC6 with the sequence motif KGRRL is successfully docked to cyclin. The third ranked prediction (dark grey) produces an acceptable accuracy result (1.9Å backbone RMSD; apo/holo PDB IDs: 1H1R/2CCH). (B) No near native structure is sampled using the standard energy function weight set when a peptide derived from synaptojanin is docked to the ap2 adaptor (apo/holo PDB IDs: 1B9K/2VJ0). Nevertheless, a 4.0Å RMSD model (gray) is produced when an electrostatics-favored coefficient set is used in place of the standard weight set. This can be explained by the fact that this interaction is dominated by several hydrogen bonds of the peptide backbone (dotted line) in the native complex, but lacks strong hydrophobic interactions with the aromatic side chains, as well as by crystal contacts in the bound conformation (shown in opaque gray sticks) that stabilize the solved peptide conformation. The hydrophobic V6 points into the “solvent”, but actually contacts the symmetry mate (interaction is marked with *; see Figure S3 for more details) (Color version of this figure is available at Bioinformatics online.)

4 Conclusions

We demonstrate remarkable performance in deterministic global ab-initio docking of peptides to protein domains using an approach that capitalizes on the relationship between sequence and structural motifs and employs accurate FFT based sampling. Results are shown for a set of domain-motif interactions. It is important to note that this set was derived from the generic diverse peptide-protein benchmark set PeptiDB v2 (Lavi ), which was designed with no motif information in mind. Nevertheless, we were able to find motif information in the literature for 16 of 21 cases in the PeptiDB v2 set, which demonstrates that motif information is generally available for a broad class of protein peptide interactions. The resulting protocol takes roughly 500 CPU hours per simulation, which translates to less than 2 hours on a cluster of 20 16-core machines. The method is freely available as part of the ClusPro protein-docking server.

Funding

This work was supported by National Science Foundation (NSF) Grants AF 1527292 and DBI 1458509, US Israel Binational Science Foundation Grant 2009418,2015207, and European Research Council (ERCstG 310873). Conflict of Interest: none declared. Click here for additional data file.
  22 in total

1.  The Protein Data Bank.

Authors:  H M Berman; J Westbrook; Z Feng; G Gilliland; T N Bhat; H Weissig; I N Shindyalov; P E Bourne
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  ZDOCK predictions for the CAPRI challenge.

Authors:  Rong Chen; Weiwei Tong; Julian Mintseris; Li Li; Zhiping Weng
Journal:  Proteins       Date:  2003-07-01

3.  How good is automated protein docking?

Authors:  Dima Kozakov; Dmitri Beglov; Tanggis Bohnuud; Scott E Mottarella; Bing Xia; David R Hall; Sandor Vajda
Journal:  Proteins       Date:  2013-10-17

Review 4.  CHARMM: the biomolecular simulation program.

Authors:  B R Brooks; C L Brooks; A D Mackerell; L Nilsson; R J Petrella; B Roux; Y Won; G Archontis; C Bartels; S Boresch; A Caflisch; L Caves; Q Cui; A R Dinner; M Feig; S Fischer; J Gao; M Hodoscek; W Im; K Kuczera; T Lazaridis; J Ma; V Ovchinnikov; E Paci; R W Pastor; C B Post; J Z Pu; M Schaefer; B Tidor; R M Venable; H L Woodcock; X Wu; W Yang; D M York; M Karplus
Journal:  J Comput Chem       Date:  2009-07-30       Impact factor: 3.376

5.  Detection of peptide-binding sites on protein surfaces: the first step toward the modeling and targeting of peptide-mediated interactions.

Authors:  Assaf Lavi; Chi Ho Ngan; Dana Movshovitz-Attias; Tanggis Bohnuud; Christine Yueh; Dmitri Beglov; Ora Schueler-Furman; Dima Kozakov
Journal:  Proteins       Date:  2013-10-17

6.  The structural basis of peptide-protein binding strategies.

Authors:  Nir London; Dana Movshovitz-Attias; Ora Schueler-Furman
Journal:  Structure       Date:  2010-02-10       Impact factor: 5.006

7.  Assembly of protein tertiary structures from fragments with similar local sequences using simulated annealing and Bayesian scoring functions.

Authors:  K T Simons; C Kooperberg; E Huang; D Baker
Journal:  J Mol Biol       Date:  1997-04-25       Impact factor: 5.469

8.  Structural and dynamic determinants of protein-peptide recognition.

Authors:  Onur Dagliyan; Elizabeth A Proctor; Kevin M D'Auria; Feng Ding; Nikolay V Dokholyan
Journal:  Structure       Date:  2011-12-07       Impact factor: 5.006

9.  CABS-dock web server for the flexible docking of peptides to proteins without prior knowledge of the binding site.

Authors:  Mateusz Kurcinski; Michal Jamroz; Maciej Blaszczyk; Andrzej Kolinski; Sebastian Kmiecik
Journal:  Nucleic Acids Res       Date:  2015-05-05       Impact factor: 16.971

10.  A unified conformational selection and induced fit approach to protein-peptide docking.

Authors:  Mikael Trellet; Adrien S J Melquiond; Alexandre M J J Bonvin
Journal:  PLoS One       Date:  2013-03-13       Impact factor: 3.240

View more
  25 in total

1.  From Proteomic Mapping to Invasion-Metastasis-Cascade Systemic Biomarkering and Targeted Drugging of Mutant BRAF-Dependent Human Cutaneous Melanomagenesis.

Authors:  Aikaterini F Giannopoulou; Athanassios D Velentzas; Athanasios K Anagnostopoulos; Adamantia Agalou; Nikos C Papandreou; Stamatia A Katarachia; Dimitra G Koumoundourou; Eumorphia G Konstantakou; Vasiliki I Pantazopoulou; Anastasios Delis; Maria T Michailidi; Dimitrios Valakos; Dimitris Chatzopoulos; Popi Syntichaki; Vassiliki A Iconomidou; Ourania E Tsitsilonis; Issidora S Papassideri; Gerassimos E Voutsinas; Polydefkis Hatzopoulos; Dimitris Thanos; Dimitris Beis; Ema Anastasiadou; George Th Tsangaris; Dimitrios J Stravopodis
Journal:  Cancers (Basel)       Date:  2021-04-22       Impact factor: 6.639

2.  Methods for Molecular Modelling of Protein Complexes.

Authors:  Tejashree Rajaram Kanitkar; Neeladri Sen; Sanjana Nair; Neelesh Soni; Kaustubh Amritkar; Yogendra Ramtirtha; M S Madhusudhan
Journal:  Methods Mol Biol       Date:  2021

3.  Flexible docking of peptides to proteins using CABS-dock.

Authors:  Mateusz Kurcinski; Aleksandra Badaczewska-Dawid; Michal Kolinski; Andrzej Kolinski; Sebastian Kmiecik
Journal:  Protein Sci       Date:  2019-11-11       Impact factor: 6.725

4.  Harnessing protein folding neural networks for peptide-protein docking.

Authors:  Tomer Tsaban; Julia K Varga; Orly Avraham; Ziv Ben-Aharon; Alisa Khramushin; Ora Schueler-Furman
Journal:  Nat Commun       Date:  2022-01-10       Impact factor: 14.919

5.  HMI-PRED: A Web Server for Structural Prediction of Host-Microbe Interactions Based on Interface Mimicry.

Authors:  Emine Guven-Maiorov; Asma Hakouz; Sukejna Valjevac; Ozlem Keskin; Chung-Jung Tsai; Attila Gursoy; Ruth Nussinov
Journal:  J Mol Biol       Date:  2020-02-13       Impact factor: 5.469

6.  Inhibition of protein interactions: co-crystalized protein-protein interfaces are nearly as good as holo proteins in rigid-body ligand docking.

Authors:  Saveliy Belkin; Petras J Kundrotas; Ilya A Vakser
Journal:  J Comput Aided Mol Des       Date:  2018-07-12       Impact factor: 3.686

7.  ClusPro in rounds 38 to 45 of CAPRI: Toward combining template-based methods with free docking.

Authors:  Dzmitry Padhorny; Kathryn A Porter; Mikhail Ignatov; Andrey Alekseenko; Dmitri Beglov; Sergei Kotelnikov; Ryota Ashizawa; Israel Desta; Nawsad Alam; Zhuyezi Sun; Emiliano Brini; Ken Dill; Ora Schueler-Furman; Sandor Vajda; Dima Kozakov
Journal:  Proteins       Date:  2020-03-23

8.  PepPro: A Nonredundant Structure Data Set for Benchmarking Peptide-Protein Computational Docking.

Authors:  Xianjin Xu; Xiaoqin Zou
Journal:  J Comput Chem       Date:  2019-12-02       Impact factor: 3.376

9.  Protein-ligand docking using FFT based sampling: D3R case study.

Authors:  Dzmitry Padhorny; David R Hall; Hanieh Mirzaei; Artem B Mamonov; Mohammad Moghadasi; Andrey Alekseenko; Dmitri Beglov; Dima Kozakov
Journal:  J Comput Aided Mol Des       Date:  2017-11-03       Impact factor: 3.686

10.  Modeling beta-sheet peptide-protein interactions: Rosetta FlexPepDock in CAPRI rounds 38-45.

Authors:  Alisa Khramushin; Orly Marcu; Nawsad Alam; Orly Shimony; Dzmitry Padhorny; Emiliano Brini; Ken A Dill; Sandor Vajda; Dima Kozakov; Ora Schueler-Furman
Journal:  Proteins       Date:  2020-01-06
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.