Literature DB >> 31141126

PRODIGY-crystal: a web-tool for classification of biological interfaces in protein complexes.

Brian Jiménez-García1, Katarina Elez1, Panagiotis I Koukos1, Alexandre Mjj Bonvin1, Anna Vangone1.   

Abstract

SUMMARY: Distinguishing biologically relevant interfaces from crystallographic ones in biological complexes is fundamental in order to associate cellular functions to the correct macromolecular assemblies. Recently, we described a detailed study reporting the differences in the type of intermolecular residue-residue contacts between biological and crystallographic interfaces. Our findings allowed us to develop a fast predictor of biological interfaces reaching an accuracy of 0.92 and competitive to the current state of the art. Here we present its web-server implementation, PRODIGY-CRYSTAL, aimed at the classification of biological and crystallographic interfaces. PRODIGY-CRYSTAL has the advantage of being fast, accurate and simple. This, together with its user-friendly interface and user support forum, ensures its broad accessibility.
AVAILABILITY AND IMPLEMENTATION: PRODIGY-CRYSTAL is freely available without registration requirements at https://haddock.science.uu.nl/services/PRODIGY-CRYSTAL.
© The Author(s) 2019. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Mesh:

Substances:

Year:  2019        PMID: 31141126      PMCID: PMC9186318          DOI: 10.1093/bioinformatics/btz437

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.931


1 Introduction

Interactions between proteins are at the origin of essential function in cells. Correct identification of biologically relevant assemblies is therefore fundamental for a proper understanding of their function and mode of action. X-ray crystallography is the most widely used technique to experimentally determine structures of protein–protein complexes. However, distinguishing biologically relevant interfaces from crystallographic ones, which are non-specific and mere artifacts of the crystallization process, remains a non-trivial problem. We have previously demonstrated the fundamental role of interfacial residue–residue contacts in describing and predicting binding properties. These have led to high-performing binding affinity predictors for protein–protein (Vangone and Bonvin, 2015) and protein–small ligand binding affinities (Kurkcuoglu ) implemented into the PRotein binDIng enerGY prediction (PRODIGY) web server available at http://haddock.science.uu.nl/services/PRODIGY (Vangone and Bonvin, 2017; Xue ) and PRODIGY-LIGand (PRODIGY-LIG), available at http://haddock.science.uu.nl/services/PRODIGY-LIG (Vangone ), respectively. We have recently further extended the concept of intermolecular contacts into a new methodology for the classification of biological/crystallographic interfaces in protein–protein complexes, reported in Elez ). Our predictor is based on simple structural properties of the interface: The residue–residue contacts classified by their polar/apolar/charged character and the type of amino acid involved in the contact. By using the Many dataset (Baskaran ), we trained a machine learning predictor which reaches an accuracy of 0.92 (Elez ). Our method is competitive method with respect to the current state of the art PISA (Krissinel and Henrick, 2007) and EPPIC (Duarte ) which reach accuracies of 0.83 and 0.88 on the same dataset, respectively. A few web-tools for interface classification have been made available to the scientific community (Bernauer ; Duarte ; Krissinel and Henrick, 2007; Mitra ; Ponstingl ). Here we present PRODIGY-CRYSTAL, a fast and user-friendly online tool which implements our contact-based methodology to classify interfaces into biological or crystallographic ones.

2 The web server

The input page

PRODIGY-CRYSTAL has been implemented as a user-friendly web server within our PRODIGY collection of web-tools, which are aimed at predicting various properties in biomolecules complexes. The server is freely available without registration at http://haddock.science.uu.nl/services/PRODIGY-CRYSTAL. It provides a fast prediction of the protein–protein interface nature, classifying it as biological or crystallographic. Users are required to provide the following information: Structures: coordinates file in PDB or mmCIF format. The file can be uploaded or automatically retrieved from the PDB by specifying a PDB ID. A compressed archive file (tar, tgz, zip, bz2 or tar.gz) containing multiple complexes can also be provided for classification of multiple interfaces in one submission. Chain IDs: the chain identifiers for the interface to be analyzed. These must be provided in the two boxes named ‘Interactor 1’ and ‘Interactor 2’, respectively. One interface can also be formed by more than one chain (e.g. chains A and B). In that case the chains should be reported as a comma separated list (‘A, B’). Job ID (Optional): personalized job name. Email (Optional): in the case an email is provided, a link to the results page will be sent to the user. Upon successful validation of the input data, users are redirected to the job page, which displays the status of the job during execution and the results upon completion.

The output page

Prediction is usually completed within a few seconds and the results displayed online. If the user closes the page, the calculation will still be carried out and the results will remain accessible for 2 weeks provided the user saved the URL or provided an email address. The results page displays the following information: Class of the interface predicted, reported as ‘biological’ or ‘crystallographic’ together with the probability values, which range between 0 and 1 and indicate how likely is the interface to belong to each of the two classes; Number of residue contacts found at the interface, classified according to their polar/apolar/charged character; Link Density value (Elez ); List of residue contacts, reported as a downloadable table in .txt format. If an interface is predicted as ‘biological’, the server will run the PRODIGY module (Xue ) in order to provide the binding affinity prediction (reported in kcal mol−1), which is then displayed in the main results page. Online visualization of the query interfaces is also provided, powered by the NGL Viewer (Rose ) JavaScript software. The interacting chains are colored in blue/pink for Interactor 1 and Interactor 2, respectively. The interface, composed of the residues within 5.0 Å distance from the other chain, is highlighted in green for a predicted biological interface or in red for a crystallographic one. Finally, it is possible to download an archive folder of the PRODIGY-CRYSTAL output (in .tgz format). An overview of PRODIGY-CRYSTAL output page is reported in Figure 1.
Fig. 1.

Example of the PRODIGY-CRYSTAL output page for the complex 1PPE, chains E versus I. The four sections of the output pages are shown (Prediction results, Prediction details, Interface visualization and Download outputs). A three-dimensional representation of the complex interface is shown in the gray box, powered by NGL Viewer. The interacting chains will be reported in pink/blue ribbons, while the interface will be marked in green spheres (will be reported in red for interfaces predicted as crystallographic)

Example of the PRODIGY-CRYSTAL output page for the complex 1PPE, chains E versus I. The four sections of the output pages are shown (Prediction results, Prediction details, Interface visualization and Download outputs). A three-dimensional representation of the complex interface is shown in the gray box, powered by NGL Viewer. The interacting chains will be reported in pink/blue ribbons, while the interface will be marked in green spheres (will be reported in red for interfaces predicted as crystallographic)

The web-tool

As for the other PRODIGY tools, the home page is organized with a top banner displaying the several available sections. Dedicated tabs help the user read the content of each page and easily navigate through them. Besides the Home page, which reports general information and presents the form for job submission, Manual, Method, Dataset, References and Help pages are also provided. Explanations regarding the usage of the web-tool, the required input and output formats are reported in Manual. The implemented methodology and dataset used for training and testing (also available for download through the SBGRID repository at https://data.sbgrid.org/dataset/566/) can be found in Method and Dataset, respectively. A typical output page can be found by clicking on the Example tab. It is also possible to run a test job from scratch in the Home input page, by clicking on the Load Example Data button. The References page provides the relevant literature related to the PRODIGY services and the Help page provides links to the dedicated user support forum (http://ask.bioexcel.eu/c/prodigy) and the standalone code freely available from GitHub. Indeed, in addition to the web-server we also provide a standalone version of the PRODIGY-CRYSTAL code for local use. Information about use, installation and download are reported in our GitHub repository at https://github.com/haddocking/prodigy-cryst. The PRODIGY-CRYSTAL web server was written in Python 3.6.5 based on the Flask framework (version 1.0.2). The server is containerized with Docker (1.20.1) and deployed on a dedicated GNU/Linux server.

3 Conclusion

We have presented here an expansion of PRODIGY, namely PRODIGY-CRYSTAL, for the classification of protein–protein interfaces in crystallographic complexes. The web-tool takes a PDB or mmCIF file as input and applies our contact-based prediction method to classify interfaces as biological or crystallographic, not requiring any additional data. Its user-friendly interface, free availability and extensive online documentation and support will target a large audience, reaching researchers with different background and limited computational resources.

Funding

This work was supported by the European Union’s Horizon 2020 e-Infrastructure grants West-Life [grant number 675858]; BioExcel [grant numbers 675728, 823830] and EOSC-Hub [grant number 777536]; and by the Dutch Foundation for Scientific Research (NWO) [TOP-PUNT grant 718.015.001]. A.V. was supported by the Marie Skłodowska-Curie Individual Fellowship H2020 MSCA-IF-2015 [BAP-659025]. Conflict of Interest: none declared.
  12 in total

1.  Inference of macromolecular assemblies from crystalline state.

Authors:  Evgeny Krissinel; Kim Henrick
Journal:  J Mol Biol       Date:  2007-05-13       Impact factor: 5.469

2.  DiMoVo: a Voronoi tessellation-based method for discriminating crystallographic and biological protein-protein interactions.

Authors:  Julie Bernauer; Ranjit Prasad Bahadur; Francis Rodier; Joël Janin; Anne Poupon
Journal:  Bioinformatics       Date:  2008-01-19       Impact factor: 6.937

3.  NGL viewer: web-based molecular graphics for large complexes.

Authors:  Alexander S Rose; Anthony R Bradley; Yana Valasatava; Jose M Duarte; Andreas Prlic; Peter W Rose
Journal:  Bioinformatics       Date:  2018-11-01       Impact factor: 6.937

4.  Combining Bayes classification and point group symmetry under Boolean framework for enhanced protein quaternary structure inference.

Authors:  Pralay Mitra; Debnath Pal
Journal:  Structure       Date:  2011-03-09       Impact factor: 5.006

5.  Large-scale prediction of binding affinity in protein-small ligand complexes: the PRODIGY-LIG web server.

Authors:  Anna Vangone; Joerg Schaarschmidt; Panagiotis Koukos; Cunliang Geng; Nevia Citro; Mikael E Trellet; Li C Xue; Alexandre M J J Bonvin
Journal:  Bioinformatics       Date:  2019-05-01       Impact factor: 6.937

6.  PRODIGY: a web server for predicting the binding affinity of protein-protein complexes.

Authors:  Li C Xue; João Pglm Rodrigues; Panagiotis L Kastritis; Alexandre Mjj Bonvin; Anna Vangone
Journal:  Bioinformatics       Date:  2016-08-08       Impact factor: 6.937

7.  PRODIGY: A Contact-based Predictor of Binding Affinity in Protein-protein Complexes.

Authors:  Anna Vangone; Alexandre M J J Bonvin
Journal:  Bio Protoc       Date:  2017-02-05

8.  A PDB-wide, evolution-based assessment of protein-protein interfaces.

Authors:  Kumaran Baskaran; Jose M Duarte; Nikhil Biyani; Spencer Bliven; Guido Capitani
Journal:  BMC Struct Biol       Date:  2014-10-18

9.  Performance of HADDOCK and a simple contact-based protein-ligand binding affinity predictor in the D3R Grand Challenge 2.

Authors:  Zeynep Kurkcuoglu; Panagiotis I Koukos; Nevia Citro; Mikael E Trellet; J P G L M Rodrigues; Irina S Moreira; Jorge Roel-Touris; Adrien S J Melquiond; Cunliang Geng; Jörg Schaarschmidt; Li C Xue; Anna Vangone; A M J J Bonvin
Journal:  J Comput Aided Mol Des       Date:  2017-08-22       Impact factor: 3.686

10.  Protein interface classification by evolutionary analysis.

Authors:  Jose M Duarte; Adam Srebniak; Martin A Schärer; Guido Capitani
Journal:  BMC Bioinformatics       Date:  2012-12-22       Impact factor: 3.169

View more
  5 in total

1.  Accurate Classification of Biological and non-Biological Interfaces in Protein Crystal Structures using Subtle Covariation Signals.

Authors:  Yoshinori Fukasawa; Kentaro Tomii
Journal:  Sci Rep       Date:  2019-08-30       Impact factor: 4.379

Review 2.  QSalignWeb: A Server to Predict and Analyze Protein Quaternary Structure.

Authors:  Sucharita Dey; Jaime Prilusky; Emmanuel D Levy
Journal:  Front Mol Biosci       Date:  2022-01-05

3.  DeepRank: a deep learning framework for data mining 3D protein-protein interfaces.

Authors:  Nicolas Renaud; Cunliang Geng; Sonja Georgievska; Francesco Ambrosetti; Lars Ridder; Dario F Marzella; Manon F Réau; Alexandre M J J Bonvin; Li C Xue
Journal:  Nat Commun       Date:  2021-12-03       Impact factor: 14.919

4.  Staphylococcus aureus Exfoliative Toxin E, Oligomeric State and Flip of P186: Implications for Its Action Mechanism.

Authors:  Carolina Gismene; Jorge Enrique Hernández González; Angela Rocio Niño Santisteban; Andrey Fabricio Ziem Nascimento; Lucas Dos Santos Cunha; Fábio Rogério de Moraes; Cristiano Luis Pinto de Oliveira; Caio C Oliveira; Paola Jocelan Scarin Provazzi; Pedro Geraldo Pascutti; Raghuvir Krishnaswamy Arni; Ricardo Barros Mariutti
Journal:  Int J Mol Sci       Date:  2022-08-30       Impact factor: 6.208

5.  PDB-wide identification of physiological hetero-oligomeric assemblies based on conserved quaternary structure geometry.

Authors:  Sucharita Dey; Emmanuel D Levy
Journal:  Structure       Date:  2021-09-13       Impact factor: 5.006

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.