Literature DB >> 18467424

MultiBind and MAPPIS: webservers for multiple alignment of protein 3D-binding sites and their interactions.

Alexandra Shulman-Peleg¹, Maxim Shatsky, Ruth Nussinov, Haim J Wolfson.

Abstract

Analysis of protein-ligand complexes and recognition of spatially conserved physico-chemical properties is important for the prediction of binding and function. Here, we present two webservers for multiple alignment and recognition of binding patterns shared by a set of protein structures. The first webserver, MultiBind (http://bioinfo3d.cs.tau.ac.il/MultiBind), performs multiple alignment of protein binding sites. It recognizes the common spatial chemical binding patterns even in the absence of similarity of the sequences or the folds of the compared proteins. The input to the MultiBind server is a set of protein-binding sites defined by interactions with small molecules. The output is a detailed list of the shared physico-chemical binding site properties. The second webserver, MAPPIS (http://bioinfo3d.cs.tau.ac.il/MAPPIS), aims to analyze protein-protein interactions. It performs multiple alignment of protein-protein interfaces (PPIs), which are regions of interaction between two protein molecules. MAPPIS recognizes the spatially conserved physico-chemical interactions, which often involve energetically important hot-spot residues that are crucial for protein-protein associations. The input to the MAPPIS server is a set of protein-protein complexes. The output is a detailed list of the shared interaction properties of the interfaces.

Entities: Chemical Species

Mesh：

Substances：
Multiprotein Complexes

Year: 2008 PMID： 18467424 PMCID： PMC2447750 DOI： 10.1093/nar/gkn185

Source DB: PubMed Journal: Nucleic Acids Res ISSN： 0305-1048 Impact factor: 16.971

INTRODUCTION

Proteins, which are essential to all biological systems, function by interacting with other molecules. Consequently, multiple alignment of the protein binding regions can help in defining the properties that are essential for the interaction with certain binding partners and in inferring the function. Here, we consider two related problems of multiple alignments: that of protein-binding sites and of protein–protein interfaces, PPIs, which are formed between pairs of interacting binding sites. Multiple sequence and structural alignment have become a common practice (1). Yet, a dissimilarity in the global properties does not necessarily imply different functions; indeed, it has been shown that convergent evolution of binding sites is not a rare phenomenon (2). Several methods have been developed to identify specific 3D patterns of protein catalytic residues (3–7). However, many binding sites of small molecules such as ATP and estradiol do not share common patterns of amino acids (8–10); rather, they present a set of surface regions with similar physico-chemical properties and shapes. While several approaches have been proposed for recognition and pairwise alignment of such functional sites (10–13), no multiple alignment methods are available. Since pairwise alignments may contain a large number of features that are not essential for the binding, multiple alignment methods are required to determine the smallest set of features, a consensus, that is necessary to achieve a desired biological consequence. Consideration of pairs of interacting binding sites, which form PPIs, provide additional valuable information of the actual interactions formed between the molecules. Analysis of a set of protein–protein complexes helped in gaining important insights toward deciphering the principles of protein–protein interactions (14–20) and their modular architecture (21). Previous PPI alignment methods, which considered the backbone C atoms (22) or the physico-chemical binding patterns (23,24) aligned only pairs of PPIs. However, multiple alignment methods are required for recognition of conservation of the spatial interaction patterns formed between the molecules. In this article, we present two webservers for multiple spatial alignment of protein-binding sites and PPIs. The difference between the two methods is in the input representation. While the first method, MultiBind (25), looks at the binding site surface of one molecule, the second method, MAPPIS (26), constructs interaction edges between two interacting proteins. MultiBind aligns a set of binding sites and recognizes the common spatial arrangements of their physicochemical properties. On the other hand, MAPPIS performs multiple alignments of PPIs and recognizes the spatially conserved interaction patterns. Both methods consider the physico-chemical properties formed by groups of atoms and are independent of the overall similarity in the protein sequences or folds.

MULTIBIND: MULTIPLE ALIGNMENT OF PROTEIN-BINDING SITES

Given a set of binding sites that bind the same small molecule, our goal is to reveal the common physico-chemical pattern that may be responsible for the binding. Figure 1A illustrates the binding site representation, which is crucial for the description of the chemistry of the recognized patterns. Each binding site is determined by the solvent accessible surface points (27) that are located < 4Å from the surface of the binding partner. Following the definition of Schmitt et al. (28), each amino acid in a binding site is represented by points in 3D space termed pseudocenters. Each pseudocenter represents one of the following properties important for protein–ligand interactions: hydrogen-bond donor (DON), hydrogen-bond acceptor (ACC), mixed donor/acceptor (DAC), hydrophobic aliphatic (ALI) and aromatic contacts (PI). We considered all the pseudocenters with at least one surface exposed atom. The pseudocenters and the surfaces are assigned such attributes as charge, normal vectors of the surface direction, ring plane orientation as well as surface patch size and curvature (25,29).

Figure 1.

Physico-chemical representation of binding sites and PPIs. (A) Representation of a binding site by its pseudocenters (balls). Hydrogen bond donors are blue, acceptors are red, donors/acceptors are green, hydrophobic aliphatic are orange and aromatic are white/gray. The surface is represented as dots and are colored according to the property of the corresponding, surface exposed, pseudocenters. (B) An interface as a pair of interacting binding sites. The surfaces and the pseudocenters are colored as in (A). The rightmost figure illustrates the definition of pseudocenters and the bar at the bottom illustrates the complementarity of the pseudocenter properties. MultiBind (25) is an efficient method, which achieves this goal by local multiple alignment of protein binding sites, which are not assumed to share any sequence or fold similarity. MultiBind utilizes a time efficient Geometric Hashing method (30), which allows recognition of the candidate 3D transformations that align pairs of structures. Then, by applying a branch-and-bound procedure, MultiBind recognizes a combination of multiple 3D transformations that give the highest scoring common 3D pattern. The score of a pattern is the sum of similarity scores of the matched pseudocenters. These are measured by a scoring function that compares properties like spatial proximity (after the superimposition), charge, surface curvature as well as aromatic ring plane orientation. The input to the MultiBind webserver consists of set of protein—small molecule complexes (defined by the PDB codes or uploaded files). The binding sites of each complex are automatically extracted according to the ligands bound to the input structure. The output of MultiBind is a set of physico-chemical properties shared by all the input binding sites. We provide the details of the properties and the amino acids that contribute to these as well as the 3D transformation that superimposes the binding sites in 3D space. We provide a PDB file with the spatial superimposition of the input complexes, which can be viewed online with a Jmol script that visualizes the shared patterns (Figure 2).

Figure 2.

Web interface and output of MultiBind. (A) Entrance webserver page. (B) Selection of the binding sites of interest by the description of the bound ligands. Only ligands listed as HETATM records with more than seven non-hydrogen atoms are considered. (C) Example of an output page, which details the matched pseudocenters of the common pattern. Each three columns present the details of a specific pseudocenter: (i) chain identifier and residue number; (ii) residue type; (iii) pseudocenter type. Although the pseudocenters are not required to have the same amino acid identity or origin (backbone or side chain), we indicate the conservation of these (b/s or *, respectively). (D) A default Jmol visualization of the superimposed complexes. (E) A Jmol visualization of the common pattern. The shared pseudocenters are represented as balls, colored as in Figure 1. The ligands, represented as sticks, are not considered by MultiBind, but their spatial alignment supports the correctness of the solution. The buttons at the bottom detail the web page options that should be selected to obtain this visualization automatically.

MAPPIS: MULTIPLE ALIGNMENT OF PPIs

Given a set of protein–protein complexes, our goal is to align them in 3D space and recognize the shared spatially conserved interaction patterns. Similarly to multiple sequence and structure alignment, the main motivation is the assumption that an interaction common to a number of interfaces is functionally more significant than a similar interaction found in a single or a pair of PPIs. The uniqueness of MAPPIS lies in its ability to detect spatially conserved patterns of interactions even when there is no sequence or fold similarity between the corresponding proteins. Recently, we have applied MAPPIS to different families of PPIs and observed that most of the conserved physico-chemical interactions are contributed by the hot spot residues, and consequently, MAPPIS predicts hot spots with a high success rate (29). Figure 1B, illustrates the PPI representation by its physico-chemical properties and interactions. Specifically, an interaction across PPIs is defined by a pair of close enough pseudocenters, one from each side of the interface, possessing complementary physico-chemical properties (hydrogen bond donors are complementary to acceptors, while hydrophobic aliphatic and aromatic centers can interact with similar ones). MAPPIS calculates a set of transformations, which superimpose the PPIs according to their similar interactions that can be of the following three types: hydrogen bonds, hydrophobic aliphatic and aromatic (π) contacts. Two interactions are considered similar if they are created by similar pseudocenters that are superimposed to nearby spatial locations (e.g. within 3Å). The similarity of interactions from two different PPIs is scored according to the similarity of the corresponding pseudocenters and the complementarity of their properties. Specifically, we measured the complementarity in terms of the pseudocenter proximity, charge complementarity, surface fit as well as aromatic ring orientations (favoring perpendicular and parallel π stacking). MAPPIS finds a set of transformations that superimpose the input PPIs in 3D space in a way that maximizes the spatial and chemical similarity of their interactions and pseudocenters. The input to MAPPIS is a set of protein–protein complexes with at least one pair of interacting protein chains. The interacting chains, which define the PPI, can be either specified by the user or selected from a list of automatically recognized interactions. The chain definition is followed by the automatic construction of the PPIs and their multiple alignment with MAPPIS. The output of MAPPIS is a set of the physico-chemical interactions shared by all the PPIs. We provide a PDB file with the superimposed complexes, which can be viewed online with a Jmol script that visualizes the shared properties and interactions (Figure 3).

Figure 3.

Web interface and output of MAPPIS. (A) Entrance webserver page. (B) An option for the automatic selection of the interacting protein chains (if not specified manually). Two protein chains are considered as interacting if there are at least five atoms of one chain that are within the distance of 6.0 Å from the other. (C) Example of an output page, which details the common pattern of matched interactions. Each pair of rows details a shared interaction, which is defined by the interacting pseudocenters from two different chains of a PPI. As in MultiBind each three columns present the details of a specific pseudocenter of a PPI. (D) A default Jmol visualization of the aligned complexes. The shared physico-chemical properties are represented as in Figure 1. The corresponding common interactions are represented as yellow sticks.

PERFORMANCE AND AVAILABILITY

The webservers of MultiBind and MAPPIS are available from http://bioinfo3d.cs.tau.ac.il. Although the running times of each algorithm are several minutes, the server overload may lead to longer running times. Consequently, the user has an option to supply an email address to which the link to the output page will be sent upon the completion. Users who are interested in performing large scale database analysis and classification are advised to download the freely available software packages. The packages contain the Linux executable programs as well as user manuals and a set of scripts for the extraction of binding sites and PPIs.

26 in total

Review 1. Electrostatic aspects of protein-protein interactions.

Authors: F B Sheinerman; R Norel; B Honig
Journal: Curr Opin Struct Biol Date: 2000-04 Impact factor: 6.809

2. Dissecting protein-protein recognition sites.

Authors: Pinak Chakrabarti; Joël Janin
Journal: Proteins Date: 2002-05-15

3. A new method to detect related function among proteins independent of sequence and fold homology.

Authors: Stefan Schmitt; Daniel Kuhn; Gerhard Klebe
Journal: J Mol Biol Date: 2002-10-18 Impact factor: 5.469

4. Searching for patterns of amino acids in 3D protein structures.

Authors: Ruth V Spriggs; Peter J Artymiuk; Peter Willett
Journal: J Chem Inf Comput Sci Date: 2003 Mar-Apr

Review 5. Diversity of protein-protein interactions.

Authors: Irene M A Nooren; Janet M Thornton
Journal: EMBO J Date: 2003-07-15 Impact factor: 11.598

6. A new bioinformatic approach to detect common 3D sites in protein structures.

Authors: Martin Jambon; Anne Imberty; Gilbert Deléage; Christophe Geourjon
Journal: Proteins Date: 2003-08-01

7. A new, structurally nonredundant, diverse data set of protein-protein interfaces and its implications.

Authors: Ozlem Keskin; Chung-Jung Tsai; Haim Wolfson; Ruth Nussinov
Journal: Protein Sci Date: 2004-04 Impact factor: 6.725

8. Inferring functional relationships of proteins from local sequence and spatial surface patterns.

Authors: T Andrew Binkowski; Larisa Adamian; Jie Liang
Journal: J Mol Biol Date: 2003-09-12 Impact factor: 5.469

9. Adenine recognition: a motif present in ATP-, CoA-, NAD-, NADP-, and FAD-dependent proteins.

Authors: K A Denessiouk; V V Rantanen; M S Johnson
Journal: Proteins Date: 2001-08-15

10. Spatial chemical conservation of hot spot interactions in protein-protein complexes.

Authors: Alexandra Shulman-Peleg; Maxim Shatsky; Ruth Nussinov; Haim J Wolfson
Journal: BMC Biol Date: 2007-10-09 Impact factor: 7.431

45 in total

1. Human proteome-scale structural modeling of E2-E3 interactions exploiting interface motifs.

Authors: Gozde Kar; Ozlem Keskin; Ruth Nussinov; Attila Gursoy
Journal: J Proteome Res Date: 2012-01-10 Impact factor: 4.466

2. PESDserv: a server for high-throughput comparison of protein binding site surfaces.

Authors: Sourav Das; Michael P Krein; Curt M Breneman
Journal: Bioinformatics Date: 2010-06-10 Impact factor: 6.937

3. PatchSearch: a web server for off-target protein identification.

Authors: Julien Rey; Inès Rasolohery; Pierre Tufféry; Frédéric Guyon; Gautier Moroy
Journal: Nucleic Acids Res Date: 2019-07-02 Impact factor: 16.971

4. Computational structural analysis of proteins of Mycobacterium tuberculosis and a resource for identifying off-targets.

Authors: Sameer Hassan; Abhimita Debnath; Vasantha Mahalingam; Luke Elizabeth Hanna
Journal: J Mol Model Date: 2012-04-27 Impact factor: 1.810