Literature DB >> 19717576

ANCHOR: web server for predicting protein binding regions in disordered proteins.

Zsuzsanna Dosztányi1, Bálint Mészáros, István Simon.   

Abstract

UNLABELLED: ANCHOR is a web-based implementation of an original method that takes a single amino acid sequence as an input and predicts protein binding regions that are disordered in isolation but can undergo disorder-to-order transition upon binding. The server incorporates the result of a general disorder prediction method, IUPred and can carry out simple motif searches as well. AVAILABILITY: The web server is available at http://anchor.enzim.hu. The program package is freely available for academic users.

Entities:  

Mesh:

Substances:

Year:  2009        PMID: 19717576      PMCID: PMC2759549          DOI: 10.1093/bioinformatics/btp518

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


1 INTRODUCTION

Many disordered proteins contain important functional elements involved in protein–protein interactions. Disordered binding regions play a critical role in various biological processes, involving regulation and signaling (Dyson and Wright, 2002). These segments differ from protein interaction sites of globular proteins due to their distinct structural properties (Mészáros et al., 2007). Such regions exist as a highly flexible structural ensemble in isolation and adopt a well-defined conformation only upon binding to their specific partner molecules. It was suggested that certain disorder prediction methods can be indicative of disordered binding regions (Garner et al., 1999). Specialized methods have been developed to regions adopting α-helical conformation in their bound state (Cheng et al., 2007) or for the binding partners of calmodulin (Radivojac et al., 2006). In contrast, ANCHOR is a general method for recognizing disordered binding regions. ANCHOR aims to capture the basic biophysical properties of disordered binding regions using estimated energy calculations (Mészáros et al., 2009). Estimated energies can be assigned to each residue in a sequence and were shown to well-approximate the corresponding energies calculated from known structures of globular proteins (Dosztányi et al., 2005b). Generally, disordered regions can be discriminated from ordered proteins by unfavorable estimated energies. This concept is utilized in the IUPred server for the prediction of protein disorder (Dosztányi et al., 2005a). The estimated energies can also detect regions that are likely to gain energetically by interacting with globular proteins. Predictions in ANCHOR combine the general disorder tendency with the sensitivity to the structural environment (Mészáros et al., 2009). Because of this additional property, ANCHOR scores are relatively independent from IUPred scores. The developed method was able to recognize disordered binding regions with almost 70% accuracy at the segment level on various datasets. We also ensured that disordered binding regions could be discriminated from generally disordered regions and that the false positive rate on a dataset of globular proteins was <5%. Since the publication of the original paper (Mészáros et al., 2009), we have found that the false positive rate can be further reduced by eliminating segments with IUPred scores too low to be compatible with disordered binding regions. Additionally, short predicted segments of length less than six residues are also filtered out. ANCHOR predicts disordered binding regions without any information about the partner protein(s). A complementary approach identifies protein binding regions using motif searches. It was suggested that interaction with certain proteins or protein families are mediated through specific linear motifs that capture key residues responsible for binding. A growing number of such linear motifs are now being categorized in the ELM server (Puntervoll et al., 2003). The presence of sequence motifs reduces the complex task of finding putative protein binding sites to a simple pattern matching problem. However, such matches can contain many false positives, suggesting that the definition of the binding motif should include information about the specific structural context. Since several instances of linear motifs occur within disordered regions, disordered binding regions could help to filter out false positive matches. Therefore, complementing the prediction of disordered binding regions with specific motif searches can prove useful in many cases and help to explore other motifs.

2 THE ANCHOR SERVER

The minimum input of the web server is a single amino acid sequence. Sequences can also be specified by their corresponding UniProt IDs or ACs. A list of motifs can also be submitted, specified as regular expressions with or without their names. A few examples, including known eukaryotic linear motifs are given in the help to guide the user with the format. The motif search, however, is not restricted to known linear motifs, any kind of regular expression can be specified. The basic output of our prediction method is a probability score, indicating the likelihood of the residue to be part of a disordered binding region along each position in the sequence. Regions that have a score >0.5 and pass the filtering criteria are predicted as disordered binding regions. The returned plot shows the prediction profile calculated by ANCHOR, the disordered binding region prediction method, together with IUPred, a general disorder prediction method. Predicted disordered binding regions and matched motifs are also indicated underneath the profile as horizontal bars. The graphical output is followed by a simple text output, summarizing the predicted and filtered binding regions, the location of the found motifs and the returned prediction profile. An example for the graphical output is presented in Figure 1. The core program of ANCHOR is written C, while motif searches are carried out by a Perl wrapper. This Perl program is called by the web server written in PHP. The graphical output is generated by the JpGraph software (JpGraph, 2005; http://www.aditus.nu/jpgraph/). The default option for graphical/text output is automatically determined by the browser type, but it can be changed by user. Additionally, list of sequences can also be submitted to generate simple text output on a larger scale.
Fig. 1.

An example of ANCHOR graphical output for the Wiskott–Aldrich Syndrome protein (WASp) with various motif searches. The N-terminal of the protein contains an ordered domain, otherwise it is largely disordered. Multiple disordered binding regions were predicted, and several of these can be confirmed experimentally [see Mészáros et al. (2009) for more details]. The results of the motif searches show regions containing various SH3 binding sites as specified in the ELM database. Additionally, proline rich regions and the CRIB motif implicated in binding to Cdc42 can also be located.

An example of ANCHOR graphical output for the Wiskott–Aldrich Syndrome protein (WASp) with various motif searches. The N-terminal of the protein contains an ordered domain, otherwise it is largely disordered. Multiple disordered binding regions were predicted, and several of these can be confirmed experimentally [see Mészáros et al. (2009) for more details]. The results of the motif searches show regions containing various SH3 binding sites as specified in the ELM database. Additionally, proline rich regions and the CRIB motif implicated in binding to Cdc42 can also be located. Funding: Hungarian Scientific Research Fund (OTKA-K72569); the National Office for Research and Technology, Hungary (NKTH07a-TB_INTER). Conflict of Interest: none declared.
  9 in total

Review 1.  Coupling of folding and binding for unstructured proteins.

Authors:  H Jane Dyson; Peter E Wright
Journal:  Curr Opin Struct Biol       Date:  2002-02       Impact factor: 6.809

2.  ELM server: A new resource for investigating short functional sites in modular eukaryotic proteins.

Authors:  Pål Puntervoll; Rune Linding; Christine Gemünd; Sophie Chabanis-Davidson; Morten Mattingsdal; Scott Cameron; David M A Martin; Gabriele Ausiello; Barbara Brannetti; Anna Costantini; Fabrizio Ferrè; Vincenza Maselli; Allegra Via; Gianni Cesareni; Francesca Diella; Giulio Superti-Furga; Lucjan Wyrwicz; Chenna Ramu; Caroline McGuigan; Rambabu Gudavalli; Ivica Letunic; Peer Bork; Leszek Rychlewski; Bernhard Küster; Manuela Helmer-Citterich; William N Hunter; Rein Aasland; Toby J Gibson
Journal:  Nucleic Acids Res       Date:  2003-07-01       Impact factor: 16.971

3.  IUPred: web server for the prediction of intrinsically unstructured regions of proteins based on estimated energy content.

Authors:  Zsuzsanna Dosztányi; Veronika Csizmok; Peter Tompa; István Simon
Journal:  Bioinformatics       Date:  2005-06-14       Impact factor: 6.937

4.  Mining alpha-helix-forming molecular recognition features with cross species sequence alignments.

Authors:  Yugong Cheng; Christopher J Oldfield; Jingwei Meng; Pedro Romero; Vladimir N Uversky; A Keith Dunker
Journal:  Biochemistry       Date:  2007-11-01       Impact factor: 3.162

5.  Molecular principles of the interactions of disordered proteins.

Authors:  Bálint Mészáros; Peter Tompa; István Simon; Zsuzsanna Dosztányi
Journal:  J Mol Biol       Date:  2007-07-12       Impact factor: 5.469

6.  The pairwise energy content estimated from amino acid composition discriminates between folded and intrinsically unstructured proteins.

Authors:  Zsuzsanna Dosztányi; Veronika Csizmók; Péter Tompa; István Simon
Journal:  J Mol Biol       Date:  2005-04-08       Impact factor: 5.469

7.  Predicting Binding Regions within Disordered Proteins.

Authors: 
Journal:  Genome Inform Ser Workshop Genome Inform       Date:  1999

8.  Calmodulin signaling: analysis and prediction of a disorder-dependent molecular recognition.

Authors:  Predrag Radivojac; Slobodan Vucetic; Timothy R O'Connor; Vladimir N Uversky; Zoran Obradovic; A Keith Dunker
Journal:  Proteins       Date:  2006-05-01

9.  Prediction of protein binding regions in disordered proteins.

Authors:  Bálint Mészáros; István Simon; Zsuzsanna Dosztányi
Journal:  PLoS Comput Biol       Date:  2009-05-01       Impact factor: 4.475

  9 in total
  234 in total

Review 1.  How a disordered ubiquitin ligase maintains order in nuclear protein homeostasis.

Authors:  Joel C Rosenbaum; Richard G Gardner
Journal:  Nucleus       Date:  2011-07-01       Impact factor: 4.197

2.  MoRFpred, a computational tool for sequence-based prediction and characterization of short disorder-to-order transitioning binding regions in proteins.

Authors:  Fatemeh Miri Disfani; Wei-Lun Hsu; Marcin J Mizianty; Christopher J Oldfield; Bin Xue; A Keith Dunker; Vladimir N Uversky; Lukasz Kurgan
Journal:  Bioinformatics       Date:  2012-06-15       Impact factor: 6.937

Review 3.  Understanding protein non-folding.

Authors:  Vladimir N Uversky; A Keith Dunker
Journal:  Biochim Biophys Acta       Date:  2010-02-01

4.  Analysis of the native conformation of the LIR/AIM motif in the Atg8/LC3/GABARAP-binding proteins.

Authors:  Hana Popelka; Daniel J Klionsky
Journal:  Autophagy       Date:  2015       Impact factor: 16.016

5.  Resolving the ambiguity: Making sense of intrinsic disorder when PDB structures disagree.

Authors:  Shelly DeForte; Vladimir N Uversky
Journal:  Protein Sci       Date:  2016-01-09       Impact factor: 6.725

6.  Folding factors and partners for the intrinsically disordered protein micro-exon gene 14 (MEG-14).

Authors:  Jose Luiz S Lopes; Debora Orcia; Ana Paula U Araujo; Ricardo DeMarco; B A Wallace
Journal:  Biophys J       Date:  2013-06-04       Impact factor: 4.033

7.  Identification of multiple dityrosine bonds in materials composed of the Drosophila protein Ultrabithorax.

Authors:  David W Howell; Shang-Pu Tsai; Kelly Churion; Jan Patterson; Colette Abbey; Joshua T Atkinson; Dustin Porterpan; Yil-Hwan You; Kenith E Meissner; Kayla J Bayless; Sarah E Bondos
Journal:  Adv Funct Mater       Date:  2015-08-31       Impact factor: 18.808

8.  p15PAF is an intrinsically disordered protein with nonrandom structural preferences at sites of interaction with other proteins.

Authors:  Alfredo De Biasio; Alain Ibáñez de Opakua; Tiago N Cordeiro; Maider Villate; Nekane Merino; Nathalie Sibille; Moreno Lelli; Tammo Diercks; Pau Bernadó; Francisco J Blanco
Journal:  Biophys J       Date:  2014-02-18       Impact factor: 4.033

9.  The BADC and BCCP subunits of chloroplast acetyl-CoA carboxylase sense the pH changes of the light-dark cycle.

Authors:  Yajin Ye; Yan G Fulcher; David J Sliman; Mizani T Day; Mark J Schroeder; Rama K Koppisetti; Philip D Bates; Jay J Thelen; Steven R Van Doren
Journal:  J Biol Chem       Date:  2020-05-27       Impact factor: 5.157

10.  Molecular heterogeneity and CXorf67 alterations in posterior fossa group A (PFA) ependymomas.

Authors:  Kristian W Pajtler; Ji Wen; Martin Sill; Tong Lin; Wilda Orisme; Bo Tang; Jens-Martin Hübner; Vijay Ramaswamy; Sujuan Jia; James D Dalton; Kelly Haupfear; Hazel A Rogers; Chandanamali Punchihewa; Ryan Lee; John Easton; Gang Wu; Timothy A Ritzmann; Rebecca Chapman; Lukas Chavez; Fredrick A Boop; Paul Klimo; Noah D Sabin; Robert Ogg; Stephen C Mack; Brian D Freibaum; Hong Joo Kim; Hendrik Witt; David T W Jones; Baohan Vo; Amar Gajjar; Stan Pounds; Arzu Onar-Thomas; Martine F Roussel; Jinghui Zhang; J Paul Taylor; Thomas E Merchant; Richard Grundy; Ruth G Tatevossian; Michael D Taylor; Stefan M Pfister; Andrey Korshunov; Marcel Kool; David W Ellison
Journal:  Acta Neuropathol       Date:  2018-06-16       Impact factor: 17.088

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.