Literature DB >> 19508719

Combining specificity determining and conserved residues improves functional site prediction.

Olga V Kalinina1, Mikhail S Gelfand, Robert B Russell.   

Abstract

BACKGROUND: Predicting the location of functionally important sites from protein sequence and/or structure is a long-standing problem in computational biology. Most current approaches make use of sequence conservation, assuming that amino acid residues conserved within a protein family are most likely to be functionally important. Most often these approaches do not consider many residues that act to define specific sub-functions within a family, or they make no distinction between residues important for function and those more relevant for maintaining structure (e.g. in the hydrophobic core). Many protein families bind and/or act on a variety of ligands, meaning that conserved residues often only bind a common ligand sub-structure or perform general catalytic activities.
RESULTS: Here we present a novel method for functional site prediction based on identification of conserved positions, as well as those responsible for determining ligand specificity. We define Specificity-Determining Positions (SDPs), as those occupied by conserved residues within sub-groups of proteins in a family having a common specificity, but differ between groups, and are thus likely to account for specific recognition events. We benchmark the approach on enzyme families of known 3D structure with bound substrates, and find that in nearly all families residues predicted by SDPsite are in contact with the bound substrate, and that the addition of SDPs significantly improves functional site prediction accuracy. We apply SDPsite to various families of proteins containing known three-dimensional structures, but lacking clear functional annotations, and discusse several illustrative examples.
CONCLUSION: The results suggest a better means to predict functional details for the thousands of protein structures determined prior to a clear understanding of molecular function.

Entities:  

Mesh:

Substances:

Year:  2009        PMID: 19508719      PMCID: PMC2709924          DOI: 10.1186/1471-2105-10-174

Source DB:  PubMed          Journal:  BMC Bioinformatics        ISSN: 1471-2105            Impact factor:   3.169


  59 in total

1.  Structure of YciI from Haemophilus influenzae (HI0828) reveals a ferredoxin-like alpha/beta-fold with a histidine/aspartate centered catalytic site.

Authors:  Mark A Willis; Feng Song; Zhihao Zhuang; Wojciech Krajewski; Vani Rao Chalamasetty; Prasad Reddy; Andrew Howard; Debra Dunaway-Mariano; Osnat Herzberg
Journal:  Proteins       Date:  2005-05-15

2.  Prediction of functional specificity determinants from protein sequences using log-likelihood ratios.

Authors:  Jimin Pei; Wei Cai; Lisa N Kinch; Nick V Grishin
Journal:  Bioinformatics       Date:  2005-11-08       Impact factor: 6.937

3.  Q-SiteFinder: an energy-based method for the prediction of protein-ligand binding sites.

Authors:  Alasdair T R Laurie; Richard M Jackson
Journal:  Bioinformatics       Date:  2005-02-08       Impact factor: 6.937

4.  BADASP: predicting functional specificity in protein families using ancestral sequences.

Authors:  Richard J Edwards; Denis C Shields
Journal:  Bioinformatics       Date:  2005-09-13       Impact factor: 6.937

Review 5.  The impact of structural genomics: expectations and outcomes.

Authors:  John-Marc Chandonia; Steven E Brenner
Journal:  Science       Date:  2006-01-20       Impact factor: 47.728

6.  Automated discovery of 3D motifs for protein function annotation.

Authors:  Benjamin J Polacco; Patricia C Babbitt
Journal:  Bioinformatics       Date:  2006-01-12       Impact factor: 6.937

7.  PANDIT: an evolution-centric database of protein and associated nucleotide domains with inferred trees.

Authors:  Simon Whelan; Paul I W de Bakker; Emmanuel Quevillon; Nicolas Rodriguez; Nick Goldman
Journal:  Nucleic Acids Res       Date:  2006-01-01       Impact factor: 16.971

8.  Linking enzyme sequence to function using Conserved Property Difference Locator to identify and annotate positions likely to control specific functionality.

Authors:  Kimberly M Mayer; Sean R McCorkle; John Shanklin
Journal:  BMC Bioinformatics       Date:  2005-11-30       Impact factor: 3.169

9.  Pfam: clans, web tools and services.

Authors:  Robert D Finn; Jaina Mistry; Benjamin Schuster-Böckler; Sam Griffiths-Jones; Volker Hollich; Timo Lassmann; Simon Moxon; Mhairi Marshall; Ajay Khanna; Richard Durbin; Sean R Eddy; Erik L L Sonnhammer; Alex Bateman
Journal:  Nucleic Acids Res       Date:  2006-01-01       Impact factor: 16.971

10.  Predicting specificity-determining residues in two large eukaryotic transcription factor families.

Authors:  Jason E Donald; Eugene I Shakhnovich
Journal:  Nucleic Acids Res       Date:  2005-08-05       Impact factor: 16.971

View more
  16 in total

1.  Molecular dynamics and docking simulations as a proof of high flexibility in E. coli FabH and its relevance for accurate inhibitor modeling.

Authors:  Yunierkis Pérez-Castillo; Matheus Froeyen; Miguel Angel Cabrera-Pérez; Ann Nowé
Journal:  J Comput Aided Mol Des       Date:  2011-04-23       Impact factor: 3.686

Review 2.  Emerging methods in protein co-evolution.

Authors:  David de Juan; Florencio Pazos; Alfonso Valencia
Journal:  Nat Rev Genet       Date:  2013-03-05       Impact factor: 53.242

3.  pocketZebra: a web-server for automated selection and classification of subfamily-specific binding sites by bioinformatic analysis of diverse protein families.

Authors:  Dmitry Suplatov; Eugeny Kirilin; Mikhail Arbatsky; Vakil Takhaveev; Vytas Svedas
Journal:  Nucleic Acids Res       Date:  2014-05-22       Impact factor: 16.971

4.  Functionally important positions can comprise the majority of a protein's architecture.

Authors:  Sudheer Tungtur; Daniel J Parente; Liskin Swint-Kruse
Journal:  Proteins       Date:  2011-03-04

5.  Multi-Harmony: detecting functional specificity from sequence alignment.

Authors:  Bernd W Brandt; K Anton Feenstra; Jaap Heringa
Journal:  Nucleic Acids Res       Date:  2010-06-04       Impact factor: 16.971

6.  An automated stochastic approach to the identification of the protein specificity determinants and functional subfamilies.

Authors:  Pavel V Mazin; Mikhail S Gelfand; Andrey A Mironov; Aleksandra B Rakhmaninova; Anatoly R Rubinov; Robert B Russell; Olga V Kalinina
Journal:  Algorithms Mol Biol       Date:  2010-07-15       Impact factor: 1.405

7.  Comparing the functional roles of nonconserved sequence positions in homologous transcription repressors: implications for sequence/function analyses.

Authors:  Sudheer Tungtur; Sarah Meinhardt; Liskin Swint-Kruse
Journal:  J Mol Biol       Date:  2009-10-08       Impact factor: 5.469

8.  Principal components analysis of protein sequence clusters.

Authors:  Bo Wang; Michael A Kennedy
Journal:  J Struct Funct Genomics       Date:  2014-02-05

9.  Comparative Bioinformatic Analysis of Active Site Structures in Evolutionarily Remote Homologues of α,β-Hydrolase Superfamily Enzymes.

Authors:  D A Suplatov; V K Arzhanik; V K Svedas
Journal:  Acta Naturae       Date:  2011-01       Impact factor: 1.845

10.  CLIPS-1D: analysis of multiple sequence alignments to deduce for residue-positions a role in catalysis, ligand-binding, or protein structure.

Authors:  Jan-Oliver Janda; Markus Busch; Fabian Kück; Mikhail Porfenenko; Rainer Merkl
Journal:  BMC Bioinformatics       Date:  2012-04-05       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.