Literature DB >> 21671455

Internal organization of large protein families: relationship between the sequence, structure, and function-based clustering.

Xiao-Hui Cai1, Lukasz Jaroszewski, John Wooley, Adam Godzik.   

Abstract

The protein universe can be organized in families that group proteins sharing common ancestry. Such families display variable levels of structural and functional divergence, from homogenous families, where all members have the same function and very similar structure, to very divergent families, where large variations in function and structure are observed. For practical purposes of structure and function prediction, it would be beneficial to identify sub-groups of proteins with highly similar structures (iso-structural) and/or functions (iso-functional) within divergent protein families. We compared three algorithms in their ability to cluster large protein families and discuss whether any of these methods could reliably identify such iso-structural or iso-functional groups. We show that clustering using profile-sequence and profile-profile comparison methods closely reproduces clusters based on similarities between 3D structures or clusters of proteins with similar biological functions. In contrast, the still commonly used sequence-based methods with fixed thresholds result in vast overestimates of structural and functional diversity in protein families. As a result, these methods also overestimate the number of protein structures that have to be determined to fully characterize structural space of such families. The fact that one can build reliable models based on apparently distantly related templates is crucial for extracting maximal amount of information from new sequencing projects.
Copyright © 2011 Wiley-Liss, Inc.

Entities:  

Mesh:

Substances:

Year:  2011        PMID: 21671455      PMCID: PMC3132221          DOI: 10.1002/prot.23049

Source DB:  PubMed          Journal:  Proteins        ISSN: 0887-3585


  36 in total

1.  Comparison of sequence profiles. Strategies for structural predictions using sequence information.

Authors:  L Rychlewski; L Jaroszewski; W Li; A Godzik
Journal:  Protein Sci       Date:  2000-02       Impact factor: 6.725

2.  Within the twilight zone: a sensitive profile-profile comparison tool based on information theory.

Authors:  Golan Yona; Michael Levitt
Journal:  J Mol Biol       Date:  2002-02-01       Impact factor: 5.469

3.  Completeness in structural genomics.

Authors:  D Vitkup; E Melamud; J Moult; C Sander
Journal:  Nat Struct Biol       Date:  2001-06

4.  Profile-profile alignment: a powerful tool for protein structure prediction.

Authors:  Niklas von Ohsen; Ingolf Sommer; Ralf Zimmer
Journal:  Pac Symp Biocomput       Date:  2003

5.  Using evolutionary information for the query and target improves fold recognition.

Authors:  Björn Wallner; Huisheng Fang; Tomas Ohlson; Johannes Frey-Skött; Arne Elofsson
Journal:  Proteins       Date:  2004-02-01

Review 6.  Studying genomes through the aeons: protein families, pseudogenes and proteome evolution.

Authors:  Paul M Harrison; Mark Gerstein
Journal:  J Mol Biol       Date:  2002-05-17       Impact factor: 5.469

7.  Finding weak similarities between proteins by sequence profile comparison.

Authors:  Anna R Panchenko
Journal:  Nucleic Acids Res       Date:  2003-01-15       Impact factor: 16.971

8.  Flexible structure alignment by chaining aligned fragment pairs allowing twists.

Authors:  Yuzhen Ye; Adam Godzik
Journal:  Bioinformatics       Date:  2003-10       Impact factor: 6.937

9.  Prediction and functional analysis of native disorder in proteins from the three kingdoms of life.

Authors:  J J Ward; J S Sodhi; L J McGuffin; B F Buxton; D T Jones
Journal:  J Mol Biol       Date:  2004-03-26       Impact factor: 5.469

10.  Profile-profile comparisons by COMPASS predict intricate homologies between protein families.

Authors:  Ruslan I Sadreyev; David Baker; Nick V Grishin
Journal:  Protein Sci       Date:  2003-10       Impact factor: 6.725

View more
  3 in total

1.  Basis for substrate recognition and distinction by matrix metalloproteinases.

Authors:  Boris I Ratnikov; Piotr Cieplak; Kosi Gramatikoff; James Pierce; Alexey Eroshkin; Yoshinobu Igarashi; Marat Kazanov; Qing Sun; Adam Godzik; Andrei Osterman; Boguslaw Stec; Alex Strongin; Jeffrey W Smith
Journal:  Proc Natl Acad Sci U S A       Date:  2014-09-22       Impact factor: 11.205

2.  Evolutionary dynamics on protein bi-stability landscapes can potentially resolve adaptive conflicts.

Authors:  Tobias Sikosek; Erich Bornberg-Bauer; Hue Sun Chan
Journal:  PLoS Comput Biol       Date:  2012-09-13       Impact factor: 4.475

Review 3.  State-of-the-Art Biocatalysis.

Authors:  Joshua B Pyser; Suman Chakrabarty; Evan O Romero; Alison R H Narayan
Journal:  ACS Cent Sci       Date:  2021-06-25       Impact factor: 14.553

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.