Literature DB >> 11967367

Structural similarity to link sequence space: new potential superfamilies and implications for structural genomics.

Patrick Aloy1, Baldomero Oliva, Enrique Querol, Francesc X Aviles, Robert B Russell.   

Abstract

The current pace of structural biology now means that protein three-dimensional structure can be known before protein function, making methods for assigning homology via structure comparison of growing importance. Previous research has suggested that sequence similarity after structure-based alignment is one of the best discriminators of homology and often functional similarity. Here, we exploit this observation, together with a merger of protein structure and sequence databases, to predict distant homologous relationships. We use the Structural Classification of Proteins (SCOP) database to link sequence alignments from the SMART and Pfam databases. We thus provide new alignments that could not be constructed easily in the absence of known three-dimensional structures. We then extend the method of Murzin (1993b) to assign statistical significance to sequence identities found after structural alignment and thus suggest the best link between diverse sequence families. We find that several distantly related protein sequence families can be linked with confidence, showing the approach to be a means for inferring homologous relationships and thus possible functions when proteins are of known structure but of unknown function. The analysis also finds several new potential superfamilies, where inspection of the associated alignments and superimpositions reveals conservation of unusual structural features or co-location of conserved amino acids and bound substrates. We discuss implications for Structural Genomics initiatives and for improvements to sequence comparison methods.

Mesh:

Substances:

Year:  2002        PMID: 11967367      PMCID: PMC2373547          DOI: 10.1110/ps.3950102

Source DB:  PubMed          Journal:  Protein Sci        ISSN: 0961-8368            Impact factor:   6.725


  57 in total

1.  Evolution of function in protein superfamilies, from a structural perspective.

Authors:  A E Todd; C A Orengo; J M Thornton
Journal:  J Mol Biol       Date:  2001-04-06       Impact factor: 5.469

2.  Identification of homology in protein structure classification.

Authors:  S Dietmann; L Holm
Journal:  Nat Struct Biol       Date:  2001-11

3.  Multiple protein sequence alignment from tertiary structure comparison: assignment of global and residue confidence levels.

Authors:  R B Russell; G J Barton
Journal:  Proteins       Date:  1992-10

4.  Basic local alignment search tool.

Authors:  S F Altschul; W Gish; W Miller; E W Myers; D J Lipman
Journal:  J Mol Biol       Date:  1990-10-05       Impact factor: 5.469

Review 5.  The P-loop--a common motif in ATP- and GTP-binding proteins.

Authors:  M Saraste; P R Sibbald; A Wittinghofer
Journal:  Trends Biochem Sci       Date:  1990-11       Impact factor: 13.807

6.  Structural resemblance between the families of bacterial signal-transduction proteins and of G proteins revealed by graph theoretical techniques.

Authors:  P J Artymiuk; D W Rice; E M Mitchell; P Willett
Journal:  Protein Eng       Date:  1990-10

7.  Structural design and molecular evolution of a cytokine receptor superfamily.

Authors:  J F Bazan
Journal:  Proc Natl Acad Sci U S A       Date:  1990-09       Impact factor: 11.205

8.  ALSCRIPT: a tool to format multiple sequence alignments.

Authors:  G J Barton
Journal:  Protein Eng       Date:  1993-01

9.  Roles of the highly conserved aspartate and lysine residues in the response regulator of bacterial chemotaxis.

Authors:  G S Lukat; B H Lee; J M Mottonen; A M Stock; J B Stock
Journal:  J Biol Chem       Date:  1991-05-05       Impact factor: 5.157

10.  Three-dimensional structure of the bifunctional enzyme phosphoribosylanthranilate isomerase: indoleglycerolphosphate synthase from Escherichia coli refined at 2.0 A resolution.

Authors:  M Wilmanns; J P Priestle; T Niermann; J N Jansonius
Journal:  J Mol Biol       Date:  1992-01-20       Impact factor: 5.469

View more
  5 in total

1.  Structural similarity to bridge sequence space: finding new families on the bridges.

Authors:  Parantu K Shah; Patrick Aloy; Peer Bork; Robert B Russell
Journal:  Protein Sci       Date:  2005-05       Impact factor: 6.725

2.  Prediction of a new class of RNA recognition motif.

Authors:  Núria Cerdà-Costa; Jaume Bonet; M Rosario Fernández; Francesc X Avilés; Baldomero Oliva; Sandra Villegas
Journal:  J Mol Model       Date:  2010-11-17       Impact factor: 1.810

3.  Protein-protein interaction hotspots carved into sequences.

Authors:  Yanay Ofran; Burkhard Rost
Journal:  PLoS Comput Biol       Date:  2007-07       Impact factor: 4.475

4.  SUPFAM: a database of sequence superfamilies of protein domains.

Authors:  Shashi B Pandit; Rana Bhadra; V S Gowri; S Balaji; B Anand; N Srinivasan
Journal:  BMC Bioinformatics       Date:  2004-03-15       Impact factor: 3.169

5.  Profile-profile comparisons by COMPASS predict intricate homologies between protein families.

Authors:  Ruslan I Sadreyev; David Baker; Nick V Grishin
Journal:  Protein Sci       Date:  2003-10       Impact factor: 6.725

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.