Literature DB >> 10211816

CORA--topological fingerprints for protein structural families.

C A Orengo1.   

Abstract

CORA is a suite of programs for multiply aligning and analyzing protein structural families to identify the consensus positions and capture their most conserved structural characteristics (e.g., residue accessibility, torsional angles, and global geometry as described by inter-residue vectors/contacts). Knowledge of these structurally conserved positions, which are mostly in the core of the fold and of their properties, significantly improves the identification and classification of newly-determined relatives. Information is encoded in a consensus three-dimensional (3D) template and relatives found by a sensitive alignment method, which employs a new scoring scheme based on conserved residue contacts. By encapsulating these critical "core" features, templates perform more reliably in recognizing distant structural relatives than searches with representative structures. Parameters for 3D-template generation and alignment were optimized for each structural class (mainly-alpha, mainly-beta, alpha-beta), using representative superfold families. For all families selected, the templates gave significant improvements in sensitivity and selectivity in recognizing distant structural relatives. Furthermore, since templates contain less than 70% of fold positions and compare fewer positions when aligning structures, scans are at least an order of magnitude faster than scans using selected structures. CORA was subsequently tested on eight other broad structural families from the CATH database. Diagnostics plots are generated automatically and provide qualitative assistance for classifying newly determined relatives. They are demonstrated here by application to the large globin-like fold family. CORA templates for both homologous superfamilies and fold families will be stored in CATH and used to improve the classification and analysis of newly determined structures.

Mesh:

Year:  1999        PMID: 10211816      PMCID: PMC2144314          DOI: 10.1110/ps.8.4.699

Source DB:  PubMed          Journal:  Protein Sci        ISSN: 0961-8368            Impact factor:   6.725


  35 in total

1.  Multiple protein sequence alignment from tertiary structure comparison: assignment of global and residue confidence levels.

Authors:  R B Russell; G J Barton
Journal:  Proteins       Date:  1992-10

2.  Fast structure alignment for protein databank searching.

Authors:  C A Orengo; N P Brown; W R Taylor
Journal:  Proteins       Date:  1992-10

3.  A rapid method of protein structure alignment.

Authors:  C A Orengo; W R Taylor
Journal:  J Theor Biol       Date:  1990-12-21       Impact factor: 2.691

4.  Database of homology-derived protein structures and the structural meaning of sequence alignment.

Authors:  C Sander; R Schneider
Journal:  Proteins       Date:  1991

5.  A database of protein structure families with common folding motifs.

Authors:  L Holm; C Ouzounis; C Sander; G Tuparev; G Vriend
Journal:  Protein Sci       Date:  1992-12       Impact factor: 6.725

6.  Definition of general topological equivalence in protein structures. A procedure involving comparison of properties and relationships through simulated annealing and dynamic programming.

Authors:  A Sali; T L Blundell
Journal:  J Mol Biol       Date:  1990-03-20       Impact factor: 5.469

7.  Identification of protein folds: matching hydrophobicity patterns of sequence sets with solvent accessibility patterns of known structures.

Authors:  J U Bowie; N D Clarke; C O Pabo; R T Sauer
Journal:  Proteins       Date:  1990

8.  Alignment and searching for common protein folds using a data bank of structural templates.

Authors:  M S Johnson; J P Overington; T L Blundell
Journal:  J Mol Biol       Date:  1993-06-05       Impact factor: 5.469

9.  Protein structure comparison by alignment of distance matrices.

Authors:  L Holm; C Sander
Journal:  J Mol Biol       Date:  1993-09-05       Impact factor: 5.469

10.  OB(oligonucleotide/oligosaccharide binding)-fold: common structural and functional solution for non-homologous sequences.

Authors:  A G Murzin
Journal:  EMBO J       Date:  1993-03       Impact factor: 11.598

View more
  17 in total

1.  Assigning genomic sequences to CATH.

Authors:  F M Pearl; D Lee; J E Bray; I Sillitoe; A E Todd; A P Harrison; J M Thornton; C A Orengo
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  A rapid classification protocol for the CATH Domain Database to support structural genomics.

Authors:  F M Pearl; N Martin; J E Bray; D W Buchan; A P Harrison; D Lee; G A Reeves; A J Shepherd; I Sillitoe; A E Todd; J M Thornton; C A Orengo
Journal:  Nucleic Acids Res       Date:  2001-01-01       Impact factor: 16.971

3.  Amino acid-base interactions: a three-dimensional analysis of protein-DNA interactions at an atomic level.

Authors:  N M Luscombe; R A Laskowski; J M Thornton
Journal:  Nucleic Acids Res       Date:  2001-07-01       Impact factor: 16.971

4.  FoldMiner: structural motif discovery using an improved superposition algorithm.

Authors:  Jessica Shapiro; Douglas Brutlag
Journal:  Protein Sci       Date:  2004-01       Impact factor: 6.725

5.  A consensus view of fold space: combining SCOP, CATH, and the Dali Domain Dictionary.

Authors:  Ryan Day; David A C Beck; Roger S Armen; Valerie Daggett
Journal:  Protein Sci       Date:  2003-10       Impact factor: 6.725

6.  Detailed analysis of function divergence in a large and diverse domain superfamily: toward a refined protocol of function classification.

Authors:  Benoit H Dessailly; Oliver C Redfern; Alison L Cuff; Christine A Orengo
Journal:  Structure       Date:  2010-11-10       Impact factor: 5.006

7.  Protein-RNA interactions: a structural analysis.

Authors:  S Jones; D T Daley; N M Luscombe; H M Berman; J M Thornton
Journal:  Nucleic Acids Res       Date:  2001-02-15       Impact factor: 16.971

8.  Automatic generation and evaluation of sparse protein signatures for families of protein structural domains.

Authors:  Matthew J Blades; Jon C Ison; Ranjeeva Ranasinghe; John B C Findlay
Journal:  Protein Sci       Date:  2005-01       Impact factor: 6.725

9.  Assessing strategies for improved superfamily recognition.

Authors:  Ian Sillitoe; Mark Dibley; James Bray; Sarah Addou; Christine Orengo
Journal:  Protein Sci       Date:  2005-06-03       Impact factor: 6.725

10.  Protein superfamily evolution and the last universal common ancestor (LUCA).

Authors:  Juan A G Ranea; Antonio Sillero; Janet M Thornton; Christine A Orengo
Journal:  J Mol Evol       Date:  2006-10-04       Impact factor: 2.395

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.