Literature DB >> 9309224

CATH--a hierarchic classification of protein domain structures.

C A Orengo1, A D Michie, S Jones, D T Jones, M B Swindells, J M Thornton.   

Abstract

BACKGROUND: Protein evolution gives rise to families of structurally related proteins, within which sequence identities can be extremely low. As a result, structure-based classifications can be effective at identifying unanticipated relationships in known structures and in optimal cases function can also be assigned. The ever increasing number of known protein structures is too large to classify all proteins manually, therefore, automatic methods are needed for fast evaluation of protein structures.
RESULTS: We present a semi-automatic procedure for deriving a novel hierarchical classification of protein domain structures (CATH). The four main levels of our classification are protein class (C), architecture (A), topology (T) and homologous superfamily (H). Class is the simplest level, and it essentially describes the secondary structure composition of each domain. In contrast, architecture summarises the shape revealed by the orientations of the secondary structure units, such as barrels and sandwiches. At the topology level, sequential connectivity is considered, such that members of the same architecture might have quite different topologies. When structures belonging to the same T-level have suitably high similarities combined with similar functions, the proteins are assumed to be evolutionarily related and put into the same homologous superfamily.
CONCLUSIONS: Analysis of the structural families generated by CATH reveals the prominent features of protein structure space. We find that nearly a third of the homologous superfamilies (H-levels) belong to ten major T-levels, which we call superfolds, and furthermore that nearly two-thirds of these H-levels cluster into nine simple architectures. A database of well-characterised protein structure families, such as CATH, will facilitate the assignment of structure-function/evolution relationships to both known and newly determined protein structures.

Mesh:

Substances:

Year:  1997        PMID: 9309224     DOI: 10.1016/s0969-2126(97)00260-8

Source DB:  PubMed          Journal:  Structure        ISSN: 0969-2126            Impact factor:   5.006


  707 in total

1.  SCOP: a structural classification of proteins database.

Authors:  L Lo Conte; B Ailey; T J Hubbard; S E Brenner; A G Murzin; C Chothia
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  The Protein Data Bank.

Authors:  H M Berman; J Westbrook; Z Feng; G Gilliland; T N Bhat; H Weissig; I N Shindyalov; P E Bourne
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

3.  Assigning genomic sequences to CATH.

Authors:  F M Pearl; D Lee; J E Bray; I Sillitoe; A E Todd; A P Harrison; J M Thornton; C A Orengo
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

4.  Diversity of functions of proteins with internal symmetry in spatial arrangement of secondary structural elements.

Authors:  K Kinoshita; A Kidera; N Go
Journal:  Protein Sci       Date:  1999-06       Impact factor: 6.725

5.  PDB-REPRDB: a database of representative protein chains from the Protein Data Bank (PDB).

Authors:  T Noguchi; H Matsuda; Y Akiyama
Journal:  Nucleic Acids Res       Date:  2001-01-01       Impact factor: 16.971

6.  PDBsum: summaries and analyses of PDB structures.

Authors:  R A Laskowski
Journal:  Nucleic Acids Res       Date:  2001-01-01       Impact factor: 16.971

7.  A rapid classification protocol for the CATH Domain Database to support structural genomics.

Authors:  F M Pearl; N Martin; J E Bray; D W Buchan; A P Harrison; D Lee; G A Reeves; A J Shepherd; I Sillitoe; A E Todd; J M Thornton; C A Orengo
Journal:  Nucleic Acids Res       Date:  2001-01-01       Impact factor: 16.971

8.  PALI-a database of Phylogeny and ALIgnment of homologous protein structures.

Authors:  S Balaji; S Sujatha; S S Kumar; N Srinivasan
Journal:  Nucleic Acids Res       Date:  2001-01-01       Impact factor: 16.971

9.  PartsList: a web-based system for dynamically ranking protein folds based on disparate attributes, including whole-genome expression and interaction information.

Authors:  J Qian; B Stenger; C A Wilson; J Lin; R Jansen; S A Teichmann; J Park; W G Krebs; H Yu; V Alexandrov; N Echols; M Gerstein
Journal:  Nucleic Acids Res       Date:  2001-04-15       Impact factor: 16.971

10.  The Hans Neurath Award lecture of The Protein Society: proteins-- a testament to physics, chemistry, and evolution.

Authors:  J M Thornton
Journal:  Protein Sci       Date:  2001-01       Impact factor: 6.725

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.