Literature DB >> 12520050

The CATH database: an extended protein family resource for structural and functional genomics.

F M G Pearl1, C F Bennett, J E Bray, A P Harrison, N Martin, A Shepherd, I Sillitoe, J Thornton, C A Orengo.   

Abstract

The CATH database of protein domain structures (http://www.biochem.ucl.ac.uk/bsm/cath_new) currently contains 34 287 domain structures classified into 1383 superfamilies and 3285 sequence families. Each structural family is expanded with domain sequence relatives recruited from GenBank using a variety of efficient sequence search protocols and reliable thresholds. This extended resource, known as the CATH-protein family database (CATH-PFDB) contains a total of 310 000 domain sequences classified into 26 812 sequence families. New sequence search protocols have been designed, based on these intermediate sequence libraries, to allow more regular updating of the classification. Further developments include the adaptation of a recently developed method for rapid structure comparison, based on secondary structure matching, for domain boundary assignment. The philosophy behind CATHEDRAL is the recognition of recurrent folds already classified in CATH. Benchmarking of CATHEDRAL, using manually validated domain assignments, demonstrated that 43% of domains boundaries could be completely automatically assigned. This is an improvement on a previous consensus approach for which only 10-20% of domains could be reliably processed in a completely automated fashion. Since domain boundary assignment is a significant bottleneck in the classification of new structures, CATHEDRAL will also help to increase the frequency of CATH updates.

Mesh:

Substances:

Year:  2003        PMID: 12520050      PMCID: PMC165509          DOI: 10.1093/nar/gkg062

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  18 in total

1.  Using the CATH domain database to assign structures and functions to the genome sequences.

Authors:  F Pearl; A E Todd; J E Bray; A C Martin; A A Salamov; M Suwa; M B Swindells; J M Thornton; C A Orengo
Journal:  Biochem Soc Trans       Date:  2000-02       Impact factor: 5.407

2.  Protein folds and families: sequence and structure alignments.

Authors:  L Holm; C Sander
Journal:  Nucleic Acids Res       Date:  1999-01-01       Impact factor: 16.971

3.  Hidden Markov models for detecting remote protein homologies.

Authors:  K Karplus; C Barrett; R Hughey
Journal:  Bioinformatics       Date:  1998       Impact factor: 6.937

4.  Domain assignment for protein structures using a consensus approach: characterization and analysis.

Authors:  S Jones; M Stewart; A Michie; M B Swindells; C Orengo; J M Thornton
Journal:  Protein Sci       Date:  1998-02       Impact factor: 6.725

5.  Predicting protein structure using hidden Markov models.

Authors:  K Karplus; K Sjölander; C Barrett; M Cline; D Haussler; R Hughey; L Holm; C Sander
Journal:  Proteins       Date:  1997

6.  Assessing sequence comparison methods with reliable structurally identified distant evolutionary relationships.

Authors:  S E Brenner; C Chothia; T J Hubbard
Journal:  Proc Natl Acad Sci U S A       Date:  1998-05-26       Impact factor: 11.205

7.  CATH--a hierarchic classification of protein domain structures.

Authors:  C A Orengo; A D Michie; S Jones; D T Jones; M B Swindells; J M Thornton
Journal:  Structure       Date:  1997-08-15       Impact factor: 5.006

8.  Use of techniques derived from graph theory to compare secondary structure motifs in proteins.

Authors:  E M Mitchell; P J Artymiuk; D W Rice; P Willett
Journal:  J Mol Biol       Date:  1990-03-05       Impact factor: 5.469

9.  Protein structure alignment.

Authors:  W R Taylor; C A Orengo
Journal:  J Mol Biol       Date:  1989-07-05       Impact factor: 5.469

10.  Parser for protein folding units.

Authors:  L Holm; C Sander
Journal:  Proteins       Date:  1994-07
View more
  75 in total

1.  HOMSTRAD: recent developments of the Homologous Protein Structure Alignment Database.

Authors:  Lucy A Stebbings; Kenji Mizuguchi
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

2.  PIRSF: family classification system at the Protein Information Resource.

Authors:  Cathy H Wu; Anastasia Nikolskaya; Hongzhan Huang; Lai-Su L Yeh; Darren A Natale; C R Vinayaka; Zhang-Zhi Hu; Raja Mazumder; Sandeep Kumar; Panagiotis Kourtesis; Robert S Ledley; Baris E Suzek; Leslie Arminski; Yongxing Chen; Jian Zhang; Jorge Louie Cardenas; Sehee Chung; Jorge Castro-Alvear; Georgi Dinkov; Winona C Barker
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

3.  EyeSite: a semi-automated database of protein families in the eye.

Authors:  David A Lee; Sandrine Fefeu; Adrian A Edo-Ukeh; Christine A Orengo; Christine Slingsby
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

4.  DomIns: a web resource for domain insertions in known protein structures.

Authors:  R Aroul Selvam; Rajkumar Sasidharan
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

5.  Integrated databanks access and sequence/structure analysis services at the PBIL.

Authors:  Guy Perrière; Christophe Combet; Simon Penel; Christophe Blanchet; Jean Thioulouse; Christophe Geourjon; Julien Grassot; Céline Charavay; Manolo Gouy; Laurent Duret; Gilbert Deléage
Journal:  Nucleic Acids Res       Date:  2003-07-01       Impact factor: 16.971

6.  Tools for comparative protein structure modeling and analysis.

Authors:  Narayanan Eswar; Bino John; Nebojsa Mirkovic; Andras Fiser; Valentin A Ilyin; Ursula Pieper; Ashley C Stuart; Marc A Marti-Renom; M S Madhusudhan; Bozidar Yerkovich; Andrej Sali
Journal:  Nucleic Acids Res       Date:  2003-07-01       Impact factor: 16.971

7.  CHOP: parsing proteins into structural domains.

Authors:  Jinfeng Liu; Burkhard Rost
Journal:  Nucleic Acids Res       Date:  2004-07-01       Impact factor: 16.971

8.  Functional classification of proteins and protein variants.

Authors:  Albert Y Lau; Daniel I Chasman
Journal:  Proc Natl Acad Sci U S A       Date:  2004-04-15       Impact factor: 11.205

9.  Orientational potentials extracted from protein structures improve native fold recognition.

Authors:  Nicolae-Viorel Buchete; John E Straub; Devarajan Thirumalai
Journal:  Protein Sci       Date:  2004-04       Impact factor: 6.725

10.  LEON: multiple aLignment Evaluation Of Neighbours.

Authors:  Julie D Thompson; Véronique Prigent; Olivier Poch
Journal:  Nucleic Acids Res       Date:  2004-02-24       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.