Literature DB >> 10592246

Assigning genomic sequences to CATH.

F M Pearl1, D Lee, J E Bray, I Sillitoe, A E Todd, A P Harrison, J M Thornton, C A Orengo.   

Abstract

We report the latest release (version 1.6) of the CATH protein domains database (http://www.biochem.ucl. ac.uk/bsm/cath ). This is a hierarchical classification of 18 577 domains into evolutionary families and structural groupings. We have identified 1028 homo-logous superfamilies in which the proteins have both structural, and sequence or functional similarity. These can be further clustered into 672 fold groups and 35 distinct architectures. Recent developments of the database include the generation of 3D templates for recognising structural relatives in each fold group, which has led to significant improvements in the speed and accuracy of updating the database and also means that less manual validation is required. We also report the establishment of the CATH-PFDB (Protein Family Database), which associates 1D sequences with the 3D homologous superfamilies. Sequences showing identifiable homology to entries in CATH have been extracted from GenBank using PSI-BLAST. A CATH-PSIBLAST server has been established, which allows you to scan a new sequence against the database. The CATH Dictionary of Homologous Superfamilies (DHS), which contains validated multiple structural alignments annotated with consensus functional information for evolutionary protein superfamilies, has been updated to include annotations associated with sequence relatives identified in GenBank. The DHS is a powerful tool for considering the variation of functional properties within a given CATH superfamily and in deciding what functional properties may be reliably inherited by a newly identified relative.

Mesh:

Substances:

Year:  2000        PMID: 10592246      PMCID: PMC102424          DOI: 10.1093/nar/28.1.277

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  19 in total

Review 1.  Evolution of protein function, from a structural perspective.

Authors:  A E Todd; C A Orengo; J M Thornton
Journal:  Curr Opin Chem Biol       Date:  1999-10       Impact factor: 8.822

2.  CORA--topological fingerprints for protein structural families.

Authors:  C A Orengo
Journal:  Protein Sci       Date:  1999-04       Impact factor: 6.725

3.  Proteins. One thousand families for the molecular biologist.

Authors:  C Chothia
Journal:  Nature       Date:  1992-06-18       Impact factor: 49.962

4.  Raster3D: photorealistic molecular graphics.

Authors:  E A Merritt; D J Bacon
Journal:  Methods Enzymol       Date:  1997       Impact factor: 1.600

5.  The Protein Data Bank: a computer-based archival file for macromolecular structures.

Authors:  F C Bernstein; T F Koetzle; G J Williams; E F Meyer; M D Brice; J R Rodgers; O Kennard; T Shimanouchi; M Tasumi
Journal:  J Mol Biol       Date:  1977-05-25       Impact factor: 5.469

6.  Protein structure alignment.

Authors:  W R Taylor; C A Orengo
Journal:  J Mol Biol       Date:  1989-07-05       Impact factor: 5.469

7.  A general method applicable to the search for similarities in the amino acid sequence of two proteins.

Authors:  S B Needleman; C D Wunsch
Journal:  J Mol Biol       Date:  1970-03       Impact factor: 5.469

8.  A procedure for detecting structural domains in proteins.

Authors:  M B Swindells
Journal:  Protein Sci       Date:  1995-01       Impact factor: 6.725

9.  OWL--a non-redundant composite protein sequence database.

Authors:  A J Bleasby; D Akrigg; T K Attwood
Journal:  Nucleic Acids Res       Date:  1994-09       Impact factor: 16.971

10.  Structural features can be unconserved in proteins with similar folds. An analysis of side-chain to side-chain contacts secondary structure and accessibility.

Authors:  R B Russell; G J Barton
Journal:  J Mol Biol       Date:  1994-12-02       Impact factor: 5.469

View more
  39 in total

1.  PDBsum: summaries and analyses of PDB structures.

Authors:  R A Laskowski
Journal:  Nucleic Acids Res       Date:  2001-01-01       Impact factor: 16.971

2.  The CATH extended protein-family database: providing structural annotations for genome sequences.

Authors:  Frances M G Pearl; David Lee; James E Bray; Daniel W A Buchan; Adrian J Shepherd; Christine A Orengo
Journal:  Protein Sci       Date:  2002-02       Impact factor: 6.725

3.  Streptococcus pneumonia YlxR at 1.35 A shows a putative new fold.

Authors:  J Osipiuk; P Górnicki; L Maj; I Dementieva; R Laskowski; A Joachimiak
Journal:  Acta Crystallogr D Biol Crystallogr       Date:  2001-10-25

4.  An NMR approach to structural proteomics.

Authors:  Adelinda Yee; Xiaoqing Chang; Antonio Pineda-Lucena; Bin Wu; Anthony Semesi; Brian Le; Theresa Ramelot; Gregory M Lee; Sudeepa Bhattacharyya; Pablo Gutierrez; Aleksej Denisov; Chang-Hun Lee; John R Cort; Guennadi Kozlov; Jack Liao; Grzegorz Finak; Limin Chen; David Wishart; Weontae Lee; Lawrence P McIntosh; Kalle Gehring; Michael A Kennedy; Aled M Edwards; Cheryl H Arrowsmith
Journal:  Proc Natl Acad Sci U S A       Date:  2002-02-19       Impact factor: 11.205

5.  Construction and characterization of protein libraries composed of secondary structure modules.

Authors:  Tomoaki Matsuura; Andreas Ernst; Andreas Plückthun
Journal:  Protein Sci       Date:  2002-11       Impact factor: 6.725

6.  EyeSite: a semi-automated database of protein families in the eye.

Authors:  David A Lee; Sandrine Fefeu; Adrian A Edo-Ukeh; Christine A Orengo; Christine Slingsby
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

7.  Structural basis for recruitment of the ATPase activator Aha1 to the Hsp90 chaperone machinery.

Authors:  Philippe Meyer; Chrisostomos Prodromou; Chunyan Liao; Bin Hu; S Mark Roe; Cara K Vaughan; Ignacija Vlasic; Barry Panaretou; Peter W Piper; Laurence H Pearl
Journal:  EMBO J       Date:  2004-01-22       Impact factor: 11.598

8.  Toward predicting protein topology: an approach to identifying beta hairpins.

Authors:  Xavier de la Cruz; E Gail Hutchinson; Adrian Shepherd; Janet M Thornton
Journal:  Proc Natl Acad Sci U S A       Date:  2002-08-12       Impact factor: 11.205

9.  Orientational potentials extracted from protein structures improve native fold recognition.

Authors:  Nicolae-Viorel Buchete; John E Straub; Devarajan Thirumalai
Journal:  Protein Sci       Date:  2004-04       Impact factor: 6.725

10.  PreSPI: a domain combination based prediction system for protein-protein interaction.

Authors:  Dong-Soo Han; Hong-Soog Kim; Woo-Hyuk Jang; Sung-Doke Lee; Jung-Keun Suh
Journal:  Nucleic Acids Res       Date:  2004-12-01       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.