Literature DB >> 9521098

Domain assignment for protein structures using a consensus approach: characterization and analysis.

S Jones1, M Stewart, A Michie, M B Swindells, C Orengo, J M Thornton.   

Abstract

A consensus approach for the assignment of structural domains in proteins is presented. The approach combines a number of previously published algorithms, and takes advantage of the elevated accuracy obtained when assignments from the individual algorithms are in agreement. The consensus approach is tested on a data set of 55 protein chains, for which domain assignments from four automated methods were known, and for which crystallographers assignments had been reported in the literature. Accuracy was found to increase in this test from 72% using individual algorithms to 100% when all four methods were in agreement. However a consensus prediction using all four methods was only possible for 52% of the dataset. The consensus approach [using three publicly available domain assignment algorithms (PUU, DETECTIVE, DOMAK)] was then used to make domain assignments for a data set of 787 protein chains from the Protein Data Bank. Analysis of the assignments showed 55.7% of assignments could be made automatically, and of these, 13.5% were multi-domain proteins. Of the remaining 44.3% that could not be assigned by the consensus procedure 90.4% had their domain boundaries assigned correctly by at least one of the algorithms. Once identified, these domains were analyzed for trends in their size and secondary structure class. In addition, the discontinuity of each domain along the protein chain was considered.

Mesh:

Year:  1998        PMID: 9521098      PMCID: PMC2143930          DOI: 10.1002/pro.5560070202

Source DB:  PubMed          Journal:  Protein Sci        ISSN: 0961-8368            Impact factor:   6.725


  15 in total

1.  Three-dimensional structure of soybean beta-amylase determined at 3.0 A resolution: preliminary chain tracing of the complex with alpha-cyclodextrin.

Authors:  B Mikami; M Sato; T Shibata; M Hirose; S Aibara; Y Katsube; Y Morita
Journal:  J Biochem       Date:  1992-10       Impact factor: 3.387

2.  Insight into E-selectin/ligand interaction from the crystal structure and mutagenesis of the lec/EGF domains.

Authors:  B J Graves; R L Crowther; C Chandran; J M Rumberger; S Li; K S Huang; D H Presky; P C Familletti; B A Wolitzky; D K Burns
Journal:  Nature       Date:  1994-02-10       Impact factor: 49.962

3.  The Protein Data Bank: a computer-based archival file for macromolecular structures.

Authors:  F C Bernstein; T F Koetzle; G J Williams; E F Meyer; M D Brice; J R Rodgers; O Kennard; T Shimanouchi; M Tasumi
Journal:  J Mol Biol       Date:  1977-05-25       Impact factor: 5.469

4.  A database of globular protein structural domains: clustering of representative family members into similar folds.

Authors:  R Sowdhamini; S D Rufino; T L Blundell
Journal:  Fold Des       Date:  1996

5.  SCOP: a structural classification of proteins database for the investigation of sequences and structures.

Authors:  A G Murzin; S E Brenner; T Hubbard; C Chothia
Journal:  J Mol Biol       Date:  1995-04-07       Impact factor: 5.469

6.  Binary discontinuous compact protein domains.

Authors:  M H Zehfus
Journal:  Protein Eng       Date:  1994-03

7.  The FSSP database of structurally aligned protein fold families.

Authors:  L Holm; C Sander
Journal:  Nucleic Acids Res       Date:  1994-09       Impact factor: 16.971

8.  Three-dimensional structure of bacterial luciferase from Vibrio harveyi at 2.4 A resolution.

Authors:  A J Fisher; F M Raushel; T O Baldwin; I Rayment
Journal:  Biochemistry       Date:  1995-05-23       Impact factor: 3.162

9.  Crystal structure of adenylosuccinate synthetase from Escherichia coli. Evidence for convergent evolution of GTP-binding domains.

Authors:  B W Poland; M M Silva; M A Serra; Y Cho; K H Kim; E M Harris; R B Honzatko
Journal:  J Biol Chem       Date:  1993-12-05       Impact factor: 5.157

10.  The 2.2 A resolution crystal structure of influenza B neuraminidase and its complex with sialic acid.

Authors:  W P Burmeister; R W Ruigrok; S Cusack
Journal:  EMBO J       Date:  1992-01       Impact factor: 11.598

View more
  45 in total

1.  Assigning genomic sequences to CATH.

Authors:  F M Pearl; D Lee; J E Bray; I Sillitoe; A E Todd; A P Harrison; J M Thornton; C A Orengo
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  A rapid classification protocol for the CATH Domain Database to support structural genomics.

Authors:  F M Pearl; N Martin; J E Bray; D W Buchan; A P Harrison; D Lee; G A Reeves; A J Shepherd; I Sillitoe; A E Todd; J M Thornton; C A Orengo
Journal:  Nucleic Acids Res       Date:  2001-01-01       Impact factor: 16.971

3.  Distinguishing between sequential and nonsequentially folded proteins: implications for folding and misfolding.

Authors:  C J Tsai; J V Maizel; R Nussinov
Journal:  Protein Sci       Date:  1999-08       Impact factor: 6.725

4.  Genome analysis: Assigning protein coding regions to three-dimensional structures.

Authors:  A A Salamov; M Suwa; C A Orengo; M B Swindells
Journal:  Protein Sci       Date:  1999-04       Impact factor: 6.725

5.  The CATH extended protein-family database: providing structural annotations for genome sequences.

Authors:  Frances M G Pearl; David Lee; James E Bray; Daniel W A Buchan; Adrian J Shepherd; Christine A Orengo
Journal:  Protein Sci       Date:  2002-02       Impact factor: 6.725

Review 6.  Classification of protein folds.

Authors:  Robert B Russell
Journal:  Mol Biotechnol       Date:  2002-01       Impact factor: 2.695

7.  Low free energy cost of very long loop insertions in proteins.

Authors:  Michelle Scalley-Kim; Philippe Minard; David Baker
Journal:  Protein Sci       Date:  2003-02       Impact factor: 6.725

8.  The CATH database: an extended protein family resource for structural and functional genomics.

Authors:  F M G Pearl; C F Bennett; J E Bray; A P Harrison; N Martin; A Shepherd; I Sillitoe; J Thornton; C A Orengo
Journal:  Nucleic Acids Res       Date:  2003-01-01       Impact factor: 16.971

9.  Rapid protein domain assignment from amino acid sequence using predicted secondary structure.

Authors:  Russell L Marsden; Liam J McGuffin; David T Jones
Journal:  Protein Sci       Date:  2002-12       Impact factor: 6.725

10.  Improving the performance of DomainParser for structural domain partition using neural network.

Authors:  Jun-tao Guo; Dong Xu; Dongsup Kim; Ying Xu
Journal:  Nucleic Acids Res       Date:  2003-02-01       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.