Literature DB >> 10081963

Evaluation and improvement of multiple sequence methods for protein secondary structure prediction.

J A Cuff1, G J Barton.   

Abstract

A new dataset of 396 protein domains is developed and used to evaluate the performance of the protein secondary structure prediction algorithms DSC, PHD, NNSSP, and PREDATOR. The maximum theoretical Q3 accuracy for combination of these methods is shown to be 78%. A simple consensus prediction on the 396 domains, with automatically generated multiple sequence alignments gives an average Q3 prediction accuracy of 72.9%. This is a 1% improvement over PHD, which was the best single method evaluated. Segment Overlap Accuracy (SOV) is 75.4% for the consensus method on the 396-protein set. The secondary structure definition method DSSP defines 8 states, but these are reduced by most authors to 3 for prediction. Application of the different published 8- to 3-state reduction methods shows variation of over 3% on apparent prediction accuracy. This suggests that care should be taken to compare methods by the same reduction method. Two new sequence datasets (CB513 and CB251) are derived which are suitable for cross-validation of secondary structure prediction methods without artifacts due to internal homology. A fully automatic World Wide Web service that predicts protein secondary structure by a combination of methods is available via http://barton.ebi.ac.uk/.

Entities:  

Mesh:

Year:  1999        PMID: 10081963     DOI: 10.1002/(sici)1097-0134(19990301)34:4<508::aid-prot10>3.0.co;2-4

Source DB:  PubMed          Journal:  Proteins        ISSN: 0887-3585


  132 in total

1.  Environment-dependent residue contact energies for proteins.

Authors:  C Zhang; S H Kim
Journal:  Proc Natl Acad Sci U S A       Date:  2000-03-14       Impact factor: 11.205

2.  HOBACGEN: database system for comparative genomics in bacteria.

Authors:  G Perrière; L Duret; M Gouy
Journal:  Genome Res       Date:  2000-03       Impact factor: 9.043

3.  A highly conserved domain of the maize activator transposase is involved in dimerization.

Authors:  L Essers; R H Adolphs; R Kunze
Journal:  Plant Cell       Date:  2000-02       Impact factor: 11.277

4.  Cascaded multiple classifiers for secondary structure prediction.

Authors:  M Ouali; R D King
Journal:  Protein Sci       Date:  2000-06       Impact factor: 6.725

5.  Probability-based protein secondary structure identification using combined NMR chemical-shift data.

Authors:  Yunjun Wang; Oleg Jardetzky
Journal:  Protein Sci       Date:  2002-04       Impact factor: 6.725

6.  Environmental features are important in determining protein secondary structure.

Authors:  J R Macdonald; W C Johnson
Journal:  Protein Sci       Date:  2001-06       Impact factor: 6.725

7.  Environmentally induced reversible conformational switching in the yeast cell adhesion protein alpha-agglutinin.

Authors:  H Zhao; M H Chen; Z M Shen; P C Kahn; P N Lipke
Journal:  Protein Sci       Date:  2001-06       Impact factor: 6.725

8.  The crystal structure of the C-terminal fragment of striated-muscle alpha-tropomyosin reveals a key troponin T recognition site.

Authors:  Yu Li; Suet Mui; Jerry H Brown; James Strand; Ludmilla Reshetnikova; Larry S Tobacman; Carolyn Cohen
Journal:  Proc Natl Acad Sci U S A       Date:  2002-05-28       Impact factor: 11.205

9.  The transcriptional switch of bacteriophage WPhi, a P2-related but heteroimmune coliphage.

Authors:  T Liu; E Haggård-Ljungquist
Journal:  J Virol       Date:  1999-12       Impact factor: 5.103

10.  The membrane trafficking protein calpactin forms a complex with bluetongue virus protein NS3 and mediates virus release.

Authors:  Andrew R Beaton; Javier Rodriguez; Y Krishnamohan Reddy; Polly Roy
Journal:  Proc Natl Acad Sci U S A       Date:  2002-09-16       Impact factor: 11.205

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.