Literature DB >> 8356056

Improved prediction of protein secondary structure by use of sequence profiles and neural networks.

B Rost1, C Sander.   

Abstract

The explosive accumulation of protein sequences in the wake of large-scale sequencing projects is in stark contrast to the much slower experimental determination of protein structures. Improved methods of structure prediction from the gene sequence alone are therefore needed. Here, we report a substantial increase in both the accuracy and quality of secondary-structure predictions, using a neural-network algorithm. The main improvements come from the use of multiple sequence alignments (better overall accuracy), from "balanced training" (better prediction of beta-strands), and from "structure context training" (better prediction of helix and strand lengths). This method, cross-validated on seven different test sets purged of sequence similarity to learning sets, achieves a three-state prediction accuracy of 69.7%, significantly better than previous methods. In addition, the predicted structures have a more realistic distribution of helix and strand segments. The predictions may be suitable for use in practice as a first estimate of the structural type of newly sequenced proteins.

Mesh:

Substances:

Year:  1993        PMID: 8356056      PMCID: PMC47181          DOI: 10.1073/pnas.90.16.7558

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


  26 in total

1.  Improvements in the prediction of protein backbone topography by reduction of statistical errors.

Authors:  F R Maxfield; H A Scheraga
Journal:  Biochemistry       Date:  1979-02-20       Impact factor: 3.162

2.  A database of protein structure families with common folding motifs.

Authors:  L Holm; C Ouzounis; C Sander; G Tuparev; G Vriend
Journal:  Protein Sci       Date:  1992-12       Impact factor: 6.725

3.  Predicting the secondary structure of globular proteins using neural network models.

Authors:  N Qian; T J Sejnowski
Journal:  J Mol Biol       Date:  1988-08-20       Impact factor: 5.469

4.  Prediction of protein secondary structure and active sites using the alignment of homologous sequences.

Authors:  M J Zvelebil; G J Barton; W R Taylor; M J Sternberg
Journal:  J Mol Biol       Date:  1987-06-20       Impact factor: 5.469

5.  Improvements in a secondary structure prediction method based on a search for local sequence homologies and its use as a model building tool.

Authors:  J M Levin; J Garnier
Journal:  Biochim Biophys Acta       Date:  1988-08-10

6.  Protein secondary structure and homology by neural networks. The alpha-helices in rhodopsin.

Authors:  H Bohr; J Bohr; S Brunak; R M Cotterill; B Lautrup; L Nørskov; O H Olsen; S B Petersen
Journal:  FEBS Lett       Date:  1988-12-05       Impact factor: 4.124

7.  Dictionary of protein secondary structure: pattern recognition of hydrogen-bonded and geometrical features.

Authors:  W Kabsch; C Sander
Journal:  Biopolymers       Date:  1983-12       Impact factor: 2.505

8.  Theory of protein secondary structure and algorithm of its prediction.

Authors:  O B Ptitsyn; A V Finkelstein
Journal:  Biopolymers       Date:  1983-01       Impact factor: 2.505

Review 9.  The anatomy and taxonomy of protein structure.

Authors:  J S Richardson
Journal:  Adv Protein Chem       Date:  1981

10.  The relation between the divergence of sequence and structure in proteins.

Authors:  C Chothia; A M Lesk
Journal:  EMBO J       Date:  1986-04       Impact factor: 11.598

View more
  134 in total

1.  The zinc finger-associated SCAN box is a conserved oligomerization domain.

Authors:  A J Williams; S C Blacklow; T Collins
Journal:  Mol Cell Biol       Date:  1999-12       Impact factor: 4.272

2.  CS5 pilus biosynthesis genes from enterotoxigenic Escherichia coli O115:H40.

Authors:  T G Duthy; L H Staendner; P A Manning; M W Heuzenroeder
Journal:  J Bacteriol       Date:  1999-09       Impact factor: 3.490

3.  The active form of the steroidogenic acute regulatory protein, StAR, appears to be a molten globule.

Authors:  H S Bose; R M Whittal; M A Baldwin; W L Miller
Journal:  Proc Natl Acad Sci U S A       Date:  1999-06-22       Impact factor: 11.205

4.  Glutamine synthetase inactivation by protein-protein interaction.

Authors:  M García-Domínguez; J C Reyes; F J Florencio
Journal:  Proc Natl Acad Sci U S A       Date:  1999-06-22       Impact factor: 11.205

5.  Poly(A)-binding protein I of Leishmania: functional analysis and localisation in trypanosomatid parasites.

Authors:  E J Bates; E Knuepfer; D F Smith
Journal:  Nucleic Acids Res       Date:  2000-03-01       Impact factor: 16.971

6.  A thermostable platform for transcriptional regulation: the DNA-binding properties of two Lrp homologs from the hyperthermophilic archaeon Methanococcus jannaschii.

Authors:  M Ouhammouch; E P Geiduschek
Journal:  EMBO J       Date:  2001-01-15       Impact factor: 11.598

7.  TOUCHSTONE: an ab initio protein structure prediction method that uses threading-based tertiary restraints.

Authors:  D Kihara; H Lu; A Kolinski; J Skolnick
Journal:  Proc Natl Acad Sci U S A       Date:  2001-08-14       Impact factor: 11.205

8.  A sequence and structural study of transmembrane helices.

Authors:  R P Bywater; D Thomas; G Vriend
Journal:  J Comput Aided Mol Des       Date:  2001-06       Impact factor: 3.686

9.  Characterization of Ad5 E3-14.7K, an adenoviral inhibitor of apoptosis: structure, oligomeric state, and metal binding.

Authors:  Hee-Jung Kim; Mark P Foster
Journal:  Protein Sci       Date:  2002-05       Impact factor: 6.725

10.  Novel folded protein domains generated by combinatorial shuffling of polypeptide segments.

Authors:  L Riechmann; G Winter
Journal:  Proc Natl Acad Sci U S A       Date:  2000-08-29       Impact factor: 11.205

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.