Literature DB >> 1593625

Predicting protein secondary structure using neural net and statistical methods.

P Stolorz1, A Lapedes, Y Xia.   

Abstract

A comparison of neural network methods and Bayesian statistical methods is presented for prediction of the secondary structure of proteins given their primary sequence. The Bayesian method makes the unphysical assumption that the probability of an amino acid occurring in each position in the protein is independent of the amino acids occurring elsewhere. However, we find the predictive accuracy of the Bayesian method to be only minimally less than the accuracy of the most sophisticated methods used to date. We present the relationship of neural network methods to Bayesian statistical methods and show that, in principle, neural methods offer considerable power, although apparently they are not particularly useful for this problem. In the process, we derive a neural formalism in which the output neurons directly represent the conditional probabilities of structure class. The probabilistic formalism allows introduction of a new objective function, the mutual information, which translates the notion of correlation as a measure of predictive accuracy into a useful training measure. Although a similar accuracy to other approaches (utilizing a mean-square error) is achieved using this new measure, the accuracy on the training set is significantly and tantalizingly higher, even though the number of adjustable parameters remains the same. The mutual information measure predicts a greater fraction of helix and sheet structures correctly than the mean-square error measure, at the expense of coil accuracy, precisely as it was designed to do. By combining the two objective functions, we obtain a marginally improved accuracy of 64.4%, with Matthews coefficients C alpha, C beta and Ccoil of 0.40, 0.32 and 0.42, respectively. However, since all methods to date perform only slightly better than the Bayes algorithm, which entails the drastic assumption of independence of amino acids, one is forced to conclude that little progress has been made on this problem, despite the application of a variety of sophisticated algorithms such as neural networks, and that further advances will require a better understanding of the relevant biophysics.

Mesh:

Substances:

Year:  1992        PMID: 1593625     DOI: 10.1016/0022-2836(92)90927-c

Source DB:  PubMed          Journal:  J Mol Biol        ISSN: 0022-2836            Impact factor:   5.469


  15 in total

1.  Environmental features are important in determining protein secondary structure.

Authors:  J R Macdonald; W C Johnson
Journal:  Protein Sci       Date:  2001-06       Impact factor: 6.725

2.  PROSHIFT: protein chemical shift prediction using artificial neural networks.

Authors:  Jens Meiler
Journal:  J Biomol NMR       Date:  2003-05       Impact factor: 2.835

3.  Fuzzy cluster analysis of simple physicochemical properties of amino acids for recognizing secondary structure in proteins.

Authors:  G Mocz
Journal:  Protein Sci       Date:  1995-06       Impact factor: 6.725

4.  Improving protein secondary structure prediction with aligned homologous sequences.

Authors:  V Di Francesco; J Garnier; P J Munson
Journal:  Protein Sci       Date:  1996-01       Impact factor: 6.725

5.  Predicting protein secondary structure with probabilistic schemata of evolutionarily derived information.

Authors:  M J Thompson; R A Goldstein
Journal:  Protein Sci       Date:  1997-09       Impact factor: 6.725

6.  Rearranging the domains of pepsinogen.

Authors:  X Lin; G Koelsch; J A Loy; J Tang
Journal:  Protein Sci       Date:  1995-02       Impact factor: 6.725

7.  Neural networks for secondary structure and structural class predictions.

Authors:  J M Chandonia; M Karplus
Journal:  Protein Sci       Date:  1995-02       Impact factor: 6.725

8.  A preference-based free-energy parameterization of enzyme-inhibitor binding. Applications to HIV-1-protease inhibitor design.

Authors:  A Wallqvist; R L Jernigan; D G Covell
Journal:  Protein Sci       Date:  1995-09       Impact factor: 6.725

9.  Predicting secondary structures of membrane proteins with neural networks.

Authors:  P Fariselli; M Compiani; R Casadio
Journal:  Eur Biophys J       Date:  1993       Impact factor: 1.733

10.  Development of simple fitness landscapes for peptides by artificial neural filter systems.

Authors:  G Schneider; J Schuchhardt; P Wrede
Journal:  Biol Cybern       Date:  1995-08       Impact factor: 2.086

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.