Literature DB >> 1613789

Hybrid system for protein secondary structure prediction.

X Zhang1, J P Mesirov, D L Waltz.   

Abstract

We have developed a hybrid system to predict the secondary structures (alpha-helix, beta-sheet and coil) of proteins and achieved 66.4% accuracy, with correlation coefficients of C(coil) = 0.429, C alpha = 0.470 and C beta = 0.387. This system contains three subsystems ("experts"): a neural network module, a statistical module and a memory-based reasoning module. First, the three experts independently learn the mapping between amino acid sequences and secondary structures from the known protein structures, then a Combiner learns to combine automatically the outputs of the experts to make final predictions. The hybrid system was tested with 107 protein structures through k-way cross-validation. Its performance was better than each expert and all previously reported methods with greater than 0.99 statistical significance. It was observed that for 20% of the residues, all three experts produced the same but wrong predictions. This may suggest an upper bound on the accuracy of secondary structure predictions based on local information from the currently available protein structures, and indicate places where non-local interactions may play a dominant role in conformation. For 64% of the residues, at least two experts were the same and correct, which shows that the Combiner performed better than majority vote. For 77% of the residues, at least one expert was correct, thus there may still be room for improvement in this hybrid approach. Rigorous evaluation procedures were used in testing the hybrid system, and statistical significance measures were developed in analyzing the differences among different methods. When measured in terms of the number of secondary structures (rather than the number of residues) that were predicted correctly, the prediction produced by the hybrid system was also better than those of individual experts.

Mesh:

Substances:

Year:  1992        PMID: 1613789     DOI: 10.1016/0022-2836(92)90104-r

Source DB:  PubMed          Journal:  J Mol Biol        ISSN: 0022-2836            Impact factor:   5.469


  17 in total

1.  Protein energetic conformational analysis from NMR chemical shifts (PECAN) and its use in determining secondary structural elements.

Authors:  Hamid R Eghbalnia; Liya Wang; Arash Bahrami; Amir Assadi; John L Markley
Journal:  J Biomol NMR       Date:  2005-05       Impact factor: 2.835

2.  The effects of the codon usage and translation speed on protein folding of 3D(pol) of foot-and-mouth disease virus.

Authors:  Xiao-Xia Ma; Yu-Ping Feng; Jun-Lin Liu; Bing Ma; Li Chen; Yong-Qing Zhao; Peng-Hui Guo; Jun-Zhen Guo; Zhong-Ren Ma; Jie Zhang
Journal:  Vet Res Commun       Date:  2013-05-29       Impact factor: 2.459

3.  The importance of larger data sets for protein secondary structure prediction with neural networks.

Authors:  J M Chandonia; M Karplus
Journal:  Protein Sci       Date:  1996-04       Impact factor: 6.725

4.  Improving protein secondary structure prediction with aligned homologous sequences.

Authors:  V Di Francesco; J Garnier; P J Munson
Journal:  Protein Sci       Date:  1996-01       Impact factor: 6.725

5.  Identification and application of the concepts important for accurate and reliable protein secondary structure prediction.

Authors:  R D King; M J Sternberg
Journal:  Protein Sci       Date:  1996-11       Impact factor: 6.725

6.  Predicting protein secondary structure with probabilistic schemata of evolutionarily derived information.

Authors:  M J Thompson; R A Goldstein
Journal:  Protein Sci       Date:  1997-09       Impact factor: 6.725

7.  Analysis of the relation between the sequence and secondary and three-dimensional structures of immunoglobulin molecules.

Authors:  I M Gelfand; A E Kister
Journal:  Proc Natl Acad Sci U S A       Date:  1995-11-21       Impact factor: 11.205

8.  A simple and fast approach to prediction of protein secondary structure from multiply aligned sequences with accuracy above 70%.

Authors:  P K Mehta; J Heringa; P Argos
Journal:  Protein Sci       Date:  1995-12       Impact factor: 6.725

9.  A preference-based free-energy parameterization of enzyme-inhibitor binding. Applications to HIV-1-protease inhibitor design.

Authors:  A Wallqvist; R L Jernigan; D G Covell
Journal:  Protein Sci       Date:  1995-09       Impact factor: 6.725

10.  Machine learning integration for predicting the effect of single amino acid substitutions on protein stability.

Authors:  Ayşegül Ozen; Mehmet Gönen; Ethem Alpaydan; Türkan Haliloğlu
Journal:  BMC Struct Biol       Date:  2009-10-19
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.