Literature DB >> 1480619

Protein secondary structure prediction using logic-based machine learning.

S Muggleton1, R D King, M J Sternberg.   

Abstract

Many attempts have been made to solve the problem of predicting protein secondary structure from the primary sequence but the best performance results are still disappointing. In this paper, the use of a machine learning algorithm which allows relational descriptions is shown to lead to improved performance. The Inductive Logic Programming computer program, Golem, was applied to learning secondary structure prediction rules for alpha/alpha domain type proteins. The input to the program consisted of 12 non-homologous proteins (1612 residues) of known structure, together with a background knowledge describing the chemical and physical properties of the residues. Golem learned a small set of rules that predict which residues are part of the alpha-helices--based on their positional relationships and chemical and physical properties. The rules were tested on four independent non-homologous proteins (416 residues) giving an accuracy of 81% (+/- 2%). This is an improvement, on identical data, over the previously reported result of 73% by King and Sternberg (1990, J. Mol. Biol., 216, 441-457) using the machine learning program PROMIS, and of 72% using the standard Garnier-Osguthorpe-Robson method. The best previously reported result in the literature for the alpha/alpha domain type is 76%, achieved using a neural net approach. Machine learning also has the advantage over neural network and statistical methods in producing more understandable results.

Mesh:

Year:  1992        PMID: 1480619     DOI: 10.1093/protein/5.7.647

Source DB:  PubMed          Journal:  Protein Eng        ISSN: 0269-2139


  16 in total

1.  Cascaded multiple classifiers for secondary structure prediction.

Authors:  M Ouali; R D King
Journal:  Protein Sci       Date:  2000-06       Impact factor: 6.725

2.  Identification and application of the concepts important for accurate and reliable protein secondary structure prediction.

Authors:  R D King; M J Sternberg
Journal:  Protein Sci       Date:  1996-11       Impact factor: 6.725

3.  A simple and fast approach to prediction of protein secondary structure from multiply aligned sequences with accuracy above 70%.

Authors:  P K Mehta; J Heringa; P Argos
Journal:  Protein Sci       Date:  1995-12       Impact factor: 6.725

4.  Improved prediction of protein secondary structure by use of sequence profiles and neural networks.

Authors:  B Rost; C Sander
Journal:  Proc Natl Acad Sci U S A       Date:  1993-08-15       Impact factor: 11.205

5.  Learning Qualitative Differential Equation models: a survey of algorithms and applications.

Authors:  Wei Pang; George M Coghill
Journal:  Knowl Eng Rev       Date:  2010-03       Impact factor: 1.115

6.  Homology induction: the use of machine learning to improve sequence similarity searches.

Authors:  Andreas Karwath; Ross D King
Journal:  BMC Bioinformatics       Date:  2002-04-23       Impact factor: 3.169

7.  PredSTP: a highly accurate SVM based model to predict sequential cystine stabilized peptides.

Authors:  S M Ashiqul Islam; Tanvir Sajed; Christopher Michel Kearney; Erich J Baker
Journal:  BMC Bioinformatics       Date:  2015-07-05       Impact factor: 3.169

8.  Learning a Markov Logic network for supervised gene regulatory network inference.

Authors:  Céline Brouard; Christel Vrain; Julie Dubois; David Castel; Marie-Anne Debily; Florence d'Alché-Buc
Journal:  BMC Bioinformatics       Date:  2013-09-12       Impact factor: 3.169

9.  Classifying kinase conformations using a machine learning approach.

Authors:  Daniel Ian McSkimming; Khaled Rasheed; Natarajan Kannan
Journal:  BMC Bioinformatics       Date:  2017-02-02       Impact factor: 3.169

10.  Knowledge discovery in variant databases using inductive logic programming.

Authors:  Hoan Nguyen; Tien-Dao Luu; Olivier Poch; Julie D Thompson
Journal:  Bioinform Biol Insights       Date:  2013-03-18
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.