Literature DB >> 11327775

A novel method of protein secondary structure prediction with high segment overlap measure: support vector machine approach.

S Hua1, Z Sun.   

Abstract

We have introduced a new method of protein secondary structure prediction which is based on the theory of support vector machine (SVM). SVM represents a new approach to supervised pattern classification which has been successfully applied to a wide range of pattern recognition problems, including object recognition, speaker identification, gene function prediction with microarray expression profile, etc. In these cases, the performance of SVM either matches or is significantly better than that of traditional machine learning approaches, including neural networks.The first use of the SVM approach to predict protein secondary structure is described here. Unlike the previous studies, we first constructed several binary classifiers, then assembled a tertiary classifier for three secondary structure states (helix, sheet and coil) based on these binary classifiers. The SVM method achieved a good performance of segment overlap accuracy SOV=76.2 % through sevenfold cross validation on a database of 513 non-homologous protein chains with multiple sequence alignments, which out-performs existing methods. Meanwhile three-state overall per-residue accuracy Q(3) achieved 73.5 %, which is at least comparable to existing single prediction methods. Furthermore a useful "reliability index" for the predictions was developed. In addition, SVM has many attractive features, including effective avoidance of overfitting, the ability to handle large feature spaces, information condensing of the given data set, etc. The SVM method is conveniently applied to many other pattern classification tasks in biology. Copyright 2001 Academic Press.

Mesh:

Substances:

Year:  2001        PMID: 11327775     DOI: 10.1006/jmbi.2001.4580

Source DB:  PubMed          Journal:  J Mol Biol        ISSN: 0022-2836            Impact factor:   5.469


  78 in total

1.  Support vector machines for predicting membrane protein types by using functional domain composition.

Authors:  Yu-Dong Cai; Guo-Ping Zhou; Kuo-Chen Chou
Journal:  Biophys J       Date:  2003-05       Impact factor: 4.033

2.  SVM-Prot: Web-based support vector machine software for functional classification of a protein from its primary sequence.

Authors:  C Z Cai; L Y Han; Z L Ji; X Chen; Y Z Chen
Journal:  Nucleic Acids Res       Date:  2003-07-01       Impact factor: 16.971

3.  Prediction of RNA-binding proteins from primary sequence by a support vector machine approach.

Authors:  Lian Yi Han; Cong Zhong Cai; Siew Lin Lo; Maxey C M Chung; Yu Zong Chen
Journal:  RNA       Date:  2004-03       Impact factor: 4.942

4.  HYPROSP: a hybrid protein secondary structure prediction algorithm--a knowledge-based approach.

Authors:  Kuen-Pin Wu; Hsin-Nan Lin; Jia-Ming Chang; Ting-Yi Sung; Wen-Lian Hsu
Journal:  Nucleic Acids Res       Date:  2004-09-24       Impact factor: 16.971

5.  GOR V server for protein secondary structure prediction.

Authors:  Taner Z Sen; Robert L Jernigan; Jean Garnier; Andrzej Kloczkowski
Journal:  Bioinformatics       Date:  2005-03-29       Impact factor: 6.937

6.  The effect of long-range interactions on the secondary structure formation of proteins.

Authors:  Daisuke Kihara
Journal:  Protein Sci       Date:  2005-06-29       Impact factor: 6.725

7.  The implications of higher (or lower) success in secondary structure prediction of chain fragments.

Authors:  Chung-Jung Tsai; Ruth Nussinov
Journal:  Protein Sci       Date:  2005-08       Impact factor: 6.725

8.  Evaluation of features for catalytic residue prediction in novel folds.

Authors:  Eunseog Youn; Brandon Peters; Predrag Radivojac; Sean D Mooney
Journal:  Protein Sci       Date:  2006-12-22       Impact factor: 6.725

9.  A Consensus Data Mining secondary structure prediction by combining GOR V and Fragment Database Mining.

Authors:  Taner Z Sen; Haitao Cheng; Andrzej Kloczkowski; Robert L Jernigan
Journal:  Protein Sci       Date:  2006-09-25       Impact factor: 6.725

10.  Learning biophysically-motivated parameters for alpha helix prediction.

Authors:  Blaise Gassend; Charles W O'Donnell; William Thies; Andrew Lee; Marten van Dijk; Srinivas Devadas
Journal:  BMC Bioinformatics       Date:  2007-05-24       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.