Literature DB >> 12112679

Prediction of protein solvent accessibility using support vector machines.

Zheng Yuan1, Kevin Burrage, John S Mattick.   

Abstract

A Support Vector Machine learning system has been trained to predict protein solvent accessibility from the primary structure. Different kernel functions and sliding window sizes have been explored to find how they affect the prediction performance. Using a cut-off threshold of 15% that splits the dataset evenly (an equal number of exposed and buried residues), this method was able to achieve a prediction accuracy of 70.1% for single sequence input and 73.9% for multiple alignment sequence input, respectively. The prediction of three and more states of solvent accessibility was also studied and compared with other methods. The prediction accuracies are better than, or comparable to, those obtained by other methods such as neural networks, Bayesian classification, multiple linear regression, and information theory. In addition, our results further suggest that this system may be combined with other prediction methods to achieve more reliable results, and that the Support Vector Machine method is a very useful tool for biological sequence analysis. Copyright 2002 Wiley-Liss, Inc.

Mesh:

Substances:

Year:  2002        PMID: 12112679     DOI: 10.1002/prot.10176

Source DB:  PubMed          Journal:  Proteins        ISSN: 0887-3585


  33 in total

1.  SVM-Prot: Web-based support vector machine software for functional classification of a protein from its primary sequence.

Authors:  C Z Cai; L Y Han; Z L Ji; X Chen; Y Z Chen
Journal:  Nucleic Acids Res       Date:  2003-07-01       Impact factor: 16.971

2.  Prediction of RNA-binding proteins from primary sequence by a support vector machine approach.

Authors:  Lian Yi Han; Cong Zhong Cai; Siew Lin Lo; Maxey C M Chung; Yu Zong Chen
Journal:  RNA       Date:  2004-03       Impact factor: 4.942

3.  pH Dependence of Charge Multipole Moments in Proteins.

Authors:  Anže Lošdorfer Božič; Rudolf Podgornik
Journal:  Biophys J       Date:  2017-10-03       Impact factor: 4.033

4.  Computational identification of potential molecular interactions in Arabidopsis.

Authors:  Mingzhi Lin; Bin Hu; Lijuan Chen; Peng Sun; Yi Fan; Ping Wu; Xin Chen
Journal:  Plant Physiol       Date:  2009-07-10       Impact factor: 8.340

5.  Identification of protein functions using a machine-learning approach based on sequence-derived properties.

Authors:  Bum Ju Lee; Moon Sun Shin; Young Joon Oh; Hae Seok Oh; Keun Ho Ryu
Journal:  Proteome Sci       Date:  2009-08-09       Impact factor: 2.480

6.  Identification of type 2 diabetes-associated combination of SNPs using support vector machine.

Authors:  Hyo-Jeong Ban; Jee Yeon Heo; Kyung-Soo Oh; Keun-Joon Park
Journal:  BMC Genet       Date:  2010-04-23       Impact factor: 2.797

7.  Real value prediction of protein solvent accessibility using enhanced PSSM features.

Authors:  Darby Tien-Hao Chang; Hsuan-Yu Huang; Yu-Tang Syu; Chih-Peng Wu
Journal:  BMC Bioinformatics       Date:  2008-12-12       Impact factor: 3.169

8.  Analysis of accessible surface of residues in proteins.

Authors:  Laurence Lins; Annick Thomas; Robert Brasseur
Journal:  Protein Sci       Date:  2003-07       Impact factor: 6.725

9.  Beta edge strands in protein structure prediction and aggregation.

Authors:  Jennifer A Siepen; Sheena E Radford; David R Westhead
Journal:  Protein Sci       Date:  2003-10       Impact factor: 6.725

10.  A generic method for assignment of reliability scores applied to solvent accessibility predictions.

Authors:  Bent Petersen; Thomas Nordahl Petersen; Pernille Andersen; Morten Nielsen; Claus Lundegaard
Journal:  BMC Struct Biol       Date:  2009-07-31
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.