Literature DB >> 11301304

Multi-class protein fold recognition using support vector machines and neural networks.

C H Ding1, I Dubchak.   

Abstract

MOTIVATION: Protein fold recognition is an important approach to structure discovery without relying on sequence similarity. We study this approach with new multi-class classification methods and examined many issues important for a practical recognition system.
RESULTS: Most current discriminative methods for protein fold prediction use the one-against-others method, which has the well-known 'False Positives' problem. We investigated two new methods: the unique one-against-others and the all-against-all methods. Both improve prediction accuracy by 14-110% on a dataset containing 27 SCOP folds. We used the Support Vector Machine (SVM) and the Neural Network (NN) learning methods as base classifiers. SVMs converges fast and leads to high accuracy. When scores of multiple parameter datasets are combined, majority voting reduces noise and increases recognition accuracy. We examined many issues involved with large number of classes, including dependencies of prediction accuracy on the number of folds and on the number of representatives in a fold. Overall, recognition systems achieve 56% fold prediction accuracy on a protein test dataset, where most of the proteins have below 25% sequence identity with the proteins used in training.

Mesh:

Substances:

Year:  2001        PMID: 11301304     DOI: 10.1093/bioinformatics/17.4.349

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  103 in total

1.  Support vector machines for predicting membrane protein types by using functional domain composition.

Authors:  Yu-Dong Cai; Guo-Ping Zhou; Kuo-Chen Chou
Journal:  Biophys J       Date:  2003-05       Impact factor: 4.033

2.  An optimal structure-discriminative amino acid index for protein fold recognition.

Authors:  R H Leary; J B Rosen; P Jambeck
Journal:  Biophys J       Date:  2004-01       Impact factor: 4.033

3.  SVM-Prot: Web-based support vector machine software for functional classification of a protein from its primary sequence.

Authors:  C Z Cai; L Y Han; Z L Ji; X Chen; Y Z Chen
Journal:  Nucleic Acids Res       Date:  2003-07-01       Impact factor: 16.971

4.  Prediction of RNA-binding proteins from primary sequence by a support vector machine approach.

Authors:  Lian Yi Han; Cong Zhong Cai; Siew Lin Lo; Maxey C M Chung; Yu Zong Chen
Journal:  RNA       Date:  2004-03       Impact factor: 4.942

5.  Broiler chickens can benefit from machine learning: support vector machine analysis of observational epidemiological data.

Authors:  Philip J Hepworth; Alexey V Nefedov; Ilya B Muchnik; Kenton L Morgan
Journal:  J R Soc Interface       Date:  2012-02-08       Impact factor: 4.118

6.  Support Vector Machine on fluorescence landscapes for breast cancer diagnostics.

Authors:  Tatjana Dramićanin; Lea Lenhardt; Ivana Zeković; Miroslav D Dramićanin
Journal:  J Fluoresc       Date:  2012-06-08       Impact factor: 2.217

7.  The prediction of human oral absorption for diffusion rate-limited drugs based on heuristic method and support vector machine.

Authors:  H X Liu; R J Hu; R S Zhang; X J Yao; M C Liu; Z D Hu; B T Fan
Journal:  J Comput Aided Mol Des       Date:  2005-01       Impact factor: 3.686

8.  Structural bioinformatics prediction of membrane-binding proteins.

Authors:  Nitin Bhardwaj; Robert V Stahelin; Robert E Langlois; Wonhwa Cho; Hui Lu
Journal:  J Mol Biol       Date:  2006-03-30       Impact factor: 5.469

9.  A composite score for predicting errors in protein structure models.

Authors:  David Eramian; Min-yi Shen; Damien Devos; Francisco Melo; Andrej Sali; Marc A Marti-Renom
Journal:  Protein Sci       Date:  2006-06-02       Impact factor: 6.725

10.  Residue-level prediction of DNA-binding sites and its application on DNA-binding protein predictions.

Authors:  Nitin Bhardwaj; Hui Lu
Journal:  FEBS Lett       Date:  2007-02-07       Impact factor: 4.124

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.