Literature DB >> 12577258

Fine-grained protein fold assignment by support vector machines using generalized npeptide coding schemes and jury voting from multiple-parameter sets.

Chin-Sheng Yu1, Jung-Ying Wang, Jinn-Moon Yang, Ping-Chiang Lyu, Chih-Jen Lin, Jenn-Kang Hwang.   

Abstract

In the coarse-grained fold assignment of major protein classes, such as all-alpha, all-beta, alpha + beta, alpha/beta proteins, one can easily achieve high prediction accuracy from primary amino acid sequences. However, the fine-grained assignment of folds, such as those defined in the Structural Classification of Proteins (SCOP) database, presents a challenge due to the larger amount of folds available. Recent study yielded reasonable prediction accuracy of 56.0% on an independent set of 27 most populated folds. In this communication, we apply the support vector machine (SVM) method, using a combination of protein descriptors based on the properties derived from the composition of n-peptide and jury voting, to the fine-grained fold prediction, and are able to achieve an overall prediction accuracy of 69.6% on the same independent set-significantly higher than the previous results. On 10-fold cross-validation, we obtained a prediction accuracy of 65.3%. Our results show that SVM coupled with suitable global sequence-coding schemes can significantly improve the fine-grained fold prediction. Our approach should be useful in structure prediction and modeling. Copyright 2003 Wiley-Liss, Inc.

Mesh:

Substances:

Year:  2003        PMID: 12577258     DOI: 10.1002/prot.10313

Source DB:  PubMed          Journal:  Proteins        ISSN: 0887-3585


  8 in total

1.  Predicting subcellular localization of proteins for Gram-negative bacteria by support vector machines based on n-peptide compositions.

Authors:  Chin-Sheng Yu; Chih-Jen Lin; Jenn-Kang Hwang
Journal:  Protein Sci       Date:  2004-05       Impact factor: 6.725

2.  The structure-based cancer-related single amino acid variation prediction.

Authors:  Jia-Jun Liu; Chin-Sheng Yu; Hsiao-Wei Wu; Yu-Jen Chang; Chih-Peng Lin; Chih-Hao Lu
Journal:  Sci Rep       Date:  2021-06-30       Impact factor: 4.379

3.  Identification of antifreeze proteins and their functional residues by support vector machine and genetic algorithms based on n-peptide compositions.

Authors:  Chin-Sheng Yu; Chih-Hao Lu
Journal:  PLoS One       Date:  2011-05-31       Impact factor: 3.240

4.  Subcellular location prediction of proteins using support vector machines with alignment of block sequences utilizing amino acid composition.

Authors:  Takeyuki Tamura; Tatsuya Akutsu
Journal:  BMC Bioinformatics       Date:  2007-11-30       Impact factor: 3.169

5.  An SVM-based system for predicting protein subnuclear localizations.

Authors:  Zhengdeng Lei; Yang Dai
Journal:  BMC Bioinformatics       Date:  2005-12-07       Impact factor: 3.169

6.  Identification of Cancerlectins Using Support Vector Machines With Fusion of G-Gap Dipeptide.

Authors:  Lili Qian; Yaping Wen; Guosheng Han
Journal:  Front Genet       Date:  2020-04-03       Impact factor: 4.599

7.  Predicting Anticancer Drug Resistance Mediated by Mutations.

Authors:  Yu-Feng Lin; Jia-Jun Liu; Yu-Jen Chang; Chin-Sheng Yu; Wei Yi; Hsien-Yuan Lane; Chih-Hao Lu
Journal:  Pharmaceuticals (Basel)       Date:  2022-01-24

8.  On the structural context and identification of enzyme catalytic residues.

Authors:  Yu-Tung Chien; Shao-Wei Huang
Journal:  Biomed Res Int       Date:  2013-02-03       Impact factor: 3.411

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.