Literature DB >> 18423492

Predicting protein structural class by SVM with class-wise optimized features and decision probabilities.

Ashish Anand1, Ganesan Pugalenthi, P N Suganthan.   

Abstract

Determination of protein structural class solely from sequence information is a challenging task. Several attempts to solve this problem using various methods can be found in literature. We present support vector machine (SVM) approach where probability-based decision is used along with class-wise optimized feature sets. This approach has two distinguishing characteristics from earlier attempts: (1) it uses class-wise optimized features and (2) decisions of different SVM classifiers are coupled with probability estimates to make the final prediction. The algorithm was tested on three datasets, containing 498 domains, 1092 domains and 5261 domains. Ten-fold external cross-validation was performed to assess the performance of the algorithm. Significantly high accuracy of 92.89% was obtained for the 498-dataset. We achieved 54.67% accuracy for the dataset with 1092 domains, which is better than the previously reported best accuracy of 53.8%. We obtained 59.43% prediction accuracy for the larger and less redundant 5261-dataset. We also investigated the advantage of using class-wise features over union of these features (conventional approach) in one-vs.-all SVM framework. Our results clearly show the advantage of using class-wise optimized features. Brief analysis of the selected class-wise features indicates their biological significance.

Mesh:

Year:  2008        PMID: 18423492     DOI: 10.1016/j.jtbi.2008.02.031

Source DB:  PubMed          Journal:  J Theor Biol        ISSN: 0022-5193            Impact factor:   2.691


  9 in total

1.  iAFP-Ense: An Ensemble Classifier for Identifying Antifreeze Protein by Incorporating Grey Model and PSSM into PseAAC.

Authors:  Xuan Xiao; Mengjuan Hui; Zi Liu
Journal:  J Membr Biol       Date:  2016-11-03       Impact factor: 1.843

2.  A novel fusion based on the evolutionary features for protein fold recognition using support vector machines.

Authors:  Mohammad Saleh Refahi; A Mir; Jalal A Nasiri
Journal:  Sci Rep       Date:  2020-09-01       Impact factor: 4.379

3.  Prediction of protein structural classes for low-homology sequences based on predicted secondary structure.

Authors:  Jian-Yi Yang; Zhen-Ling Peng; Xin Chen
Journal:  BMC Bioinformatics       Date:  2010-01-18       Impact factor: 3.169

4.  Prediction of lysine ubiquitylation with ensemble classifier and feature selection.

Authors:  Xiaowei Zhao; Xiangtao Li; Zhiqiang Ma; Minghao Yin
Journal:  Int J Mol Sci       Date:  2011-11-28       Impact factor: 5.923

5.  Customised fragments libraries for protein structure prediction based on structural class annotations.

Authors:  Jad Abbass; Jean-Christophe Nebel
Journal:  BMC Bioinformatics       Date:  2015-04-29       Impact factor: 3.169

6.  Proposing a highly accurate protein structural class predictor using segmentation-based features.

Authors:  Abdollah Dehzangi; Kuldip Paliwal; James Lyons; Alok Sharma; Abdul Sattar
Journal:  BMC Genomics       Date:  2014-01-24       Impact factor: 3.969

7.  Improvement in Protein Domain Identification Is Reached by Breaking Consensus, with the Agreement of Many Profiles and Domain Co-occurrence.

Authors:  Juliana Bernardes; Gerson Zaverucha; Catherine Vaquero; Alessandra Carbone
Journal:  PLoS Comput Biol       Date:  2016-07-29       Impact factor: 4.475

8.  Modular prediction of protein structural classes from sequences of twilight-zone identity with predicting sequences.

Authors:  Marcin J Mizianty; Lukasz Kurgan
Journal:  BMC Bioinformatics       Date:  2009-12-13       Impact factor: 3.169

9.  Predicting antifreeze proteins with weighted generalized dipeptide composition and multi-regression feature selection ensemble.

Authors:  Shunfang Wang; Lin Deng; Xinnan Xia; Zicheng Cao; Yu Fei
Journal:  BMC Bioinformatics       Date:  2021-06-23       Impact factor: 3.169

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.