Literature DB >> 26460680

A highly accurate protein structural class prediction approach using auto cross covariance transformation and recursive feature elimination.

Xiaowei Li1, Taigang Liu2, Peiying Tao1, Chunhua Wang3, Lanming Chen1.   

Abstract

Structural class characterizes the overall folding type of a protein or its domain. Many methods have been proposed to improve the prediction accuracy of protein structural class in recent years, but it is still a challenge for the low-similarity sequences. In this study, we introduce a feature extraction technique based on auto cross covariance (ACC) transformation of position-specific score matrix (PSSM) to represent a protein sequence. Then support vector machine-recursive feature elimination (SVM-RFE) is adopted to select top K features according to their importance and these features are input to a support vector machine (SVM) to conduct the prediction. Performance evaluation of the proposed method is performed using the jackknife test on three low-similarity datasets, i.e., D640, 1189 and 25PDB. By means of this method, the overall accuracies of 97.2%, 96.2%, and 93.3% are achieved on these three datasets, which are higher than those of most existing methods. This suggests that the proposed method could serve as a very cost-effective tool for predicting protein structural class especially for low-similarity datasets.
Copyright © 2015 Elsevier Ltd. All rights reserved.

Keywords:  Auto cross covariance; Low-similarity; Position-specific score matrix; Recursive feature elimination; Support vector machine

Mesh:

Substances:

Year:  2015        PMID: 26460680     DOI: 10.1016/j.compbiolchem.2015.08.012

Source DB:  PubMed          Journal:  Comput Biol Chem        ISSN: 1476-9271            Impact factor:   2.877


  5 in total

1.  iAPSL-IF: Identification of Apoptosis Protein Subcellular Location Using Integrative Features Captured from Amino Acid Sequences.

Authors:  Yadong Tang; Lu Xie; Lanming Chen
Journal:  Int J Mol Sci       Date:  2018-04-13       Impact factor: 5.923

2.  Decision Variants for the Automatic Determination of Optimal Feature Subset in RF-RFE.

Authors:  Qi Chen; Zhaopeng Meng; Xinyi Liu; Qianguo Jin; Ran Su
Journal:  Genes (Basel)       Date:  2018-06-15       Impact factor: 4.096

3.  ProTstab - predictor for cellular protein stability.

Authors:  Yang Yang; Xuesong Ding; Guanchen Zhu; Abhishek Niroula; Qiang Lv; Mauno Vihinen
Journal:  BMC Genomics       Date:  2019-11-04       Impact factor: 3.969

4.  Fused-Filament Fabrication of Short Carbon Fiber-Reinforced Polyamide: Parameter Optimization for Improved Performance under Uniaxial Tensile Loading.

Authors:  Carlos Belei; Jana Joeressen; Sergio T Amancio-Filho
Journal:  Polymers (Basel)       Date:  2022-03-23       Impact factor: 4.329

5.  HMMPred: Accurate Prediction of DNA-Binding Proteins Based on HMM Profiles and XGBoost Feature Selection.

Authors:  Xiuzhi Sang; Wanyue Xiao; Huiwen Zheng; Yang Yang; Taigang Liu
Journal:  Comput Math Methods Med       Date:  2020-03-28       Impact factor: 2.238

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.