Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 A highly accurate protein structural class prediction approach using auto cross covariance transformation and recursive feature elimination.

Literature DB >> 26460680

A highly accurate protein structural class prediction approach using auto cross covariance transformation and recursive feature elimination.

Xiaowei Li¹, Taigang Liu², Peiying Tao¹, Chunhua Wang³, Lanming Chen¹.

Abstract

Structural class characterizes the overall folding type of a protein or its domain. Many methods have been proposed to improve the prediction accuracy of protein structural class in recent years, but it is still a challenge for the low-similarity sequences. In this study, we introduce a feature extraction technique based on auto cross covariance (ACC) transformation of position-specific score matrix (PSSM) to represent a protein sequence. Then support vector machine-recursive feature elimination (SVM-RFE) is adopted to select top K features according to their importance and these features are input to a support vector machine (SVM) to conduct the prediction. Performance evaluation of the proposed method is performed using the jackknife test on three low-similarity datasets, i.e., D640, 1189 and 25PDB. By means of this method, the overall accuracies of 97.2%, 96.2%, and 93.3% are achieved on these three datasets, which are higher than those of most existing methods. This suggests that the proposed method could serve as a very cost-effective tool for predicting protein structural class especially for low-similarity datasets.

Keywords: Auto cross covariance; Low-similarity; Position-specific score matrix; Recursive feature elimination; Support vector machine

Mesh：

Substances：
Proteins

Year: 2015 PMID： 26460680 DOI： 10.1016/j.compbiolchem.2015.08.012

Source DB: PubMed Journal: Comput Biol Chem ISSN： 1476-9271 Impact factor: 2.877

Keyword Cloud
Cited

5 in total

1. iAPSL-IF: Identification of Apoptosis Protein Subcellular Location Using Integrative Features Captured from Amino Acid Sequences.

Authors: Yadong Tang; Lu Xie; Lanming Chen
Journal: Int J Mol Sci Date: 2018-04-13 Impact factor: 5.923

2. Decision Variants for the Automatic Determination of Optimal Feature Subset in RF-RFE.

Authors: Qi Chen; Zhaopeng Meng; Xinyi Liu; Qianguo Jin; Ran Su
Journal: Genes (Basel) Date: 2018-06-15 Impact factor: 4.096

3. ProTstab - predictor for cellular protein stability.

Authors: Yang Yang; Xuesong Ding; Guanchen Zhu; Abhishek Niroula; Qiang Lv; Mauno Vihinen
Journal: BMC Genomics Date: 2019-11-04 Impact factor: 3.969

4. Fused-Filament Fabrication of Short Carbon Fiber-Reinforced Polyamide: Parameter Optimization for Improved Performance under Uniaxial Tensile Loading.

Authors: Carlos Belei; Jana Joeressen; Sergio T Amancio-Filho
Journal: Polymers (Basel) Date: 2022-03-23 Impact factor: 4.329

5. HMMPred: Accurate Prediction of DNA-Binding Proteins Based on HMM Profiles and XGBoost Feature Selection.

Authors: Xiuzhi Sang; Wanyue Xiao; Huiwen Zheng; Yang Yang; Taigang Liu
Journal: Comput Math Methods Med Date: 2020-03-28 Impact factor: 2.238

5 in total