Literature DB >> 29055184

Predict protein structural class by incorporating two different modes of evolutionary information into Chou's general pseudo amino acid composition.

Yunyun Liang1, Shengli Zhang2.   

Abstract

Structural class is an important attribute of protein, which plays an important role in both the experiment and theoretical research of protein science. Prediction of protein structural classes has been a challenging task, specifically for low-similarity datasets. In this paper, we develop a feature extraction method PSSS-SOMA-PSSM by incorporating two different modes of evolutionary information into Chou's general pseudo amino acid composition (PseAAC) based on predicted secondary structure sequence (PSSS) and position-specific scoring matrix (PSSM). We construct a 170-dimensional (170D) feature vector for each protein sequence sample, which contains 10D PSSS features that reflect content, alternating word frequency and novel position information, and contains 160D PSSM features that are calculated by second-order moving average (SOMA) algorithm. The SVM classifier with RBF kernel function and the jackknife test are used to predict and evaluate on 1189, 25PDB and 640 datasets with sequence similarity lower than 40%, 25%, and 25%, respectively. Comparison of our results with other methods shows that the proposed method provides the state-of-the-art performance and a cost-effective alternative to structural classes prediction for low-similarity datasets.
Copyright © 2017 Elsevier Inc. All rights reserved.

Keywords:  Position information; Position-specific scoring matrix; Predicted secondary structure sequence; Protein structural classes; Second-order moving average; Support vector machine

Mesh:

Substances:

Year:  2017        PMID: 29055184     DOI: 10.1016/j.jmgm.2017.10.003

Source DB:  PubMed          Journal:  J Mol Graph Model        ISSN: 1093-3263            Impact factor:   2.518


  5 in total

Review 1.  Some illuminating remarks on molecular genetics and genomics as well as drug development.

Authors:  Kuo-Chen Chou
Journal:  Mol Genet Genomics       Date:  2020-01-01       Impact factor: 3.291

2.  Assessing the Performances of Protein Function Prediction Algorithms from the Perspectives of Identification Accuracy and False Discovery Rate.

Authors:  Chun Yan Yu; Xiao Xu Li; Hong Yang; Ying Hong Li; Wei Wei Xue; Yu Zong Chen; Lin Tao; Feng Zhu
Journal:  Int J Mol Sci       Date:  2018-01-08       Impact factor: 5.923

3.  HBPred: a tool to identify growth hormone-binding proteins.

Authors:  Hua Tang; Ya-Wei Zhao; Ping Zou; Chun-Mei Zhang; Rong Chen; Po Huang; Hao Lin
Journal:  Int J Biol Sci       Date:  2018-05-22       Impact factor: 6.580

4.  PSSMCOOL: a comprehensive R package for generating evolutionary-based descriptors of protein sequences from PSSM profiles.

Authors:  Alireza Mohammadi; Javad Zahiri; Saber Mohammadi; Mohsen Khodarahmi; Seyed Shahriar Arab
Journal:  Biol Methods Protoc       Date:  2022-03-30

5.  VTP-Identifier: Vesicular Transport Proteins Identification Based on PSSM Profiles and XGBoost.

Authors:  Yue Gong; Benzhi Dong; Zixiao Zhang; Yixiao Zhai; Bo Gao; Tianjiao Zhang; Jingyu Zhang
Journal:  Front Genet       Date:  2022-01-03       Impact factor: 4.599

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.