Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Accurate prediction of protein structural classes by incorporating predicted secondary structure information into the general form of Chou's pseudo amino acid composition.

Literature DB >> 24316044

Accurate prediction of protein structural classes by incorporating predicted secondary structure information into the general form of Chou's pseudo amino acid composition.

Liang Kong¹, Lichao Zhang², Jinfeng Lv¹.

Abstract

Extracting good representation from protein sequence is fundamental for protein structural classes prediction tasks. In this paper, we propose a novel and powerful method to predict protein structural classes based on the predicted secondary structure information. At the feature extraction stage, a 13-dimensional feature vector is extracted to characterize general contents and spatial arrangements of the secondary structural elements of a given protein sequence. Specially, four segment-level features are designed to elevate discriminative ability for proteins from the α/β and α+β classes. After the features are extracted, a multi-class non-linear support vector machine classifier is used to implement protein structural classes prediction. We report extensive experiments comparing the proposed method to the state-of-the-art in protein structural classes prediction on three widely used low-similarity benchmark datasets: FC699, 1189 and 640. Our method achieves competitive performance on prediction accuracies, especially for the overall prediction accuracies which have exceeded the best reported results on all of the three datasets.

Keywords: Secondary structure; Sequence similarity; Support vector machine

Mesh：

Substances：
Amino Acids

Year: 2013 PMID： 24316044 DOI： 10.1016/j.jtbi.2013.11.021

Source DB: PubMed Journal: J Theor Biol ISSN： 0022-5193 Impact factor: 2.691

Keyword Cloud
Cited

12 in total

1. Protein remote homology detection by combining Chou's distance-pair pseudo amino acid composition and principal component analysis.

Authors: Bin Liu; Junjie Chen; Xiaolong Wang
Journal: Mol Genet Genomics Date: 2015-04-21 Impact factor: 3.291

Review 2. Some illuminating remarks on molecular genetics and genomics as well as drug development.

Authors: Kuo-Chen Chou
Journal: Mol Genet Genomics Date: 2020-01-01 Impact factor: 3.291

3. iNitro-Tyr: prediction of nitrotyrosine sites in proteins with general pseudo amino acid composition.

Authors: Yan Xu; Xin Wen; Li-Shu Wen; Ling-Yun Wu; Nai-Yang Deng; Kuo-Chen Chou
Journal: PLoS One Date: 2014-08-14 Impact factor: 3.240

4. iDNA-Prot|dis: identifying DNA-binding proteins by incorporating amino acid distance-pairs and reduced alphabet profile into the general pseudo amino acid composition.

Authors: Bin Liu; Jinghao Xu; Xun Lan; Ruifeng Xu; Jiyun Zhou; Xiaolong Wang; Kuo-Chen Chou
Journal: PLoS One Date: 2014-09-03 Impact factor: 3.240

5. Identification of real microRNA precursors with a pseudo structure status composition approach.

Authors: Bin Liu; Longyun Fang; Fule Liu; Xiaolong Wang; Junjie Chen; Kuo-Chen Chou
Journal: PLoS One Date: 2015-03-30 Impact factor: 3.240

6. A high performance prediction of HPV genotypes by Chaos game representation and singular value decomposition.

Authors: Watcharaporn Tanchotsrinon; Chidchanok Lursinsap; Yong Poovorawan
Journal: BMC Bioinformatics Date: 2015-03-05 Impact factor: 3.169

7. An Alignment-Free Algorithm in Comparing the Similarity of Protein Sequences Based on Pseudo-Markov Transition Probabilities among Amino Acids.

Authors: Yushuang Li; Tian Song; Jiasheng Yang; Yi Zhang; Jialiang Yang
Journal: PLoS One Date: 2016-12-05 Impact factor: 3.240

8. Accurate prediction of subcellular location of apoptosis proteins combining Chou's PseAAC and PsePSSM based on wavelet denoising.

Authors: Bin Yu; Shan Li; Wen-Ying Qiu; Cheng Chen; Rui-Xin Chen; Lei Wang; Ming-Hui Wang; Yan Zhang
Journal: Oncotarget Date: 2017-11-21

9. PseAAC-General: fast building various modes of general form of Chou's pseudo-amino acid composition for large-scale protein datasets.

Authors: Pufeng Du; Shuwang Gu; Yasen Jiao
Journal: Int J Mol Sci Date: 2014-02-26 Impact factor: 5.923

10. A New Method for Recognizing Cytokines Based on Feature Combination and a Support Vector Machine Classifier.

Authors: Zhe Yang; Juan Wang; Zhida Zheng; Xin Bai
Journal: Molecules Date: 2018-08-11 Impact factor: 4.411