| Literature DB >> 24316044 |
Liang Kong1, Lichao Zhang2, Jinfeng Lv1.
Abstract
Extracting good representation from protein sequence is fundamental for protein structural classes prediction tasks. In this paper, we propose a novel and powerful method to predict protein structural classes based on the predicted secondary structure information. At the feature extraction stage, a 13-dimensional feature vector is extracted to characterize general contents and spatial arrangements of the secondary structural elements of a given protein sequence. Specially, four segment-level features are designed to elevate discriminative ability for proteins from the α/β and α+β classes. After the features are extracted, a multi-class non-linear support vector machine classifier is used to implement protein structural classes prediction. We report extensive experiments comparing the proposed method to the state-of-the-art in protein structural classes prediction on three widely used low-similarity benchmark datasets: FC699, 1189 and 640. Our method achieves competitive performance on prediction accuracies, especially for the overall prediction accuracies which have exceeded the best reported results on all of the three datasets.Keywords: Secondary structure; Sequence similarity; Support vector machine
Mesh:
Substances:
Year: 2013 PMID: 24316044 DOI: 10.1016/j.jtbi.2013.11.021
Source DB: PubMed Journal: J Theor Biol ISSN: 0022-5193 Impact factor: 2.691