Literature DB >> 17503398

Protein topology classification using two-stage support vector machines.

Jayavardhana Gubbi1, Alistair Shilton, Michael Parker, Marimuthu Palaniswami.   

Abstract

The determination of the first 3-D model of a protein from its sequence alone is a non-trivial problem. The first 3-D model is the key to the molecular replacement method of solving phase problem in x-ray crystallography. If the sequence identity is more than 30%, homology modelling can be used to determine the correct topology (as defined by CATH) or fold (as defined by SCOP). If the sequence identity is less than 25%, however, the task is very challenging. In this paper we address the topology classification of proteins with sequence identity of less than 25%. The input information to the system is amino acid sequence, the predicted secondary structure and the predicted real value relative solvent accessibility. A two stage support vector machine (SVM) approach is proposed for classifying the sequences to three different structural classes (alpha, beta, alpha+beta) in the first stage and 39 topologies in the second stage. The method is evaluated using a newly curated dataset from CATH with maximum pairwise sequence identity less than 25%. An impressive overall accuracy of 87.44% and 83.15% is reported for class and topology prediction, respectively. In the class prediction stage, a sensitivity of 0.77 and a specificity of 0.91 is obtained. Data file, SVM implementation (SVMHEAVY) and result files can be downloaded from http://www.ee.unimelb.edu.au/ISSNIP/downloads/.

Mesh:

Substances:

Year:  2006        PMID: 17503398

Source DB:  PubMed          Journal:  Genome Inform        ISSN: 0919-9454


  2 in total

1.  On the relation between the predicted secondary structure and the protein size.

Authors:  Lukasz Kurgan
Journal:  Protein J       Date:  2008-06       Impact factor: 2.371

2.  An ensemble method for predicting subnuclear localizations from primary protein structures.

Authors:  Guo Sheng Han; Zu Guo Yu; Vo Anh; Anaththa P D Krishnajith; Yu-Chu Tian
Journal:  PLoS One       Date:  2013-02-27       Impact factor: 3.240

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.