Literature DB >> 21594694

PSP_MCSVM: brainstorming consensus prediction of protein secondary structures using two-stage multiclass support vector machines.

Piyali Chatterjee1, Subhadip Basu, Mahantapas Kundu, Mita Nasipuri, Dariusz Plewczynski.   

Abstract

Secondary structure prediction is a crucial task for understanding the variety of protein structures and performed biological functions. Prediction of secondary structures for new proteins using their amino acid sequences is of fundamental importance in bioinformatics. We propose a novel technique to predict protein secondary structures based on position-specific scoring matrices (PSSMs) and physico-chemical properties of amino acids. It is a two stage approach involving multiclass support vector machines (SVMs) as classifiers for three different structural conformations, viz., helix, sheet and coil. In the first stage, PSSMs obtained from PSI-BLAST and five specially selected physicochemical properties of amino acids are fed into SVMs as features for sequence-to-structure prediction. Confidence values for forming helix, sheet and coil that are obtained from the first stage SVM are then used in the second stage SVM for performing structure-to-structure prediction. The two-stage cascaded classifiers (PSP_MCSVM) are trained with proteins from RS126 dataset. The classifiers are finally tested on target proteins of critical assessment of protein structure prediction experiment-9 (CASP9). PSP_MCSVM with brainstorming consensus procedure performs better than the prediction servers like Predator, DSC, SIMPA96, for randomly selected proteins from CASP9 targets. The overall performance is found to be comparable with the current state-of-the art. PSP_MCSVM source code, train-test datasets and supplementary files are available freely in public domain at: http://sysbio.icm.edu.pl/secstruct and http://code.google.com/p/cmater-bioinfo/

Entities:  

Mesh:

Substances:

Year:  2011        PMID: 21594694      PMCID: PMC3168739          DOI: 10.1007/s00894-011-1102-8

Source DB:  PubMed          Journal:  J Mol Model        ISSN: 0948-5023            Impact factor:   1.810


  19 in total

1.  Protein secondary structure prediction based on position-specific scoring matrices.

Authors:  D T Jones
Journal:  J Mol Biol       Date:  1999-09-17       Impact factor: 5.469

2.  A novel method for protein secondary structure prediction using dual-layer SVM and profiles.

Authors:  Jian Guo; Hu Chen; Zhirong Sun; Yuanlie Lin
Journal:  Proteins       Date:  2004-03-01

3.  HYPROSP: a hybrid protein secondary structure prediction algorithm--a knowledge-based approach.

Authors:  Kuen-Pin Wu; Hsin-Nan Lin; Jia-Ming Chang; Ting-Yi Sung; Wen-Lian Hsu
Journal:  Nucleic Acids Res       Date:  2004-09-24       Impact factor: 16.971

4.  HYPROSP II--a knowledge-based hybrid method for protein secondary structure prediction based on local prediction confidence.

Authors:  Hsin-Nan Lin; Jia-Ming Chang; Kuen-Pin Wu; Ting-Yi Sung; Wen-Lian Hsu
Journal:  Bioinformatics       Date:  2005-06-02       Impact factor: 6.937

5.  Identification and application of the concepts important for accurate and reliable protein secondary structure prediction.

Authors:  R D King; M J Sternberg
Journal:  Protein Sci       Date:  1996-11       Impact factor: 6.725

6.  Seventy-five percent accuracy in protein secondary structure prediction.

Authors:  D Frishman; P Argos
Journal:  Proteins       Date:  1997-03

7.  Further developments of protein secondary structure prediction using information theory. New parameters and consideration of residue pairs.

Authors:  J F Gibrat; J Garnier; B Robson
Journal:  J Mol Biol       Date:  1987-12-05       Impact factor: 5.469

8.  Prediction of protein secondary structure at better than 70% accuracy.

Authors:  B Rost; C Sander
Journal:  J Mol Biol       Date:  1993-07-20       Impact factor: 5.469

9.  Prediction of protein secondary structure by combining nearest-neighbor algorithms and multiple sequence alignments.

Authors:  A A Salamov; V V Solovyev
Journal:  J Mol Biol       Date:  1995-03-17       Impact factor: 5.469

10.  Analysis of the accuracy and implications of simple methods for predicting the secondary structure of globular proteins.

Authors:  J Garnier; D J Osguthorpe; B Robson
Journal:  J Mol Biol       Date:  1978-03-25       Impact factor: 5.469

View more
  6 in total

1.  Protein-protein interaction site prediction in Homo sapiens and E. coli using an interaction-affinity based membership function in fuzzy SVM.

Authors:  Brijesh Kumar Sriwastava; Subhadip Basu; Ujjwal Maulik
Journal:  J Biosci       Date:  2015-10       Impact factor: 1.826

2.  ccPDB: compilation and creation of data sets from Protein Data Bank.

Authors:  Harinder Singh; Jagat Singh Chauhan; M Michael Gromiha; Gajendra P S Raghava
Journal:  Nucleic Acids Res       Date:  2011-12-01       Impact factor: 16.971

3.  Integrated Strategy Improves the Prediction Accuracy of miRNA in Large Dataset.

Authors:  Bin Xue; David Lipps; Sree Devineni
Journal:  PLoS One       Date:  2016-12-21       Impact factor: 3.240

4.  FunPred 3.0: improved protein function prediction using protein interaction network.

Authors:  Sovan Saha; Piyali Chatterjee; Subhadip Basu; Mita Nasipuri; Dariusz Plewczynski
Journal:  PeerJ       Date:  2019-05-22       Impact factor: 2.984

5.  Prediction of protein secondary structure based on an improved channel attention and multiscale convolution module.

Authors:  Xin Jin; Lin Guo; Qian Jiang; Nan Wu; Shaowen Yao
Journal:  Front Bioeng Biotechnol       Date:  2022-07-22

6.  PPIcons: identification of protein-protein interaction sites in selected organisms.

Authors:  Brijesh K Sriwastava; Subhadip Basu; Ujjwal Maulik; Dariusz Plewczynski
Journal:  J Mol Model       Date:  2013-06-02       Impact factor: 1.810

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.