Literature DB >> 16045309

Predicting the phosphorylation sites using hidden Markov models and machine learning methods.

Pasak Senawongse1, Andrew R Dalby, Zheng Rong Yang.   

Abstract

Accurately predicting phosphorylation sites in proteins is an important issue in postgenomics, for which how to efficiently extract the most predictive features from amino acid sequences for modeling is still challenging. Although both the distributed encoding method and the bio-basis function method work well, they still have some limits in use. The distributed encoding method is unable to code the biological content in sequences efficiently, whereas the bio-basis function method is a nonparametric method, which is often computationally expensive. As hidden Markov models (HMMs) can be used to generate one model for one cluster of aligned protein sequences, the aim in this study is to use HMMs to extract features from amino acid sequences, where sequence clusters are determined using available biological knowledge. In this novel method, HMMs are first constructed using functional sequences only. Both functional and nonfunctional training sequences are then inputted into the trained HMMs to generate functional and nonfunctional feature vectors. From this, a machine learning algorithm is used to construct a classifier based on these feature vectors. It is found in this work that (1) this method provides much better prediction accuracy than the use of HMMs only for prediction, and (2) the support vector machines (SVMs) algorithm outperforms decision trees and neural network algorithms when they are constructed on the features extracted using the trained HMMs.

Mesh:

Substances:

Year:  2005        PMID: 16045309     DOI: 10.1021/ci050047+

Source DB:  PubMed          Journal:  J Chem Inf Model        ISSN: 1549-9596            Impact factor:   4.956


  8 in total

1.  AutoMotif Server for prediction of phosphorylation sites in proteins using support vector machine: 2007 update.

Authors:  Dariusz Plewczynski; Adrian Tkacz; Lucjan S Wyrwicz; Leszek Rychlewski; Krzysztof Ginalski
Journal:  J Mol Model       Date:  2007-11-08       Impact factor: 1.810

2.  iPhosY-PseAAC: identify phosphotyrosine sites by incorporating sequence statistical moments into PseAAC.

Authors:  Yaser Daanial Khan; Nouman Rasool; Waqar Hussain; Sher Afzal Khan; Kuo-Chen Chou
Journal:  Mol Biol Rep       Date:  2018-10-11       Impact factor: 2.316

3.  Protein kinase C phosphorylation regulates membrane insertion of GABAA receptor subtypes that mediate tonic inhibition.

Authors:  Armen M Abramian; Eydith Comenencia-Ortiz; Mansi Vithlani; Eva Verena Tretter; Werner Sieghart; Paul A Davies; Stephen J Moss
Journal:  J Biol Chem       Date:  2010-10-12       Impact factor: 5.157

Review 4.  Peptide bioinformatics: peptide classification using peptide machines.

Authors:  Zheng Rong Yang
Journal:  Methods Mol Biol       Date:  2008

5.  Novel molecular imaging platform for monitoring oncological kinases.

Authors:  Shyam Nyati; Brian D Ross; Alnawaz Rehemtulla; Mahaveer S Bhojani
Journal:  Cancer Cell Int       Date:  2010-07-08       Impact factor: 5.722

6.  Charge environments around phosphorylation sites in proteins.

Authors:  James Kitchen; Rebecca E Saunders; Jim Warwicker
Journal:  BMC Struct Biol       Date:  2008-03-25

7.  A biochemical genomics screen for substrates of Ste20p kinase enables the in silico prediction of novel substrates.

Authors:  Robert B Annan; Anna Y Lee; Ian D Reid; Azin Sayad; Malcolm Whiteway; Michael Hallett; David Y Thomas
Journal:  PLoS One       Date:  2009-12-16       Impact factor: 3.240

8.  Prediction of cyclin-dependent kinase phosphorylation substrates.

Authors:  Emmanuel J Chang; Rashida Begum; Brian T Chait; Terry Gaasterland
Journal:  PLoS One       Date:  2007-08-01       Impact factor: 3.240

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.