Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Predicting the phosphorylation sites using hidden Markov models and machine learning methods.

Literature DB >> 16045309

Predicting the phosphorylation sites using hidden Markov models and machine learning methods.

Pasak Senawongse¹, Andrew R Dalby, Zheng Rong Yang.

Abstract

Accurately predicting phosphorylation sites in proteins is an important issue in postgenomics, for which how to efficiently extract the most predictive features from amino acid sequences for modeling is still challenging. Although both the distributed encoding method and the bio-basis function method work well, they still have some limits in use. The distributed encoding method is unable to code the biological content in sequences efficiently, whereas the bio-basis function method is a nonparametric method, which is often computationally expensive. As hidden Markov models (HMMs) can be used to generate one model for one cluster of aligned protein sequences, the aim in this study is to use HMMs to extract features from amino acid sequences, where sequence clusters are determined using available biological knowledge. In this novel method, HMMs are first constructed using functional sequences only. Both functional and nonfunctional training sequences are then inputted into the trained HMMs to generate functional and nonfunctional feature vectors. From this, a machine learning algorithm is used to construct a classifier based on these feature vectors. It is found in this work that (1) this method provides much better prediction accuracy than the use of HMMs only for prediction, and (2) the support vector machines (SVMs) algorithm outperforms decision trees and neural network algorithms when they are constructed on the features extracted using the trained HMMs.

Mesh：

Substances：
Proteins

Year: 2005 PMID： 16045309 DOI： 10.1021/ci050047+

Source DB: PubMed Journal: J Chem Inf Model ISSN： 1549-9596 Impact factor: 4.956

Keyword Cloud
Cited

8 in total

1. AutoMotif Server for prediction of phosphorylation sites in proteins using support vector machine: 2007 update.

Authors: Dariusz Plewczynski; Adrian Tkacz; Lucjan S Wyrwicz; Leszek Rychlewski; Krzysztof Ginalski
Journal: J Mol Model Date: 2007-11-08 Impact factor: 1.810

2. iPhosY-PseAAC: identify phosphotyrosine sites by incorporating sequence statistical moments into PseAAC.

Authors: Yaser Daanial Khan; Nouman Rasool; Waqar Hussain; Sher Afzal Khan; Kuo-Chen Chou
Journal: Mol Biol Rep Date: 2018-10-11 Impact factor: 2.316

3. Protein kinase C phosphorylation regulates membrane insertion of GABAA receptor subtypes that mediate tonic inhibition.

Authors: Armen M Abramian; Eydith Comenencia-Ortiz; Mansi Vithlani; Eva Verena Tretter; Werner Sieghart; Paul A Davies; Stephen J Moss
Journal: J Biol Chem Date: 2010-10-12 Impact factor: 5.157

Predicting the phosphorylation sites using hidden Markov models and machine learning methods.

1. AutoMotif Server for prediction of phosphorylation sites in proteins using support vector machine: 2007 update.

2. iPhosY-PseAAC: identify phosphotyrosine sites by incorporating sequence statistical moments into PseAAC.

3. Protein kinase C phosphorylation regulates membrane insertion of GABAA receptor subtypes that mediate tonic inhibition.

Review 4. Peptide bioinformatics: peptide classification using peptide machines.

5. Novel molecular imaging platform for monitoring oncological kinases.

6. Charge environments around phosphorylation sites in proteins.

7. A biochemical genomics screen for substrates of Ste20p kinase enables the in silico prediction of novel substrates.

8. Prediction of cyclin-dependent kinase phosphorylation substrates.