Literature DB >> 24163645

Probabilistic Prediction of Protein Phosphorylation Sites Using Classification Relevance Units Machines.

Mark Menor1, Kyungim Baek, Guylaine Poisson.   

Abstract

Phosphorylation is an important post-translational modification of proteins that is essential to the regulation of many cellular processes. Although most of the phosphorylation sites discovered in protein sequences have been identified experimentally, the in vivo and in vitro discovery of the sites is an expensive, time-consuming and laborious task. Therefore, the development of computational methods for prediction of protein phosphorylation sites has drawn considerable attention. In this work, we present a kernel-based probabilistic Classification Relevance Units Machine (CRUM) for in silico phosphorylation site prediction. In comparison with the popular Support Vector Machine (SVM) CRUM shows comparable predictive performance and yet provides a more parsimonious model. This is desirable since it leads to a reduction in prediction run-time, which is important in predictions on large-scale data. Furthermore, the CRUM training algorithm has lower run-time and memory complexity and has a simpler parameter selection scheme than the Relevance Vector Machine (RVM) learning algorithm. To further investigate the viability of using CRUM in phosphorylation site prediction, we construct multiple CRUM predictors using different combinations of three phosphorylation site features - BLOSUM encoding, disorder, and amino acid composition. The predictors are evaluated through cross-validation and the results show that CRUM with BLOSUM feature is among the best performing CRUM predictors in both cross-validation and benchmark experiments. A comparative study with existing prediction tools in an independent benchmark experiment suggests possible direction for further improving the predictive performance of CRUM predictors.

Entities:  

Keywords:  Classification; Kernel machine; Phosphorylation

Year:  2012        PMID: 24163645      PMCID: PMC3806113          DOI: 10.1145/2432546.2432547

Source DB:  PubMed          Journal:  ACM SIGAPP Appl Comput Rev        ISSN: 1559-6915


  24 in total

1.  Sequence and structure-based prediction of eukaryotic protein phosphorylation sites.

Authors:  N Blom; S Gammeltoft; S Brunak
Journal:  J Mol Biol       Date:  1999-12-17       Impact factor: 5.469

2.  Structural basis and prediction of substrate specificity in protein serine/threonine kinases.

Authors:  Ross I Brinkworth; Robert A Breinl; Bostjan Kobe
Journal:  Proc Natl Acad Sci U S A       Date:  2002-12-26       Impact factor: 11.205

3.  Prediction of phosphorylation sites using SVMs.

Authors:  Jong Hun Kim; Juyoung Lee; Bermseok Oh; Kuchan Kimm; Insong Koh
Journal:  Bioinformatics       Date:  2004-07-01       Impact factor: 6.937

4.  Musite, a tool for global prediction of general and kinase-specific phosphorylation sites.

Authors:  Jianjiong Gao; Jay J Thelen; A Keith Dunker; Dong Xu
Journal:  Mol Cell Proteomics       Date:  2010-08-11       Impact factor: 5.911

5.  GANNPhos: a new phosphorylation site predictor based on a genetic algorithm integrated neural network.

Authors:  Yu-Rong Tang; Yong-Zi Chen; Carlos A Canchaya; Ziding Zhang
Journal:  Protein Eng Des Sel       Date:  2007-07-24       Impact factor: 1.650

6.  Machine learning approach to predict protein phosphorylation sites by incorporating evolutionary information.

Authors:  Ashis Kumer Biswas; Nasimul Noman; Abdur Rahman Sikder
Journal:  BMC Bioinformatics       Date:  2010-05-21       Impact factor: 3.169

7.  NetPhosYeast: prediction of protein phosphorylation sites in yeast.

Authors:  Christian R Ingrell; Martin L Miller; Ole N Jensen; Nikolaj Blom
Journal:  Bioinformatics       Date:  2007-02-05       Impact factor: 6.937

8.  The importance of intrinsic disorder for protein phosphorylation.

Authors:  Lilia M Iakoucheva; Predrag Radivojac; Celeste J Brown; Timothy R O'Connor; Jason G Sikes; Zoran Obradovic; A Keith Dunker
Journal:  Nucleic Acids Res       Date:  2004-02-11       Impact factor: 16.971

9.  KinasePhos 2.0: a web server for identifying protein kinase-specific phosphorylation sites based on sequences and coupling patterns.

Authors:  Yung-Hao Wong; Tzong-Yi Lee; Han-Kuen Liang; Chia-Mao Huang; Ting-Yuan Wang; Yi-Huan Yang; Chia-Huei Chu; Hsien-Da Huang; Ming-Tat Ko; Jenn-Kang Hwang
Journal:  Nucleic Acids Res       Date:  2007-05-21       Impact factor: 16.971

10.  Phospho.ELM: a database of phosphorylation sites--update 2008.

Authors:  Francesca Diella; Cathryn M Gould; Claudia Chica; Allegra Via; Toby J Gibson
Journal:  Nucleic Acids Res       Date:  2007-10-25       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.