Literature DB >> 26395771

Positive-unlabeled ensemble learning for kinase substrate prediction from dynamic phosphoproteomics data.

Pengyi Yang1, Sean J Humphrey2, David E James3, Yee Hwa Yang4, Raja Jothi1.   

Abstract

MOTIVATION: Protein phosphorylation is a post-translational modification that underlines various aspects of cellular signaling. A key step to reconstructing signaling networks involves identification of the set of all kinases and their substrates. Experimental characterization of kinase substrates is both expensive and time-consuming. To expedite the discovery of novel substrates, computational approaches based on kinase recognition sequence (motifs) from known substrates, protein structure, interaction and co-localization have been proposed. However, rarely do these methods take into account the dynamic responses of signaling cascades measured from in vivo cellular systems. Given that recent advances in mass spectrometry-based technologies make it possible to quantify phosphorylation on a proteome-wide scale, computational approaches that can integrate static features with dynamic phosphoproteome data would greatly facilitate the prediction of biologically relevant kinase-specific substrates.
RESULTS: Here, we propose a positive-unlabeled ensemble learning approach that integrates dynamic phosphoproteomics data with static kinase recognition motifs to predict novel substrates for kinases of interest. We extended a positive-unlabeled learning technique for an ensemble model, which significantly improves prediction sensitivity on novel substrates of kinases while retaining high specificity. We evaluated the performance of the proposed model using simulation studies and subsequently applied it to predict novel substrates of key kinases relevant to insulin signaling. Our analyses show that static sequence motifs and dynamic phosphoproteomics data are complementary and that the proposed integrated model performs better than methods relying only on static information for accurate prediction of kinase-specific substrates.
AVAILABILITY AND IMPLEMENTATION: Executable GUI tool, source code and documentation are freely available at https://github.com/PengyiYang/KSP-PUEL. CONTACT: pengyi.yang@nih.gov or jothi@mail.nih.gov SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Published by Oxford University Press 2015. This work is written by US Government employees and is in the public domain in the US.

Entities:  

Mesh:

Substances:

Year:  2015        PMID: 26395771      PMCID: PMC4739180          DOI: 10.1093/bioinformatics/btv550

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  36 in total

1.  Prediction of phosphorylation sites using SVMs.

Authors:  Jong Hun Kim; Juyoung Lee; Bermseok Oh; Kuchan Kimm; Insong Koh
Journal:  Bioinformatics       Date:  2004-07-01       Impact factor: 6.937

2.  Systematic discovery of in vivo phosphorylation networks.

Authors:  Rune Linding; Lars Juhl Jensen; Gerard J Ostheimer; Marcel A T M van Vugt; Claus Jørgensen; Ioana M Miron; Francesca Diella; Karen Colwill; Lorne Taylor; Kelly Elder; Pavel Metalnikov; Vivian Nguyen; Adrian Pasculescu; Jing Jin; Jin Gyoon Park; Leona D Samson; James R Woodgett; Robert B Russell; Peer Bork; Michael B Yaffe; Tony Pawson
Journal:  Cell       Date:  2007-06-14       Impact factor: 41.582

3.  A curated compendium of phosphorylation motifs.

Authors:  Ramars Amanchy; Balamurugan Periaswamy; Suresh Mathivanan; Raghunath Reddy; Sudhir Gopal Tattikota; Akhilesh Pandey
Journal:  Nat Biotechnol       Date:  2007-03       Impact factor: 54.908

4.  SVMs modeling for highly imbalanced classification.

Authors:  Yuchun Tang; Yan-Qing Zhang; Nitesh V Chawla; Sven Krasser
Journal:  IEEE Trans Syst Man Cybern B Cybern       Date:  2008-12-09

Review 5.  Protein kinases and phosphatases: the yin and yang of protein phosphorylation and signaling.

Authors:  T Hunter
Journal:  Cell       Date:  1995-01-27       Impact factor: 41.582

6.  Global, in vivo, and site-specific phosphorylation dynamics in signaling networks.

Authors:  Jesper V Olsen; Blagoy Blagoev; Florian Gnad; Boris Macek; Chanchal Kumar; Peter Mortensen; Matthias Mann
Journal:  Cell       Date:  2006-11-03       Impact factor: 41.582

7.  Linear motif atlas for phosphorylation-dependent signaling.

Authors:  Martin Lee Miller; Lars Juhl Jensen; Francesca Diella; Claus Jørgensen; Michele Tinti; Lei Li; Marilyn Hsiung; Sirlester A Parker; Jennifer Bordeaux; Thomas Sicheritz-Ponten; Marina Olhovsky; Adrian Pasculescu; Jes Alexander; Stefan Knapp; Nikolaj Blom; Peer Bork; Shawn Li; Gianni Cesareni; Tony Pawson; Benjamin E Turk; Michael B Yaffe; Søren Brunak; Rune Linding
Journal:  Sci Signal       Date:  2008-09-02       Impact factor: 8.192

8.  GPS 2.0, a tool to predict kinase-specific phosphorylation sites in hierarchy.

Authors:  Yu Xue; Jian Ren; Xinjiao Gao; Changjiang Jin; Longping Wen; Xuebiao Yao
Journal:  Mol Cell Proteomics       Date:  2008-05-06       Impact factor: 5.911

9.  KinasePhos 2.0: a web server for identifying protein kinase-specific phosphorylation sites based on sequences and coupling patterns.

Authors:  Yung-Hao Wong; Tzong-Yi Lee; Han-Kuen Liang; Chia-Mao Huang; Ting-Yuan Wang; Yi-Huan Yang; Chia-Huei Chu; Hsien-Da Huang; Ming-Tat Ko; Jenn-Kang Hwang
Journal:  Nucleic Acids Res       Date:  2007-05-21       Impact factor: 16.971

10.  Prediction of kinase-specific phosphorylation sites using conditional random fields.

Authors:  Thanh Hai Dang; Koenraad Van Leemput; Alain Verschoren; Kris Laukens
Journal:  Bioinformatics       Date:  2008-10-20       Impact factor: 6.937

View more
  10 in total

1.  Twenty years of bioinformatics research for protease-specific substrate and cleavage site prediction: a comprehensive revisit and benchmarking of existing methods.

Authors:  Fuyi Li; Yanan Wang; Chen Li; Tatiana T Marquez-Lago; André Leier; Neil D Rawlings; Gholamreza Haffari; Jerico Revote; Tatsuya Akutsu; Kuo-Chen Chou; Anthony W Purcell; Robert N Pike; Geoffrey I Webb; A Ian Smith; Trevor Lithgow; Roger J Daly; James C Whisstock; Jiangning Song
Journal:  Brief Bioinform       Date:  2019-11-27       Impact factor: 11.622

2.  Multi-omic Profiling Reveals Dynamics of the Phased Progression of Pluripotency.

Authors:  Pengyi Yang; Sean J Humphrey; Senthilkumar Cinghu; Rajneesh Pathania; Andrew J Oldfield; Dhirendra Kumar; Dinuka Perera; Jean Y H Yang; David E James; Matthias Mann; Raja Jothi
Journal:  Cell Syst       Date:  2019-05-08       Impact factor: 10.304

3.  Accurate, high-coverage assignment of in vivo protein kinases to phosphosites from in vitro phosphoproteomic specificity data.

Authors:  Brandon M Invergo
Journal:  PLoS Comput Biol       Date:  2022-05-13       Impact factor: 4.779

4.  Chromosome preference of disease genes and vectorization for the prediction of non-coding disease genes.

Authors:  Hui Peng; Chaowang Lan; Yuansheng Liu; Tao Liu; Michael Blumenstein; Jinyan Li
Journal:  Oncotarget       Date:  2017-08-24

5.  Positive-unlabelled learning of glycosylation sites in the human proteome.

Authors:  Fuyi Li; Yang Zhang; Anthony W Purcell; Geoffrey I Webb; Kuo-Chen Chou; Trevor Lithgow; Chen Li; Jiangning Song
Journal:  BMC Bioinformatics       Date:  2019-03-06       Impact factor: 3.169

6.  KinPred: A unified and sustainable approach for harnessing proteome-level human kinase-substrate predictions.

Authors:  Bingjie Xue; Benjamin Jordan; Saqib Rizvi; Kristen M Naegle
Journal:  PLoS Comput Biol       Date:  2021-02-08       Impact factor: 4.475

7.  Predicting HIV-1 Protease Cleavage Sites With Positive-Unlabeled Learning.

Authors:  Zhenfeng Li; Lun Hu; Zehai Tang; Cheng Zhao
Journal:  Front Genet       Date:  2021-03-26       Impact factor: 4.599

8.  Unraveling Kinase Activation Dynamics Using Kinase-Substrate Relationships from Temporal Large-Scale Phosphoproteomics Studies.

Authors:  Westa Domanova; James Krycer; Rima Chaudhuri; Pengyi Yang; Fatemeh Vafaee; Daniel Fazakerley; Sean Humphrey; David James; Zdenka Kuncic
Journal:  PLoS One       Date:  2016-06-23       Impact factor: 3.240

9.  PhosphoPredict: A bioinformatics tool for prediction of human kinase-specific phosphorylation substrates and sites by integrating heterogeneous feature selection.

Authors:  Jiangning Song; Huilin Wang; Jiawei Wang; André Leier; Tatiana Marquez-Lago; Bingjiao Yang; Ziding Zhang; Tatsuya Akutsu; Geoffrey I Webb; Roger J Daly
Journal:  Sci Rep       Date:  2017-07-31       Impact factor: 4.379

10.  Global redox proteome and phosphoproteome analysis reveals redox switch in Akt.

Authors:  Zhiduan Su; James G Burchfield; Pengyi Yang; Sean J Humphrey; Guang Yang; Deanne Francis; Sabina Yasmin; Sung-Young Shin; Dougall M Norris; Alison L Kearney; Miro A Astore; Jonathan Scavuzzo; Kelsey H Fisher-Wellman; Qiao-Ping Wang; Benjamin L Parker; G Gregory Neely; Fatemeh Vafaee; Joyce Chiu; Reichelle Yeo; Philip J Hogg; Daniel J Fazakerley; Lan K Nguyen; Serdar Kuyucak; David E James
Journal:  Nat Commun       Date:  2019-12-02       Impact factor: 14.919

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.