Literature DB >> 35696076

A Pretrained ELECTRA Model for Kinase-Specific Phosphorylation Site Prediction.

Lei Jiang1, Duolin Wang1, Dong Xu2.   

Abstract

Phosphorylation plays a vital role in signal transduction and cell cycle. Identifying and understanding phosphorylation through machine-learning methods has a long history. However, existing methods only learn representations of a protein sequence segment from a labeled dataset itself, which could result in biased or incomplete features, especially for kinase-specific phosphorylation site prediction in which training data are typically sparse. To learn a comprehensive contextual representation of a protein sequence segment for kinase-specific phosphorylation site prediction, we pretrained our model from over 24 million unlabeled sequence fragments using ELECTRA (Efficiently Learning an Encoder that Classifies Token Replacements Accurately). The pretrained model was applied to kinase-specific site prediction of kinases CDK, PKA, CK2, MAPK, and PKC. The pretrained ELECTRA model achieves 9.02% improvement over BERT and 11.10% improvement over MusiteDeep in the area under the precision-recall curve on the benchmark data.
© 2022. The Author(s), under exclusive license to Springer Science+Business Media, LLC, part of Springer Nature.

Entities:  

Keywords:  Deep leaning; ELECTRA; Kinase-specific phosphorylation site prediction; Phosphorylation; Pretraining; Transformer

Mesh:

Substances:

Year:  2022        PMID: 35696076     DOI: 10.1007/978-1-0716-2317-6_4

Source DB:  PubMed          Journal:  Methods Mol Biol        ISSN: 1064-3745


  1 in total

1.  Predicting and analyzing protein phosphorylation sites in plants using musite.

Authors:  Qiuming Yao; Jianjiong Gao; Curtis Bollinger; Jay J Thelen; Dong Xu
Journal:  Front Plant Sci       Date:  2012-08-21       Impact factor: 5.753

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.