Literature DB >> 32569263

Assessing predictors for new post translational modification sites: A case study on hydroxylation.

Damiano Piovesan1, Andras Hatos1, Giovanni Minervini1, Federica Quaglia1, Alexander Miguel Monzon1, Silvio C E Tosatto1.   

Abstract

Post-translational modification (PTM) sites have become popular for predictor development. However, with the exception of phosphorylation and a handful of other examples, PTMs suffer from a limited number of available training examples and sparsity in protein sequences. Here, proline hydroxylation is taken as an example to compare different methods and evaluate their performance on new experimentally determined sites. As a guide for effective experimental design, predictors require both high specificity and sensitivity. However, the self-reported performance may often not be indicative of prediction quality and detection of new sites is not guaranteed. We have benchmarked seven published hydroxylation site predictors on two newly constructed independent datasets. The self-reported performance is found to widely overestimate the real accuracy measured on independent datasets. No predictor performs better than random on new examples, indicating the refined models do not sufficiently generalize to detect new sites. The number of false positives is high and precision low, in particular for non-collagen proteins whose motifs are not conserved. As hydroxylation site predictors do not generalize for new data, caution is advised when using PTM predictors in the absence of independent evaluations, in particular for highly specific sites involved in signalling.

Entities:  

Year:  2020        PMID: 32569263     DOI: 10.1371/journal.pcbi.1007967

Source DB:  PubMed          Journal:  PLoS Comput Biol        ISSN: 1553-734X            Impact factor:   4.475


  5 in total

1.  DOME: recommendations for supervised machine learning validation in biology.

Authors:  Ian Walsh; Dmytro Fishman; Dario Garcia-Gasulla; Tiina Titma; Gianluca Pollastri; Jennifer Harrow; Fotis E Psomopoulos; Silvio C E Tosatto
Journal:  Nat Methods       Date:  2021-07-27       Impact factor: 28.547

Review 2.  Current status of PTMs structural databases: applications, limitations and prospects.

Authors:  Alexandre G de Brevern; Joseph Rebehmed
Journal:  Amino Acids       Date:  2022-01-12       Impact factor: 3.520

3.  Deep Learning Approaches for Detection of Breast Adenocarcinoma Causing Carcinogenic Mutations.

Authors:  Asghar Ali Shah; Fahad Alturise; Tamim Alkhalifah; Yaser Daanial Khan
Journal:  Int J Mol Sci       Date:  2022-09-29       Impact factor: 6.208

Review 4.  In silico prediction of post-translational modifications in therapeutic antibodies.

Authors:  Shabdita Vatsa
Journal:  MAbs       Date:  2022 Jan-Dec       Impact factor: 5.857

5.  Development of an experiment-split method for benchmarking the generalization of a PTM site predictor: Lysine methylome as an example.

Authors:  Guoyang Zou; Yang Zou; Chenglong Ma; Jiaojiao Zhao; Lei Li
Journal:  PLoS Comput Biol       Date:  2021-12-08       Impact factor: 4.475

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.