Literature DB >> 30184176

Twenty years of bioinformatics research for protease-specific substrate and cleavage site prediction: a comprehensive revisit and benchmarking of existing methods.

Fuyi Li1, Yanan Wang1,2, Chen Li1,3, Tatiana T Marquez-Lago4, André Leier4, Neil D Rawlings5, Gholamreza Haffari6, Jerico Revote1, Tatsuya Akutsu7, Kuo-Chen Chou8,9, Anthony W Purcell1, Robert N Pike10,11, Geoffrey I Webb6, A Ian Smith1,11, Trevor Lithgow12, Roger J Daly1, James C Whisstock1,11, Jiangning Song1,6,11.   

Abstract

The roles of proteolytic cleavage have been intensively investigated and discussed during the past two decades. This irreversible chemical process has been frequently reported to influence a number of crucial biological processes (BPs), such as cell cycle, protein regulation and inflammation. A number of advanced studies have been published aiming at deciphering the mechanisms of proteolytic cleavage. Given its significance and the large number of functionally enriched substrates targeted by specific proteases, many computational approaches have been established for accurate prediction of protease-specific substrates and their cleavage sites. Consequently, there is an urgent need to systematically assess the state-of-the-art computational approaches for protease-specific cleavage site prediction to further advance the existing methodologies and to improve the prediction performance. With this goal in mind, in this article, we carefully evaluated a total of 19 computational methods (including 8 scoring function-based methods and 11 machine learning-based methods) in terms of their underlying algorithm, calculated features, performance evaluation and software usability. Then, extensive independent tests were performed to assess the robustness and scalability of the reviewed methods using our carefully prepared independent test data sets with 3641 cleavage sites (specific to 10 proteases). The comparative experimental results demonstrate that PROSPERous is the most accurate generic method for predicting eight protease-specific cleavage sites, while GPS-CCD and LabCaS outperformed other predictors for calpain-specific cleavage sites. Based on our review, we then outlined some potential ways to improve the prediction performance and ease the computational burden by applying ensemble learning, deep learning, positive unlabeled learning and parallel and distributed computing techniques. We anticipate that our study will serve as a practical and useful guide for interested readers to further advance next-generation bioinformatics tools for protease-specific cleavage site prediction.
© The Author(s) 2018. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

Entities:  

Keywords:  bioinformatics; machine learning; prediction model; protease; sequence analysis; substrate cleavage; substrate specificity

Year:  2019        PMID: 30184176      PMCID: PMC6954447          DOI: 10.1093/bib/bby077

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   11.622


  97 in total

1.  CASVM: web server for SVM-based prediction of caspase substrates cleavage sites.

Authors:  Lawrence J K Wee; Tin Wee Tan; Shoba Ranganathan
Journal:  Bioinformatics       Date:  2007-06-28       Impact factor: 6.937

2.  'Big data', Hadoop and cloud computing in genomics.

Authors:  Aisling O'Driscoll; Jurate Daugelaite; Roy D Sleator
Journal:  J Biomed Inform       Date:  2013-07-18       Impact factor: 6.317

Review 3.  Regulated intramembrane proteolysis: signaling pathways and biological functions.

Authors:  Mark Lal; Michael Caplan
Journal:  Physiology (Bethesda)       Date:  2011-02

4.  Inflammation, metalloproteinases, and increased proteolysis: an emerging pathophysiological paradigm in aortic aneurysm.

Authors:  P K Shah
Journal:  Circulation       Date:  1997-10-07       Impact factor: 29.690

Review 5.  Proteolysis-Targeting Chimeras: Induced Protein Degradation as a Therapeutic Strategy.

Authors:  Philipp Ottis; Craig M Crews
Journal:  ACS Chem Biol       Date:  2017-03-20       Impact factor: 5.100

6.  PROSPERous: high-throughput prediction of substrate cleavage sites for 90 proteases with improved accuracy.

Authors:  Jiangning Song; Fuyi Li; André Leier; Tatiana T Marquez-Lago; Tatsuya Akutsu; Gholamreza Haffari; Kuo-Chen Chou; Geoffrey I Webb; Robert N Pike; John Hancock
Journal:  Bioinformatics       Date:  2018-02-15       Impact factor: 6.937

7.  DeepSol: a deep learning framework for sequence-based protein solubility prediction.

Authors:  Sameer Khurana; Reda Rawi; Khalid Kunji; Gwo-Yu Chuang; Halima Bensmail; Raghvendra Mall
Journal:  Bioinformatics       Date:  2018-08-01       Impact factor: 6.937

8.  Structural determinants of limited proteolysis.

Authors:  Marat D Kazanov; Yoshinobu Igarashi; Alexey M Eroshkin; Piotr Cieplak; Boris Ratnikov; Ying Zhang; Zhanwen Li; Adam Godzik; Andrei L Osterman; Jeffrey W Smith
Journal:  J Proteome Res       Date:  2011-07-08       Impact factor: 4.466

9.  GlycoMinestruct: a new bioinformatics tool for highly accurate mapping of the human N-linked and O-linked glycoproteomes by incorporating structural features.

Authors:  Fuyi Li; Chen Li; Jerico Revote; Yang Zhang; Geoffrey I Webb; Jian Li; Jiangning Song; Trevor Lithgow
Journal:  Sci Rep       Date:  2016-10-06       Impact factor: 4.379

Review 10.  iProt-Sub: a comprehensive package for accurately mapping and predicting protease-specific substrates and cleavage sites.

Authors:  Jiangning Song; Yanan Wang; Fuyi Li; Tatsuya Akutsu; Neil D Rawlings; Geoffrey I Webb; Kuo-Chen Chou
Journal:  Brief Bioinform       Date:  2019-03-25       Impact factor: 11.622

View more
  18 in total

1.  MULTiPly: a novel multi-layer predictor for discovering general and specific types of promoters.

Authors:  Meng Zhang; Fuyi Li; Tatiana T Marquez-Lago; André Leier; Cunshuo Fan; Chee Keong Kwoh; Kuo-Chen Chou; Jiangning Song; Cangzhi Jia
Journal:  Bioinformatics       Date:  2019-09-01       Impact factor: 6.937

2.  Computational prediction and interpretation of both general and specific types of promoters in Escherichia coli by exploiting a stacked ensemble-learning framework.

Authors:  Fuyi Li; Jinxiang Chen; Zongyuan Ge; Ya Wen; Yanwei Yue; Morihiro Hayashida; Abdelkader Baggag; Halima Bensmail; Jiangning Song
Journal:  Brief Bioinform       Date:  2021-03-22       Impact factor: 11.622

3.  A comprehensive review and performance evaluation of bioinformatics tools for HLA class I peptide-binding prediction.

Authors:  Shutao Mei; Fuyi Li; André Leier; Tatiana T Marquez-Lago; Kailin Giam; Nathan P Croft; Tatsuya Akutsu; A Ian Smith; Jian Li; Jamie Rossjohn; Anthony W Purcell; Jiangning Song
Journal:  Brief Bioinform       Date:  2020-07-15       Impact factor: 11.622

4.  DeepCleave: a deep learning predictor for caspase and matrix metalloprotease substrates and cleavage sites.

Authors:  Fuyi Li; Jinxiang Chen; André Leier; Tatiana Marquez-Lago; Quanzhong Liu; Yanze Wang; Jerico Revote; A Ian Smith; Tatsuya Akutsu; Geoffrey I Webb; Lukasz Kurgan; Jiangning Song
Journal:  Bioinformatics       Date:  2020-02-15       Impact factor: 6.937

5.  Deep Learning-Based Advances In Protein Posttranslational Modification Site and Protein Cleavage Prediction.

Authors:  Subash C Pakhrin; Suresh Pokharel; Hiroto Saigo; Dukka B Kc
Journal:  Methods Mol Biol       Date:  2022

Review 6.  Large-scale comparative review and assessment of computational methods for anti-cancer peptide identification.

Authors:  Xiao Liang; Fuyi Li; Jinxiang Chen; Junlong Li; Hao Wu; Shuqin Li; Jiangning Song; Quanzhong Liu
Journal:  Brief Bioinform       Date:  2021-07-20       Impact factor: 11.622

7.  Positive-unlabelled learning of glycosylation sites in the human proteome.

Authors:  Fuyi Li; Yang Zhang; Anthony W Purcell; Geoffrey I Webb; Kuo-Chen Chou; Trevor Lithgow; Chen Li; Jiangning Song
Journal:  BMC Bioinformatics       Date:  2019-03-06       Impact factor: 3.169

8.  Software-aided workflow for predicting protease-specific cleavage sites using physicochemical properties of the natural and unnatural amino acids in peptide-based drug discovery.

Authors:  Tatiana Radchenko; Fabien Fontaine; Luca Morettoni; Ismael Zamora
Journal:  PLoS One       Date:  2019-01-08       Impact factor: 3.240

Review 9.  Mouse Models of Human Proprotein Convertase Insufficiency.

Authors:  Manita Shakya; Iris Lindberg
Journal:  Endocr Rev       Date:  2021-05-25       Impact factor: 19.871

Review 10.  Structure, Application, and Biochemistry of Microbial Keratinases.

Authors:  Qingxin Li
Journal:  Front Microbiol       Date:  2021-06-23       Impact factor: 5.640

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.