Literature DB >> 30285084

Large-scale comparative assessment of computational predictors for lysine post-translational modification sites.

Zhen Chen1, Xuhan Liu2, Fuyi Li3,4, Chen Li3,5, Tatiana Marquez-Lago6,7, André Leier6,7, Tatsuya Akutsu8, Geoffrey I Webb9, Dakang Xu10,11, Alexander Ian Smith3,4, Lei Li1, Kuo-Chen Chou12,13, Jiangning Song3,4,9.   

Abstract

Lysine post-translational modifications (PTMs) play a crucial role in regulating diverse functions and biological processes of proteins. However, because of the large volumes of sequencing data generated from genome-sequencing projects, systematic identification of different types of lysine PTM substrates and PTM sites in the entire proteome remains a major challenge. In recent years, a number of computational methods for lysine PTM identification have been developed. These methods show high diversity in their core algorithms, features extracted and feature selection techniques and evaluation strategies. There is therefore an urgent need to revisit these methods and summarize their methodologies, to improve and further develop computational techniques to identify and characterize lysine PTMs from the large amounts of sequence data. With this goal in mind, we first provide a comprehensive survey on a large collection of 49 state-of-the-art approaches for lysine PTM prediction. We cover a variety of important aspects that are crucial for the development of successful predictors, including operating algorithms, sequence and structural features, feature selection, model performance evaluation and software utility. We further provide our thoughts on potential strategies to improve the model performance. Second, in order to examine the feasibility of using deep learning for lysine PTM prediction, we propose a novel computational framework, termed MUscADEL (Multiple Scalable Accurate Deep Learner for lysine PTMs), using deep, bidirectional, long short-term memory recurrent neural networks for accurate and systematic mapping of eight major types of lysine PTMs in the human and mouse proteomes. Extensive benchmarking tests show that MUscADEL outperforms current methods for lysine PTM characterization, demonstrating the potential and power of deep learning techniques in protein PTM prediction. The web server of MUscADEL, together with all the data sets assembled in this study, is freely available at http://muscadel.erc.monash.edu/. We anticipate this comprehensive review and the application of deep learning will provide practical guide and useful insights into PTM prediction and inspire future bioinformatics studies in the related fields.
© The Author(s) 2018. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

Entities:  

Keywords:  deep learning; feature engineering; lysine post-translational modification; prediction model; sequence features

Year:  2019        PMID: 30285084      PMCID: PMC6954452          DOI: 10.1093/bib/bby089

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   11.622


  144 in total

1.  Amino acid substitution matrices from protein blocks.

Authors:  S Henikoff; J G Henikoff
Journal:  Proc Natl Acad Sci U S A       Date:  1992-11-15       Impact factor: 11.205

2.  Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences.

Authors:  Weizhong Li; Adam Godzik
Journal:  Bioinformatics       Date:  2006-05-26       Impact factor: 6.937

Review 3.  Recent progress in protein subcellular location prediction.

Authors:  Kuo-Chen Chou; Hong-Bin Shen
Journal:  Anal Biochem       Date:  2007-07-12       Impact factor: 3.365

Review 4.  Nonproteolytic functions of ubiquitin in cell signaling.

Authors:  Zhijian J Chen; Lijun J Sun
Journal:  Mol Cell       Date:  2009-02-13       Impact factor: 17.970

Review 5.  Ubiquitin and ubiquitin-like proteins as multifunctional signals.

Authors:  Rebecca L Welchman; Colin Gordon; R John Mayer
Journal:  Nat Rev Mol Cell Biol       Date:  2005-08       Impact factor: 94.444

6.  ZincExplorer: an accurate hybrid method to improve the prediction of zinc-binding sites from protein sequences.

Authors:  Zhen Chen; Yanying Wang; Ya-Feng Zhai; Jiangning Song; Ziding Zhang
Journal:  Mol Biosyst       Date:  2013-09

Review 7.  Acetylation of non-histone proteins modulates cellular signalling at multiple levels.

Authors:  Stephanie Spange; Tobias Wagner; Thorsten Heinzel; Oliver H Krämer
Journal:  Int J Biochem Cell Biol       Date:  2008-09-02       Impact factor: 5.085

8.  Accurate in silico prediction of species-specific methylation sites based on information gain feature optimization.

Authors:  Ping-Ping Wen; Shao-Ping Shi; Hao-Dong Xu; Li-Na Wang; Jian-Ding Qiu
Journal:  Bioinformatics       Date:  2016-06-26       Impact factor: 6.937

9.  Improving protein disorder prediction by deep bidirectional long short-term memory recurrent neural networks.

Authors:  Jack Hanson; Yuedong Yang; Kuldip Paliwal; Yaoqi Zhou
Journal:  Bioinformatics       Date:  2017-03-01       Impact factor: 6.937

10.  PseAAC: a flexible web server for generating various kinds of protein pseudo amino acid composition.

Authors:  Hong-Bin Shen; Kuo-Chen Chou
Journal:  Anal Biochem       Date:  2007-10-13       Impact factor: 3.365

View more
  27 in total

1.  i6mA-VC: A Multi-Classifier Voting Method for the Computational Identification of DNA N6-methyladenine Sites.

Authors:  Tian Xue; Shengli Zhang; Huijuan Qiao
Journal:  Interdiscip Sci       Date:  2021-04-08       Impact factor: 2.233

2.  MULTiPly: a novel multi-layer predictor for discovering general and specific types of promoters.

Authors:  Meng Zhang; Fuyi Li; Tatiana T Marquez-Lago; André Leier; Cunshuo Fan; Chee Keong Kwoh; Kuo-Chen Chou; Jiangning Song; Cangzhi Jia
Journal:  Bioinformatics       Date:  2019-09-01       Impact factor: 6.937

3.  A deep dense inception network for protein beta-turn prediction.

Authors:  Chao Fang; Yi Shang; Dong Xu
Journal:  Proteins       Date:  2019-07-23

4.  DeepBL: a deep learning-based approach for in silico discovery of beta-lactamases.

Authors:  Yanan Wang; Fuyi Li; Manasa Bharathwaj; Natalia C Rosas; André Leier; Tatsuya Akutsu; Geoffrey I Webb; Tatiana T Marquez-Lago; Jian Li; Trevor Lithgow; Jiangning Song
Journal:  Brief Bioinform       Date:  2021-07-20       Impact factor: 11.622

5.  iLearnPlus: a comprehensive and automated machine-learning platform for nucleic acid and protein sequence analysis, prediction and visualization.

Authors:  Zhen Chen; Pei Zhao; Chen Li; Fuyi Li; Dongxu Xiang; Yong-Zi Chen; Tatsuya Akutsu; Roger J Daly; Geoffrey I Webb; Quanzhi Zhao; Lukasz Kurgan; Jiangning Song
Journal:  Nucleic Acids Res       Date:  2021-06-04       Impact factor: 16.971

6.  Deep Learning-Based Advances In Protein Posttranslational Modification Site and Protein Cleavage Prediction.

Authors:  Subash C Pakhrin; Suresh Pokharel; Hiroto Saigo; Dukka B Kc
Journal:  Methods Mol Biol       Date:  2022

7.  nhKcr: a new bioinformatics tool for predicting crotonylation sites on human nonhistone proteins based on deep learning.

Authors:  Yong-Zi Chen; Zhuo-Zhi Wang; Yanan Wang; Guoguang Ying; Zhen Chen; Jiangning Song
Journal:  Brief Bioinform       Date:  2021-11-05       Impact factor: 11.622

8.  HSM6AP: a high-precision predictor for the Homo sapiens N6-methyladenosine (m^6 A) based on multiple weights and feature stitching.

Authors:  Jing Li; Shida He; Fei Guo; Quan Zou
Journal:  RNA Biol       Date:  2021-02-12       Impact factor: 4.652

9.  IRC-Fuse: improved and robust prediction of redox-sensitive cysteine by fusing of multiple feature representations.

Authors:  Md Mehedi Hasan; Md Ashad Alam; Watshara Shoombuatong; Hiroyuki Kurata
Journal:  J Comput Aided Mol Des       Date:  2021-01-04       Impact factor: 3.686

Review 10.  Functions and Mechanisms of Lysine Glutarylation in Eukaryotes.

Authors:  Longxiang Xie; Yafei Xiao; Fucheng Meng; Yongqiang Li; Zhenyu Shi; Keli Qian
Journal:  Front Cell Dev Biol       Date:  2021-06-24
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.