Literature DB >> 30520972

Capsule network for protein post-translational modification site prediction.

Duolin Wang1,2, Yanchun Liang2,3, Dong Xu1,2.   

Abstract

MOTIVATION: Computational methods for protein post-translational modification (PTM) site prediction provide a useful approach for studying protein functions. The prediction accuracy of the existing methods has significant room for improvement. A recent deep-learning architecture, Capsule Network (CapsNet), which can characterize the internal hierarchical representation of input data, presents a great opportunity to solve this problem, especially using small training data.
RESULTS: We proposed a CapsNet for predicting protein PTM sites, including phosphorylation, N-linked glycosylation, N6-acetyllysine, methyl-arginine, S-palmitoyl-cysteine, pyrrolidone-carboxylic-acid and SUMOylation sites. The CapsNet outperformed the baseline convolutional neural network architecture MusiteDeep and other well-known tools in most cases and provided promising results for practical use, especially in learning from small training data. The capsule length also gives an accurate estimate for the confidence of the PTM prediction. We further demonstrated that the internal capsule features could be trained as a motif detector of phosphorylation sites when no kinase-specific phosphorylation labels were provided. In addition, CapsNet generates robust representations that have strong discriminant power in distinguishing kinase substrates from different kinase families. Our study sheds some light on the recognition mechanism of PTMs and applications of CapsNet on other bioinformatic problems.
AVAILABILITY AND IMPLEMENTATION: The codes are free to download from https://github.com/duolinwang/CapsNet_PTM. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author(s) 2018. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Mesh:

Substances:

Year:  2019        PMID: 30520972      PMCID: PMC6612812          DOI: 10.1093/bioinformatics/bty977

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  18 in total

1.  Clustering of highly homologous sequences to reduce the size of large protein databases.

Authors:  W Li; L Jaroszewski; A Godzik
Journal:  Bioinformatics       Date:  2001-03       Impact factor: 6.937

2.  Structural basis and prediction of substrate specificity in protein serine/threonine kinases.

Authors:  Ross I Brinkworth; Robert A Breinl; Bostjan Kobe
Journal:  Proc Natl Acad Sci U S A       Date:  2002-12-26       Impact factor: 11.205

3.  Musite, a tool for global prediction of general and kinase-specific phosphorylation sites.

Authors:  Jianjiong Gao; Jay J Thelen; A Keith Dunker; Dong Xu
Journal:  Mol Cell Proteomics       Date:  2010-08-11       Impact factor: 5.911

4.  Predicting the sequence specificities of DNA- and RNA-binding proteins by deep learning.

Authors:  Babak Alipanahi; Andrew Delong; Matthew T Weirauch; Brendan J Frey
Journal:  Nat Biotechnol       Date:  2015-07-27       Impact factor: 54.908

5.  PhosPred-RF: A Novel Sequence-Based Predictor for Phosphorylation Sites Using Sequential Information Only.

Authors:  Leyi Wei; Pengwei Xing; Jijun Tang; Quan Zou
Journal:  IEEE Trans Nanobioscience       Date:  2017-01-31       Impact factor: 2.935

6.  MusiteDeep: a deep-learning framework for general and kinase-specific phosphorylation site prediction.

Authors:  Duolin Wang; Shuai Zeng; Chunhui Xu; Wangren Qiu; Yanchun Liang; Trupti Joshi; Dong Xu
Journal:  Bioinformatics       Date:  2017-12-15       Impact factor: 6.937

7.  Proteome-wide post-translational modification statistics: frequency analysis and curation of the swiss-prot database.

Authors:  George A Khoury; Richard C Baliban; Christodoulos A Floudas
Journal:  Sci Rep       Date:  2011-09-13       Impact factor: 4.379

8.  The Universal Protein Resource (UniProt).

Authors:  Amos Bairoch; Rolf Apweiler; Cathy H Wu; Winona C Barker; Brigitte Boeckmann; Serenella Ferro; Elisabeth Gasteiger; Hongzhan Huang; Rodrigo Lopez; Michele Magrane; Maria J Martin; Darren A Natale; Claire O'Donovan; Nicole Redaschi; Lai-Su L Yeh
Journal:  Nucleic Acids Res       Date:  2005-01-01       Impact factor: 16.971

9.  DanQ: a hybrid convolutional and recurrent deep neural network for quantifying the function of DNA sequences.

Authors:  Daniel Quang; Xiaohui Xie
Journal:  Nucleic Acids Res       Date:  2016-04-15       Impact factor: 16.971

10.  Database resources of the National Center for Biotechnology Information.

Authors:  David L Wheeler; Tanya Barrett; Dennis A Benson; Stephen H Bryant; Kathi Canese; Vyacheslav Chetvernin; Deanna M Church; Michael Dicuccio; Ron Edgar; Scott Federhen; Michael Feolo; Lewis Y Geer; Wolfgang Helmberg; Yuri Kapustin; Oleg Khovayko; David Landsman; David J Lipman; Thomas L Madden; Donna R Maglott; Vadim Miller; James Ostell; Kim D Pruitt; Gregory D Schuler; Martin Shumway; Edwin Sequeira; Steven T Sherry; Karl Sirotkin; Alexandre Souvorov; Grigory Starchenko; Roman L Tatusov; Tatiana A Tatusova; Lukas Wagner; Eugene Yaschenko
Journal:  Nucleic Acids Res       Date:  2007-11-27       Impact factor: 16.971

View more
  17 in total

1.  MusiteDeep: a deep-learning based webserver for protein post-translational modification site prediction and visualization.

Authors:  Duolin Wang; Dongpeng Liu; Jiakang Yuchi; Fei He; Yuexu Jiang; Siteng Cai; Jingyi Li; Dong Xu
Journal:  Nucleic Acids Res       Date:  2020-07-02       Impact factor: 16.971

2.  DeepCleave: a deep learning predictor for caspase and matrix metalloprotease substrates and cleavage sites.

Authors:  Fuyi Li; Jinxiang Chen; André Leier; Tatiana Marquez-Lago; Quanzhong Liu; Yanze Wang; Jerico Revote; A Ian Smith; Tatsuya Akutsu; Geoffrey I Webb; Lukasz Kurgan; Jiangning Song
Journal:  Bioinformatics       Date:  2020-02-15       Impact factor: 6.937

Review 3.  Maximizing Depth of PTM Coverage: Generating Robust MS Datasets for Computational Prediction Modeling.

Authors:  Anthony A Iannetta; Leslie M Hicks
Journal:  Methods Mol Biol       Date:  2022

4.  Computational Prediction of Ubiquitination Proteins Using Evolutionary Profiles and Functional Domain Annotation.

Authors:  Wangren Qiu; Chunhui Xu; Xuan Xiao; Dong Xu
Journal:  Curr Genomics       Date:  2019-08       Impact factor: 2.236

5.  Identifying the sequence specificities of circRNA-binding proteins based on a capsule network architecture.

Authors:  Zhengfeng Wang; Xiujuan Lei
Journal:  BMC Bioinformatics       Date:  2021-01-07       Impact factor: 3.169

6.  Phenotype Prediction and Genome-Wide Association Study Using Deep Convolutional Neural Network of Soybean.

Authors:  Yang Liu; Duolin Wang; Fei He; Juexin Wang; Trupti Joshi; Dong Xu
Journal:  Front Genet       Date:  2019-11-22       Impact factor: 4.599

7.  CapsNet-SSP: multilane capsule network for predicting human saliva-secretory proteins.

Authors:  Wei Du; Yu Sun; Gaoyang Li; Huansheng Cao; Ran Pang; Ying Li
Journal:  BMC Bioinformatics       Date:  2020-06-09       Impact factor: 3.169

8.  iProDNA-CapsNet: identifying protein-DNA binding residues using capsule neural networks.

Authors:  Binh P Nguyen; Quang H Nguyen; Giang-Nam Doan-Ngoc; Thanh-Hoang Nguyen-Vo; Susanto Rahardja
Journal:  BMC Bioinformatics       Date:  2019-12-27       Impact factor: 3.169

9.  Capsule Networks Showed Excellent Performance in the Classification of hERG Blockers/Nonblockers.

Authors:  Yiwei Wang; Lei Huang; Siwen Jiang; Yifei Wang; Jun Zou; Hongguang Fu; Shengyong Yang
Journal:  Front Pharmacol       Date:  2020-01-28       Impact factor: 5.810

10.  UbiComb: A Hybrid Deep Learning Model for Predicting Plant-Specific Protein Ubiquitylation Sites.

Authors:  Arslan Siraj; Dae Yeong Lim; Hilal Tayara; Kil To Chong
Journal:  Genes (Basel)       Date:  2021-05-11       Impact factor: 4.096

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.