Literature DB >> 33672741

PUP-Fuse: Prediction of Protein Pupylation Sites by Integrating Multiple Sequence Representations.

Firda Nurul Auliah1, Andi Nur Nilamyani1, Watshara Shoombuatong2, Md Ashad Alam3, Md Mehedi Hasan1,4, Hiroyuki Kurata1.   

Abstract

Pupylation is a type of reversible post-translational modification of proteins, which plays a key role in the cellular function of microbial organisms. Several proteomics methods have been developed for the prediction and analysis of pupylated proteins and pupylation sites. However, the traditional experimental methods are laborious and time-consuming. Hence, computational algorithms are highly needed that can predict potential pupylation sites using sequence features. In this research, a new prediction model, PUP-Fuse, has been developed for pupylation site prediction by integrating multiple sequence representations. Meanwhile, we explored the five types of feature encoding approaches and three machine learning (ML) algorithms. In the final model, we integrated the successive ML scores using a linear regression model. The PUP-Fuse achieved a Mathew correlation value of 0.768 by a 10-fold cross-validation test. It also outperformed existing predictors in an independent test. The web server of the PUP-Fuse with curated datasets is freely available.

Entities:  

Keywords:  chi-squared; feature encoding; machine learning; pupylation

Mesh:

Substances:

Year:  2021        PMID: 33672741      PMCID: PMC7924619          DOI: 10.3390/ijms22042120

Source DB:  PubMed          Journal:  Int J Mol Sci        ISSN: 1422-0067            Impact factor:   5.923


  62 in total

1.  iBitter-SCM: Identification and characterization of bitter peptides using a scoring card method with propensity scores of dipeptides.

Authors:  Phasit Charoenkwan; Janchai Yana; Nalini Schaduangrat; Chanin Nantasenamat; Md Mehedi Hasan; Watshara Shoombuatong
Journal:  Genomics       Date:  2020-03-28       Impact factor: 5.736

2.  iCarPS: a computational tool for identifying protein carbonylation sites by novel encoded features.

Authors:  Dan Zhang; Zhao-Chun Xu; Wei Su; Yu-He Yang; Hao Lv; Hui Yang; Hao Lin
Journal:  Bioinformatics       Date:  2021-04-19       Impact factor: 6.937

3.  Critical evaluation of web-based DNA N6-methyladenine site prediction tools.

Authors:  Md Mehedi Hasan; Watshara Shoombuatong; Hiroyuki Kurata; Balachandran Manavalan
Journal:  Brief Funct Genomics       Date:  2021-07-17       Impact factor: 4.241

Review 4.  Pupylation versus ubiquitylation: tagging for proteasome-dependent degradation.

Authors:  Kristin E Burns; K Heran Darwin
Journal:  Cell Microbiol       Date:  2010-01-26       Impact factor: 3.715

Review 5.  Systematic analysis and prediction of pupylation sites in prokaryotic proteins.

Authors:  Xiang Chen; Jian-Ding Qiu; Shao-Ping Shi; Sheng-Bao Suo; Ru-Ping Liang
Journal:  PLoS One       Date:  2013-09-03       Impact factor: 3.240

6.  GPSuc: Global Prediction of Generic and Species-specific Succinylation Sites by aggregating multiple sequence features.

Authors:  Md Mehedi Hasan; Hiroyuki Kurata
Journal:  PLoS One       Date:  2018-10-12       Impact factor: 3.240

7.  Computational identification of microbial phosphorylation sites by the enhanced characteristics of sequence information.

Authors:  Md Mehedi Hasan; Md Mamunur Rashid; Mst Shamima Khatun; Hiroyuki Kurata
Journal:  Sci Rep       Date:  2019-06-04       Impact factor: 4.379

8.  CD-HIT Suite: a web server for clustering and comparing biological sequences.

Authors:  Ying Huang; Beifang Niu; Ying Gao; Limin Fu; Weizhong Li
Journal:  Bioinformatics       Date:  2010-01-06       Impact factor: 6.937

9.  iUmami-SCM: A Novel Sequence-Based Predictor for Prediction and Analysis of Umami Peptides Using a Scoring Card Method with Propensity Scores of Dipeptides.

Authors:  Phasit Charoenkwan; Janchai Yana; Chanin Nantasenamat; Md Mehedi Hasan; Watshara Shoombuatong
Journal:  J Chem Inf Model       Date:  2020-10-23       Impact factor: 4.956

10.  AtbPpred: A Robust Sequence-Based Prediction of Anti-Tubercular Peptides Using Extremely Randomized Trees.

Authors:  Balachandran Manavalan; Shaherin Basith; Tae Hwan Shin; Leyi Wei; Gwang Lee
Journal:  Comput Struct Biotechnol J       Date:  2019-07-03       Impact factor: 7.271

View more
  3 in total

1.  Editorial of Special Issue "Deep Learning and Machine Learning in Bioinformatics".

Authors:  Mingon Kang; Jung Hun Oh
Journal:  Int J Mol Sci       Date:  2022-06-14       Impact factor: 6.208

2.  Identifying Pupylation Proteins and Sites by Incorporating Multiple Methods.

Authors:  Wang-Ren Qiu; Meng-Yue Guan; Qian-Kun Wang; Li-Liang Lou; Xuan Xiao
Journal:  Front Endocrinol (Lausanne)       Date:  2022-04-26       Impact factor: 6.055

3.  PredNTS: Improved and Robust Prediction of Nitrotyrosine Sites by Integrating Multiple Sequence Features.

Authors:  Andi Nur Nilamyani; Firda Nurul Auliah; Mohammad Ali Moni; Watshara Shoombuatong; Md Mehedi Hasan; Hiroyuki Kurata
Journal:  Int J Mol Sci       Date:  2021-03-08       Impact factor: 5.923

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.