Literature DB >> 27197054

Predicting pupylation sites in prokaryotic proteins using semi-supervised self-training support vector machine algorithm.

Zhe Ju1, Hong Gu2.   

Abstract

As one important post-translational modification of prokaryotic proteins, pupylation plays a key role in regulating various biological processes. The accurate identification of pupylation sites is crucial for understanding the underlying mechanisms of pupylation. Although several computational methods have been developed for the identification of pupylation sites, the prediction accuracy of them is still unsatisfactory. Here, a novel bioinformatics tool named IMP-PUP is proposed to improve the prediction of pupylation sites. IMP-PUP is constructed on the composition of k-spaced amino acid pairs and trained with a modified semi-supervised self-training support vector machine (SVM) algorithm. The proposed algorithm iteratively trains a series of support vector machine classifiers on both annotated and non-annotated pupylated proteins. Computational results show that IMP-PUP achieves the area under receiver operating characteristic curves of 0.91, 0.73, and 0.75 on our training set, Tung's testing set, and our testing set, respectively, which are better than those of the different error costs SVM algorithm and the original self-training SVM algorithm. Independent tests also show that IMP-PUP significantly outperforms three other existing pupylation site predictors: GPS-PUP, iPUP, and pbPUP. Therefore, IMP-PUP can be a useful tool for accurate prediction of pupylation sites. A MATLAB software package for IMP-PUP is available at https://juzhe1120.github.io/.
Copyright © 2016 Elsevier Inc. All rights reserved.

Entities:  

Keywords:  Post-translational modification; Pupylation; Semi-supervised learning; Support vector machine; k-spaced amino acid pair

Mesh:

Substances:

Year:  2016        PMID: 27197054     DOI: 10.1016/j.ab.2016.05.005

Source DB:  PubMed          Journal:  Anal Biochem        ISSN: 0003-2697            Impact factor:   3.365


  7 in total

1.  pSuc-FFSEA: Predicting Lysine Succinylation Sites in Proteins Based on Feature Fusion and Stacking Ensemble Algorithm.

Authors:  Jianhua Jia; Genqiang Wu; Wangren Qiu
Journal:  Front Cell Dev Biol       Date:  2022-05-24

2.  EPuL: An Enhanced Positive-Unlabeled Learning Algorithm for the Prediction of Pupylation Sites.

Authors:  Xuanguo Nan; Lingling Bao; Xiaosa Zhao; Xiaowei Zhao; Arun Kumar Sangaiah; Gai-Ge Wang; Zhiqiang Ma
Journal:  Molecules       Date:  2017-09-05       Impact factor: 4.411

3.  PhoglyStruct: Prediction of phosphoglycerylated lysine residues using structural properties of amino acids.

Authors:  Abel Chandra; Alok Sharma; Abdollah Dehzangi; Shoba Ranganathan; Anjeela Jokhan; Kuo-Chen Chou; Tatsuhiko Tsunoda
Journal:  Sci Rep       Date:  2018-12-18       Impact factor: 4.379

4.  Bigram-PGK: phosphoglycerylation prediction using the technique of bigram probabilities of position specific scoring matrix.

Authors:  Abel Chandra; Alok Sharma; Abdollah Dehzangi; Daichi Shigemizu; Tatsuhiko Tsunoda
Journal:  BMC Mol Cell Biol       Date:  2019-12-20

5.  RAM-PGK: Prediction of Lysine Phosphoglycerylation Based on Residue Adjacency Matrix.

Authors:  Abel Avitesh Chandra; Alok Sharma; Abdollah Dehzangi; Tatushiko Tsunoda
Journal:  Genes (Basel)       Date:  2020-12-20       Impact factor: 4.096

6.  PupStruct: Prediction of Pupylated Lysine Residues Using Structural Properties of Amino Acids.

Authors:  Vineet Singh; Alok Sharma; Abdollah Dehzangi; Tatushiko Tsunoda
Journal:  Genes (Basel)       Date:  2020-11-28       Impact factor: 4.096

7.  Positive-Unlabeled Learning for Pupylation Sites Prediction.

Authors:  Ming Jiang; Jun-Zhe Cao
Journal:  Biomed Res Int       Date:  2016-08-07       Impact factor: 3.411

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.