Literature DB >> 25843215

Accurate in silico identification of protein succinylation sites using an iterative semi-supervised learning technique.

Xiaowei Zhao1, Qiao Ning2, Haiting Chai2, Zhiqiang Ma3.   

Abstract

As a widespread type of protein post-translational modifications (PTMs), succinylation plays an important role in regulating protein conformation, function and physicochemical properties. Compared with the labor-intensive and time-consuming experimental approaches, computational predictions of succinylation sites are much desirable due to their convenient and fast speed. Currently, numerous computational models have been developed to identify PTMs sites through various types of two-class machine learning algorithms. These methods require both positive and negative samples for training. However, designation of the negative samples of PTMs was difficult and if it is not properly done can affect the performance of computational models dramatically. So that in this work, we implemented the first application of positive samples only learning (PSoL) algorithm to succinylation sites prediction problem, which was a special class of semi-supervised machine learning that used positive samples and unlabeled samples to train the model. Meanwhile, we proposed a novel succinylation sites computational predictor called SucPred (succinylation site predictor) by using multiple feature encoding schemes. Promising results were obtained by the SucPred predictor with an accuracy of 88.65% using 5-fold cross validation on the training dataset and an accuracy of 84.40% on the independent testing dataset, which demonstrated that the positive samples only learning algorithm presented here was particularly useful for identification of protein succinylation sites. Besides, the positive samples only learning algorithm can be applied to build predictors for other types of PTMs sites with ease. A web server for predicting succinylation sites was developed and was freely accessible at http://59.73.198.144:8088/SucPred/.
Copyright © 2015 Elsevier Ltd. All rights reserved.

Keywords:  Multiple features; Positive samples only learning; Succinylated proteins

Mesh:

Substances:

Year:  2015        PMID: 25843215     DOI: 10.1016/j.jtbi.2015.03.029

Source DB:  PubMed          Journal:  J Theor Biol        ISSN: 0022-5193            Impact factor:   2.691


  16 in total

1.  iSulf-Cys: Prediction of S-sulfenylation Sites in Proteins with Physicochemical Properties of Amino Acids.

Authors:  Yan Xu; Jun Ding; Ling-Yun Wu
Journal:  PLoS One       Date:  2016-04-22       Impact factor: 3.240

2.  CIPPN: computational identification of protein pupylation sites by using neural network.

Authors:  Wenzheng Bao; Zhu-Hong You; De-Shuang Huang
Journal:  Oncotarget       Date:  2017-11-06

3.  Success: evolutionary and structural properties of amino acids prove effective for succinylation site prediction.

Authors:  Yosvany López; Alok Sharma; Abdollah Dehzangi; Sunil Pranit Lal; Ghazaleh Taherzadeh; Abdul Sattar; Tatsuhiko Tsunoda
Journal:  BMC Genomics       Date:  2018-01-19       Impact factor: 3.969

4.  Improving succinylation prediction accuracy by incorporating the secondary structure via helix, strand and coil, and evolutionary information from profile bigrams.

Authors:  Abdollah Dehzangi; Yosvany López; Sunil Pranit Lal; Ghazaleh Taherzadeh; Abdul Sattar; Tatsuhiko Tsunoda; Alok Sharma
Journal:  PLoS One       Date:  2018-02-12       Impact factor: 3.240

5.  Detecting Succinylation sites from protein sequences using ensemble support vector machine.

Authors:  Qiao Ning; Xiaosa Zhao; Lingling Bao; Zhiqiang Ma; Xiaowei Zhao
Journal:  BMC Bioinformatics       Date:  2018-06-25       Impact factor: 3.169

6.  GPSuc: Global Prediction of Generic and Species-specific Succinylation Sites by aggregating multiple sequence features.

Authors:  Md Mehedi Hasan; Hiroyuki Kurata
Journal:  PLoS One       Date:  2018-10-12       Impact factor: 3.240

Review 7.  Large-Scale Assessment of Bioinformatics Tools for Lysine Succinylation Sites.

Authors:  Md Mehedi Hasan; Mst Shamima Khatun; Hiroyuki Kurata
Journal:  Cells       Date:  2019-01-28       Impact factor: 6.600

8.  Characterization and Identification of Lysine Succinylation Sites based on Deep Learning Method.

Authors:  Kai-Yao Huang; Justin Bo-Kai Hsu; Tzong-Yi Lee
Journal:  Sci Rep       Date:  2019-11-07       Impact factor: 4.379

9.  LSTMCNNsucc: A Bidirectional LSTM and CNN-Based Deep Learning Method for Predicting Lysine Succinylation Sites.

Authors:  Guohua Huang; Qingfeng Shen; Guiyang Zhang; Pan Wang; Zu-Guo Yu
Journal:  Biomed Res Int       Date:  2021-05-28       Impact factor: 3.411

10.  A systematic identification of species-specific protein succinylation sites using joint element features information.

Authors:  Md Mehedi Hasan; Mst Shamima Khatun; Md Nurul Haque Mollah; Cao Yong; Dianjing Guo
Journal:  Int J Nanomedicine       Date:  2017-08-28
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.