Literature DB >> 28411111

S-SulfPred: A sensitive predictor to capture S-sulfenylation sites based on a resampling one-sided selection undersampling-synthetic minority oversampling technique.

Cangzhi Jia1, Yun Zuo2.   

Abstract

Protein S-sulfenylation is a reversible post-translational modification involving covalent attachment of hydroxide to the thiol group of cysteine residues, which is involved in various biological processes including cell signaling, response to stress and protein functions. Herein we present S-SulfPred, a support vector machine based model to capture potential S-sulfenylation sites and improve the efficiency and relevance of experimental identification of protein S-sulfenylation sites. One-sided selection (OSS) undersampling and synthetic minority oversampling technique (SMOTE) oversampling were combined to establish balanced training datasets. This approach is shown to perform better than using only OSS or SMOTE in an independent test. The best combination of position-specific amino acid propensity and five physicochemical properties of amino acids were selected to optimize the predictor performance. Using S-SulfPred, we achieve an average sensitivity of 74.62%, and an average specificity of 71.62% on independent datasets. Compared with other published tools, S-SulfPred attains both higher sensitivity and specificity. We not only propose a highly accurate method to predict protein S-sulfenylation sites, but also provide insights that could improve the efficiency of other bioinformatics tools.
Copyright © 2017 Elsevier Ltd. All rights reserved.

Entities:  

Keywords:  OSS; PSAAP; Prediction; S-sulfenylation; SMOTE; SVM

Mesh:

Substances:

Year:  2017        PMID: 28411111     DOI: 10.1016/j.jtbi.2017.03.031

Source DB:  PubMed          Journal:  J Theor Biol        ISSN: 0022-5193            Impact factor:   2.691


  4 in total

1.  MULTiPly: a novel multi-layer predictor for discovering general and specific types of promoters.

Authors:  Meng Zhang; Fuyi Li; Tatiana T Marquez-Lago; André Leier; Cunshuo Fan; Chee Keong Kwoh; Kuo-Chen Chou; Jiangning Song; Cangzhi Jia
Journal:  Bioinformatics       Date:  2019-09-01       Impact factor: 6.937

2.  Mal-Light: Enhancing Lysine Malonylation Sites Prediction Problem Using Evolutionary-based Features.

Authors:  Wakil Ahmad; Easin Arafat; Ghazaleh Taherzadeh; Alok Sharma; Shubhashis Roy Dipta; Abdollah Dehzangi; Swakkhar Shatabda
Journal:  IEEE Access       Date:  2020-04-22       Impact factor: 3.367

3.  2lpiRNApred: a two-layered integrated algorithm for identifying piRNAs and their functions based on LFE-GM feature selection.

Authors:  Yun Zuo; Quan Zou; Jianyuan Lin; Min Jiang; Xiangrong Liu
Journal:  RNA Biol       Date:  2020-03-05       Impact factor: 4.652

4.  Accurately Predicting Glutarylation Sites Using Sequential Bi-Peptide-Based Evolutionary Features.

Authors:  Md Easin Arafat; Md Wakil Ahmad; S M Shovan; Abdollah Dehzangi; Shubhashis Roy Dipta; Md Al Mehedi Hasan; Ghazaleh Taherzadeh; Swakkhar Shatabda; Alok Sharma
Journal:  Genes (Basel)       Date:  2020-08-31       Impact factor: 4.096

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.