| Literature DB >> 27104833 |
Yan Xu1, Jun Ding1, Ling-Yun Wu2.
Abstract
Cysteine S-sulfenylation is an important post-translational modification (PTM) in proteins, and provides redox regulation of protein functions. Bioinformatics and structural analyses indicated that S-sulfenylation could impact many biological and functional categories and had distinct structural features. However, major limitations for identifying cysteine S-sulfenylation were expensive and low-throughout. In view of this situation, the establishment of a useful computational method and the development of an efficient predictor are highly desired. In this study, a predictor iSulf-Cys which incorporated 14 kinds of physicochemical properties of amino acids was proposed. With the 10-fold cross-validation, the value of area under the curve (AUC) was 0.7155 ± 0.0085, MCC 0.3122 ± 0.0144 on the training dataset for 20 times. iSulf-Cys also showed satisfying performance in the independent testing dataset with AUC 0.7343 and MCC 0.3315. Features which were constructed from physicochemical properties and position were carefully analyzed. Meanwhile, a user-friendly web-server for iSulf-Cys is accessible at http://app.aporc.org/iSulf-Cys/.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27104833 PMCID: PMC4841585 DOI: 10.1371/journal.pone.0154237
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1A diagram flow to illustrate the predicting procedure.
The number of positive and negative peptides in training and independent test dataset.
| S_tr | 900 | 6856 |
| S_te | 145 | 268 |
The number of dimensions of three feature constructions.
| 280 | 420 | 20 |
The 10-fold cross-validation results of three different feature constructions on the balanced training dataset.
The results have been run 20 times for every feature construction by SVM algorithm with g = 0.005 and cutoff = 0.5. The values are mean ± standard variance. The results of MDD-SOH were obtained in 5-fold cross-validation.
| PSAAP | 0.6233 ± 0.0054 | 31.34 ± 1.52 | 81.74 ± 0.75 | 56.54 ± 0.55 | 0.1515 ± 0.0114 |
| Binary | 0.7040 ± 0.0083 | 68.56 ± 0.47 | 63.11 ± 0.87 | 65.83 ± 0.67 | 0.3172 ± 0.0135 |
| AAIndex | 67.31 ± 0.73 | 63.89 ± 1.05 | 65.59 ± 0.72 | 0.3122 ± 0.0144 | |
| MDD-SOH | -- | 68 | 70 | 70 | 0.27 |
Fig 2(a)The 10-fold ROC curves of the three feature constructions on the balanced training dataset. (b) The 10-fold ROC curve of AAIndex feature construction on the independent test.
The 10-fold cross-validation results of independent test by SVM algorithm with g = 0.005 and cutoff = 0.5.
| 0.7343 | 68.97 | 65.67 | 66.83 | 0.3315 |
Fig 3(a) The amino acid composition Logo of S-sulfenylated peptides. (b) The amino acid composition Logo of non-S-sulfenylated peptides.
Fig 4The TwoSampleLogo between sulfenylation and non-sulfenylation peptides (p<0.01).
Fig 5The predictive IBS results of the online webserver.