Yan Xu1, Li Li2, Jun Ding3, Ling-Yun Wu4, Guoqin Mai5, Fengfeng Zhou6. 1. Department of Information and Computer Science, University of Science and Technology Beijing, Beijing 100083, China. Electronic address: xuyan@ustb.edu.cn. 2. Department of Information and Computer Science, University of Science and Technology Beijing, Beijing 100083, China. Electronic address: s20157158@ustb.edu.cn. 3. Department of Information and Computer Science, University of Science and Technology Beijing, Beijing 100083, China. Electronic address: artin@ustb.edu.cn. 4. Institute of Applied Mathematics, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing 100190, China. Electronic address: wulingyun@gmail.com. 5. Center for Synthetic Biology Engineering Research, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, Guangdong 518055, China. Electronic address: gq.mai@siat.ac.cn. 6. College of Computer Science and Technology, Jilin University, Changchun 130012, China; Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, Changchun, Jilin 130012, China.. Electronic address: ffzhou@jlu.edu.cn.
Abstract
BACKGROUND: Similar to the regular enzymatic glycosylation, glycation also attaches a sugar molecule to a peptide, but does not need the help of an enzyme. Glycation may occur both inside and outside the host body, and will compete with the glycosylation procedure for functional regulation of mature protein products. The glycated residues do not show significant patterns, which make both in silico sequence-level predictors and wet-lab validations a major challenge. This study hypothesizes that a better feature set formulated from the glycated flanking peptides may lead to a good glycation prediction program. RESULTS: We explored the application of sequence order information and position specific amino acid propensity (PSAAP) in the glycation residue prediction problem. The PSAAP demonstrated its ability to discriminate the glycated residues from the background control peptides. A Support Vector Machine (SVM) model was constructed from the training dataset and achieved 68.91% in the overall accuracy. The model also achieves 0.7258 and 0.3198 in the Area under the ROC and Matthew's Correlation Coefficient, respectively. The user-friendly online version of the proposed algorithm may be found on the web server Gly-PseAAC at http://app.aporc.org/Gly-PseAAC/. CONCLUSION: The feature set PSAAP was calculated and led to a useful classification of glycation residues.
BACKGROUND: Similar to the regular enzymatic glycosylation, glycation also attaches a sugar molecule to a peptide, but does not need the help of an enzyme. Glycation may occur both inside and outside the host body, and will compete with the glycosylation procedure for functional regulation of mature protein products. The glycated residues do not show significant patterns, which make both in silico sequence-level predictors and wet-lab validations a major challenge. This study hypothesizes that a better feature set formulated from the glycated flanking peptides may lead to a good glycation prediction program. RESULTS: We explored the application of sequence order information and position specific amino acid propensity (PSAAP) in the glycation residue prediction problem. The PSAAP demonstrated its ability to discriminate the glycated residues from the background control peptides. A Support Vector Machine (SVM) model was constructed from the training dataset and achieved 68.91% in the overall accuracy. The model also achieves 0.7258 and 0.3198 in the Area under the ROC and Matthew's Correlation Coefficient, respectively. The user-friendly online version of the proposed algorithm may be found on the web server Gly-PseAAC at http://app.aporc.org/Gly-PseAAC/. CONCLUSION: The feature set PSAAP was calculated and led to a useful classification of glycation residues.
Authors: Viraj R Sanghvi; Josef Leibold; Marco Mina; Prathibha Mohan; Marjan Berishaj; Zhuoning Li; Matthew M Miele; Nathalie Lailler; Chunying Zhao; Elisa de Stanchina; Agnes Viale; Leila Akkari; Scott W Lowe; Giovanni Ciriello; Ronald C Hendrickson; Hans-Guido Wendel Journal: Cell Date: 2019-08-08 Impact factor: 66.850
Authors: Jared A Delmar; Jihong Wang; Seo Woo Choi; Jason A Martins; John P Mikhail Journal: Mol Ther Methods Clin Dev Date: 2019-10-01 Impact factor: 6.698