Literature DB >> 30590442

PredGly: predicting lysine glycation sites for Homo sapiens based on XGboost feature optimization.

Jialin Yu1, Shaoping Shi1, Fang Zhang1, Guodong Chen1, Man Cao1.   

Abstract

MOTIVATION: Protein glycation is a familiar post-translational modification (PTM) which is a two-step non-enzymatic reaction. Glycation not only impairs the function but also changes the characteristics of the proteins so that it is related to many human diseases. It is still much more difficult to systematically detect glycation sites due to the glycated residues without crucial patterns. Computational approaches, which can filter supposed sites prior to experimental verification, can extremely increase the efficiency of experiment work. However, the previous lysine glycation prediction method uses a small number of training datasets. Hence, the model is not generalized or pervasive.
RESULTS: By searching from a new database, we collected a large dataset in Homo sapiens. PredGly, a novel software, can predict lysine glycation sites for H.sapiens, which was developed by combining multiple features. In addition, XGboost was adopted to optimize feature vectors and to improve the model performance. Through comparing various classifiers, support vector machine achieved an optimal performance. On the basis of a new independent test set, PredGly outperformed other glycation tools. It suggests that PredGly can provide more instructive guidance for further experimental research of lysine glycation.
AVAILABILITY AND IMPLEMENTATION: https://github.com/yujialinncu/PredGly. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author(s) 2018. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Mesh:

Substances:

Year:  2019        PMID: 30590442     DOI: 10.1093/bioinformatics/bty1043

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  9 in total

1.  Computational prediction and interpretation of both general and specific types of promoters in Escherichia coli by exploiting a stacked ensemble-learning framework.

Authors:  Fuyi Li; Jinxiang Chen; Zongyuan Ge; Ya Wen; Yanwei Yue; Morihiro Hayashida; Abdelkader Baggag; Halima Bensmail; Jiangning Song
Journal:  Brief Bioinform       Date:  2021-03-22       Impact factor: 11.622

2.  A systematic review of recent trends in research on therapeutically significant L-asparaginase and acute lymphoblastic leukemia.

Authors:  Susan Aishwarya Suresh; Selvarajan Ethiraj; K N Rajnish
Journal:  Mol Biol Rep       Date:  2022-07-10       Impact factor: 2.742

3.  iProtGly-SS: A Tool to Accurately Predict Protein Glycation Site Using Structural-Based Features.

Authors:  Iman Dehzangi; Alok Sharma; Swakkhar Shatabda
Journal:  Methods Mol Biol       Date:  2022

4.  nhKcr: a new bioinformatics tool for predicting crotonylation sites on human nonhistone proteins based on deep learning.

Authors:  Yong-Zi Chen; Zhuo-Zhi Wang; Yanan Wang; Guoguang Ying; Zhen Chen; Jiangning Song
Journal:  Brief Bioinform       Date:  2021-11-05       Impact factor: 11.622

5.  predPhogly-Site: Predicting phosphoglycerylation sites by incorporating probabilistic sequence-coupling information into PseAAC and addressing data imbalance.

Authors:  Sabit Ahmed; Afrida Rahman; Md Al Mehedi Hasan; Md Khaled Ben Islam; Julia Rahman; Shamim Ahmad
Journal:  PLoS One       Date:  2021-04-01       Impact factor: 3.240

6.  On the Prediction of In Vitro Arginine Glycation of Short Peptides Using Artificial Neural Networks.

Authors:  Ulices Que-Salinas; Dulce Martinez-Peon; Angel D Reyes-Figueroa; Ivonne Ibarra; Christian Quintus Scheckhuber
Journal:  Sensors (Basel)       Date:  2022-07-13       Impact factor: 3.847

7.  Accurate identification of RNA D modification using multiple features.

Authors:  Lijun Dou; Wenyang Zhou; Lichao Zhang; Lei Xu; Ke Han
Journal:  RNA Biol       Date:  2021-03-17       Impact factor: 4.652

8.  SDM6A: A Web-Based Integrative Machine-Learning Framework for Predicting 6mA Sites in the Rice Genome.

Authors:  Shaherin Basith; Balachandran Manavalan; Tae Hwan Shin; Gwang Lee
Journal:  Mol Ther Nucleic Acids       Date:  2019-08-16       Impact factor: 8.886

9.  Machine Learning Enables Accurate Prediction of Asparagine Deamidation Probability and Rate.

Authors:  Jared A Delmar; Jihong Wang; Seo Woo Choi; Jason A Martins; John P Mikhail
Journal:  Mol Ther Methods Clin Dev       Date:  2019-10-01       Impact factor: 6.698

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.