Literature DB >> 33090794

iGlu_AdaBoost: Identification of Lysine Glutarylation Using the AdaBoost Classifier.

Lijun Dou1,2, Xiaoling Li3, Lichao Zhang4, Huaikun Xiang1, Lei Xu5.   

Abstract

Lysine glutarylation is a newly reported post-translational modification (PTM) that plays significant roles in regulating metabolic and mitochondrial processes. Accurate identification of protein glutarylation is the primary task to better investigate molecular functions and various applications. Due to the common disadvantages of the time-consuming and expensive nature of traditional biological sequencing techniques as well as the explosive growth of protein data, building precise computational models to rapidly diagnose glutarylation is a popular and feasible solution. In this work, we proposed a novel AdaBoost-based predictor called iGlu_AdaBoost to distinguish glutarylation and non-glutarylation sequences. Here, the top 37 features were chosen from a total of 1768 combined features using Chi2 following incremental feature selection (IFS) to build the model, including 188D, the composition of k-spaced amino acid pairs (CKSAAP), and enhanced amino acid composition (EAAC). With the help of the hybrid-sampling method SMOTE-Tomek, the AdaBoost algorithm was performed with satisfactory recall, specificity, and AUC values of 87.48%, 72.49%, and 0.89 over 10-fold cross validation as well as 72.73%, 71.92%, and 0.63 over independent test, respectively. Further feature analysis inferred that positively charged amino acids RK play critical roles in glutarylation recognition. Our model presented the well generalization ability and consistency of the prediction results of positive and negative samples, which is comparable to four published tools. The proposed predictor is an efficient tool to find potential glutarylation sites and provides helpful suggestions for further research on glutarylation mechanisms and concerned disease treatments.

Entities:  

Keywords:  188D features; Chi2 analysis; SMOTE-Tomek; glutarylation; unbalanced data

Mesh:

Substances:

Year:  2020        PMID: 33090794     DOI: 10.1021/acs.jproteome.0c00314

Source DB:  PubMed          Journal:  J Proteome Res        ISSN: 1535-3893            Impact factor:   4.466


  6 in total

Review 1.  Insights into the post-translational modification and its emerging role in shaping the tumor microenvironment.

Authors:  Wen Li; Feifei Li; Xia Zhang; Hui-Kuan Lin; Chuan Xu
Journal:  Signal Transduct Target Ther       Date:  2021-12-20

2.  ProtTrans-Glutar: Incorporating Features From Pre-trained Transformer-Based Models for Predicting Glutarylation Sites.

Authors:  Fatma Indriani; Kunti Robiatul Mahmudah; Bedy Purnama; Kenji Satou
Journal:  Front Genet       Date:  2022-05-31       Impact factor: 4.772

3.  EnRank: An Ensemble Method to Detect Pulmonary Hypertension Biomarkers Based on Feature Selection and Machine Learning Models.

Authors:  Xiangju Liu; Yu Zhang; Chunli Fu; Ruochi Zhang; Fengfeng Zhou
Journal:  Front Genet       Date:  2021-04-27       Impact factor: 4.599

Review 4.  A chemical field guide to histone nonenzymatic modifications.

Authors:  Sarah Faulkner; Igor Maksimovic; Yael David
Journal:  Curr Opin Chem Biol       Date:  2021-06-20       Impact factor: 8.972

5.  DeepMC-iNABP: Deep learning for multiclass identification and classification of nucleic acid-binding proteins.

Authors:  Feifei Cui; Shuang Li; Zilong Zhang; Miaomiao Sui; Chen Cao; Abd El-Latif Hesham; Quan Zou
Journal:  Comput Struct Biotechnol J       Date:  2022-04-26       Impact factor: 6.155

Review 6.  Functions and Mechanisms of Lysine Glutarylation in Eukaryotes.

Authors:  Longxiang Xie; Yafei Xiao; Fucheng Meng; Yongqiang Li; Zhenyu Shi; Keli Qian
Journal:  Front Cell Dev Biol       Date:  2021-06-24
  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.