Literature DB >> 31025681

RF-GlutarySite: a random forest based predictor for glutarylation sites.

Hussam J Al-Barakati1, Hiroto Saigo2, Robert H Newman3, Dukka B Kc1.   

Abstract

Glutarylation, which is a newly identified posttranslational modification that occurs on lysine residues, has recently emerged as an important regulator of several metabolic and mitochondrial processes. However, the specific sites of modification on individual proteins, as well as the extent of glutarylation throughout the proteome, remain largely uncharacterized. Though informative, proteomic approaches based on mass spectrometry can be expensive, technically challenging and time-consuming. Therefore, the ability to predict glutarylation sites from protein primary sequences can complement proteomics analyses and help researchers study the characteristics and functional consequences of glutarylation. To this end, we used Random Forest (RF) machine learning strategies to identify the physiochemical and sequence-based features that correlated most substantially with glutarylation. We then used these features to develop a novel method to predict glutarylation sites from primary amino acid sequences using RF. Based on 10-fold cross-validation, the resulting algorithm, termed 'RF-GlutarySite', achieved efficiency scores of 75%, 81%, 68% and 0.50 with respect to accuracy (ACC), sensitivity (SN), specificity (SP) and Matthew's correlation coefficient (MCC), respectively. Likewise, using an independent test set, RF-GlutarySite exhibited ACC, SN, SP and MCC scores of 72%, 73%, 70% and 0.43, respectively. Results using both 10-fold cross validation and an independent test set were on par with or better than those achieved by existing glutarylation site predictors. Notably, RF-GlutarySite achieved the highest SN score among available glutarylation site prediction tools. Consequently, our method has the potential to uncover new glutarylation sites and to facilitate the discovery of relationships between glutarylation and well-known lysine modifications, such as acetylation, methylation and SUMOylation, as well as a number of recently identified lysine modifications, such as malonylation and succinylation.

Entities:  

Mesh:

Substances:

Year:  2019        PMID: 31025681     DOI: 10.1039/c9mo00028c

Source DB:  PubMed          Journal:  Mol Omics        ISSN: 2515-4184


  11 in total

1.  Computational Identification of Lysine Glutarylation Sites Using Positive-Unlabeled Learning.

Authors:  Zhe Ju; Shi-Yun Wang
Journal:  Curr Genomics       Date:  2020-04       Impact factor: 2.236

Review 2.  Insights into the post-translational modification and its emerging role in shaping the tumor microenvironment.

Authors:  Wen Li; Feifei Li; Xia Zhang; Hui-Kuan Lin; Chuan Xu
Journal:  Signal Transduct Target Ther       Date:  2021-12-20

3.  Bioinformatic Analyses of Peroxiredoxins and RF-Prx: A Random Forest-Based Predictor and Classifier for Prxs.

Authors:  Hussam Al-Barakati; Robert H Newman; Dukka B Kc; Leslie B Poole
Journal:  Methods Mol Biol       Date:  2022

4.  ProtTrans-Glutar: Incorporating Features From Pre-trained Transformer-Based Models for Predicting Glutarylation Sites.

Authors:  Fatma Indriani; Kunti Robiatul Mahmudah; Bedy Purnama; Kenji Satou
Journal:  Front Genet       Date:  2022-05-31       Impact factor: 4.772

Review 5.  A chemical field guide to histone nonenzymatic modifications.

Authors:  Sarah Faulkner; Igor Maksimovic; Yael David
Journal:  Curr Opin Chem Biol       Date:  2021-06-20       Impact factor: 8.972

6.  RF-MaloSite and DL-Malosite: Methods based on random forest and deep learning to identify malonylation sites.

Authors:  Hussam Al-Barakati; Niraj Thapa; Saigo Hiroto; Kaushik Roy; Robert H Newman; Dukka Kc
Journal:  Comput Struct Biotechnol J       Date:  2020-03-04       Impact factor: 7.271

7.  predPhogly-Site: Predicting phosphoglycerylation sites by incorporating probabilistic sequence-coupling information into PseAAC and addressing data imbalance.

Authors:  Sabit Ahmed; Afrida Rahman; Md Al Mehedi Hasan; Md Khaled Ben Islam; Julia Rahman; Shamim Ahmad
Journal:  PLoS One       Date:  2021-04-01       Impact factor: 3.240

8.  Accurate identification of RNA D modification using multiple features.

Authors:  Lijun Dou; Wenyang Zhou; Lichao Zhang; Lei Xu; Ke Han
Journal:  RNA Biol       Date:  2021-03-17       Impact factor: 4.652

Review 9.  Functions and Mechanisms of Lysine Glutarylation in Eukaryotes.

Authors:  Longxiang Xie; Yafei Xiao; Fucheng Meng; Yongqiang Li; Zhenyu Shi; Keli Qian
Journal:  Front Cell Dev Biol       Date:  2021-06-24

10.  Accurately Predicting Glutarylation Sites Using Sequential Bi-Peptide-Based Evolutionary Features.

Authors:  Md Easin Arafat; Md Wakil Ahmad; S M Shovan; Abdollah Dehzangi; Shubhashis Roy Dipta; Md Al Mehedi Hasan; Ghazaleh Taherzadeh; Swakkhar Shatabda; Alok Sharma
Journal:  Genes (Basel)       Date:  2020-08-31       Impact factor: 4.096

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.