Literature DB >> 32114285

Computational prediction of protein ubiquitination sites mapping on Arabidopsis thaliana.

Md Parvez Mosharaf1, Md Mehedi Hassan2, Fee Faysal Ahmed3, Mst Shamima Khatun2, Mohammad Ali Moni4, Md Nurul Haque Mollah5.   

Abstract

Among the protein post-translational modifications (PTMs), ubiquitination is considered as one of the most significant processes which can regulate the cellular functions and various diseases. Identification of ubiquitination sites becomes important for understanding the mechanisms of ubiquitination-related biological processes. Both experimental and computational approaches are available for identifying ubiquitination sites based on protein sequences of different species. The experimental approaches are time-consuming, laborious and costly. In silico prediction is an alternative time saving, easier and cost-effective approach for identifying ubiquitination sites. Moreover, the sequence patterns in the different species around the ubiquitination sites are not similar which demands species-specific predictors. Therefore, in this study, we have proposed a novel computational method for identifying ubiquitination sites based on protein sequences of A. thaliana species which will be robust against outlying observations also. Through the comparative study of two encoding schemes and three classifiers, the random forest (RF) based predictor was selected as the best predictor under the CKSAAP encoding scheme with 1:1 ratio of positive and negative samples (i.e. ubiquitinated and non-ubiquitinated) in training dataset. The proposed predictor produced the area under the ROC curve (AUC score) as 0.91 and 0.86 for 5-fold cross-validation test with the training dataset and the independent test dataset of A. thaliana respectively. The proposed RF based predictor also performed much better than the other existing ubiquitination sites predictors for A. thaliana.
Copyright © 2020 Elsevier Ltd. All rights reserved.

Entities:  

Keywords:  Arabidopsis thaliana species; CKSAAP encoding; Protein sequences; Random forest; Ubiquitination sites

Mesh:

Substances:

Year:  2020        PMID: 32114285     DOI: 10.1016/j.compbiolchem.2020.107238

Source DB:  PubMed          Journal:  Comput Biol Chem        ISSN: 1476-9271            Impact factor:   2.877


  9 in total

1.  PseAraUbi: predicting arabidopsis ubiquitination sites by incorporating the physico-chemical and structural features.

Authors:  Wei Wang; Yu Zhang; Dong Liu; HongJun Zhang; XianFang Wang; Yun Zhou
Journal:  Plant Mol Biol       Date:  2022-07-01       Impact factor: 4.335

2.  nhKcr: a new bioinformatics tool for predicting crotonylation sites on human nonhistone proteins based on deep learning.

Authors:  Yong-Zi Chen; Zhuo-Zhi Wang; Yanan Wang; Guoguang Ying; Zhen Chen; Jiangning Song
Journal:  Brief Bioinform       Date:  2021-11-05       Impact factor: 11.622

3.  Prediction of serine phosphorylation sites mapping on Schizosaccharomyces Pombe by fusing three encoding schemes with the random forest classifier.

Authors:  Samme Amena Tasmia; Md Kaderi Kibria; Khanis Farhana Tuly; Md Ariful Islam; Mst Shamima Khatun; Md Mehedi Hasan; Md Nurul Haque Mollah
Journal:  Sci Rep       Date:  2022-02-16       Impact factor: 4.379

4.  Network-Based Data Analysis Reveals Ion Channel-Related Gene Features in COVID-19: A Bioinformatic Approach.

Authors:  Hao Zhang; Ting Feng
Journal:  Biochem Genet       Date:  2022-09-14       Impact factor: 2.220

5.  IRC-Fuse: improved and robust prediction of redox-sensitive cysteine by fusing of multiple feature representations.

Authors:  Md Mehedi Hasan; Md Ashad Alam; Watshara Shoombuatong; Hiroyuki Kurata
Journal:  J Comput Aided Mol Des       Date:  2021-01-04       Impact factor: 3.686

Review 6.  Evolution of Sequence-based Bioinformatics Tools for Protein-protein Interaction Prediction.

Authors:  Mst Shamima Khatun; Watshara Shoombuatong; Md Mehedi Hasan; Hiroyuki Kurata
Journal:  Curr Genomics       Date:  2020-09       Impact factor: 2.236

7.  UbiComb: A Hybrid Deep Learning Model for Predicting Plant-Specific Protein Ubiquitylation Sites.

Authors:  Arslan Siraj; Dae Yeong Lim; Hilal Tayara; Kil To Chong
Journal:  Genes (Basel)       Date:  2021-05-11       Impact factor: 4.096

8.  An Improved Computational Prediction Model for Lysine Succinylation Sites Mapping on Homo sapiens by Fusing Three Sequence Encoding Schemes with the Random Forest Classifier.

Authors:  Samme Amena Tasmia; Fee Faysal Ahmed; Parvez Mosharaf; Mehedi Hasan; Nurul Haque Mollah
Journal:  Curr Genomics       Date:  2021-02       Impact factor: 2.236

9.  AoP-LSE: Antioxidant Proteins Classification Using Deep Latent Space Encoding of Sequence Features.

Authors:  Muhammad Usman; Shujaat Khan; Seongyong Park; Jeong-A Lee
Journal:  Curr Issues Mol Biol       Date:  2021-10-09       Impact factor: 2.976

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.