Literature DB >> 35773617

PseAraUbi: predicting arabidopsis ubiquitination sites by incorporating the physico-chemical and structural features.

Wei Wang1,2, Yu Zhang3, Dong Liu3, HongJun Zhang4, XianFang Wang5, Yun Zhou6.   

Abstract

KEY MESSAGE: We makes three kinds of important features from Arabidopsis thaliana: protein secondary structure based on the Chou-Fasman parameter, amino acids hydrophobicity and polarity information, and analyze their properties. Ubiquitination modification is an important post-translational modification of proteins, which participates in the regulation of many important life activities in cells. At present, ubiquitination proteomics research is mostly concentrated in animals and yeasts, while relatively few studies have been carried out in plants. It can be said that the calculation and prediction of Arabidopsis thaliana ubiquitination sites is still in its infancy. Based on this, we describe a calculation method, PseAraUbi (Prediction of Arabidopsis thaliana ubiquitination sites using pseudo amino acid composition), that can effectively detect ubiquitination sites on Arabidopsis thaliana using support vector machine learning classifiers. Based on protein sequence information, extract features from the Chou-Fasman parameter, amino acids hydrophobicity features, polarity information and selected for classification with the Boruta algorithm. PseAraUbi achieves promising performances with an AUC score of 0.953 with fivefold cross-validation on the training dataset, which are significantly better than that of the pioneer Arabidopsis thaliana ubiquitination sites method. We also proved the ability of our proposed method on independent test sets, thus gaining a competitive advantage. In addition, we also in-depth analyzed the physicochemical properties of amino acids in the region adjacent to the ubiquitination site. To facilitate the community, the source code, optimal feature subset, ubiquitination sites dataset in the Arbidopsis proteome are available at GitHub ( https://github.com/HNUBioinformatics/PseAraUbi.git ) for interest users.
© 2022. The Author(s), under exclusive licence to Springer Nature B.V.

Entities:  

Keywords:  Arabidopsis thaliana; Sequence information; Support vector machine; Ubiquitination sites

Mesh:

Substances:

Year:  2022        PMID: 35773617     DOI: 10.1007/s11103-022-01288-3

Source DB:  PubMed          Journal:  Plant Mol Biol        ISSN: 0167-4412            Impact factor:   4.335


  31 in total

1.  Prediction of protein cellular attributes using pseudo-amino acid composition.

Authors:  K C Chou
Journal:  Proteins       Date:  2001-05-15

2.  The Positively Charged Active Site of the Bacterial Toxin RelE Causes a Large Shift in the General Base pKa.

Authors:  David A Hiller; Brian F Dunican; Sunitha Nallur; Nan-Sheng Li; Joseph A Piccirilli; Scott A Strobel
Journal:  Biochemistry       Date:  2020-04-24       Impact factor: 3.162

Review 3.  Ubiquitin and ubiquitin-like proteins in protein regulation.

Authors:  Joerg Herrmann; Lilach O Lerman; Amir Lerman
Journal:  Circ Res       Date:  2007-05-11       Impact factor: 17.367

4.  Predicting gene phenotype by multi-label multi-class model based on essential functional features.

Authors:  Lei Chen; Zhandong Li; Tao Zeng; Yu-Hang Zhang; Hao Li; Tao Huang; Yu-Dong Cai
Journal:  Mol Genet Genomics       Date:  2021-04-29       Impact factor: 3.291

Review 5.  Research progress in protein posttranslational modification site prediction.

Authors:  Wenying He; Leyi Wei; Quan Zou
Journal:  Brief Funct Genomics       Date:  2018-07-22       Impact factor: 4.241

Review 6.  Towards more accurate prediction of ubiquitination sites: a comprehensive review of current methods, tools and features.

Authors:  Zhen Chen; Yuan Zhou; Ziding Zhang; Jiangning Song
Journal:  Brief Bioinform       Date:  2014-09-10       Impact factor: 11.622

Review 7.  Prediction of the secondary structure of proteins from their amino acid sequence.

Authors:  P Y Chou; G D Fasman
Journal:  Adv Enzymol Relat Areas Mol Biol       Date:  1978

8.  Prediction of citrullination sites by incorporating k-spaced amino acid pairs into Chou's general pseudo amino acid composition.

Authors:  Zhe Ju; Shi-Yun Wang
Journal:  Gene       Date:  2018-04-23       Impact factor: 3.688

Review 9.  The Role of Atypical Ubiquitin Chains in the Regulation of the Antiviral Innate Immune Response.

Authors:  Mariska van Huizen; Marjolein Kikkert
Journal:  Front Cell Dev Biol       Date:  2020-01-22

10.  CD-HIT: accelerated for clustering the next-generation sequencing data.

Authors:  Limin Fu; Beifang Niu; Zhengwei Zhu; Sitao Wu; Weizhong Li
Journal:  Bioinformatics       Date:  2012-10-11       Impact factor: 6.937

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.