Literature DB >> 28488814

iPhos-PseEvo: Identifying Human Phosphorylated Proteins by Incorporating Evolutionary Information into General PseAAC via Grey System Theory.

Wang-Ren Qiu1,2, Bi-Qian Sun1, Xuan Xiao1,3, Dong Xu2, Kuo-Chen Chou3,4.   

Abstract

Protein phosphorylation plays a critical role in human body by altering the structural conformation of a protein, causing it to become activated/deactivated, or functional modification. Given an uncharacterized protein sequence, can we predict whether it may be phosphorylated or may not? This is no doubt a very meaningful problem for both basic research and drug development. Unfortunately, to our best knowledge, so far no high throughput bioinformatics tool whatsoever has been developed to address such a very basic but important problem due to its extremely complexity and lacking sufficient training data. Here we proposed a predictor called iPhos-PseEvo by (1) incorporating the protein sequence evolutionary information into the general pseudo amino acid composition (PseAAC) via the grey system theory, (2) balancing out the skewed training datasets by the asymmetric bootstrap approach, and (3) constructing an ensemble predictor by fusing an array of individual random forest classifiers thru a voting system. Rigorous jackknife tests have indicated that very promising success rates have been achieved by iPhos-PseEvo even for such a difficult problem. A user-friendly web-server for iPhos-PseEvo has been established at http://www.jci-bioinfo.cn/iPhos-PseEvo, by which users can easily obtain their desired results without the need to go through the complicated mathematical equations involved. It has not escaped our notice that the formulation and approach presented here can be used to analyze many other problems in protein science as well.
© 2017 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.

Entities:  

Keywords:  Chou's general PseAAC; Disease-related phosphorylation; Evolutionary information; Fusion ensemble classifier; Grey system model; Random forest classifiers

Mesh:

Substances:

Year:  2016        PMID: 28488814     DOI: 10.1002/minf.201600010

Source DB:  PubMed          Journal:  Mol Inform        ISSN: 1868-1743            Impact factor:   3.353


  28 in total

1.  Predicting membrane proteins and their types by extracting various sequence features into Chou's general PseAAC.

Authors:  Ahmad Hassan Butt; Nouman Rasool; Yaser Daanial Khan
Journal:  Mol Biol Rep       Date:  2018-09-20       Impact factor: 2.316

Review 2.  Structural Variability in the RLR-MAVS Pathway and Sensitive Detection of Viral RNAs.

Authors:  Qiu-Xing Jiang
Journal:  Med Chem       Date:  2019       Impact factor: 2.745

3.  Mal-Light: Enhancing Lysine Malonylation Sites Prediction Problem Using Evolutionary-based Features.

Authors:  Wakil Ahmad; Easin Arafat; Ghazaleh Taherzadeh; Alok Sharma; Shubhashis Roy Dipta; Abdollah Dehzangi; Swakkhar Shatabda
Journal:  IEEE Access       Date:  2020-04-22       Impact factor: 3.367

4.  Imbalanced multi-label learning for identifying antimicrobial peptides and their functional types.

Authors:  Weizhong Lin; Dong Xu
Journal:  Bioinformatics       Date:  2016-08-26       Impact factor: 6.937

5.  iHyd-PseCp: Identify hydroxyproline and hydroxylysine in proteins by incorporating sequence-coupled effects into general PseAAC.

Authors:  Wang-Ren Qiu; Bi-Qian Sun; Xuan Xiao; Zhao-Chun Xu; Kuo-Chen Chou
Journal:  Oncotarget       Date:  2016-07-12

6.  iPhos-PseEn: identifying phosphorylation sites in proteins by fusing different pseudo components into an ensemble classifier.

Authors:  Wang-Ren Qiu; Xuan Xiao; Zhao-Chun Xu; Kuo-Chen Chou
Journal:  Oncotarget       Date:  2016-08-09

7.  Assessing the Performances of Protein Function Prediction Algorithms from the Perspectives of Identification Accuracy and False Discovery Rate.

Authors:  Chun Yan Yu; Xiao Xu Li; Hong Yang; Ying Hong Li; Wei Wei Xue; Yu Zong Chen; Lin Tao; Feng Zhu
Journal:  Int J Mol Sci       Date:  2018-01-08       Impact factor: 5.923

8.  iRNAm5C-PseDNC: identifying RNA 5-methylcytosine sites by incorporating physical-chemical properties into pseudo dinucleotide composition.

Authors:  Wang-Ren Qiu; Shi-Yu Jiang; Zhao-Chun Xu; Xuan Xiao; Kuo-Chen Chou
Journal:  Oncotarget       Date:  2017-06-20

9.  Prediction of HIV-1 and HIV-2 proteins by using Chou's pseudo amino acid compositions and different classifiers.

Authors:  Juan Mei; Ji Zhao
Journal:  Sci Rep       Date:  2018-02-05       Impact factor: 4.379

10.  2L-piRNA: A Two-Layer Ensemble Classifier for Identifying Piwi-Interacting RNAs and Their Function.

Authors:  Bin Liu; Fan Yang; Kuo-Chen Chou
Journal:  Mol Ther Nucleic Acids       Date:  2017-04-13
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.