Literature DB >> 23756733

iHSP-PseRAAAC: Identifying the heat shock protein families using pseudo reduced amino acid alphabet composition.

Peng-Mian Feng1, Wei Chen, Hao Lin, Kuo-Chen Chou.   

Abstract

Heat shock proteins (HSPs) are a type of functionally related proteins present in all living organisms, both prokaryotes and eukaryotes. They play essential roles in protein-protein interactions such as folding and assisting in the establishment of proper protein conformation and prevention of unwanted protein aggregation. Their dysfunction may cause various life-threatening disorders, such as Parkinson's, Alzheimer's, and cardiovascular diseases. Based on their functions, HSPs are usually classified into six families: (i) HSP20 or sHSP, (ii) HSP40 or J-class proteins, (iii) HSP60 or GroEL/ES, (iv) HSP70, (v) HSP90, and (vi) HSP100. Although considerable progress has been achieved in discriminating HSPs from other proteins, it is still a big challenge to identify HSPs among their six different functional types according to their sequence information alone. With the avalanche of protein sequences generated in the post-genomic age, it is highly desirable to develop a high-throughput computational tool in this regard. To take up such a challenge, a predictor called iHSP-PseRAAAC has been developed by incorporating the reduced amino acid alphabet information into the general form of pseudo amino acid composition. One of the remarkable advantages of introducing the reduced amino acid alphabet is being able to avoid the notorious dimension disaster or overfitting problem in statistical prediction. It was observed that the overall success rate achieved by iHSP-PseRAAAC in identifying the functional types of HSPs among the aforementioned six types was more than 87%, which was derived by the jackknife test on a stringent benchmark dataset in which none of HSPs included has ≥40% pairwise sequence identity to any other in the same subset. It has not escaped our notice that the reduced amino acid alphabet approach can also be used to investigate other protein classification problems. As a user-friendly web server, iHSP-PseRAAAC is accessible to the public at http://lin.uestc.edu.cn/server/iHSP-PseRAAAC.
Copyright © 2013 Elsevier Inc. All rights reserved.

Entities:  

Keywords:  Heat shock protein; PseAAC; Reduced amino acid alphabet; SVM; Web server; n-Peptide composition

Mesh:

Substances:

Year:  2013        PMID: 23756733     DOI: 10.1016/j.ab.2013.05.024

Source DB:  PubMed          Journal:  Anal Biochem        ISSN: 0003-2697            Impact factor:   3.365


  80 in total

1.  iPro54-PseKNC: a sequence-based predictor for identifying sigma-54 promoters in prokaryote with pseudo k-tuple nucleotide composition.

Authors:  Hao Lin; En-Ze Deng; Hui Ding; Wei Chen; Kuo-Chen Chou
Journal:  Nucleic Acids Res       Date:  2014-10-31       Impact factor: 16.971

2.  iRSpot-GAEnsC: identifing recombination spots via ensemble classifier and extending the concept of Chou's PseAAC to formulate DNA samples.

Authors:  Muhammad Kabir; Maqsood Hayat
Journal:  Mol Genet Genomics       Date:  2015-08-30       Impact factor: 3.291

3.  XG-PseU: an eXtreme Gradient Boosting based method for identifying pseudouridine sites.

Authors:  Kewei Liu; Wei Chen; Hao Lin
Journal:  Mol Genet Genomics       Date:  2019-08-07       Impact factor: 3.291

4.  iPhosY-PseAAC: identify phosphotyrosine sites by incorporating sequence statistical moments into PseAAC.

Authors:  Yaser Daanial Khan; Nouman Rasool; Waqar Hussain; Sher Afzal Khan; Kuo-Chen Chou
Journal:  Mol Biol Rep       Date:  2018-10-11       Impact factor: 2.316

5.  Predicting membrane proteins and their types by extracting various sequence features into Chou's general PseAAC.

Authors:  Ahmad Hassan Butt; Nouman Rasool; Yaser Daanial Khan
Journal:  Mol Biol Rep       Date:  2018-09-20       Impact factor: 2.316

Review 6.  Structural Variability in the RLR-MAVS Pathway and Sensitive Detection of Viral RNAs.

Authors:  Qiu-Xing Jiang
Journal:  Med Chem       Date:  2019       Impact factor: 2.745

7.  Sequence-based predictive modeling to identify cancerlectins.

Authors:  Hong-Yan Lai; Xin-Xin Chen; Wei Chen; Hua Tang; Hao Lin
Journal:  Oncotarget       Date:  2017-04-25

8.  Large-scale comparative assessment of computational predictors for lysine post-translational modification sites.

Authors:  Zhen Chen; Xuhan Liu; Fuyi Li; Chen Li; Tatiana Marquez-Lago; André Leier; Tatsuya Akutsu; Geoffrey I Webb; Dakang Xu; Alexander Ian Smith; Lei Li; Kuo-Chen Chou; Jiangning Song
Journal:  Brief Bioinform       Date:  2019-11-27       Impact factor: 11.622

9.  Prediction of Protein Submitochondrial Locations by Incorporating Dipeptide Composition into Chou's General Pseudo Amino Acid Composition.

Authors:  Khurshid Ahmad; Muhammad Waris; Maqsood Hayat
Journal:  J Membr Biol       Date:  2016-01-08       Impact factor: 1.843

10.  The effect of three novel feature extraction methods on the prediction of the subcellular localization of multi-site virus proteins.

Authors:  Lei Wang; Yaou Zhao; Yuehui Chen; Dong Wang
Journal:  Bioengineered       Date:  2017-11-22       Impact factor: 3.269

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.