Literature DB >> 16213466

Predicting protein subnuclear location with optimized evidence-theoretic K-nearest classifier and pseudo amino acid composition.

Hong-Bin Shen1, Kuo-Chen Chou.   

Abstract

The nucleus is the brain of eukaryotic cells that guides the life processes of the cell by issuing key instructions. For in-depth understanding of the biochemical process of the nucleus, the knowledge of localization of nuclear proteins is very important. With the avalanche of protein sequences generated in the post-genomic era, it is highly desired to develop an automated method for fast annotating the subnuclear locations for numerous newly found nuclear protein sequences so as to be able to timely utilize them for basic research and drug discovery. In view of this, a novel approach is developed for predicting the protein subnuclear location. It is featured by introducing a powerful classifier, the optimized evidence-theoretic K-nearest classifier, and using the pseudo amino acid composition [K.C. Chou, PROTEINS: Structure, Function, and Genetics, 43 (2001) 246], which can incorporate a considerable amount of sequence-order effects, to represent protein samples. As a demonstration, identifications were performed for 370 nuclear proteins among the following 9 subnuclear locations: (1) Cajal body, (2) chromatin, (3) heterochromatin, (4) nuclear diffuse, (5) nuclear pore, (6) nuclear speckle, (7) nucleolus, (8) PcG body, and (9) PML body. The overall success rates thus obtained by both the re-substitution test and jackknife cross-validation test are significantly higher than those by existing classifiers on the same working dataset. It is anticipated that the powerful approach may also become a useful high throughput vehicle to bridge the huge gap occurring in the post-genomic era between the number of gene sequences in databases and the number of gene products that have been functionally characterized. The OET-KNN classifier will be available at www.pami.sjtu.edu.cn/people/hbshen.

Entities:  

Mesh:

Substances:

Year:  2005        PMID: 16213466     DOI: 10.1016/j.bbrc.2005.09.117

Source DB:  PubMed          Journal:  Biochem Biophys Res Commun        ISSN: 0006-291X            Impact factor:   3.575


  11 in total

1.  A multi-label classifier for prediction membrane protein functional types in animal.

Authors:  Hong-Liang Zou
Journal:  J Membr Biol       Date:  2014-08-09       Impact factor: 1.843

2.  Prediction of lysine ubiquitylation with ensemble classifier and feature selection.

Authors:  Xiaowei Zhao; Xiangtao Li; Zhiqiang Ma; Minghao Yin
Journal:  Int J Mol Sci       Date:  2011-11-28       Impact factor: 5.923

3.  Prediction of protein submitochondria locations by hybridizing pseudo-amino acid composition with various physicochemical features of segmented sequence.

Authors:  Pufeng Du; Yanda Li
Journal:  BMC Bioinformatics       Date:  2006-11-30       Impact factor: 3.169

4.  CIPPN: computational identification of protein pupylation sites by using neural network.

Authors:  Wenzheng Bao; Zhu-Hong You; De-Shuang Huang
Journal:  Oncotarget       Date:  2017-11-06

5.  Prediction of Protein Sub-Mitochondria Locations Using Protein Interaction Networks.

Authors:  Adele Sadat Haghighat Hoseini; Mitra Mirzarezaee
Journal:  Iran J Biotechnol       Date:  2018-08-11       Impact factor: 1.671

6.  An ensemble method for predicting subnuclear localizations from primary protein structures.

Authors:  Guo Sheng Han; Zu Guo Yu; Vo Anh; Anaththa P D Krishnajith; Yu-Chu Tian
Journal:  PLoS One       Date:  2013-02-27       Impact factor: 3.240

7.  ProLoc-GO: utilizing informative Gene Ontology terms for sequence-based prediction of protein subcellular localization.

Authors:  Wen-Lin Huang; Chun-Wei Tung; Shih-Wen Ho; Shiow-Fen Hwang; Shinn-Ying Ho
Journal:  BMC Bioinformatics       Date:  2008-02-01       Impact factor: 3.169

8.  Protein Sub-Nuclear Localization Based on Effective Fusion Representations and Dimension Reduction Algorithm LDA.

Authors:  Shunfang Wang; Shuhui Liu
Journal:  Int J Mol Sci       Date:  2015-12-19       Impact factor: 5.923

9.  Position-specific analysis and prediction of protein pupylation sites based on multiple features.

Authors:  Xiaowei Zhao; Jiangyan Dai; Qiao Ning; Zhiqiang Ma; Minghao Yin; Pingping Sun
Journal:  Biomed Res Int       Date:  2013-08-26       Impact factor: 3.411

10.  Protein sub-nuclear localization prediction using SVM and Pfam domain information.

Authors:  Ravindra Kumar; Sohni Jain; Bandana Kumari; Manish Kumar
Journal:  PLoS One       Date:  2014-06-04       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.