Literature DB >> 22185507

Prediction of protein subcellular multi-localization based on the general form of Chou's pseudo amino acid composition.

Li-Qi Li1, Yuan Zhang, Ling-Yun Zou, Yue Zhou, Xiao-Qi Zheng.   

Abstract

Many proteins bear multi-locational characteristics, and this phenomenon is closely related to biological function. However, most of the existing methods can only deal with single-location proteins. Therefore, an automatic and reliable ensemble classifier for protein subcellular multi-localization is needed. We propose a new ensemble classifier combining the KNN (K-nearest neighbour) and SVM (support vector machine) algorithms to predict the subcellular localization of eukaryotic, Gram-negative bacterial and viral proteins based on the general form of Chou's pseudo amino acid composition, i.e., GO (gene ontology) annotations, dipeptide composition and AmPseAAC (Amphiphilic pseudo amino acid composition). This ensemble classifier was developed by fusing many basic individual classifiers through a voting system. The overall prediction accuracies obtained by the KNN-SVM ensemble classifier are 95.22, 93.47 and 80.72% for the eukaryotic, Gram-negative bacterial and viral proteins, respectively. Our prediction accuracies are significantly higher than those by previous methods and reveal that our strategy better predicts subcellular locations of multi-location proteins.

Entities:  

Mesh:

Substances:

Year:  2012        PMID: 22185507     DOI: 10.2174/092986612799789369

Source DB:  PubMed          Journal:  Protein Pept Lett        ISSN: 0929-8665            Impact factor:   1.890


  9 in total

1.  EuLoc: a web-server for accurately predict protein subcellular localization in eukaryotes by incorporating various features of sequence segments into the general form of Chou's PseAAC.

Authors:  Tzu-Hao Chang; Li-Ching Wu; Tzong-Yi Lee; Shu-Pin Chen; Hsien-Da Huang; Jorng-Tzong Horng
Journal:  J Comput Aided Mol Des       Date:  2013-01-03       Impact factor: 3.686

Review 2.  Some illuminating remarks on molecular genetics and genomics as well as drug development.

Authors:  Kuo-Chen Chou
Journal:  Mol Genet Genomics       Date:  2020-01-01       Impact factor: 3.291

3.  Comprehensive comparative analysis and identification of RNA-binding protein domains: multi-class classification and feature selection.

Authors:  Samad Jahandideh; Vinodh Srinivasasainagendra; Degui Zhi
Journal:  J Theor Biol       Date:  2012-08-03       Impact factor: 2.691

4.  Sequence-based identification of recombination spots using pseudo nucleic acid representation and recursive feature extraction by linear kernel SVM.

Authors:  Liqi Li; Sanjiu Yu; Weidong Xiao; Yongsheng Li; Lan Huang; Xiaoqi Zheng; Shiwen Zhou; Hua Yang
Journal:  BMC Bioinformatics       Date:  2014-11-20       Impact factor: 3.169

5.  Protein (multi-)location prediction: utilizing interdependencies via a generative model.

Authors:  Ramanuja Simha; Sebastian Briesemeister; Oliver Kohlbacher; Hagit Shatkay
Journal:  Bioinformatics       Date:  2015-06-15       Impact factor: 6.937

6.  mGOASVM: Multi-label protein subcellular localization based on gene ontology and support vector machines.

Authors:  Shibiao Wan; Man-Wai Mak; Sun-Yuan Kung
Journal:  BMC Bioinformatics       Date:  2012-11-06       Impact factor: 3.169

7.  Protein (multi-)location prediction: using location inter-dependencies in a probabilistic framework.

Authors:  Ramanuja Simha; Hagit Shatkay
Journal:  Algorithms Mol Biol       Date:  2014-03-19       Impact factor: 1.405

8.  HybridGO-Loc: mining hybrid features on gene ontology for predicting subcellular localization of multi-location proteins.

Authors:  Shibiao Wan; Man-Wai Mak; Sun-Yuan Kung
Journal:  PLoS One       Date:  2014-03-19       Impact factor: 3.240

9.  PseAAC-General: fast building various modes of general form of Chou's pseudo-amino acid composition for large-scale protein datasets.

Authors:  Pufeng Du; Shuwang Gu; Yasen Jiao
Journal:  Int J Mol Sci       Date:  2014-02-26       Impact factor: 5.923

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.