Literature DB >> 16639720

Predicting protein subcellular location by fusing multiple classifiers.

Kuo-Chen Chou1, Hong-Bin Shen.   

Abstract

One of the fundamental goals in cell biology and proteomics is to identify the functions of proteins in the context of compartments that organize them in the cellular environment. Knowledge of subcellular locations of proteins can provide key hints for revealing their functions and understanding how they interact with each other in cellular networking. Unfortunately, it is both time-consuming and expensive to determine the localization of an uncharacterized protein in a living cell purely based on experiments. With the avalanche of newly found protein sequences emerging in the post genomic era, we are facing a critical challenge, that is, how to develop an automated method to fast and reliably identify their subcellular locations so as to be able to timely use them for basic research and drug discovery. In view of this, an ensemble classifier was developed by the approach of fusing many basic individual classifiers through a voting system. Each of these basic classifiers was trained in a different dimension of the amphiphilic pseudo amino acid composition (Chou [2005] Bioinformatics 21: 10-19). As a demonstration, predictions were performed with the fusion classifier for proteins among the following 14 localizations: (1) cell wall, (2) centriole, (3) chloroplast, (4) cytoplasm, (5) cytoskeleton, (6) endoplasmic reticulum, (7) extracellular, (8) Golgi apparatus, (9) lysosome, (10) mitochondria, (11) nucleus, (12) peroxisome, (13) plasma membrane, and (14) vacuole. The overall success rates thus obtained via the resubstitution test, jackknife test, and independent dataset test were all significantly higher than those by the existing classifiers. It is anticipated that the novel ensemble classifier may also become a very useful vehicle in classifying other attributes of proteins according to their sequences, such as membrane protein type, enzyme family/sub-family, G-protein coupled receptor (GPCR) type, and structural class, among many others. The fusion ensemble classifier will be available at www.pami.sjtu.edu.cn/people/hbshen. Copyright 2006 Wiley-Liss, Inc.

Entities:  

Mesh:

Substances:

Year:  2006        PMID: 16639720     DOI: 10.1002/jcb.20879

Source DB:  PubMed          Journal:  J Cell Biochem        ISSN: 0730-2312            Impact factor:   4.429


  10 in total

1.  Multi label learning for prediction of human protein subcellular localizations.

Authors:  Lin Zhu; Jie Yang; Hong-Bin Shen
Journal:  Protein J       Date:  2009-12       Impact factor: 2.371

2.  A multilabel model based on Chou's pseudo-amino acid composition for identifying membrane proteins with both single and multiple functional types.

Authors:  Chao Huang; Jing-Qi Yuan
Journal:  J Membr Biol       Date:  2013-04-02       Impact factor: 1.843

3.  Automated classification of fMRI data employing trial-based imagery tasks.

Authors:  Jong-Hwan Lee; Matthew Marzelli; Ferenc A Jolesz; Seung-Schik Yoo
Journal:  Med Image Anal       Date:  2009-01-16       Impact factor: 8.545

4.  Prediction of protein submitochondria locations by hybridizing pseudo-amino acid composition with various physicochemical features of segmented sequence.

Authors:  Pufeng Du; Yanda Li
Journal:  BMC Bioinformatics       Date:  2006-11-30       Impact factor: 3.169

5.  acACS: improving the prediction accuracy of protein subcellular locations and protein classification by incorporating the average chemical shifts composition.

Authors:  Guo-Liang Fan; Yan-Ling Liu; Yong-Chun Zuo; Han-Xue Mei; Yi Rang; Bao-Yan Hou; Yan Zhao
Journal:  ScientificWorldJournal       Date:  2014-07-02

6.  iATC-mHyb: a hybrid multi-label classifier for predicting the classification of anatomical therapeutic chemicals.

Authors:  Xiang Cheng; Shu-Guang Zhao; Xuan Xiao; Kuo-Chen Chou
Journal:  Oncotarget       Date:  2017-04-11

7.  Some remarks on protein attribute prediction and pseudo amino acid composition.

Authors:  Kuo-Chen Chou
Journal:  J Theor Biol       Date:  2010-12-17       Impact factor: 2.691

8.  'Unite and conquer': enhanced prediction of protein subcellular localization by integrating multiple specialized tools.

Authors:  Yao Qing Shen; Gertraud Burger
Journal:  BMC Bioinformatics       Date:  2007-10-29       Impact factor: 3.169

9.  ProFold: Protein Fold Classification with Additional Structural Features and a Novel Ensemble Classifier.

Authors:  Daozheng Chen; Xiaoyu Tian; Bo Zhou; Jun Gao
Journal:  Biomed Res Int       Date:  2016-08-28       Impact factor: 3.411

10.  iROS-gPseKNC: Predicting replication origin sites in DNA by incorporating dinucleotide position-specific propensity into general pseudo nucleotide composition.

Authors:  Xuan Xiao; Han-Xiao Ye; Zi Liu; Jian-Hua Jia; Kuo-Chen Chou
Journal:  Oncotarget       Date:  2016-06-07
  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.