Literature DB >> 19662505

Predicting subcellular location of proteins using integrated-algorithm method.

Yu-Dong Cai1, Lin Lu, Lei Chen, Jian-Feng He.   

Abstract

Protein's subcellular location, which indicates where a protein resides in a cell, is an important characteristic of protein. Correctly assigning proteins to their subcellular locations would be of great help to the prediction of proteins' function, genome annotation, and drug design. Yet, in spite of great technical advance in the past decades, it is still time-consuming and laborious to experimentally determine protein subcellular locations on a high throughput scale. Hence, four integrated-algorithm methods were developed to fulfill such high throughput prediction in this article. Two data sets taken from the literature (Chou and Elrod, Protein Eng 12:107-118, 1999) were used as training set and test set, which consisted of 2,391 and 2,598 proteins, respectively. Amino acid composition was applied to represent the protein sequences. The jackknife cross-validation was used to test the training set. The final best integrated-algorithm predictor was constructed by integrating 10 algorithms in Weka (a software tool for tackling data mining tasks, http://www.cs.waikato.ac.nz/ml/weka/ ) based on an mRMR (Minimum Redundancy Maximum Relevance, http://research.janelia.org/peng/proj/mRMR/ ) method. It can achieve correct rate of 77.83 and 80.56% for the training set and test set, respectively, which is better than all of the 60 algorithms collected in Weka. This predicting software is available upon request.

Entities:  

Mesh:

Substances:

Year:  2009        PMID: 19662505     DOI: 10.1007/s11030-009-9182-4

Source DB:  PubMed          Journal:  Mol Divers        ISSN: 1381-1991            Impact factor:   2.943


  12 in total

1.  Protein subcellular location prediction.

Authors:  K C Chou; D W Elrod
Journal:  Protein Eng       Date:  1999-02

2.  Support vector machine approach for protein subcellular localization prediction.

Authors:  S Hua; Z Sun
Journal:  Bioinformatics       Date:  2001-08       Impact factor: 6.937

3.  Prediction of protein subcellular locations using Markov chain models.

Authors:  Z Yuan
Journal:  FEBS Lett       Date:  1999-05-14       Impact factor: 4.124

4.  Predicting membrane protein type by functional domain composition and pseudo-amino acid composition.

Authors:  Yu-Dong Cai; Kuo-Chen Chou
Journal:  J Theor Biol       Date:  2005-07-25       Impact factor: 2.691

5.  Feature selection based on mutual information: criteria of max-dependency, max-relevance, and min-redundancy.

Authors:  Hanchuan Peng; Fuhui Long; Chris Ding
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2005-08       Impact factor: 6.226

6.  BioWeka--extending the Weka framework for bioinformatics.

Authors:  Jan E Gewehr; Martin Szugat; Ralf Zimmer
Journal:  Bioinformatics       Date:  2007-01-19       Impact factor: 6.937

7.  EnsemPro: an ensemble approach to predicting transcription start sites in human genomic DNA sequences.

Authors:  Hong-Hee Won; Min-Ji Kim; Seonwoo Kim; Jong-Won Kim
Journal:  Genomics       Date:  2008-03       Impact factor: 5.736

8.  Enzymes/non-enzymes classification model complexity based on composition, sequence, 3D and topological indices.

Authors:  Cristian Robert Munteanu; Humberto González-Díaz; Alexandre L Magalhães
Journal:  J Theor Biol       Date:  2008-06-14       Impact factor: 2.691

Review 9.  Wanted: subcellular localization of proteins based on sequence.

Authors:  F Eisenhaber; P Bork
Journal:  Trends Cell Biol       Date:  1998-04       Impact factor: 20.808

10.  Relation between amino acid composition and cellular location of proteins.

Authors:  J Cedano; P Aloy; J A Pérez-Pons; E Querol
Journal:  J Mol Biol       Date:  1997-02-28       Impact factor: 5.469

View more
  7 in total

1.  On the pH-optimum of activity and stability of proteins.

Authors:  Kemper Talley; Emil Alexov
Journal:  Proteins       Date:  2010-09

2.  Inference of global HIV-1 sequence patterns and preliminary feature analysis.

Authors:  Yan Wang; Reda Rawi; Daniel Hoffmann; Binlian Sun; Rongge Yang
Journal:  Virol Sin       Date:  2013-07-27       Impact factor: 4.327

3.  Classification and analysis of regulatory pathways using graph property, biochemical and physicochemical property, and functional property.

Authors:  Tao Huang; Lei Chen; Yu-Dong Cai; Kuo-Chen Chou
Journal:  PLoS One       Date:  2011-09-28       Impact factor: 3.240

4.  Predicting Anatomical Therapeutic Chemical (ATC) classification of drugs by integrating chemical-chemical interactions and similarities.

Authors:  Lei Chen; Wei-Ming Zeng; Yu-Dong Cai; Kai-Yan Feng; Kuo-Chen Chou
Journal:  PLoS One       Date:  2012-04-13       Impact factor: 3.240

5.  Prediction and analysis of retinoblastoma related genes through gene ontology and KEGG.

Authors:  Zhen Li; Bi-Qing Li; Min Jiang; Lei Chen; Jian Zhang; Lin Liu; Tao Huang
Journal:  Biomed Res Int       Date:  2013-08-13       Impact factor: 3.411

6.  Prediction of cancer proteins by integrating protein interaction, domain frequency, and domain interaction data using machine learning algorithms.

Authors:  Chien-Hung Huang; Huai-Shun Peng; Ka-Lok Ng
Journal:  Biomed Res Int       Date:  2015-03-17       Impact factor: 3.411

7.  Predicting chemical toxicity effects based on chemical-chemical interactions.

Authors:  Lei Chen; Jing Lu; Jian Zhang; Kai-Rui Feng; Ming-Yue Zheng; Yu-Dong Cai
Journal:  PLoS One       Date:  2013-02-15       Impact factor: 3.240

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.