Literature DB >> 12186861

Using functional domain composition and support vector machines for prediction of protein subcellular location.

Kuo-Chen Chou1, Yu-Dong Cai.   

Abstract

Proteins are generally classified into the following 12 subcellular locations: 1) chloroplast, 2) cytoplasm, 3) cytoskeleton, 4) endoplasmic reticulum, 5) extracellular, 6) Golgi apparatus, 7) lysosome, 8) mitochondria, 9) nucleus, 10) peroxisome, 11) plasma membrane, and 12) vacuole. Because the function of a protein is closely correlated with its subcellular location, with the rapid increase in new protein sequences entering into databanks, it is vitally important for both basic research and pharmaceutical industry to establish a high throughput tool for predicting protein subcellular location. In this paper, a new concept, the so-called "functional domain composition" is introduced. Based on the novel concept, the representation for a protein can be defined as a vector in a high-dimensional space, where each of the clustered functional domains derived from the protein universe serves as a vector base. With such a novel representation for a protein, the support vector machine (SVM) algorithm is introduced for predicting protein subcellular location. High success rates are obtained by the self-consistency test, jackknife test, and independent dataset test, respectively. The current approach not only can play an important complementary role to the powerful covariant discriminant algorithm based on the pseudo amino acid composition representation (Chou, K. C. (2001) Proteins Struct. Funct. Genet. 43, 246-255; Correction (2001) Proteins Struct. Funct. Genet. 44, 60), but also may greatly stimulate the development of this area.

Entities:  

Mesh:

Substances:

Year:  2002        PMID: 12186861     DOI: 10.1074/jbc.M204161200

Source DB:  PubMed          Journal:  J Biol Chem        ISSN: 0021-9258            Impact factor:   5.157


  98 in total

1.  iPro54-PseKNC: a sequence-based predictor for identifying sigma-54 promoters in prokaryote with pseudo k-tuple nucleotide composition.

Authors:  Hao Lin; En-Ze Deng; Hui Ding; Wei Chen; Kuo-Chen Chou
Journal:  Nucleic Acids Res       Date:  2014-10-31       Impact factor: 16.971

2.  Predicting subcellular localization of proteins for Gram-negative bacteria by support vector machines based on n-peptide compositions.

Authors:  Chin-Sheng Yu; Chih-Jen Lin; Jenn-Kang Hwang
Journal:  Protein Sci       Date:  2004-05       Impact factor: 6.725

3.  Predicting subcellular localization via protein motif co-occurrence.

Authors:  Michelle S Scott; David Y Thomas; Michael T Hallett
Journal:  Genome Res       Date:  2004-10       Impact factor: 9.043

4.  On the pH-optimum of activity and stability of proteins.

Authors:  Kemper Talley; Emil Alexov
Journal:  Proteins       Date:  2010-09

5.  A novel representation of protein sequences for prediction of subcellular location using support vector machines.

Authors:  Setsuro Matsuda; Jean-Philippe Vert; Hiroto Saigo; Nobuhisa Ueda; Hiroyuki Toh; Tatsuya Akutsu
Journal:  Protein Sci       Date:  2005-11       Impact factor: 6.725

6.  Using fourier spectrum analysis and pseudo amino acid composition for prediction of membrane protein types.

Authors:  Hui Liu; Jie Yang; Meng Wang; Li Xue; Kuo-Chen Chou
Journal:  Protein J       Date:  2005-08       Impact factor: 2.371

7.  Prediction of mitochondrial proteins using discrete wavelet transform.

Authors:  Lin Jiang; Menglong Li; Zhining Wen; Kelong Wang; Yuanbo Diao
Journal:  Protein J       Date:  2006-06       Impact factor: 2.371

8.  Prediction of protein function improving sequence remote alignment search by a fuzzy logic algorithm.

Authors:  Antonio Gómez; Juan Cedano; Jordi Espadaler; Antonio Hermoso; Jaume Piñol; Enrique Querol
Journal:  Protein J       Date:  2008-02       Impact factor: 2.371

9.  Using the nonlinear dimensionality reduction method for the prediction of subcellular localization of Gram-negative bacterial proteins.

Authors:  Tong Wang; Jie Yang
Journal:  Mol Divers       Date:  2009-03-28       Impact factor: 2.943

10.  Using AdaBoost for the prediction of subcellular location of prokaryotic and eukaryotic proteins.

Authors:  Bing Niu; Yu-Huan Jin; Kai-Yan Feng; Wen-Cong Lu; Yu-Dong Cai; Guo-Zheng Li
Journal:  Mol Divers       Date:  2008-05-28       Impact factor: 2.943

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.