Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Prediction of protein subcellular locations by support vector machines using compositions of amino acids and amino acid pairs.

Literature DB >> 12967962

Prediction of protein subcellular locations by support vector machines using compositions of amino acids and amino acid pairs.

Abstract

MOTIVATION: The subcellular location of a protein is closely correlated to its function. Thus, computational prediction of subcellular locations from the amino acid sequence information would help annotation and functional prediction of protein coding genes in complete genomes. We have developed a method based on support vector machines (SVMs).
RESULTS: We considered 12 subcellular locations in eukaryotic cells: chloroplast, cytoplasm, cytoskeleton, endoplasmic reticulum, extracellular medium, Golgi apparatus, lysosome, mitochondrion, nucleus, peroxisome, plasma membrane, and vacuole. We constructed a data set of proteins with known locations from the SWISS-PROT database. A set of SVMs was trained to predict the subcellular location of a given protein based on its amino acid, amino acid pair, and gapped amino acid pair compositions. The predictors based on these different compositions were then combined using a voting scheme. Results obtained through 5-fold cross-validation tests showed an improvement in prediction accuracy over the algorithm based on the amino acid composition only. This prediction method is available via the Internet.

Mesh：

Substances：
Amino Acids
Proteins

Year: 2003 PMID： 12967962 DOI： 10.1093/bioinformatics/btg222

Source DB: PubMed Journal: Bioinformatics ISSN： 1367-4803 Impact factor: 6.937

Keyword Cloud
Cited

82 in total

1. C2 domain is responsible for targeting rice phosphoinositide specific phospholipase C.

Authors: Sunny D Rupwate; Ram Rajasekharan
Journal: Plant Mol Biol Date: 2011-11-29 Impact factor: 4.076

2. Combining machine learning and homology-based approaches to accurately predict subcellular localization in Arabidopsis.

Authors: Rakesh Kaundal; Reena Saini; Patrick X Zhao
Journal: Plant Physiol Date: 2010-07-20 Impact factor: 8.340

3. A novel representation of protein sequences for prediction of subcellular location using support vector machines.

Authors: Setsuro Matsuda; Jean-Philippe Vert; Hiroto Saigo; Nobuhisa Ueda; Hiroyuki Toh; Tatsuya Akutsu
Journal: Protein Sci Date: 2005-11 Impact factor: 6.725

4. Large-scale automated analysis of location patterns in randomly tagged 3T3 cells.

Authors: Elvira García Osuna; Juchang Hua; Nicholas W Bateman; Ting Zhao; Peter B Berget; Robert F Murphy
Journal: Ann Biomed Eng Date: 2007-02-07 Impact factor: 3.934

Review 5. Penalized feature selection and classification in bioinformatics.

Authors: Shuangge Ma; Jian Huang
Journal: Brief Bioinform Date: 2008-06-18 Impact factor: 11.622

6. Interleukin-4-inducing principle from Schistosoma mansoni eggs contains a functional C-terminal nuclear localization signal necessary for nuclear translocation in mammalian cells but not for its uptake.

Authors: Ishwinder Kaur; Gabriele Schramm; Bart Everts; Thomas Scholzen; Karin B Kindle; Christian Beetz; Cristina Montiel-Duarte; Silke Blindow; Arwyn T Jones; Helmut Haas; Snjezana Stolnik; David M Heery; Franco H Falcone
Journal: Infect Immun Date: 2011-01-10 Impact factor: 3.441

Prediction of protein subcellular locations by support vector machines using compositions of amino acids and amino acid pairs.

1. C2 domain is responsible for targeting rice phosphoinositide specific phospholipase C.

2. Combining machine learning and homology-based approaches to accurately predict subcellular localization in Arabidopsis.

3. A novel representation of protein sequences for prediction of subcellular location using support vector machines.

4. Large-scale automated analysis of location patterns in randomly tagged 3T3 cells.

Review 5. Penalized feature selection and classification in bioinformatics.

6. Interleukin-4-inducing principle from Schistosoma mansoni eggs contains a functional C-terminal nuclear localization signal necessary for nuclear translocation in mammalian cells but not for its uptake.

Review 7. Machine learning for in silico virtual screening and chemical genomics: new strategies.

8. Protein subcellular localization prediction of eukaryotes using a knowledge-based approach.

9. ESLpred2: improved method for predicting subcellular localization of eukaryotic proteins.

10. Semi-supervised protein subcellular localization.