Literature DB >> 11524373

Support vector machine approach for protein subcellular localization prediction.

S Hua1, Z Sun.   

Abstract

MOTIVATION: Subcellular localization is a key functional characteristic of proteins. A fully automatic and reliable prediction system for protein subcellular localization is needed, especially for the analysis of large-scale genome sequences.
RESULTS: In this paper, Support Vector Machine has been introduced to predict the subcellular localization of proteins from their amino acid compositions. The total prediction accuracies reach 91.4% for three subcellular locations in prokaryotic organisms and 79.4% for four locations in eukaryotic organisms. Predictions by our approach are robust to errors in the protein N-terminal sequences. This new approach provides superior prediction performance compared with existing algorithms based on amino acid composition and can be a complementary method to other existing methods based on sorting signals. AVAILABILITY: A web server implementing the prediction method is available at http://www.bioinfo.tsinghua.edu.cn/SubLoc/. SUPPLEMENTARY INFORMATION: Supplementary material is available at http://www.bioinfo.tsinghua.edu.cn/SubLoc/.

Mesh:

Substances:

Year:  2001        PMID: 11524373     DOI: 10.1093/bioinformatics/17.8.721

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  188 in total

1.  Nuclear cysteine cathepsin variants in thyroid carcinoma cells.

Authors:  Sofia Tedelind; Kseniia Poliakova; Amanda Valeta; Ruth Hunegnaw; Eyoel Lemma Yemanaberhan; Nils-Erik Heldin; Junichi Kurebayashi; Ekkehard Weber; Nataša Kopitar-Jerala; Boris Turk; Matthew Bogyo; Klaudia Brix
Journal:  Biol Chem       Date:  2010-08       Impact factor: 3.915

2.  Experimental analysis of the Arabidopsis mitochondrial proteome highlights signaling and regulatory components, provides assessment of targeting prediction programs, and indicates plant-specific mitochondrial proteins.

Authors:  Joshua L Heazlewood; Julian S Tonti-Filippini; Alexander M Gout; David A Day; James Whelan; A Harvey Millar
Journal:  Plant Cell       Date:  2003-12-11       Impact factor: 11.277

3.  DBSubLoc: database of protein subcellular localization.

Authors:  Tao Guo; Sujun Hua; Xinglai Ji; Zhirong Sun
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

4.  Sequence conserved for subcellular localization.

Authors:  Rajesh Nair; Burkhard Rost
Journal:  Protein Sci       Date:  2002-12       Impact factor: 6.725

5.  Predicting protein cellular localization using a domain projection method.

Authors:  Richard Mott; Jörg Schultz; Peer Bork; Chris P Ponting
Journal:  Genome Res       Date:  2002-08       Impact factor: 9.043

6.  GPCRpred: an SVM-based method for prediction of families and subfamilies of G-protein coupled receptors.

Authors:  Manoj Bhasin; G P S Raghava
Journal:  Nucleic Acids Res       Date:  2004-07-01       Impact factor: 16.971

7.  ESLpred: SVM-based method for subcellular localization of eukaryotic proteins using dipeptide composition and PSI-BLAST.

Authors:  Manoj Bhasin; G P S Raghava
Journal:  Nucleic Acids Res       Date:  2004-07-01       Impact factor: 16.971

8.  LOCnet and LOCtarget: sub-cellular localization for structural genomics targets.

Authors:  Rajesh Nair; Burkhard Rost
Journal:  Nucleic Acids Res       Date:  2004-07-01       Impact factor: 16.971

9.  Predicting subcellular localization of proteins for Gram-negative bacteria by support vector machines based on n-peptide compositions.

Authors:  Chin-Sheng Yu; Chih-Jen Lin; Jenn-Kang Hwang
Journal:  Protein Sci       Date:  2004-05       Impact factor: 6.725

10.  DSDBASE: a consortium of native and modelled disulphide bonds in proteins.

Authors:  A Vinayagam; G Pugalenthi; R Rajesh; R Sowdhamini
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.