Literature DB >> 18163182

AAIndexLoc: predicting subcellular localization of proteins based on a new representation of sequences using amino acid indices.

E Tantoso1, Kuo-Bin Li.   

Abstract

Identifying a protein's subcellular localization is an important step to understand its function. However, the involved experimental work is usually laborious, time consuming and costly. Computational prediction hence becomes valuable to reduce the inefficiency. Here we provide a method to predict protein subcellular localization by using amino acid composition and physicochemical properties. The method concatenates the information extracted from a protein's N-terminal, middle and full sequence. Each part is represented by amino acid composition, weighted amino acid composition, five-level grouping composition and five-level dipeptide composition. We divided our dataset into training and testing set. The training set is used to determine the best performing amino acid index by using five-fold cross validation, whereas the testing set acts as the independent dataset to evaluate the performance of our model. With the novel representation method, we achieve an accuracy of approximately 75% on independent dataset. We conclude that this new representation indeed performs well and is able to extract the protein sequence information. We have developed a web server for predicting protein subcellular localization. The web server is available at http://aaindexloc.bii.a-star.edu.sg .

Entities:  

Mesh:

Substances:

Year:  2007        PMID: 18163182     DOI: 10.1007/s00726-007-0616-y

Source DB:  PubMed          Journal:  Amino Acids        ISSN: 0939-4451            Impact factor:   3.520


  12 in total

1.  Machine learning based prediction for peptide drift times in ion mobility spectrometry.

Authors:  Anuj R Shah; Khushbu Agarwal; Erin S Baker; Mudita Singhal; Anoop M Mayampurath; Yehia M Ibrahim; Lars J Kangas; Matthew E Monroe; Rui Zhao; Mikhail E Belov; Gordon A Anderson; Richard D Smith
Journal:  Bioinformatics       Date:  2010-05-21       Impact factor: 6.937

2.  Fuzzy clustering of physicochemical and biochemical properties of amino acids.

Authors:  Indrajit Saha; Ujjwal Maulik; Sanghamitra Bandyopadhyay; Dariusz Plewczynski
Journal:  Amino Acids       Date:  2011-10-13       Impact factor: 3.520

3.  An Rh1-GFP fusion protein is in the cytoplasmic membrane of a white mutant strain of Chlamydomonas reinhardtii.

Authors:  Corinne Yoshihara; Kentaro Inoue; Denise Schichnes; Steven Ruzin; William Inwood; Sydney Kustu
Journal:  Mol Plant       Date:  2008-11-14       Impact factor: 13.164

4.  NClassG+: A classifier for non-classically secreted Gram-positive bacterial proteins.

Authors:  Daniel Restrepo-Montoya; Camilo Pino; Luis F Nino; Manuel E Patarroyo; Manuel A Patarroyo
Journal:  BMC Bioinformatics       Date:  2011-01-14       Impact factor: 3.169

5.  iLoc-Euk: a multi-label classifier for predicting the subcellular localization of singleplex and multiplex eukaryotic proteins.

Authors:  Kuo-Chen Chou; Zhi-Cheng Wu; Xuan Xiao
Journal:  PLoS One       Date:  2011-03-30       Impact factor: 3.240

6.  Bagging with CTD--a novel signature for the hierarchical prediction of secreted protein trafficking in eukaryotes.

Authors:  Geetha Govindan; Achuthsankar S Nair
Journal:  Genomics Proteomics Bioinformatics       Date:  2013-12-06       Impact factor: 7.691

7.  PR2ALIGN: a stand-alone software program and a web-server for protein sequence alignment using weighted biochemical properties of amino acids.

Authors:  Igor B Kuznetsov; Michael McDuffie
Journal:  BMC Res Notes       Date:  2015-05-07

8.  ESLpred2: improved method for predicting subcellular localization of eukaryotic proteins.

Authors:  Aarti Garg; Gajendra P S Raghava
Journal:  BMC Bioinformatics       Date:  2008-11-28       Impact factor: 3.169

9.  An improved sequence based prediction protocol for DNA-binding proteins using SVM and comprehensive feature analysis.

Authors:  Chuanxin Zou; Jiayu Gong; Honglin Li
Journal:  BMC Bioinformatics       Date:  2013-03-09       Impact factor: 3.169

10.  A Survey of Chloroplast Protein Kinases and Phosphatases in Arabidopsis thaliana.

Authors:  I Schliebner; M Pribil; J Zühlke; A Dietzmann; D Leister
Journal:  Curr Genomics       Date:  2008-05       Impact factor: 2.236

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.