Literature DB >> 11241220

Prediction of the subcellular location of prokaryotic proteins based on a new representation of the amino acid composition.

Z P Feng1.   

Abstract

A new representation of protein sequence is devoted in this paper, in which each protein can be represented by a 20-dimensional (20D) vector of unit length. Inspired by the principle of superposition of state in quantum mechanics, the squares of the 20 components of the vector correspond to the amino acid composition. Using the new representation of the primary sequence and Bayes Discriminant Algorithm, the subcellular location of prokaryotic proteins was predicted. The overall predictive accuracy in the jackknife test can be 3% higher than the result of using amino acid composition directly for the database of sequence identity is less than 90%, but 5% higher when sequence identity is less than 80%. The higher predictive accuracy indicates that the current measure of extracting the information from the primary sequence is efficient. Since the subcellular location restricting a protein's possible function, the present method should also be a useful measure for the systematic analysis of genome data. The program used in this paper is available on request.

Mesh:

Substances:

Year:  2001        PMID: 11241220     DOI: 10.1002/1097-0282(20010415)58:5<491::AID-BIP1024>3.0.CO;2-I

Source DB:  PubMed          Journal:  Biopolymers        ISSN: 0006-3525            Impact factor:   2.505


  9 in total

1.  DBSubLoc: database of protein subcellular localization.

Authors:  Tao Guo; Sujun Hua; Xinglai Ji; Zhirong Sun
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

2.  A hybrid approach for predicting promiscuous MHC class I restricted T cell epitopes.

Authors:  Manoj Bhasin; G P S Raghava
Journal:  J Biosci       Date:  2007-01       Impact factor: 1.826

3.  Subcellular location prediction of apoptosis proteins using two novel feature extraction methods based on evolutionary information and LDA.

Authors:  Lei Du; Qingfang Meng; Yuehui Chen; Peng Wu
Journal:  BMC Bioinformatics       Date:  2020-05-24       Impact factor: 3.169

4.  Molecular biocoding of insulin.

Authors:  Lutvo Kurić
Journal:  Adv Appl Bioinform Chem       Date:  2010-07-28

5.  Protein subcellular localization prediction for Gram-negative bacteria using amino acid subalphabets and a combination of multiple support vector machines.

Authors:  Jiren Wang; Wing-Kin Sung; Arun Krishnan; Kuo-Bin Li
Journal:  BMC Bioinformatics       Date:  2005-07-13       Impact factor: 3.169

6.  GPCRsclass: a web tool for the classification of amine type of G-protein-coupled receptors.

Authors:  Manoj Bhasin; G P S Raghava
Journal:  Nucleic Acids Res       Date:  2005-07-01       Impact factor: 16.971

7.  An improved sequence based prediction protocol for DNA-binding proteins using SVM and comprehensive feature analysis.

Authors:  Chuanxin Zou; Jiayu Gong; Honglin Li
Journal:  BMC Bioinformatics       Date:  2013-03-09       Impact factor: 3.169

8.  Domain organization of long signal peptides of single-pass integral membrane proteins reveals multiple functional capacity.

Authors:  Jan A Hiss; Eduard Resch; Alexander Schreiner; Michael Meissner; Anna Starzinski-Powitz; Gisbert Schneider
Journal:  PLoS One       Date:  2008-07-23       Impact factor: 3.240

9.  PredPSD: A Gradient Tree Boosting Approach for Single-Stranded and Double-Stranded DNA Binding Protein Prediction.

Authors:  Changgeng Tan; Tong Wang; Wenyi Yang; Lei Deng
Journal:  Molecules       Date:  2019-12-26       Impact factor: 4.411

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.