Literature DB >> 11849997

A graphic representation of protein sequence and predicting the subcellular locations of prokaryotic proteins.

Zhi-Ping Feng1, Chun-Ting Zhang.   

Abstract

Zp curve, a three-dimensional space curve representation of protein primary sequence based on the hydrophobicity and charged properties of amino acid residues along the primary sequence is suggested. Relying on the Zp parameters extracted from the three components of the Zp curve and the Bayes discriminant algorithm, the subcellular locations of prokaryotic proteins were predicted. Consequently, an accuracy of 81.5% in the cross-validation test has been achieved using 13 parameters extracted from the curve for the database of 997 prokaryotic proteins. The result is slightly better than that of using the neural network method (80.9%) based on the amino acid composition for the same database. By jointing the amino acid composition and the Zp parameters, the overall predictive accuracy 89.6% can be achieved. It is about 3% higher than that of the Bayes discriminant algorithm based merely on the amino acid composition for the same database. The prediction is also performed with a larger dataset derived from the version 39 SWISS-PROT databank and two datasets with different sequence similarity. Even for the dataset of non-sequence similarity, the improvement can be of 4.4% in the cross-validation test. The results indicate that the Zp parameters are effective in representing the information within a protein primary sequence. The method of extracting information from the primary structure may be useful for other areas of protein studies.

Mesh:

Substances:

Year:  2002        PMID: 11849997     DOI: 10.1016/s1357-2725(01)00121-2

Source DB:  PubMed          Journal:  Int J Biochem Cell Biol        ISSN: 1357-2725            Impact factor:   5.085


  4 in total

1.  3D representations of amino acids-applications to protein sequence comparison and classification.

Authors:  Jie Li; Patrice Koehl
Journal:  Comput Struct Biotechnol J       Date:  2014-09-06       Impact factor: 7.271

2.  ADLD: a novel graphical representation of protein sequences and its application.

Authors:  Lei Wang; Hui Peng; Jinhua Zheng
Journal:  Comput Math Methods Med       Date:  2014-10-30       Impact factor: 2.238

3.  Esub8: a novel tool to predict protein subcellular localizations in eukaryotic organisms.

Authors:  Qinghua Cui; Tianzi Jiang; Bing Liu; Songde Ma
Journal:  BMC Bioinformatics       Date:  2004-05-27       Impact factor: 3.169

4.  Protein subnuclear localization based on a new effective representation and intelligent kernel linear discriminant analysis by dichotomous greedy genetic algorithm.

Authors:  Shunfang Wang; Yaoting Yue
Journal:  PLoS One       Date:  2018-04-12       Impact factor: 3.240

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.