Literature DB >> 15949806

Prediction of protein subcellular location using a combined feature of sequence.

Qing-Bin Gao1, Zheng-Zhi Wang, Chun Yan, Yao-Hua Du.   

Abstract

To understand the structure and function of a protein, an important task is to know where it occurs in the cell. Thus, a computational method for properly predicting the subcellular location of proteins would be significant in interpreting the original data produced by the large-scale genome sequencing projects. The present work tries to explore an effective method for extracting features from protein primary sequence and find a novel measurement of similarity among proteins for classifying a protein to its proper subcellular location. We considered four locations in eukaryotic cells and three locations in prokaryotic cells, which have been investigated by several groups in the past. A combined feature of primary sequence defined as a 430D (dimensional) vector was utilized to represent a protein, including 20 amino acid compositions, 400 dipeptide compositions and 10 physicochemical properties. To evaluate the prediction performance of this encoding scheme, a jackknife test based on nearest neighbor algorithm was employed. The prediction accuracies for cytoplasmic, extracellular, mitochondrial, and nuclear proteins in the former dataset were 86.3%, 89.2%, 73.5% and 89.4%, respectively, and the total prediction accuracy reached 86.3%. As for the prediction accuracies of cytoplasmic, extracellular, and periplasmic proteins in the latter dataset, the prediction accuracies were 97.4%, 86.0%, and 79.7, respectively, and the total prediction accuracy of 92.5% was achieved. The results indicate that this method outperforms some existing approaches based on amino acid composition or amino acid composition and dipeptide composition.

Entities:  

Mesh:

Substances:

Year:  2005        PMID: 15949806     DOI: 10.1016/j.febslet.2005.05.021

Source DB:  PubMed          Journal:  FEBS Lett        ISSN: 0014-5793            Impact factor:   4.124


  9 in total

1.  Economical evolution: microbes reduce the synthetic cost of extracellular proteins.

Authors:  Daniel R Smith; Matthew R Chapman
Journal:  MBio       Date:  2010-08-24       Impact factor: 7.867

2.  NClassG+: A classifier for non-classically secreted Gram-positive bacterial proteins.

Authors:  Daniel Restrepo-Montoya; Camilo Pino; Luis F Nino; Manuel E Patarroyo; Manuel A Patarroyo
Journal:  BMC Bioinformatics       Date:  2011-01-14       Impact factor: 3.169

3.  Prediction of protein submitochondria locations by hybridizing pseudo-amino acid composition with various physicochemical features of segmented sequence.

Authors:  Pufeng Du; Yanda Li
Journal:  BMC Bioinformatics       Date:  2006-11-30       Impact factor: 3.169

Review 4.  A survey of computational intelligence techniques in protein function prediction.

Authors:  Arvind Kumar Tiwari; Rajeev Srivastava
Journal:  Int J Proteomics       Date:  2014-12-11

5.  Virtual screening of Indonesian herbal compounds as COVID-19 supportive therapy: machine learning and pharmacophore modeling approaches.

Authors:  Linda Erlina; Rafika Indah Paramita; Wisnu Ananta Kusuma; Fadilah Fadilah; Aryo Tedjo; Irandi Putra Pratomo; Nabila Sekar Ramadhanti; Ahmad Kamal Nasution; Fadhlal Khaliq Surado; Aries Fitriawan; Khaerunissa Anbar Istiadi; Arry Yanuar
Journal:  BMC Complement Med Ther       Date:  2022-08-03

6.  Efficacy of different protein descriptors in predicting protein functional families.

Authors:  Serene A K Ong; Hong Huang Lin; Yu Zong Chen; Ze Rong Li; Zhiwei Cao
Journal:  BMC Bioinformatics       Date:  2007-08-17       Impact factor: 3.169

7.  'Unite and conquer': enhanced prediction of protein subcellular localization by integrating multiple specialized tools.

Authors:  Yao Qing Shen; Gertraud Burger
Journal:  BMC Bioinformatics       Date:  2007-10-29       Impact factor: 3.169

8.  Prediction of functional class of proteins and peptides irrespective of sequence homology by support vector machines.

Authors:  Zhi Qun Tang; Hong Huang Lin; Hai Lei Zhang; Lian Yi Han; Xin Chen; Yu Zong Chen
Journal:  Bioinform Biol Insights       Date:  2009-11-24

9.  Protein Sub-Nuclear Localization Based on Effective Fusion Representations and Dimension Reduction Algorithm LDA.

Authors:  Shunfang Wang; Shuhui Liu
Journal:  Int J Mol Sci       Date:  2015-12-19       Impact factor: 5.923

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.