Literature DB >> 24929100

Prediction of bacterial protein subcellular localization by incorporating various features into Chou's PseAAC and a backward feature selection approach.

Liqi Li1, Sanjiu Yu2, Weidong Xiao1, Yongsheng Li3, Maolin Li1, Lan Huang2, Xiaoqi Zheng4, Shiwen Zhou5, Hua Yang6.   

Abstract

Information on the subcellular localization of bacterial proteins is essential for protein function prediction, genome annotation and drug design. Here we proposed a novel approach to predict the subcellular localization of bacterial proteins by fusing features from position-specific score matrix (PSSM), Gene Ontology (GO) and PROFEAT. A backward feature selection approach by linear kennel of SVM was then used to rank the integrated feature vectors and extract optimal features. Finally, SVM was applied for predicting protein subcellular locations based on these optimal features. To validate the performance of our method, we employed jackknife cross-validation tests on three low similarity datasets, i.e., M638, Gneg1456 and Gpos523. The overall accuracies of 94.98%, 93.21%, and 94.57% were achieved for these three datasets, which are higher (from 1.8% to 10.9%) than those by state-of-the-art tools. Comparison results suggest that our method could serve as a very useful vehicle for expediting the prediction of bacterial protein subcellular localization.
Copyright © 2014 Elsevier Masson SAS. All rights reserved.

Keywords:  Backward feature selection; Gene ontology; PROFEAT; Position-specific score matrix; Protein subcellular localization

Mesh:

Substances:

Year:  2014        PMID: 24929100     DOI: 10.1016/j.biochi.2014.06.001

Source DB:  PubMed          Journal:  Biochimie        ISSN: 0300-9084            Impact factor:   4.079


  10 in total

1.  Protein remote homology detection by combining Chou's distance-pair pseudo amino acid composition and principal component analysis.

Authors:  Bin Liu; Junjie Chen; Xiaolong Wang
Journal:  Mol Genet Genomics       Date:  2015-04-21       Impact factor: 3.291

Review 2.  Some illuminating remarks on molecular genetics and genomics as well as drug development.

Authors:  Kuo-Chen Chou
Journal:  Mol Genet Genomics       Date:  2020-01-01       Impact factor: 3.291

3.  Functional Brain Imaging Reliably Predicts Bimanual Motor Skill Performance in a Standardized Surgical Task.

Authors:  Yuanyuan Gao; Pingkun Yan; Uwe Kruger; Lora Cavuoto; Steven Schwaitzberg; Suvranu De; Xavier Intes
Journal:  IEEE Trans Biomed Eng       Date:  2021-06-18       Impact factor: 4.756

4.  Sequence-based identification of recombination spots using pseudo nucleic acid representation and recursive feature extraction by linear kernel SVM.

Authors:  Liqi Li; Sanjiu Yu; Weidong Xiao; Yongsheng Li; Lan Huang; Xiaoqi Zheng; Shiwen Zhou; Hua Yang
Journal:  BMC Bioinformatics       Date:  2014-11-20       Impact factor: 3.169

5.  Prediction of subcellular location of apoptosis proteins by incorporating PsePSSM and DCCA coefficient based on LFDA dimensionality reduction.

Authors:  Bin Yu; Shan Li; Wenying Qiu; Minghui Wang; Junwei Du; Yusen Zhang; Xing Chen
Journal:  BMC Genomics       Date:  2018-06-19       Impact factor: 3.969

6.  Protein Sub-Nuclear Localization Based on Effective Fusion Representations and Dimension Reduction Algorithm LDA.

Authors:  Shunfang Wang; Shuhui Liu
Journal:  Int J Mol Sci       Date:  2015-12-19       Impact factor: 5.923

7.  Identifying the subfamilies of voltage-gated potassium channels using feature selection technique.

Authors:  Wei-Xin Liu; En-Ze Deng; Wei Chen; Hao Lin
Journal:  Int J Mol Sci       Date:  2014-07-22       Impact factor: 5.923

8.  Protein subnuclear localization based on a new effective representation and intelligent kernel linear discriminant analysis by dichotomous greedy genetic algorithm.

Authors:  Shunfang Wang; Yaoting Yue
Journal:  PLoS One       Date:  2018-04-12       Impact factor: 3.240

9.  Protein sequence information extraction and subcellular localization prediction with gapped k-Mer method.

Authors:  Yu-Hua Yao; Ya-Ping Lv; Ling Li; Hui-Min Xu; Bin-Bin Ji; Jing Chen; Chun Li; Bo Liao; Xu-Ying Nan
Journal:  BMC Bioinformatics       Date:  2019-12-30       Impact factor: 3.169

10.  Identifying Heat Shock Protein Families from Imbalanced Data by Using Combined Features.

Authors:  Xiao-Yang Jing; Feng-Min Li
Journal:  Comput Math Methods Med       Date:  2020-09-23       Impact factor: 2.238

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.