Literature DB >> 20666727

Prediction of apoptosis protein locations with genetic algorithms and support vector machines through a new mode of pseudo amino acid composition.

Krishna Kumar Kandaswamy1, Ganesan Pugalenthi, Steffen Möller, Enno Hartmann, Kai-Uwe Kalies, P N Suganthan, Thomas Martinetz.   

Abstract

Apoptosis is an essential process for controlling tissue homeostasis by regulating a physiological balance between cell proliferation and cell death. The subcellular locations of proteins performing the cell death are determined by mostly independent cellular mechanisms. The regular bioinformatics tools to predict the subcellular locations of such apoptotic proteins do often fail. This work proposes a model for the sorting of proteins that are involved in apoptosis, allowing us to both the prediction of their subcellular locations as well as the molecular properties that contributed to it. We report a novel hybrid Genetic Algorithm (GA)/Support Vector Machine (SVM) approach to predict apoptotic protein sequences using 119 sequence derived properties like frequency of amino acid groups, secondary structure, and physicochemical properties. GA is used for selecting a near-optimal subset of informative features that is most relevant for the classification. Jackknife cross-validation is applied to test the predictive capability of the proposed method on 317 apoptosis proteins. Our method achieved 85.80% accuracy using all 119 features and 89.91% accuracy for 25 features selected by GA. Our models were examined by a test dataset of 98 apoptosis proteins and obtained an overall accuracy of 90.34%. The results show that the proposed approach is promising; it is able to select small subsets of features and still improves the classification accuracy. Our model can contribute to the understanding of programmed cell death and drug discovery. The software and dataset are available at http://www.inb.uni-luebeck.de/tools-demos/apoptosis/GASVM.

Entities:  

Mesh:

Substances:

Year:  2010        PMID: 20666727     DOI: 10.2174/0929866511009011473

Source DB:  PubMed          Journal:  Protein Pept Lett        ISSN: 0929-8665            Impact factor:   1.890


  15 in total

1.  Protein remote homology detection by combining Chou's distance-pair pseudo amino acid composition and principal component analysis.

Authors:  Bin Liu; Junjie Chen; Xiaolong Wang
Journal:  Mol Genet Genomics       Date:  2015-04-21       Impact factor: 3.291

2.  Prediction of antimicrobial peptides based on sequence alignment and feature selection methods.

Authors:  Ping Wang; Lele Hu; Guiyou Liu; Nan Jiang; Xiaoyun Chen; Jianyong Xu; Wen Zheng; Li Li; Ming Tan; Zugen Chen; Hui Song; Yu-Dong Cai; Kuo-Chen Chou
Journal:  PLoS One       Date:  2011-04-13       Impact factor: 3.240

3.  iLoc-Euk: a multi-label classifier for predicting the subcellular localization of singleplex and multiplex eukaryotic proteins.

Authors:  Kuo-Chen Chou; Zhi-Cheng Wu; Xuan Xiao
Journal:  PLoS One       Date:  2011-03-30       Impact factor: 3.240

4.  NR-2L: a two-level predictor for identifying nuclear receptor subfamilies based on sequence-derived features.

Authors:  Pu Wang; Xuan Xiao; Kuo-Chen Chou
Journal:  PLoS One       Date:  2011-08-15       Impact factor: 3.240

5.  iNR-PhysChem: a sequence-based predictor for identifying nuclear receptors and their subfamilies via physical-chemical property matrix.

Authors:  Xuan Xiao; Pu Wang; Kuo-Chen Chou
Journal:  PLoS One       Date:  2012-02-21       Impact factor: 3.240

6.  BS-KNN: An Effective Algorithm for Predicting Protein Subchloroplast Localization.

Authors:  Jing Hu; Xianghe Yan
Journal:  Evol Bioinform Online       Date:  2012-01-05       Impact factor: 1.625

7.  A multi-label predictor for identifying the subcellular locations of singleplex and multiplex eukaryotic proteins.

Authors:  Xiao Wang; Guo-Zheng Li
Journal:  PLoS One       Date:  2012-05-22       Impact factor: 3.240

8.  iGPCR-drug: a web server for predicting interaction between GPCRs and drugs in cellular networking.

Authors:  Xuan Xiao; Jian-Liang Min; Pu Wang; Kuo-Chen Chou
Journal:  PLoS One       Date:  2013-08-27       Impact factor: 3.240

9.  Naïve Bayes classifier with feature selection to identify phage virion proteins.

Authors:  Peng-Mian Feng; Hui Ding; Wei Chen; Hao Lin
Journal:  Comput Math Methods Med       Date:  2013-05-15       Impact factor: 2.238

10.  iRSpot-TNCPseAAC: identify recombination spots with trinucleotide composition and pseudo amino acid components.

Authors:  Wang-Ren Qiu; Xuan Xiao; Kuo-Chen Chou
Journal:  Int J Mol Sci       Date:  2014-01-24       Impact factor: 5.923

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.