Literature DB >> 17291684

ProLoc: prediction of protein subnuclear localization using SVM with automatic selection from physicochemical composition features.

Wen-Lin Huang1, Chun-Wei Tung, Hui-Ling Huang, Shiow-Fen Hwang, Shinn-Ying Ho.   

Abstract

Accurate prediction methods of protein subnuclear localizations rely on the cooperation between informative features and classifier design. Support vector machine (SVM) based learning methods are shown effective for predictions of protein subcellular and subnuclear localizations. This study proposes an evolutionary support vector machine (ESVM) based classifier with automatic selection from a large set of physicochemical composition (PCC) features to design an accurate system for predicting protein subnuclear localization, named ProLoc. ESVM using an inheritable genetic algorithm combined with SVM can automatically determine the best number m of PCC features and identify m out of 526 PCC features simultaneously. To evaluate ESVM, this study uses two datasets SNL6 and SNL9, which have 504 proteins localized in 6 subnuclear compartments and 370 proteins localized in 9 subnuclear compartments. Using a leave-one-out cross-validation, ProLoc utilizing the selected m=33 and 28 PCC features has accuracies of 56.37% for SNL6 and 72.82% for SNL9, which are better than 51.4% for the SVM-based system using k-peptide composition features applied on SNL6, and 64.32% for an optimized evidence-theoretic k-nearest neighbor classifier utilizing pseudo amino acid composition applied on SNL9, respectively.

Mesh:

Substances:

Year:  2007        PMID: 17291684     DOI: 10.1016/j.biosystems.2007.01.001

Source DB:  PubMed          Journal:  Biosystems        ISSN: 0303-2647            Impact factor:   1.973


  21 in total

Review 1.  Genetic algorithm optimization in drug design QSAR: Bayesian-regularized genetic neural networks (BRGNN) and genetic algorithm-optimized support vectors machines (GA-SVM).

Authors:  Michael Fernandez; Julio Caballero; Leyden Fernandez; Akinori Sarai
Journal:  Mol Divers       Date:  2010-03-20       Impact factor: 2.943

2.  An in silico strategy identified the target gene candidates regulated by dehydration responsive element binding proteins (DREBs) in Arabidopsis genome.

Authors:  Shichen Wang; Shuo Yang; Yuejia Yin; Xiaosen Guo; Shan Wang; Dongyun Hao
Journal:  Plant Mol Biol       Date:  2008-10-18       Impact factor: 4.076

3.  Fuzzy clustering of physicochemical and biochemical properties of amino acids.

Authors:  Indrajit Saha; Ujjwal Maulik; Sanghamitra Bandyopadhyay; Dariusz Plewczynski
Journal:  Amino Acids       Date:  2011-10-13       Impact factor: 3.520

Review 4.  Understanding molecular mechanisms of disease through spatial proteomics.

Authors:  Sandra Pankow; Salvador Martínez-Bartolomé; Casimir Bamberger; John R Yates
Journal:  Curr Opin Chem Biol       Date:  2018-10-09       Impact factor: 8.822

5.  PNAC: a protein nucleolar association classifier.

Authors:  Michelle S Scott; François-Michel Boisvert; Angus I Lamond; Geoffrey J Barton
Journal:  BMC Genomics       Date:  2011-01-27       Impact factor: 3.969

6.  Exploiting heterogeneous features to improve in silico prediction of peptide status - amyloidogenic or non-amyloidogenic.

Authors:  Smitha Sunil Kumaran Nair; N V Subba Reddy; K S Hareesha
Journal:  BMC Bioinformatics       Date:  2011-11-30       Impact factor: 3.169

7.  POPISK: T-cell reactivity prediction using support vector machines and string kernels.

Authors:  Chun-Wei Tung; Matthias Ziehm; Andreas Kämper; Oliver Kohlbacher; Shinn-Ying Ho
Journal:  BMC Bioinformatics       Date:  2011-11-15       Impact factor: 3.169

8.  Prediction and analysis of antibody amyloidogenesis from sequences.

Authors:  Chyn Liaw; Chun-Wei Tung; Shinn-Ying Ho
Journal:  PLoS One       Date:  2013-01-07       Impact factor: 3.240

9.  An ensemble method for predicting subnuclear localizations from primary protein structures.

Authors:  Guo Sheng Han; Zu Guo Yu; Vo Anh; Anaththa P D Krishnajith; Yu-Chu Tian
Journal:  PLoS One       Date:  2013-02-27       Impact factor: 3.240

10.  Prediction of nuclear proteins using nuclear translocation signals proposed by probabilistic latent semantic indexing.

Authors:  Emily Chia-Yu Su; Jia-Ming Chang; Cheng-Wei Cheng; Ting-Yi Sung; Wen-Lian Hsu
Journal:  BMC Bioinformatics       Date:  2012-12-13       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.