Literature DB >> 19908156

Prediction of subcellular location of mycobacterial protein using feature selection techniques.

Hao Lin1, Hui Ding, Feng-Biao Guo, Jian Huang.   

Abstract

Mycobacterium tuberculosis is the primary pathogen causing tuberculosis, which is one of the most prevalent infectious diseases. The subcellular location of mycobacterial proteins can provide essential clues for proteins function research and drug discovery. Therefore, it is highly desirable to develop a computational method for fast and reliable prediction of subcellular location of mycobacterial proteins. In this study, we developed a support vector machine (SVM) based method to predict subcellular location of mycobacterial proteins. A total of 444 non-redundant mycobacterial proteins were used to train and test proposed model by using jackknife cross validation. By selecting traditional pseudo amino acid composition (PseAAC) as parameters, the overall accuracy of 83.3% was achieved. Moreover, a feature selection technique was developed to find out an optimal amount of PseAAC for improving predictive performance. The optimal amount of PseAAC improved overall accuracy from 83.3 to 87.2%. In addition, the reduced amino acids in N-terminus and non N-terminus of proteins were combined in models for further improving predictive successful rate. As a result, the maximum overall accuracy of 91.2% was achieved with average accuracy of 79.7%. The proposed model provides highly useful information for further experimental research. The prediction model can be accessed free of charge at http://cobi.uestc.edu.cn/cobi/people/hlin/webserver.

Entities:  

Mesh:

Substances:

Year:  2009        PMID: 19908156     DOI: 10.1007/s11030-009-9205-1

Source DB:  PubMed          Journal:  Mol Divers        ISSN: 1381-1991            Impact factor:   2.943


  30 in total

1.  Artificial neural networks for prediction of mycobacterial promoter sequences.

Authors:  Rupali N Kalate; Sanjeev S Tambe; Bhaskar D Kulkarni
Journal:  Comput Biol Chem       Date:  2003-12       Impact factor: 2.877

2.  Virus-PLoc: a fusion classifier for predicting the subcellular localization of viral proteins within host and virus-infected cells.

Authors:  Hong-Bin Shen; Kuo-Chen Chou
Journal:  Biopolymers       Date:  2007-02-15       Impact factor: 2.505

3.  Prediction of protein subcellular localization.

Authors:  Chin-Sheng Yu; Yu-Ching Chen; Chih-Hao Lu; Jenn-Kang Hwang
Journal:  Proteins       Date:  2006-08-15

4.  Gpos-PLoc: an ensemble classifier for predicting subcellular localization of Gram-positive bacterial proteins.

Authors:  Hong-Bin Shen; Kuo-Chen Chou
Journal:  Protein Eng Des Sel       Date:  2007-01-23       Impact factor: 1.650

5.  Euk-PLoc: an ensemble classifier for large-scale eukaryotic protein subcellular location prediction.

Authors:  H-B Shen; J Yang; K-C Chou
Journal:  Amino Acids       Date:  2007-01-19       Impact factor: 3.520

6.  Prediction of membrane proteins in Mycobacterium tuberculosis using a support vector machine algorithm.

Authors:  Joanne I Yeh; Lisong Mao
Journal:  J Comput Biol       Date:  2006 Jan-Feb       Impact factor: 1.479

Review 7.  Proteomics, networks and connectivity indices.

Authors:  Humberto González-Díaz; Yenny González-Díaz; Lourdes Santana; Florencio M Ubeira; Eugenio Uriarte
Journal:  Proteomics       Date:  2008-02       Impact factor: 3.984

8.  Predicting subcellular localization of proteins based on their N-terminal amino acid sequence.

Authors:  O Emanuelsson; H Nielsen; S Brunak; G von Heijne
Journal:  J Mol Biol       Date:  2000-07-21       Impact factor: 5.469

9.  Prediction of apoptosis protein subcellular location using improved hybrid approach and pseudo-amino acid composition.

Authors:  Ying-Li Chen; Qian-Zhong Li
Journal:  J Theor Biol       Date:  2007-05-18       Impact factor: 2.691

10.  Predicting subcellular localization of mycobacterial proteins by using Chou's pseudo amino acid composition.

Authors:  Hao Lin; Hui Ding; Feng-Biao Guo; An-Ying Zhang; Jian Huang
Journal:  Protein Pept Lett       Date:  2008       Impact factor: 1.890

View more
  14 in total

1.  Metabolomic Profiling of Plasma from Patients with Tuberculosis by Use of Untargeted Mass Spectrometry Reveals Novel Biomarkers for Diagnosis.

Authors:  Susanna K P Lau; Kim-Chung Lee; Shirly O T Curreem; Wang-Ngai Chow; Kelvin K W To; Ivan F N Hung; Deborah T Y Ho; Siddharth Sridhar; Iris W S Li; Vanessa S Y Ding; Eleanor W F Koo; Chi-Fong Wong; Sidney Tam; Ching-Wan Lam; Kwok-Yung Yuen; Patrick C Y Woo
Journal:  J Clin Microbiol       Date:  2015-09-16       Impact factor: 5.948

2.  Protein binding site prediction by combining hidden Markov support vector machine and profile-based propensities.

Authors:  Bin Liu; Bingquan Liu; Fule Liu; Xiaolong Wang
Journal:  ScientificWorldJournal       Date:  2014-07-14

3.  Identification of specific metabolites in culture supernatant of Mycobacterium tuberculosis using metabolomics: exploration of potential biomarkers.

Authors:  Susanna K P Lau; Ching-Wan Lam; Shirly O T Curreem; Kim-Chung Lee; Candy C Y Lau; Wang-Ngai Chow; Antonio H Y Ngan; Kelvin K W To; Jasper F W Chan; Ivan F N Hung; Wing-Cheong Yam; Kwok-Yung Yuen; Patrick C Y Woo
Journal:  Emerg Microbes Infect       Date:  2015-01-28       Impact factor: 7.163

4.  Prediction of aptamer-protein interacting pairs using an ensemble classifier in combination with various protein sequence attributes.

Authors:  Lina Zhang; Chengjin Zhang; Rui Gao; Runtao Yang; Qing Song
Journal:  BMC Bioinformatics       Date:  2016-05-31       Impact factor: 3.169

5.  Predicting the Types of Ion Channel-Targeted Conotoxins Based on AVC-SVM Model.

Authors:  Wang Xianfang; Wang Junmei; Wang Xiaolei; Zhang Yue
Journal:  Biomed Res Int       Date:  2017-04-09       Impact factor: 3.411

6.  iRSpot-Pse6NC: Identifying recombination spots in Saccharomyces cerevisiae by incorporating hexamer composition into general PseKNC.

Authors:  Hui Yang; Wang-Ren Qiu; Guoqing Liu; Feng-Biao Guo; Wei Chen; Kuo-Chen Chou; Hao Lin
Journal:  Int J Biol Sci       Date:  2018-05-22       Impact factor: 6.580

7.  Comparative genomics analysis of Mycobacterium ulcerans for the identification of putative essential genes and therapeutic candidates.

Authors:  Azeem Mehmood Butt; Izza Nasrullah; Shifa Tahir; Yigang Tong
Journal:  PLoS One       Date:  2012-08-13       Impact factor: 3.240

Review 8.  Survey of Natural Language Processing Techniques in Bioinformatics.

Authors:  Zhiqiang Zeng; Hua Shi; Yun Wu; Zhiling Hong
Journal:  Comput Math Methods Med       Date:  2015-10-07       Impact factor: 2.238

9.  iCTX-type: a sequence-based predictor for identifying the types of conotoxins in targeting ion channels.

Authors:  Hui Ding; En-Ze Deng; Lu-Feng Yuan; Li Liu; Hao Lin; Wei Chen; Kuo-Chen Chou
Journal:  Biomed Res Int       Date:  2014-06-01       Impact factor: 3.411

10.  Identifying the subfamilies of voltage-gated potassium channels using feature selection technique.

Authors:  Wei-Xin Liu; En-Ze Deng; Wei Chen; Hao Lin
Journal:  Int J Mol Sci       Date:  2014-07-22       Impact factor: 5.923

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.