Literature DB >> 17235454

Prediction of protein subcellular localization by support vector machines using multi-scale energy and pseudo amino acid composition.

J-Y Shi1, S-W Zhang, Q Pan, Y-M Cheng, J Xie.   

Abstract

As more and more genomes have been discovered in recent years, there is an urgent need to develop a reliable method to predict the subcellular localization for the explosion of newly found proteins. However, many well-known prediction methods based on amino acid composition have problems utilizing the sequence-order information. Here, based on the concept of Chou's pseudo amino acid composition (PseAA), a new feature extraction method, the multi-scale energy (MSE) approach, is introduced to incorporate the sequence-order information. First, a protein sequence was mapped to a digital signal using the amino acid index. Then, by wavelet transform, the mapped signal was broken down into several scales in which the energy factors were calculated and further formed into an MSE feature vector. Following this, combining this MSE feature vector with amino acid composition (AA), we constructed a series of MSEPseAA feature vectors to represent the protein subcellular localization sequences. Finally, according to a new kind of normalization approach, the MSEPseAA feature vectors were normalized to form the improved MSEPseAA vectors, named as IEPseAA. Using the technique of IEPseAA, C-support vector machine (C-SVM) and three multi-class SVMs strategies, quite promising results were obtained, indicating that MSE is quite effective in reflecting the sequence-order effects and might become a useful tool for predicting the other attributes of proteins as well.

Mesh:

Substances:

Year:  2007        PMID: 17235454     DOI: 10.1007/s00726-006-0475-y

Source DB:  PubMed          Journal:  Amino Acids        ISSN: 0939-4451            Impact factor:   3.520


  13 in total

1.  A comprehensive proteogenomic study of the human Brucella vaccine strain 104 M.

Authors:  Xiaodong Zai; Qiaoling Yang; Kun Liu; Ruihua Li; Mengying Qian; Taoran Zhao; Yaohui Li; Ying Yin; Dayong Dong; Ling Fu; Shanhu Li; Junjie Xu; Wei Chen
Journal:  BMC Genomics       Date:  2017-05-23       Impact factor: 3.969

2.  A predicted physicochemically distinct sub-proteome associated with the intracellular organelle of the anammox bacterium Kuenenia stuttgartiensis.

Authors:  Marnix H Medema; Miaomiao Zhou; Sacha A F T van Hijum; Jolein Gloerich; Hans J C T Wessels; Roland J Siezen; Marc Strous
Journal:  BMC Genomics       Date:  2010-05-12       Impact factor: 3.969

3.  PlantLoc: an accurate web server for predicting plant protein subcellular localization by substantiality motif.

Authors:  Shengnan Tang; Tonghua Li; Peisheng Cong; Wenwei Xiong; Zhiheng Wang; Jiangming Sun
Journal:  Nucleic Acids Res       Date:  2013-05-31       Impact factor: 16.971

4.  iNR-Drug: predicting the interaction of drugs with nuclear receptors in cellular networking.

Authors:  Yue-Nong Fan; Xuan Xiao; Jian-Liang Min; Kuo-Chen Chou
Journal:  Int J Mol Sci       Date:  2014-03-19       Impact factor: 5.923

5.  acACS: improving the prediction accuracy of protein subcellular locations and protein classification by incorporating the average chemical shifts composition.

Authors:  Guo-Liang Fan; Yan-Ling Liu; Yong-Chun Zuo; Han-Xue Mei; Yi Rang; Bao-Yan Hou; Yan Zhao
Journal:  ScientificWorldJournal       Date:  2014-07-02

Review 6.  A survey of computational intelligence techniques in protein function prediction.

Authors:  Arvind Kumar Tiwari; Rajeev Srivastava
Journal:  Int J Proteomics       Date:  2014-12-11

7.  Prediction of candidate primary immunodeficiency disease genes using a support vector machine learning approach.

Authors:  Shivakumar Keerthikumar; Sahely Bhadra; Kumaran Kandasamy; Rajesh Raju; Y L Ramachandra; Chiranjib Bhattacharyya; Kohsuke Imai; Osamu Ohara; Sujatha Mohan; Akhilesh Pandey
Journal:  DNA Res       Date:  2009-10-03       Impact factor: 4.458

8.  'Unite and conquer': enhanced prediction of protein subcellular localization by integrating multiple specialized tools.

Authors:  Yao Qing Shen; Gertraud Burger
Journal:  BMC Bioinformatics       Date:  2007-10-29       Impact factor: 3.169

9.  A method for probabilistic mapping between protein structure and function taxonomies through cross training.

Authors:  Kshitiz Gupta; Vivek Sehgal; Andre Levchenko
Journal:  BMC Struct Biol       Date:  2008-10-03

10.  iEzy-drug: a web server for identifying the interaction between enzymes and drugs in cellular networking.

Authors:  Jian-Liang Min; Xuan Xiao; Kuo-Chen Chou
Journal:  Biomed Res Int       Date:  2013-11-26       Impact factor: 3.411

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.