Literature DB >> 19806439

Multi label learning for prediction of human protein subcellular localizations.

Lin Zhu1, Jie Yang, Hong-Bin Shen.   

Abstract

Predicting protein subcellular locations has attracted much attention in the past decade. However, one of the most challenging problems is that many proteins were found simultaneously existing in, or moving between, two or more different cell components in a eukaryotic cell. Seldom previous predictors were able to deal with such multiplex proteins although they have extremely important implications in future drug discovery in terms of their specific subcellular targeting. Approximately 20% of the human proteome consists of such multiplex proteins with multiple sample labels. In order to efficiently handle such multiplex human proteins, we have developed a novel multi-label (ML) learning and prediction framework called ML-PLoc, which decomposes the multi-label prediction problem into multiple independent binary classification problems. ML-PLoc is constructed based on support vector machine (SVM) and sequential evolution information. Experimental results show that ML-PLoc can achieve an overall accuracy 64.6% and recall ratio 67.2% on a benchmark dataset consisting of 14 human subcellular locations, and is very powerful for dealing with multiplex proteins. The current approach represents a new strategy to deal with the multi-label biological problems. ML-PLoc software is freely available for academic use at: http://www.csbio.sjtu.edu.cn/bioinf/ML-PLoc .

Entities:  

Mesh:

Substances:

Year:  2009        PMID: 19806439     DOI: 10.1007/s10930-009-9205-0

Source DB:  PubMed          Journal:  Protein J        ISSN: 1572-3887            Impact factor:   2.371


  25 in total

Review 1.  Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements.

Authors:  A A Schäffer; L Aravind; T L Madden; S Shavirin; J L Spouge; Y I Wolf; E V Koonin; S F Altschul
Journal:  Nucleic Acids Res       Date:  2001-07-15       Impact factor: 16.971

2.  Predicting subcellular localization of proteins using machine-learned classifiers.

Authors:  Z Lu; D Szafron; R Greiner; P Lu; D S Wishart; B Poulin; J Anvik; C Macdonell; R Eisner
Journal:  Bioinformatics       Date:  2004-01-22       Impact factor: 6.937

Review 3.  Searching for hypothetical proteins: theory and practice based upon original data and literature.

Authors:  Gert Lubec; Leila Afjehi-Sadat; Jae-Won Yang; Julius Paul Pradeep John
Journal:  Prog Neurobiol       Date:  2005-11-04       Impact factor: 11.685

4.  Support vector machine-based method for subcellular localization of human proteins using amino acid compositions, their order, and similarity search.

Authors:  Aarti Garg; Manoj Bhasin; Gajendra P S Raghava
Journal:  J Biol Chem       Date:  2005-01-12       Impact factor: 5.157

5.  Prediction of mitochondrial proteins using discrete wavelet transform.

Authors:  Lin Jiang; Menglong Li; Zhining Wen; Kelong Wang; Yuanbo Diao
Journal:  Protein J       Date:  2006-06       Impact factor: 2.371

6.  Cell-PLoc: a package of Web servers for predicting subcellular localization of proteins in various organisms.

Authors:  Kuo-Chen Chou; Hong-Bin Shen
Journal:  Nat Protoc       Date:  2008       Impact factor: 13.491

7.  Predicting subcellular localization of proteins based on their N-terminal amino acid sequence.

Authors:  O Emanuelsson; H Nielsen; S Brunak; G von Heijne
Journal:  J Mol Biol       Date:  2000-07-21       Impact factor: 5.469

8.  Genetic programming for creating Chou's pseudo amino acid based features for submitochondria localization.

Authors:  Loris Nanni; Alessandra Lumini
Journal:  Amino Acids       Date:  2008-01-04       Impact factor: 3.520

9.  sRNAMap: genomic maps for small non-coding RNAs, their regulators and their targets in microbial genomes.

Authors:  Hsi-Yuan Huang; Heng-Yi Chang; Chih-Hung Chou; Ching-Ping Tseng; Shinn-Ying Ho; Chi-Dung Yang; Yih-Wei Ju; Hsien-Da Huang
Journal:  Nucleic Acids Res       Date:  2008-11-16       Impact factor: 16.971

10.  A comprehensive resource for integrating and displaying protein post-translational modifications.

Authors:  Tzong-Yi Lee; Justin Bo-Kai Hsu; Wen-Chi Chang; Ting-Yuan Wang; Po-Chiang Hsu; Hsien-Da Huang
Journal:  BMC Res Notes       Date:  2009-06-23
View more
  5 in total

1.  Prediction of ketoacyl synthase family using reduced amino acid alphabets.

Authors:  Wei Chen; Pengmian Feng; Hao Lin
Journal:  J Ind Microbiol Biotechnol       Date:  2011-10-26       Impact factor: 3.346

2.  An image-based multi-label human protein subcellular localization predictor (iLocator) reveals protein mislocalizations in cancer tissues.

Authors:  Ying-Ying Xu; Fan Yang; Yang Zhang; Hong-Bin Shen
Journal:  Bioinformatics       Date:  2013-06-04       Impact factor: 6.937

3.  Multi-label multi-kernel transfer learning for human protein subcellular localization.

Authors:  Suyu Mei
Journal:  PLoS One       Date:  2012-06-13       Impact factor: 3.240

4.  iRSpot-PseDNC: identify recombination spots with pseudo dinucleotide composition.

Authors:  Wei Chen; Peng-Mian Feng; Hao Lin; Kuo-Chen Chou
Journal:  Nucleic Acids Res       Date:  2013-01-08       Impact factor: 16.971

5.  Many local pattern texture features: which is better for image-based multilabel human protein subcellular localization classification?

Authors:  Fan Yang; Ying-Ying Xu; Hong-Bin Shen
Journal:  ScientificWorldJournal       Date:  2014-06-24
  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.