Literature DB >> 21233524

Discriminative motif finding for predicting protein subcellular localization.

Tien-ho Lin1, Robert F Murphy, Ziv Bar-Joseph.   

Abstract

Many methods have been described to predict the subcellular location of proteins from sequence information. However, most of these methods either rely on global sequence properties or use a set of known protein targeting motifs to predict protein localization. Here, we develop and test a novel method that identifies potential targeting motifs using a discriminative approach based on hidden Markov models (discriminative HMMs). These models search for motifs that are present in a compartment but absent in other, nearby, compartments by utilizing an hierarchical structure that mimics the protein sorting mechanism. We show that both discriminative motif finding and the hierarchical structure improve localization prediction on a benchmark data set of yeast proteins. The motifs identified can be mapped to known targeting motifs and they are more conserved than the average protein sequence. Using our motif-based predictions, we can identify potential annotation errors in public databases for the location of some of the proteins. A software implementation and the data set described in this paper are available from http://murphylab.web.cmu.edu/software/2009_TCBB_motif/.

Entities:  

Mesh:

Substances:

Year:  2011        PMID: 21233524      PMCID: PMC3050600          DOI: 10.1109/TCBB.2009.82

Source DB:  PubMed          Journal:  IEEE/ACM Trans Comput Biol Bioinform        ISSN: 1545-5963            Impact factor:   3.710


  42 in total

1.  Global analysis of protein localization in budding yeast.

Authors:  Won-Ki Huh; James V Falvo; Luke C Gerke; Adam S Carroll; Russell W Howson; Jonathan S Weissman; Erin K O'Shea
Journal:  Nature       Date:  2003-10-16       Impact factor: 49.962

Review 2.  Nuclear transport and cancer: from mechanism to intervention.

Authors:  Tweeny R Kau; Jeffrey C Way; Pamela A Silver
Journal:  Nat Rev Cancer       Date:  2004-02       Impact factor: 60.716

3.  The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003.

Authors:  Brigitte Boeckmann; Amos Bairoch; Rolf Apweiler; Marie-Claude Blatter; Anne Estreicher; Elisabeth Gasteiger; Maria J Martin; Karine Michoud; Claire O'Donovan; Isabelle Phan; Sandrine Pilbout; Michel Schneider
Journal:  Nucleic Acids Res       Date:  2003-01-01       Impact factor: 16.971

4.  Genome-wide discovery of transcriptional modules from DNA sequence and gene expression.

Authors:  E Segal; R Yelensky; D Koller
Journal:  Bioinformatics       Date:  2003       Impact factor: 6.937

5.  Finding functional features in Saccharomyces genomes by phylogenetic footprinting.

Authors:  Paul Cliften; Priya Sudarsanam; Ashwin Desikan; Lucinda Fulton; Bob Fulton; John Majors; Robert Waterston; Barak A Cohen; Mark Johnston
Journal:  Science       Date:  2003-05-29       Impact factor: 47.728

6.  Sequencing and comparison of yeast species to identify genes and regulatory elements.

Authors:  Manolis Kellis; Nick Patterson; Matthew Endrizzi; Bruce Birren; Eric S Lander
Journal:  Nature       Date:  2003-05-15       Impact factor: 49.962

7.  High-content screening microscopy identifies novel proteins with a putative role in secretory membrane traffic.

Authors:  Vytaute Starkuviene; Urban Liebel; Jeremy C Simpson; Holger Erfle; Annemarie Poustka; Stefan Wiemann; Rainer Pepperkok
Journal:  Genome Res       Date:  2004-10       Impact factor: 9.043

8.  The nucleoplasmin nuclear location sequence is larger and more complex than that of SV-40 large T antigen.

Authors:  C Dingwall; J Robbins; S M Dilworth; B Roberts; W D Richardson
Journal:  J Cell Biol       Date:  1988-09       Impact factor: 10.539

9.  Identification of peroxisomal targeting signals located at the carboxy terminus of four peroxisomal proteins.

Authors:  S J Gould; G A Keller; S Subramani
Journal:  J Cell Biol       Date:  1988-09       Impact factor: 10.539

10.  HMM Logos for visualization of protein families.

Authors:  Benjamin Schuster-Böckler; Jörg Schultz; Sven Rahmann
Journal:  BMC Bioinformatics       Date:  2004-01-21       Impact factor: 3.169

View more
  11 in total

1.  Learning cellular sorting pathways using protein interactions and sequence motifs.

Authors:  Tien-Ho Lin; Ziv Bar-Joseph; Robert F Murphy
Journal:  J Comput Biol       Date:  2011-10-14       Impact factor: 1.479

2.  Sequence-based classification using discriminatory motif feature selection.

Authors:  Hao Xiong; Daniel Capurso; Saunak Sen; Mark R Segal
Journal:  PLoS One       Date:  2011-11-10       Impact factor: 3.240

3.  An ensemble classifier for eukaryotic protein subcellular location prediction using gene ontology categories and amino acid hydrophobicity.

Authors:  Liqi Li; Yuan Zhang; Lingyun Zou; Changqing Li; Bo Yu; Xiaoqi Zheng; Yue Zhou
Journal:  PLoS One       Date:  2012-01-30       Impact factor: 3.240

4.  MitoFates: improved prediction of mitochondrial targeting sequences and their cleavage sites.

Authors:  Yoshinori Fukasawa; Junko Tsuji; Szu-Chin Fu; Kentaro Tomii; Paul Horton; Kenichiro Imai
Journal:  Mol Cell Proteomics       Date:  2015-02-10       Impact factor: 5.911

5.  Bagging with CTD--a novel signature for the hierarchical prediction of secreted protein trafficking in eukaryotes.

Authors:  Geetha Govindan; Achuthsankar S Nair
Journal:  Genomics Proteomics Bioinformatics       Date:  2013-12-06       Impact factor: 7.691

6.  Motif-Based Text Mining of Microbial Metagenome Redundancy Profiling Data for Disease Classification.

Authors:  Yin Wang; Rudong Li; Yuhua Zhou; Zongxin Ling; Xiaokui Guo; Lu Xie; Lei Liu
Journal:  Biomed Res Int       Date:  2016-02-14       Impact factor: 3.411

7.  Accurate prediction of subcellular location of apoptosis proteins combining Chou's PseAAC and PsePSSM based on wavelet denoising.

Authors:  Bin Yu; Shan Li; Wen-Ying Qiu; Cheng Chen; Rui-Xin Chen; Lei Wang; Ming-Hui Wang; Yan Zhang
Journal:  Oncotarget       Date:  2017-11-21

8.  Plant-mSubP: a computational framework for the prediction of single- and multi-target protein subcellular localization using integrated machine-learning approaches.

Authors:  Sitanshu S Sahu; Cristian D Loaiza; Rakesh Kaundal
Journal:  AoB Plants       Date:  2019-10-17       Impact factor: 3.276

9.  Prediction of subcellular location of apoptosis proteins by incorporating PsePSSM and DCCA coefficient based on LFDA dimensionality reduction.

Authors:  Bin Yu; Shan Li; Wenying Qiu; Minghui Wang; Junwei Du; Yusen Zhang; Xing Chen
Journal:  BMC Genomics       Date:  2018-06-19       Impact factor: 3.969

10.  Discriminative motif discovery via simulated evolution and random under-sampling.

Authors:  Tao Song; Hong Gu
Journal:  PLoS One       Date:  2014-02-13       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.