Literature DB >> 12176924

Predicting protein cellular localization using a domain projection method.

Richard Mott1, Jörg Schultz, Peer Bork, Chris P Ponting.   

Abstract

We investigate the co-occurrence of domain families in eukaryotic proteins to predict protein cellular localization. Approximately half (300) of SMART domains form a "small-world network", linked by no more than seven degrees of separation. Projection of the domains onto two-dimensional space reveals three clusters that correspond to cellular compartments containing secreted, cytoplasmic, and nuclear proteins. The projection method takes into account the existence of "bridging" domains, that is, instances where two domains might not occur with each other but frequently co-occur with a third domain; in such circumstances the domains are neighbors in the projection. While the majority of domains are specific to a compartment ("locale"), and hence may be used to localize any protein that contains such a domain, a small subset of domains either are present in multiple locales or occur in transmembrane proteins. Comparison with previously annotated proteins shows that SMART domain data used with this approach can predict, with 92% accuracy, the localizations of 23% of eukaryotic proteins. The coverage and accuracy will increase with improvements in domain database coverage. This method is complementary to approaches that use amino-acid composition or identify sorting sequences; these methods may be combined to further enhance prediction accuracy.

Mesh:

Substances:

Year:  2002        PMID: 12176924      PMCID: PMC186639          DOI: 10.1101/gr.96802

Source DB:  PubMed          Journal:  Genome Res        ISSN: 1088-9051            Impact factor:   9.043


  28 in total

1.  SMART: a web-based tool for the study of genetically mobile domains.

Authors:  J Schultz; R R Copley; T Doerks; C P Ponting; P Bork
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  The Pfam protein families database.

Authors:  A Bateman; E Birney; R Durbin; S R Eddy; K L Howe; E L Sonnhammer
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

Review 3.  Evolution of domain families.

Authors:  C P Ponting; J Schultz; R R Copley; M A Andrade; P Bork
Journal:  Adv Protein Chem       Date:  2000

Review 4.  Protein sorting signals and prediction of subcellular localization.

Authors:  K Nakai
Journal:  Adv Protein Chem       Date:  2000

5.  Gene ontology: tool for the unification of biology. The Gene Ontology Consortium.

Authors:  M Ashburner; C A Ball; J A Blake; D Botstein; H Butler; J M Cherry; A P Davis; K Dolinski; S S Dwight; J T Eppig; M A Harris; D P Hill; L Issel-Tarver; A Kasarskis; S Lewis; J C Matese; J E Richardson; M Ringwald; G M Rubin; G Sherlock
Journal:  Nat Genet       Date:  2000-05       Impact factor: 38.330

6.  The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000.

Authors:  A Bairoch; R Apweiler
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

Review 7.  Wanted: subcellular localization of proteins based on sequence.

Authors:  F Eisenhaber; P Bork
Journal:  Trends Cell Biol       Date:  1998-04       Impact factor: 20.808

8.  Using neural networks for prediction of the subcellular location of proteins.

Authors:  A Reinhardt; T Hubbard
Journal:  Nucleic Acids Res       Date:  1998-05-01       Impact factor: 16.971

9.  The apoptosis inhibitor gene API2 and a novel 18q gene, MLT, are recurrently rearranged in the t(11;18)(q21;q21) associated with mucosa-associated lymphoid tissue lymphomas.

Authors:  J Dierlamm; M Baens; I Wlodarska; M Stefanova-Ouzounova; J M Hernandez; D K Hossfeld; C De Wolf-Peeters; A Hagemeijer; H Van den Berghe; P Marynen
Journal:  Blood       Date:  1999-06-01       Impact factor: 22.113

10.  Evaluation of human-readable annotation in biomolecular sequence databases with biological rule libraries.

Authors:  F Eisenhaber; P Bork
Journal:  Bioinformatics       Date:  1999 Jul-Aug       Impact factor: 6.937

View more
  30 in total

1.  Enhanced protein domain discovery by using language modeling techniques from speech recognition.

Authors:  Lachlan Coin; Alex Bateman; Richard Durbin
Journal:  Proc Natl Acad Sci U S A       Date:  2003-03-31       Impact factor: 11.205

2.  Comparative analysis of protein domain organization.

Authors:  Yuzhen Ye; Adam Godzik
Journal:  Genome Res       Date:  2004-03       Impact factor: 9.043

3.  Predicting subcellular localization via protein motif co-occurrence.

Authors:  Michelle S Scott; David Y Thomas; Michael T Hallett
Journal:  Genome Res       Date:  2004-10       Impact factor: 9.043

4.  Combining machine learning and homology-based approaches to accurately predict subcellular localization in Arabidopsis.

Authors:  Rakesh Kaundal; Reena Saini; Patrick X Zhao
Journal:  Plant Physiol       Date:  2010-07-20       Impact factor: 8.340

5.  Improved membrane protein topology prediction by domain assignments.

Authors:  Andreas Bernsel; Gunnar Von Heijne
Journal:  Protein Sci       Date:  2005-07       Impact factor: 6.725

6.  Extracellular proteome of human hepatoma cell, HepG2 analyzed using two-dimensional liquid chromatography coupled with tandem mass spectrometry.

Authors:  Ryo Yamashita; Yuko Fujiwara; Kohei Ikari; Keiko Hamada; Asuka Otomo; Kazuki Yasuda; Mitsuhiko Noda; Yasushi Kaburagi
Journal:  Mol Cell Biochem       Date:  2006-11-16       Impact factor: 3.396

7.  Computational prediction of human proteins that can be secreted into the bloodstream.

Authors:  Juan Cui; Qi Liu; David Puett; Ying Xu
Journal:  Bioinformatics       Date:  2008-08-12       Impact factor: 6.937

8.  Protein subcellular localization prediction of eukaryotes using a knowledge-based approach.

Authors:  Hsin-Nan Lin; Ching-Tai Chen; Ting-Yi Sung; Shinn-Ying Ho; Wen-Lian Hsu
Journal:  BMC Bioinformatics       Date:  2009-12-03       Impact factor: 3.169

9.  Predicting the subcellular localization of human proteins using machine learning and exploratory data analysis.

Authors:  George K Acquaah-Mensah; Sonia M Leach; Chittibabu Guda
Journal:  Genomics Proteomics Bioinformatics       Date:  2006-05       Impact factor: 7.691

10.  d-Omix: a mixer of generic protein domain analysis tools.

Authors:  Duangdao Wichadakul; Somrak Numnark; Supawadee Ingsriswang
Journal:  Nucleic Acids Res       Date:  2009-05-21       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.