Literature DB >> 12441382

Sequence conserved for subcellular localization.

Rajesh Nair1, Burkhard Rost.   

Abstract

The more proteins diverged in sequence, the more difficult it becomes for bioinformatics to infer similarities of protein function and structure from sequence. The precise thresholds used in automated genome annotations depend on the particular aspect of protein function transferred by homology. Here, we presented the first large-scale analysis of the relation between sequence similarity and identity in subcellular localization. Three results stood out: (1) The subcellular compartment is generally more conserved than what might have been expected given that short sequence motifs like nuclear localization signals can alter the native compartment; (2) the sequence conservation of localization is similar between different compartments; and (3) it is similar to the conservation of structure and enzymatic activity. In particular, we found the transition between the regions of conserved and nonconserved localization to be very sharp, although the thresholds for conservation were less well defined than for structure and enzymatic activity. We found that a simple measure for sequence similarity accounting for pairwise sequence identity and alignment length, the HSSP distance, distinguished accurately between protein pairs of identical and different localizations. In fact, BLAST expectation values outperformed the HSSP distance only for alignments in the subtwilight zone. We succeeded in slightly improving the accuracy of inferring localization through homology by fine tuning the thresholds. Finally, we applied our results to the entire SWISS-PROT database and five entirely sequenced eukaryotes.

Mesh:

Substances:

Year:  2002        PMID: 12441382      PMCID: PMC2373743          DOI: 10.1110/ps.0207402

Source DB:  PubMed          Journal:  Protein Sci        ISSN: 0961-8368            Impact factor:   6.725


  69 in total

Review 1.  Protein folds, functions and evolution.

Authors:  J M Thornton; C A Orengo; A E Todd; F M Pearl
Journal:  J Mol Biol       Date:  1999-10-22       Impact factor: 5.469

2.  Assessing annotation transfer for genomics: quantifying the relations between protein sequence, structure and function through traditional and probabilistic scores.

Authors:  C A Wilson; J Kreychman; M Gerstein
Journal:  J Mol Biol       Date:  2000-03-17       Impact factor: 5.469

Review 3.  Localization and processing of CLN3, the protein associated to Batten disease: where is it and what does it do?

Authors:  D A Pearce
Journal:  J Neurosci Res       Date:  2000-01-01       Impact factor: 4.164

Review 4.  Annotating eukaryote genomes.

Authors:  S Lewis; M Ashburner; M G Reese
Journal:  Curr Opin Struct Biol       Date:  2000-06       Impact factor: 6.809

5.  Bridging the gap between sequence and function.

Authors:  E V Koonin
Journal:  Trends Genet       Date:  2000-01       Impact factor: 11.639

6.  Practical limits of function prediction.

Authors:  D Devos; A Valencia
Journal:  Proteins       Date:  2000-10-01

7.  Sensitive sequence comparison as protein function predictor.

Authors:  K Pawłowski; L Jaroszewski; L Rychlewski; A Godzik
Journal:  Pac Symp Biocomput       Date:  2000

Review 8.  Regulated nuclear localization of stress-responsive factors: how the nuclear trafficking of protein kinases and transcription factors contributes to cell survival.

Authors:  P Ferrigno; P A Silver
Journal:  Oncogene       Date:  1999-11-01       Impact factor: 9.867

9.  The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000.

Authors:  A Bairoch; R Apweiler
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

10.  Predicting subcellular localization of proteins based on their N-terminal amino acid sequence.

Authors:  O Emanuelsson; H Nielsen; S Brunak; G von Heijne
Journal:  J Mol Biol       Date:  2000-07-21       Impact factor: 5.469

View more
  57 in total

1.  UniqueProt: Creating representative protein sequence sets.

Authors:  Sven Mika; Burkhard Rost
Journal:  Nucleic Acids Res       Date:  2003-07-01       Impact factor: 16.971

2.  The PredictProtein server.

Authors:  Burkhard Rost; Jinfeng Liu
Journal:  Nucleic Acids Res       Date:  2003-07-01       Impact factor: 16.971

3.  LOC3D: annotate sub-cellular localization for protein structures.

Authors:  Rajesh Nair; Burkhard Rost
Journal:  Nucleic Acids Res       Date:  2003-07-01       Impact factor: 16.971

4.  The PredictProtein server.

Authors:  Burkhard Rost; Guy Yachdav; Jinfeng Liu
Journal:  Nucleic Acids Res       Date:  2004-07-01       Impact factor: 16.971

5.  LOCnet and LOCtarget: sub-cellular localization for structural genomics targets.

Authors:  Rajesh Nair; Burkhard Rost
Journal:  Nucleic Acids Res       Date:  2004-07-01       Impact factor: 16.971

6.  Prediction of subcellular location of mycobacterial protein using feature selection techniques.

Authors:  Hao Lin; Hui Ding; Feng-Biao Guo; Jian Huang
Journal:  Mol Divers       Date:  2009-11-12       Impact factor: 2.943

7.  The plasma membrane proteome of Medicago truncatula roots as modified by arbuscular mycorrhizal symbiosis.

Authors:  Achref Aloui; Ghislaine Recorbet; Christelle Lemaître-Guillier; Arnaud Mounier; Thierry Balliau; Michel Zivy; Daniel Wipf; Eliane Dumas-Gaudot
Journal:  Mycorrhiza       Date:  2017-07-19       Impact factor: 3.387

8.  Sequence-based feature prediction and annotation of proteins.

Authors:  Agnieszka S Juncker; Lars J Jensen; Andrea Pierleoni; Andreas Bernsel; Michael L Tress; Peer Bork; Gunnar von Heijne; Alfonso Valencia; Christos A Ouzounis; Rita Casadio; Søren Brunak
Journal:  Genome Biol       Date:  2009-02-02       Impact factor: 13.583

Review 9.  Computational and experimental approaches to chart the Escherichia coli cell-envelope-associated proteome and interactome.

Authors:  Juan Javier Díaz-Mejía; Mohan Babu; Andrew Emili
Journal:  FEMS Microbiol Rev       Date:  2008-11-27       Impact factor: 16.408

10.  MultiLoc2: integrating phylogeny and Gene Ontology terms improves subcellular protein localization prediction.

Authors:  Torsten Blum; Sebastian Briesemeister; Oliver Kohlbacher
Journal:  BMC Bioinformatics       Date:  2009-09-01       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.