Literature DB >> 11567088

Comparing function and structure between entire proteomes.

J Liu1, B Rost.   

Abstract

More than 30 organisms have been sequenced entirely. Here, we applied a variety of simple bioinformatics tools to analyze 29 proteomes for representatives from all three kingdoms: eukaryotes, prokaryotes, and archaebacteria. We confirmed that eukaryotes have relatively more long proteins than prokaryotes and archaes, and that the overall amino acid composition is similar among the three. We predicted that approximately 15%-30% of all proteins contained transmembrane helices. We could not find a correlation between the content of membrane proteins and the complexity of the organism. In particular, we did not find significantly higher percentages of helical membrane proteins in eukaryotes than in prokaryotes or archae. However, we found more proteins with seven transmembrane helices in eukaryotes and more with six and 12 transmembrane helices in prokaryotes. We found twice as many coiled-coil proteins in eukaryotes (10%) as in prokaryotes and archaes (4%-5%), and we predicted approximately 15%-25% of all proteins to be secreted by most eukaryotes and prokaryotes. Every tenth protein had no known homolog in current databases, and 30%-40% of the proteins fell into structural families with >100 members. A classification by cellular function verified that eukaryotes have a higher proportion of proteins for communication with the environment. Finally, we found at least one homolog of experimentally known structure for approximately 20%-45% of all proteins; the regions with structural homology covered 20%-30% of all residues. These numbers may or may not suggest that there are 1200-2600 folds in the universe of protein structures. All predictions are available at http://cubic.bioc.columbia.edu/genomes.

Mesh:

Substances:

Year:  2001        PMID: 11567088      PMCID: PMC2374214          DOI: 10.1110/ps.10101

Source DB:  PubMed          Journal:  Protein Sci        ISSN: 0961-8368            Impact factor:   6.725


  70 in total

1.  Protein folds and families: sequence and structure alignments.

Authors:  L Holm; C Sander
Journal:  Nucleic Acids Res       Date:  1999-01-01       Impact factor: 16.971

Review 2.  100,000 protein structures for the biologist.

Authors:  A Sali
Journal:  Nat Struct Biol       Date:  1998-12

3.  Topology prediction for helical transmembrane proteins at 86% accuracy.

Authors:  B Rost; P Fariselli; R Casadio
Journal:  Protein Sci       Date:  1996-08       Impact factor: 6.725

4.  EUCLID: automatic classification of proteins in functional classes by their database annotations.

Authors:  J Tamames; C Ouzounis; G Casari; C Sander; A Valencia
Journal:  Bioinformatics       Date:  1998       Impact factor: 6.937

5.  Complete sequence and gene organization of the genome of a hyper-thermophilic archaebacterium, Pyrococcus horikoshii OT3.

Authors:  Y Kawarabayasi; M Sawada; H Horikawa; Y Haikawa; Y Hino; S Yamamoto; M Sekine; S Baba; H Kosugi; A Hosoyama; Y Nagai; M Sakai; K Ogura; R Otsuka; H Nakazawa; M Takamiya; Y Ohfuku; T Funahashi; T Tanaka; Y Kudoh; J Yamazaki; N Kushida; A Oguchi; K Aoki; H Kikuchi
Journal:  DNA Res       Date:  1998-04-30       Impact factor: 4.458

6.  How representative are the known structures of the proteins in a complete genome? A comprehensive structural census.

Authors:  M Gerstein
Journal:  Fold Des       Date:  1998

7.  Prediction and analysis of coiled-coil structures.

Authors:  A Lupas
Journal:  Methods Enzymol       Date:  1996       Impact factor: 1.600

8.  Sequence analysis of the genome of the unicellular cyanobacterium Synechocystis sp. strain PCC6803. II. Sequence determination of the entire genome and assignment of potential protein-coding regions.

Authors:  T Kaneko; S Sato; H Kotani; A Tanaka; E Asamizu; Y Nakamura; N Miyajima; M Hirosawa; M Sugiura; S Sasamoto; T Kimura; T Hosouchi; A Matsuno; A Muraki; N Nakazaki; K Naruo; S Okumura; S Shimpo; C Takeuchi; T Wada; A Watanabe; M Yamada; M Yasuda; S Tabata
Journal:  DNA Res       Date:  1996-06-30       Impact factor: 4.458

9.  Phylogenetic occurrence of coiled coil proteins: implications for tissue structure in metazoa via a coiled coil tissue matrix.

Authors:  P R Odgren; L W Harvie; E G Fey
Journal:  Proteins       Date:  1996-04

10.  The genome sequence of Rickettsia prowazekii and the origin of mitochondria.

Authors:  S G Andersson; A Zomorodipour; J O Andersson; T Sicheritz-Pontén; U C Alsmark; R M Podowski; A K Näslund; A S Eriksson; H H Winkler; C G Kurland
Journal:  Nature       Date:  1998-11-12       Impact factor: 49.962

View more
  91 in total

1.  Specificity in transmembrane helix-helix interactions can define a hierarchy of stability for sequence variants.

Authors:  K G Fleming; D M Engelman
Journal:  Proc Natl Acad Sci U S A       Date:  2001-11-27       Impact factor: 11.205

2.  The Arabidopsis tail-anchored protein PEROXISOMAL AND MITOCHONDRIAL DIVISION FACTOR1 is involved in the morphogenesis and proliferation of peroxisomes and mitochondria.

Authors:  Kyaw Aung; Jianping Hu
Journal:  Plant Cell       Date:  2011-12-06       Impact factor: 11.277

3.  Multi-domain protein families and domain pairs: comparison with known structures and a random model of domain recombination.

Authors:  Gordana Apic; Wolfgang Huber; Sarah A Teichmann
Journal:  J Struct Funct Genomics       Date:  2003

4.  TMPDB: a database of experimentally-characterized transmembrane topologies.

Authors:  Masami Ikeda; Masafumi Arai; Toshikatsu Okuno; Toshio Shimizu
Journal:  Nucleic Acids Res       Date:  2003-01-01       Impact factor: 16.971

5.  Transmembrane helix predictions revisited.

Authors:  Chien Peter Chen; Andrew Kernytsky; Burkhard Rost
Journal:  Protein Sci       Date:  2002-12       Impact factor: 6.725

6.  Sequence conserved for subcellular localization.

Authors:  Rajesh Nair; Burkhard Rost
Journal:  Protein Sci       Date:  2002-12       Impact factor: 6.725

7.  Long membrane helices and short loops predicted less accurately.

Authors:  Chien Peter Chen; Burkhard Rost
Journal:  Protein Sci       Date:  2002-12       Impact factor: 6.725

8.  Servers for sequence-structure relationship analysis and prediction.

Authors:  Zsuzsanna Dosztányi; Csaba Magyar; Gábor E Tusnády; Miklós Cserzo; András Fiser; István Simon
Journal:  Nucleic Acids Res       Date:  2003-07-01       Impact factor: 16.971

9.  Global analysis of predicted proteomes: functional adaptation of physical properties.

Authors:  Christopher G Knight; Rees Kassen; Holger Hebestreit; Paul B Rainey
Journal:  Proc Natl Acad Sci U S A       Date:  2004-05-18       Impact factor: 11.205

10.  Proteome-wide functional classification and identification of prokaryotic transmembrane proteins by transmembrane topology similarity comparison.

Authors:  Masafumi Arai; Kosuke Okumura; Masanobu Satake; Toshio Shimizu
Journal:  Protein Sci       Date:  2004-08       Impact factor: 6.725

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.