Literature DB >> 9927481

Distribution of protein folds in the three superkingdoms of life.

Y I Wolf1, S E Brenner, P A Bash, E V Koonin.   

Abstract

A sensitive protein-fold recognition procedure was developed on the basis of iterative database search using the PSI-BLAST program. A collection of 1193 position-dependent weight matrices that can be used as fold identifiers was produced. In the completely sequenced genomes, folds could be automatically identified for 20%-30% of the proteins, with 3%-6% more detectable by additional analysis of conserved motifs. The distribution of the most common folds is very similar in bacteria and archaea but distinct in eukaryotes. Within the bacteria, this distribution differs between parasitic and free-living species. In all analyzed genomes, the P-loop NTPases are the most abundant fold. In bacteria and archaea, the next most common folds are ferredoxin-like domains, TIM-barrels, and methyltransferases, whereas in eukaryotes, the second to fourth places belong to protein kinases, beta-propellers and TIM-barrels. The observed diversity of protein folds in different proteomes is approximately twice as high as it would be expected from a simple stochastic model describing a proteome as a finite sample from an infinite pool of proteins with an exponential distribution of the fold fractions. Distribution of the number of domains with different folds in one protein fits the geometric model, which is compatible with the evolution of multidomain proteins by random combination of domains. [Fold predictions for proteins from 14 proteomes are available on the World Wide Web at. The FIDs are available by anonymous ftp at the same location.]

Entities:  

Mesh:

Year:  1999        PMID: 9927481

Source DB:  PubMed          Journal:  Genome Res        ISSN: 1088-9051            Impact factor:   9.043


  56 in total

1.  Detection of protein fold similarity based on correlation of amino acid properties.

Authors:  I V Grigoriev; S H Kim
Journal:  Proc Natl Acad Sci U S A       Date:  1999-12-07       Impact factor: 11.205

2.  The ASTRAL compendium for protein structure and sequence analysis.

Authors:  S E Brenner; P Koehl; M Levitt
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

3.  Estimating the probability for a protein to have a new fold: A statistical computational model.

Authors:  E Portugaly; M Linial
Journal:  Proc Natl Acad Sci U S A       Date:  2000-05-09       Impact factor: 11.205

4.  Whole-genome trees based on the occurrence of folds and orthologs: implications for comparing genomes on different levels.

Authors:  J Lin; M Gerstein
Journal:  Genome Res       Date:  2000-06       Impact factor: 9.043

5.  Comparative genome analysis of the pathogenic spirochetes Borrelia burgdorferi and Treponema pallidum.

Authors:  G Subramanian; E V Koonin; L Aravind
Journal:  Infect Immun       Date:  2000-03       Impact factor: 3.441

6.  GTOP: a database of protein structures predicted from genome sequences.

Authors:  Takeshi Kawabata; Satoshi Fukuchi; Keiichi Homma; Motonori Ota; Jiro Araki; Takehiko Ito; Nobuyuki Ichiyoshi; Ken Nishikawa
Journal:  Nucleic Acids Res       Date:  2002-01-01       Impact factor: 16.971

7.  Ab initio protein structure prediction on a genomic scale: application to the Mycoplasma genitalium genome.

Authors:  Daisuke Kihara; Yang Zhang; Hui Lu; Andrzej Kolinski; Jeffrey Skolnick
Journal:  Proc Natl Acad Sci U S A       Date:  2002-04-16       Impact factor: 11.205

8.  Role of hydrophobic clusters and long-range contact networks in the folding of (alpha/beta)8 barrel proteins.

Authors:  S Selvaraj; M Michael Gromiha
Journal:  Biophys J       Date:  2003-03       Impact factor: 4.033

9.  Structural characterization of the human proteome.

Authors:  Arne Müller; Robert M MacCallum; Michael J E Sternberg
Journal:  Genome Res       Date:  2002-11       Impact factor: 9.043

10.  Proteomics of Mycoplasma genitalium: identification and characterization of unannotated and atypical proteins in a small model genome.

Authors:  S Balasubramanian; T Schneider; M Gerstein; L Regan
Journal:  Nucleic Acids Res       Date:  2000-08-15       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.