Literature DB >> 10843846

Estimating the number of protein folds and families from complete genome data.

Y I Wolf1, N V Grishin, E V Koonin.   

Abstract

Using the data on proteins encoded in complete genomes, combined with a rigorous theory of the sampling process, we estimate the total number of protein folds and families, as well as the number of folds and families in each genome. The total number of folds in globular, water- soluble proteins is estimated at about 1000, with structural information currently available for about one-third of the number. The sequenced genomes of unicellular organisms encode from approximately 25%, for the minimal genomes of the Mycoplasmas, to 70-80% for larger genomes, such as Escherichia coli and yeast, of the total number of folds. The number of protein families with significant sequence conservation was estimated to be between 4000 and 7000, with structures available for about 20% of these. Copyright 2000 Academic Press.

Entities:  

Mesh:

Substances:

Year:  2000        PMID: 10843846     DOI: 10.1006/jmbi.2000.3786

Source DB:  PubMed          Journal:  J Mol Biol        ISSN: 0022-2836            Impact factor:   5.469


  42 in total

1.  PartsList: a web-based system for dynamically ranking protein folds based on disparate attributes, including whole-genome expression and interaction information.

Authors:  J Qian; B Stenger; C A Wilson; J Lin; R Jansen; S A Teichmann; J Park; W G Krebs; H Yu; V Alexandrov; N Echols; M Gerstein
Journal:  Nucleic Acids Res       Date:  2001-04-15       Impact factor: 16.971

2.  Lineage-specific gene expansions in bacterial and archaeal genomes.

Authors:  I K Jordan; K S Makarova; J L Spouge; Y I Wolf; E V Koonin
Journal:  Genome Res       Date:  2001-04       Impact factor: 9.043

3.  Multi-domain protein families and domain pairs: comparison with known structures and a random model of domain recombination.

Authors:  Gordana Apic; Wolfgang Huber; Sarah A Teichmann
Journal:  J Struct Funct Genomics       Date:  2003

4.  FoldMiner: structural motif discovery using an improved superposition algorithm.

Authors:  Jessica Shapiro; Douglas Brutlag
Journal:  Protein Sci       Date:  2004-01       Impact factor: 6.725

Review 5.  Protein folds and protein folding.

Authors:  R Dustin Schaeffer; Valerie Daggett
Journal:  Protein Eng Des Sel       Date:  2010-11-03       Impact factor: 1.650

6.  Generation of a consensus protein domain dictionary.

Authors:  R Dustin Schaeffer; Amanda L Jonsson; Andrew M Simms; Valerie Daggett
Journal:  Bioinformatics       Date:  2010-11-09       Impact factor: 6.937

7.  Prokaryotic phylogenies inferred from protein structural domains.

Authors:  Eric J Deeds; Hooman Hennessey; Eugene I Shakhnovich
Journal:  Genome Res       Date:  2005-03       Impact factor: 9.043

8.  Evolutionary plasticity of protein families: coupling between sequence and structure variation.

Authors:  Anna R Panchenko; Yuri I Wolf; Larisa A Panchenko; Thomas Madej
Journal:  Proteins       Date:  2005-11-15

9.  A limited universe of membrane protein families and folds.

Authors:  Amit Oberai; Yungok Ihm; Sanguk Kim; James U Bowie
Journal:  Protein Sci       Date:  2006-07       Impact factor: 6.725

10.  Evolution of protein structural classes and protein sequence families.

Authors:  In-Geol Choi; Sung-Hou Kim
Journal:  Proc Natl Acad Sci U S A       Date:  2006-09-07       Impact factor: 11.205

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.