Literature DB >> 11428892

Domain combinations in archaeal, eubacterial and eukaryotic proteomes.

G Apic1, J Gough, S A Teichmann.   

Abstract

There is a limited repertoire of domain families that are duplicated and combined in different ways to form the set of proteins in a genome. Proteins are gene products, and at the level of genes, duplication, recombination, fusion and fission are the processes that produce new genes. We attempt to gain an overview of these processes by studying the evolutionary units in proteins, domains, in the protein sequences of 40 genomes. The domain and superfamily definitions in the Structural Classification of Proteins Database are used, so that we can view all pairs of adjacent domains in genome sequences in terms of their superfamily combinations. We find 783 out of the 859 superfamilies in SCOP in these genomes, and the 783 families occur in 1307 pairwise combinations. Most families are observed in combination with one or two other families, while a few families are very versatile in their combinatorial behaviour; 209 families do not make combinations with other families. This type of pattern can be described as a scale-free network. We also study the N to C-terminal orientation of domain pairs and domain repeats. The phylogenetic distribution of domain combinations is surveyed, to establish the extent of common and kingdom-specific combinations. Of the kingdom-specific combinations, significantly more combinations consist of families present in all three kingdoms than of families present in one or two kingdoms. Hence, we are led to conclude that recombination between common families, as compared to the invention of new families and recombination among these, has also been a major contribution to the evolution of kingdom-specific and species-specific functions in organisms in all three kingdoms. Finally, we compare the set of the domain combinations in the genomes to those in the RCSB Protein Data Bank, and discuss the implications for structural genomics. Copyright 2001 Academic Press.

Mesh:

Substances:

Year:  2001        PMID: 11428892     DOI: 10.1006/jmbi.2001.4776

Source DB:  PubMed          Journal:  J Mol Biol        ISSN: 0022-2836            Impact factor:   5.469


  176 in total

1.  Recombinatoric exploration of novel folded structures: a heteropolymer-based model of protein evolutionary landscapes.

Authors:  Yan Cui; Wing Hung Wong; Erich Bornberg-Bauer; Hue Sun Chan
Journal:  Proc Natl Acad Sci U S A       Date:  2002-01-22       Impact factor: 11.205

2.  SUPERFAMILY: HMMs representing all proteins of known structure. SCOP sequence searches, alignments and genome assignments.

Authors:  Julian Gough; Cyrus Chothia
Journal:  Nucleic Acids Res       Date:  2002-01-01       Impact factor: 16.971

3.  An efficient algorithm for large-scale detection of protein families.

Authors:  A J Enright; S Van Dongen; C A Ouzounis
Journal:  Nucleic Acids Res       Date:  2002-04-01       Impact factor: 16.971

4.  Structural characterization of the human proteome.

Authors:  Arne Müller; Robert M MacCallum; Michael J E Sternberg
Journal:  Genome Res       Date:  2002-11       Impact factor: 9.043

5.  Expanding protein universe and its origin from the biological Big Bang.

Authors:  Nikolay V Dokholyan; Boris Shakhnovich; Eugene I Shakhnovich
Journal:  Proc Natl Acad Sci U S A       Date:  2002-10-16       Impact factor: 11.205

6.  Multi-domain protein families and domain pairs: comparison with known structures and a random model of domain recombination.

Authors:  Gordana Apic; Wolfgang Huber; Sarah A Teichmann
Journal:  J Struct Funct Genomics       Date:  2003

7.  The SUPERFAMILY database in 2004: additions and improvements.

Authors:  Martin Madera; Christine Vogel; Sarah K Kummerfeld; Cyrus Chothia; Julian Gough
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

8.  3D-GENOMICS: a database to compare structural and functional annotations of proteins between sequenced genomes.

Authors:  Keiran Fleming; Arne Müller; Robert M MacCallum; Michael J E Sternberg
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

9.  Transmembrane protein domains rarely use covalent domain recombination as an evolutionary mechanism.

Authors:  Yang Liu; Mark Gerstein; Donald M Engelman
Journal:  Proc Natl Acad Sci U S A       Date:  2004-03-01       Impact factor: 11.205

10.  Comparative analysis of protein domain organization.

Authors:  Yuzhen Ye; Adam Godzik
Journal:  Genome Res       Date:  2004-03       Impact factor: 9.043

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.