Literature DB >> 11472996

An insight into domain combinations.

G Apic1, J Gough, S A Teichmann.   

Abstract

Domains are the building blocks of all globular proteins, and are units of compact three-dimensional structure as well as evolutionary units. There is a limited repertoire of domain families, so that these domain families are duplicated and combined in different ways to form the set of proteins in a genome. Proteins are gene products. The processes that produce new genes are duplication and recombination as well as gene fusion and fission. We attempt to gain an overview of these processes by studying the structural domains in the proteins of seven genomes from the three kingdoms of life: Eubacteria, Archaea and Eukaryota. We use here the domain and superfamily definitions in Structural Classification of Proteins Database (SCOP) in order to map pairs of adjacent domains in genome sequences in terms of their superfamily combinations. We find 624 out of the 764 superfamilies in SCOP in these genomes, and the 624 families occur in 585 pairwise combinations. Most families are observed in combination with one or two other families, while a few families are very versatile in their combinatorial behaviour. This type of pattern can be described by a scale-free network. Finally, we study domain repeats and we compare the set of the domain combinations in the genomes to those in PDB, and discuss the implications for structural genomics.

Mesh:

Substances:

Year:  2001        PMID: 11472996     DOI: 10.1093/bioinformatics/17.suppl_1.s83

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  36 in total

1.  An efficient algorithm for large-scale detection of protein families.

Authors:  A J Enright; S Van Dongen; C A Ouzounis
Journal:  Nucleic Acids Res       Date:  2002-04-01       Impact factor: 16.971

2.  Enhanced protein domain discovery by using language modeling techniques from speech recognition.

Authors:  Lachlan Coin; Alex Bateman; Richard Durbin
Journal:  Proc Natl Acad Sci U S A       Date:  2003-03-31       Impact factor: 11.205

3.  3D-GENOMICS: a database to compare structural and functional annotations of proteins between sequenced genomes.

Authors:  Keiran Fleming; Arne Müller; Robert M MacCallum; Michael J E Sternberg
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

4.  An evolutionarily structured universe of protein architecture.

Authors:  Gustavo Caetano-Anollés; Derek Caetano-Anollés
Journal:  Genome Res       Date:  2003-07       Impact factor: 9.043

5.  The nonconserved wrapping of conserved protein folds reveals a trend toward increasing connectivity in proteomic networks.

Authors:  Ariel Fernández; Ridgway Scott; R Stephen Berry
Journal:  Proc Natl Acad Sci U S A       Date:  2004-02-20       Impact factor: 11.205

6.  Modeling the evolution of protein domain architectures using maximum parsimony.

Authors:  Jessica H Fong; Lewis Y Geer; Anna R Panchenko; Stephen H Bryant
Journal:  J Mol Biol       Date:  2006-11-10       Impact factor: 5.469

7.  Reductive evolution of proteomes and protein structures.

Authors:  Minglei Wang; Charles G Kurland; Gustavo Caetano-Anollés
Journal:  Proc Natl Acad Sci U S A       Date:  2011-07-05       Impact factor: 11.205

8.  CODA: accurate detection of functional associations between proteins in eukaryotic genomes using domain fusion.

Authors:  Adam J Reid; Juan A G Ranea; Andrew B Clegg; Christine A Orengo
Journal:  PLoS One       Date:  2010-06-01       Impact factor: 3.240

9.  The nature of protein domain evolution: shaping the interaction network.

Authors:  Christoph P Bagowski; Wouter Bruins; Aartjan J W Te Velthuis
Journal:  Curr Genomics       Date:  2010-08       Impact factor: 2.236

10.  Linking fold, function and phylogeny: a comparative genomics view on protein (domain) evolution.

Authors:  Aartjan J W Te Velthuis; Christoph P Bagowski
Journal:  Curr Genomics       Date:  2008-04       Impact factor: 2.236

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.