Literature DB >> 12888524

Protein families and TRIBES in genome sequence space.

Anton J Enright1, Victor Kunin, Christos A Ouzounis.   

Abstract

Accurate detection of protein families allows assignment of protein function and the analysis of functional diversity in complete genomes. Recently, we presented a novel algorithm called TribeMCL for the detection of protein families that is both accurate and efficient. This method allows family analysis to be carried out on a very large scale. Using TribeMCL, we have generated a resource called TRIBES that contains protein family information, comprising annotations, protein sequence alignments and phylogenetic distributions describing 311 257 proteins from 83 completely sequenced genomes. The analysis of at least 60 934 detected protein families reveals that, with the essential families excluded, paralogy levels are similar between prokaryotes, irrespective of genome size. The number of essential families is estimated to be between 366 and 426. We also show that the currently known space of protein families is scale free and discuss the implications of this distribution. In addition, we show that smaller families are often formed by shorter proteins and discuss the reasons for this intriguing pattern. Finally, we analyse the functional diversity of protein families in entire genome sequences. The TRIBES protein family resource is accessible at http://www.ebi.ac.uk/research/cgg/tribes/.

Mesh:

Substances:

Year:  2003        PMID: 12888524      PMCID: PMC169885          DOI: 10.1093/nar/gkg495

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  27 in total

1.  ProtoMap: automatic classification of protein sequences and hierarchy of protein families.

Authors:  G Yona; N Linial; M Linial
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  CAST: an iterative algorithm for the complexity analysis of sequence tracts. Complexity analysis of sequence tracts.

Authors:  V J Promponas; A J Enright; S Tsoka; D P Kreil; C Leroy; S Hamodrakas; C Sander; C A Ouzounis
Journal:  Bioinformatics       Date:  2000-10       Impact factor: 6.937

3.  Universal protein families and the functional content of the last universal common ancestor.

Authors:  N Kyrpides; R Overbeek; C Ouzounis
Journal:  J Mol Evol       Date:  1999-10       Impact factor: 2.395

4.  IMPALA: matching a protein sequence against a collection of PSI-BLAST-constructed position-specific score matrices.

Authors:  A A Schäffer; Y I Wolf; C P Ponting; E V Koonin; L Aravind; S F Altschul
Journal:  Bioinformatics       Date:  1999-12       Impact factor: 6.937

Review 5.  Protein function in the post-genomic era.

Authors:  D Eisenberg; E M Marcotte; I Xenarios; T O Yeates
Journal:  Nature       Date:  2000-06-15       Impact factor: 49.962

6.  The GeneQuiz web server: protein functional analysis through the Web.

Authors:  S Hoersch; C Leroy; N P Brown; M A Andrade; C Sander
Journal:  Trends Biochem Sci       Date:  2000-01       Impact factor: 13.807

7.  Practical limits of function prediction.

Authors:  D Devos; A Valencia
Journal:  Proteins       Date:  2000-10-01

8.  GeneRAGE: a robust algorithm for sequence clustering and domain detection.

Authors:  A J Enright; C A Ouzounis
Journal:  Bioinformatics       Date:  2000-05       Impact factor: 6.937

9.  Functional classes in the three domains of life.

Authors:  M A Andrade; C Ouzounis; C Sander; J Tamames; A Valencia
Journal:  J Mol Evol       Date:  1999-11       Impact factor: 2.395

10.  The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000.

Authors:  A Bairoch; R Apweiler
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

View more
  62 in total

1.  Origins and impact of constraints in evolution of gene families.

Authors:  Boris E Shakhnovich; Eugene V Koonin
Journal:  Genome Res       Date:  2006-10-19       Impact factor: 9.043

Review 2.  Computational prediction of protein-protein interactions.

Authors:  Lucy Skrabanek; Harpreet K Saini; Gary D Bader; Anton J Enright
Journal:  Mol Biotechnol       Date:  2007-08-14       Impact factor: 2.695

3.  Evolution of protein families: is it possible to distinguish between domains of life?

Authors:  Marta Sales-Pardo; Albert O B Chan; Luís A N Amaral; Roger Guimerà
Journal:  Gene       Date:  2007-08-14       Impact factor: 3.688

Review 4.  Exploiting protein structure data to explore the evolution of protein function and biological complexity.

Authors:  Russell L Marsden; Juan A G Ranea; Antonio Sillero; Oliver Redfern; Corin Yeats; Michael Maibaum; David Lee; Sarah Addou; Gabrielle A Reeves; Timothy J Dallman; Christine A Orengo
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2006-03-29       Impact factor: 6.237

5.  Integrated analysis and reconstruction of microbial transcriptional gene regulatory networks using CoryneRegNet.

Authors:  Jan Baumbach; Tobias Wittkop; Christiane Katja Kleindt; Andreas Tauch
Journal:  Nat Protoc       Date:  2009       Impact factor: 13.491

6.  Whole-genome duplications in the ancestral vertebrate are detectable in the distribution of gene family sizes of tetrapod species.

Authors:  Timothy Hughes; David A Liberles
Journal:  J Mol Evol       Date:  2008-09-25       Impact factor: 2.395

7.  Network visualization and analysis of gene expression data using BioLayout Express(3D).

Authors:  Athanasios Theocharidis; Stjin van Dongen; Anton J Enright; Tom C Freeman
Journal:  Nat Protoc       Date:  2009-10-01       Impact factor: 13.491

8.  Comprehensive cluster analysis with Transitivity Clustering.

Authors:  Tobias Wittkop; Dorothea Emig; Anke Truss; Mario Albrecht; Sebastian Böcker; Jan Baumbach
Journal:  Nat Protoc       Date:  2011-02-10       Impact factor: 13.491

9.  Databases of homologous gene families for comparative genomics.

Authors:  Simon Penel; Anne-Muriel Arigon; Jean-François Dufayard; Anne-Sophie Sertier; Vincent Daubin; Laurent Duret; Manolo Gouy; Guy Perrière
Journal:  BMC Bioinformatics       Date:  2009-06-16       Impact factor: 3.169

10.  Stratification of co-evolving genomic groups using ranked phylogenetic profiles.

Authors:  Shiri Freilich; Leon Goldovsky; Assaf Gottlieb; Eric Blanc; Sophia Tsoka; Christos A Ouzounis
Journal:  BMC Bioinformatics       Date:  2009-10-27       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.