Literature DB >> 10854412

Whole-genome trees based on the occurrence of folds and orthologs: implications for comparing genomes on different levels.

J Lin1, M Gerstein.   

Abstract

We built whole-genome trees based on the presence or absence of particular molecular features, either orthologs or folds, in the genomes of a number of recently sequenced microorganisms. To put these genomic trees into perspective, we compared them to the traditional ribosomal phylogeny and also to trees based on the sequence similarity of individual orthologous proteins. We found that our genomic trees based on the overall occurrence of orthologs did not agree well with the traditional tree. This discrepancy, however, vanished when one restricted the tree to proteins involved in transcription and translation, not including problematic proteins involved in metabolism. Protein folds unite superficially unrelated sequence families and represent a most fundamental molecular unit described by genomes. We found that our genomic occurrence tree based on folds agreed fairly well with the traditional ribosomal phylogeny. Surprisingly, despite this overall agreement, certain classes of folds, particularly all-beta ones, had a somewhat different phylogenetic distribution. We also compared our occurrence trees to whole-genome clusters based on the composition of amino acids and di-nucleotides. Finally, we analyzed some technical aspects of genomic trees-e.g., comparing parsimony versus distance-based approaches and examining the effects of increasing numbers of organisms. Additional information (e.g. clickable trees) is available from http://bioinfo.mbb.yale.edu/genome/trees.

Entities:  

Mesh:

Substances:

Year:  2000        PMID: 10854412      PMCID: PMC310900          DOI: 10.1101/gr.10.6.808

Source DB:  PubMed          Journal:  Genome Res        ISSN: 1088-9051            Impact factor:   9.043


  61 in total

Review 1.  Phylogenetic classification and the universal tree.

Authors:  W F Doolittle
Journal:  Science       Date:  1999-06-25       Impact factor: 47.728

2.  The genomic tree as revealed from whole proteome comparisons.

Authors:  F Tekaia; A Lazcano; B Dujon
Journal:  Genome Res       Date:  1999-06       Impact factor: 9.043

3.  The root of the tree of life in the light of the covarion model.

Authors:  P Lopez; P Forterre; H Philippe
Journal:  J Mol Evol       Date:  1999-10       Impact factor: 2.395

4.  Genome signature comparisons among prokaryote, plasmid, and mitochondrial DNA.

Authors:  A Campbell; J Mrázek; S Karlin
Journal:  Proc Natl Acad Sci U S A       Date:  1999-08-03       Impact factor: 11.205

Review 5.  Archaeal aminoacyl-tRNA synthesis: diversity replaces dogma.

Authors:  D Tumbula; U C Vothknecht; H S Kim; M Ibba; B Min; T Li; J Pelaschier; C Stathopoulos; H Becker; D Söll
Journal:  Genetics       Date:  1999-08       Impact factor: 4.562

6.  Comparative genomics of the Archaea (Euryarchaeota): evolution of conserved protein families, the stable core, and the variable shell.

Authors:  K S Makarova; L Aravind; M Y Galperin; N V Grishin; R L Tatusov; Y I Wolf; E V Koonin
Journal:  Genome Res       Date:  1999-07       Impact factor: 9.043

Review 7.  Orthologs, paralogs and genome comparisons.

Authors:  J P Gogarten; L Olendzenski
Journal:  Curr Opin Genet Dev       Date:  1999-12       Impact factor: 5.578

8.  Database on the structure of large subunit ribosomal RNA.

Authors:  P De Rijk; E Robbrecht; S de Hoog; A Caers; Y Van de Peer; R De Wachter
Journal:  Nucleic Acids Res       Date:  1999-01-01       Impact factor: 16.971

9.  Structural patterns in globular proteins.

Authors:  M Levitt; C Chothia
Journal:  Nature       Date:  1976-06-17       Impact factor: 49.962

10.  Distinct types of rRNA operons exist in the genome of the actinomycete Thermomonospora chromogena and evidence for horizontal transfer of an entire rRNA operon.

Authors:  W H Yap; Z Zhang; Y Wang
Journal:  J Bacteriol       Date:  1999-09       Impact factor: 3.490

View more
  48 in total

1.  PartsList: a web-based system for dynamically ranking protein folds based on disparate attributes, including whole-genome expression and interaction information.

Authors:  J Qian; B Stenger; C A Wilson; J Lin; R Jansen; S A Teichmann; J Park; W G Krebs; H Yu; V Alexandrov; N Echols; M Gerstein
Journal:  Nucleic Acids Res       Date:  2001-04-15       Impact factor: 16.971

2.  SPINE: an integrated tracking database and data mining approach for identifying feasible targets in high-throughput structural proteomics.

Authors:  P Bertone; Y Kluger; N Lan; D Zheng; D Christendat; A Yee; A M Edwards; C H Arrowsmith; G T Montelione; M Gerstein
Journal:  Nucleic Acids Res       Date:  2001-07-01       Impact factor: 16.971

3.  Inferring genome trees by using a filter to eliminate phylogenetically discordant sequences and a distance matrix based on mean normalized BLASTP scores.

Authors:  G D Paul Clarke; Robert G Beiko; Mark A Ragan; Robert L Charlebois
Journal:  J Bacteriol       Date:  2002-04       Impact factor: 3.490

4.  Annotation transfer for genomics: measuring functional divergence in multi-domain proteins.

Authors:  H Hegyi; M Gerstein
Journal:  Genome Res       Date:  2001-10       Impact factor: 9.043

5.  Comprehensive analysis of amino acid and nucleotide composition in eukaryotic genomes, comparing genes and pseudogenes.

Authors:  Nathaniel Echols; Paul Harrison; Suganthi Balasubramanian; Nicholas M Luscombe; Paul Bertone; Zhaolei Zhang; Mark Gerstein
Journal:  Nucleic Acids Res       Date:  2002-06-01       Impact factor: 16.971

6.  GeneCensus: genome comparisons in terms of metabolic pathway activity and protein family sharing.

Authors:  J Lin; J Qian; D Greenbaum; P Bertone; R Das; N Echols; A Senes; B Stenger; M Gerstein
Journal:  Nucleic Acids Res       Date:  2002-10-15       Impact factor: 16.971

7.  A phylogenomic approach to bacterial phylogeny: evidence of a core of genes sharing a common history.

Authors:  Vincent Daubin; Manolo Gouy; Guy Perrière
Journal:  Genome Res       Date:  2002-07       Impact factor: 9.043

8.  Chalcone isomerase family and fold: no longer unique to plants.

Authors:  Michael Gensheimer; Arcady Mushegian
Journal:  Protein Sci       Date:  2004-01-10       Impact factor: 6.725

9.  Prokaryotic phylogenies inferred from protein structural domains.

Authors:  Eric J Deeds; Hooman Hennessey; Eugene I Shakhnovich
Journal:  Genome Res       Date:  2005-03       Impact factor: 9.043

10.  Different clustering of genomes across life using the A-T-C-G and degenerate R-Y alphabets: early and late signaling on genome evolution?

Authors:  V Kirzhner; A Paz; Z Volkovich; E Nevo; A Korol
Journal:  J Mol Evol       Date:  2007-03-19       Impact factor: 2.395

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.