Literature DB >> 19664165

Trees in the web of life.

Kristen S Swithers1, J Peter Gogarten, Gregory P Fournier.   

Abstract

Reconstructing the 'Tree of Life' is complicated by extensive horizontal gene transfer between diverse groups of organisms. While numerous conceptual and technical obstacles remain, a report in this issue of Journal of Biology from Koonin and colleagues on the largest-scale prokaryotic genomic reconstruction yet attempted shows that such a tree is discernible, although its branches cannot be traced.

Entities:  

Mesh:

Year:  2009        PMID: 19664165      PMCID: PMC2737374          DOI: 10.1186/jbiol160

Source DB:  PubMed          Journal:  J Biol        ISSN: 1475-4924


The Tree of Life (ToL) is a widely used metaphor to describe the history of life on Earth. While Darwin argued that the 'Coral of Life' may be a more apt description (since only the surface remains alive, supported by the dead generations beneath it), relationships between organisms based on shared characters are best organized using the schematic representation of a tree. Use of molecular markers, in particular small-subunit ribosomal RNA, have allowed this metaphor to be extended to microorganisms; however, this has also presented unique challenges for notions of phylogeny and evolution. One of the most significant challenges is the impact of horizontal gene transfer, which causes genes that coexist in a genome to have different molecular phylogenies [1]. Despite these challenges, the increasing ease with which genomes can be sequenced has reinvigorated attempts to use genomic information to reconstruct the ToL.

Combining datasets: supertree and supermatrix methods

All microbial individuals arise as the result of a fission of a parent individual. Therefore, a vertical line of descent exists, and could theoretically be reconstructed as a purely bifurcating tree (that is, an organismal or cytoplasmic tree). However, while evolution presupposes and requires descent via reproduction, the two are not analogous. Evolution is, by definition, the change in the genetic material within a population of organisms across generations; therefore, any process by which genetic material within a population changes that is unrelated to the reproduction of individuals will show a history that is unrelated to the organismal vertical line of descent. This includes horizontal gene transfer. In many cases, the sum effect of these other genetic processes may completely obfuscate vertical descent, leaving only some measure of 'relatedness' based on overall genetic similarity. Two common approaches in constructing a genome-based ToL are supermatrix analyses, in which sequence alignments for individual gene families are concatenated into a single dataset that is then used to construct a tree [2], and supertree analyses, in which a consensus phylogeny is constructed from multiple gene trees [3]. In some cases, datasets are generated by finding orthologous genes in all organisms and removing all genes whose conflicting phylogenetic topologies seem to indicate horizontal gene transfer, and then using the remaining genes to reconstruct the presumed vertical lines of descent of the genomes (see, for example, [4-6]). This approach has an obvious shortcoming in that gene transfer and the resulting phylogenetic conflicts can only be inferred if each individual gene has retained sufficient phylogenetic information to enable its origin to be correctly assigned. Furthermore, the absence of evidence for gene transfer does not constitute evidence for the absence of gene transfer. Thus, combining genes with different histories into a single data set will almost certainly result in a phylogeny that represents neither the history of any individual gene, nor the history of the organism as a whole. Another problem with supermatrix and supertree analyses is that they often give equal weight to genes that have different histories of horizontal gene transfer. This results in an average or median phylogeny that may not represent organismal history; if there are 'highways' of gene sharing – that is, large numbers of genes have, for some reason, been shared between specific groups of otherwise phylogenetically distinct organisms – this can easily be mistaken for a consistent signal supporting an organismal tree. For example, because of such highways of gene sharing these types of analyses group members of the order Thermotogales with the Firmicutes, and the members of the Aquificales with the ε-Proteobacteria. In contrast, 16S rRNA gene phylogenies and concatenated ribosomal protein phylogenies strongly support these two orders as deeply branching bacterial lineages [7,8] (Figure 1).
Figure 1

The Tree of Life as impacted by horizontal gene transfer. (a) Extensive horizontal gene transfer at all phylogenetic levels combine to produce a 'Web of Life' that often obscures the lines of descent between groups (modified from [10]). Copyright (2008) National Academy of Sciences, USA. (b) Major microbial groups as defined by 16S ribosomal RNA phylogeny. Bands represent some avenues of extensive gene sharing involving Thermotogales, Aquificales, and Firmicutes. (c) Impact on relationships between Thermotogales and Aquificales of genome content changes due to extensive horizontal gene transfer. Grey clouds represent groups of shared genes between clades that are non-monophyletic in the 16S tree. The phylogeny based on these 'gene content' clouds is quite distinct from that of 16S or other ribosome-based trees.

The Tree of Life as impacted by horizontal gene transfer. (a) Extensive horizontal gene transfer at all phylogenetic levels combine to produce a 'Web of Life' that often obscures the lines of descent between groups (modified from [10]). Copyright (2008) National Academy of Sciences, USA. (b) Major microbial groups as defined by 16S ribosomal RNA phylogeny. Bands represent some avenues of extensive gene sharing involving Thermotogales, Aquificales, and Firmicutes. (c) Impact on relationships between Thermotogales and Aquificales of genome content changes due to extensive horizontal gene transfer. Grey clouds represent groups of shared genes between clades that are non-monophyletic in the 16S tree. The phylogeny based on these 'gene content' clouds is quite distinct from that of 16S or other ribosome-based trees.

Ribosomal trees and the 'genome core'

If stringent criteria are applied to remove or down-weigh transferred genes from supertree or supermatrix analyses, the resulting trees at best represent the history of only a minor fraction of the genome, largely consisting of ribosomal proteins, effectively a 'tree of one percent' [9]. Even if this remaining 'genome core' retains a strong signal of vertical descent, this does not capture the true evolutionary history of genomes; that is, a web where different strands depict the history of different genes. A ribosomal tree of life has other shortcomings, in that within taxonomic orders many recombination and lineage sorting events may occur, and ribosomal genes are so highly conserved that such events at the tips of the tree may not be detectable. However, it can still provide a useful backbone for a reticulated genomic or organismal phylogeny [10,11], especially with respect to sets of genes that clearly have undergone horizontal transfer between more distantly related groups. While ribosomal protein and RNA encoding genes have been transferred in the past (see discussion in [12]), these genes are resistant to transfer [13], with most transfers occurring between close relatives. These properties make a phylogenetic reconstruction using ribosomal RNA and proteins an ideal scaffold upon which to map horizontal gene transfers, clearly depicting their distinct contribution to genomic (and organismal) evolution. Several attempts have been made to capture this web-like genome history (see, for example, [10,11] using ribosomal rRNA as a backbone (Figure 1). Conceptually, this method is distinct from any 'tree of one percent' [9] or genome averaging approach in that rather than being discarded, genes undergoing horizontal transfer are included in the final reconstruction without obscuring the vertical signal, even if that vertical signal is preserved only in a minority of genes.

The Forest of Life

In this issue, Puigbo, Wolf and Koonin [14] present an approach for salvaging the ToL that is a variant on other supertree methods, in which nearly 7,000 phylogenetic trees of prokaryotic genes (a 'Forest of Life') are compared in order to determine a central tendency in their topologies. The trees are built from clusters of orthologous groups of proteins (COGs), and the central tendency is deduced from a set of nearly universal trees (NUTs), defined by Puigbo et al. as those trees generated from a set of COGs that are represented in >90% of the analyzed prokaryote taxa. What distinguishes their approach from earlier supertree analyses – apart from the very large number of genes included in the comparison – is that it does not depend on a concatenation of highly conserved proteins or rRNAs, or on a supertree generated by 'pruning' down to those genes giving a consistent topology, to determine a central tendency. Instead, Puigbo et al. calculate an 'inconsistency score' that is a measure of how representative a particular topology of each tree is to the rest of the trees in the Forest of Life. In reconstructing the central tendency in such a broad distribution of gene phylogenies, the work by Puigbo et al. also shows the difficulty in resolving deep branches, which often simply collapse into radiations without any topological structure. In confronting this problem, they show that the relationship between phylogenetic depth and resolution supports a tree-like structure for these deep branches. This result is significant in that it suggests that there is no need to postulate exotic 'big bang' radiations early in evolution; rather, deep phylogenies can still be represented as bifurcating evolutionary events, albeit with extremely short branches that can prove difficult (or sometimes impossible) to resolve. Integrating the vertical descent of organisms and their genomes with the myriad phylogenetic patterns produced by horizontal gene transfer is essential for a truly comprehensive understanding of evolution. A new method that acknowledges and promotes this integration, even if falling short of fully encompassing the intricate details of a complex genome-based biological reality, represents progress towards this goal, and it now appears that a vertical signal can be discerned, if not clearly resolved.
  14 in total

Review 1.  Phylogenomics and the reconstruction of the tree of life.

Authors:  Frédéric Delsuc; Henner Brinkmann; Hervé Philippe
Journal:  Nat Rev Genet       Date:  2005-05       Impact factor: 53.242

Review 2.  Horizontal gene transfer, genome innovation and evolution.

Authors:  J Peter Gogarten; Jeffrey P Townsend
Journal:  Nat Rev Microbiol       Date:  2005-09       Impact factor: 60.633

3.  The evolution of supertrees.

Authors:  Olaf R P Bininda-Emonds
Journal:  Trends Ecol Evol       Date:  2004-06       Impact factor: 17.712

4.  Toward automatic reconstruction of a highly resolved tree of life.

Authors:  Francesca D Ciccarelli; Tobias Doerks; Christian von Mering; Christopher J Creevey; Berend Snel; Peer Bork
Journal:  Science       Date:  2006-03-03       Impact factor: 47.728

5.  Modular networks and cumulative impact of lateral transfer in prokaryote genome evolution.

Authors:  Tal Dagan; Yael Artzy-Randrup; William Martin
Journal:  Proc Natl Acad Sci U S A       Date:  2008-07-16       Impact factor: 11.205

6.  Dealing with incongruence in phylogenomic analyses.

Authors:  Nicolas Galtier; Vincent Daubin
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2008-12-27       Impact factor: 6.237

7.  The early evolution of cellular life.

Authors:  J P Gogarten
Journal:  Trends Ecol Evol       Date:  1995-04       Impact factor: 17.712

8.  Search for a 'Tree of Life' in the thicket of the phylogenetic forest.

Authors:  Pere Puigbò; Yuri I Wolf; Eugene V Koonin
Journal:  J Biol       Date:  2009-07-13

9.  On the chimeric nature, thermophilic origin, and phylogenetic placement of the Thermotogales.

Authors:  Olga Zhaxybayeva; Kristen S Swithers; Pascal Lapierre; Gregory P Fournier; Derek M Bickhart; Robert T DeBoy; Karen E Nelson; Camilla L Nesbø; W Ford Doolittle; J Peter Gogarten; Kenneth M Noll
Journal:  Proc Natl Acad Sci U S A       Date:  2009-03-23       Impact factor: 11.205

10.  Accounting for horizontal gene transfers explains conflicting hypotheses regarding the position of aquificales in the phylogeny of Bacteria.

Authors:  Bastien Boussau; Laurent Guéguen; Manolo Gouy
Journal:  BMC Evol Biol       Date:  2008-10-03       Impact factor: 3.260

View more
  7 in total

1.  Biased gene transfer mimics patterns created through shared ancestry.

Authors:  Cheryl P Andam; David Williams; J Peter Gogarten
Journal:  Proc Natl Acad Sci U S A       Date:  2010-05-21       Impact factor: 11.205

2.  Of primordial genomes and cooperative kittens.

Authors:  Miranda Robertson
Journal:  J Biol       Date:  2009-07-20

3.  Filling the gaps in the genomic landscape.

Authors:  David Williams; J Peter Gogarten; Pascal Lapierre
Journal:  Genome Biol       Date:  2010-02-16       Impact factor: 13.583

4.  Seeing the Tree of Life behind the phylogenetic forest.

Authors:  Pere Puigbò; Yuri I Wolf; Eugene V Koonin
Journal:  BMC Biol       Date:  2013-04-15       Impact factor: 7.431

5.  A rooted net of life.

Authors:  David Williams; Gregory P Fournier; Pascal Lapierre; Kristen S Swithers; Anna G Green; Cheryl P Andam; J Peter Gogarten
Journal:  Biol Direct       Date:  2011-09-21       Impact factor: 4.540

Review 6.  Protein based molecular markers provide reliable means to understand prokaryotic phylogeny and support Darwinian mode of evolution.

Authors:  Vaibhav Bhandari; Hafiz S Naushad; Radhey S Gupta
Journal:  Front Cell Infect Microbiol       Date:  2012-07-26       Impact factor: 5.293

7.  Detection of horizontal gene transfer in the genome of the choanoflagellate Salpingoeca rosetta.

Authors:  Danielle M Matriano; Rosanna A Alegado; Cecilia Conaco
Journal:  Sci Rep       Date:  2021-03-16       Impact factor: 4.379

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.