| Literature DB >> 18662388 |
Cuong Than1, Derek Ruths, Luay Nakhleh.
Abstract
BACKGROUND: Phylogenies, i.e., the evolutionary histories of groups of taxa, play a major role in representing the interrelationships among biological entities. Many software tools for reconstructing and evaluating such phylogenies have been proposed, almost all of which assume the underlying evolutionary history to be a tree. While trees give a satisfactory first-order approximation for many families of organisms, other families exhibit evolutionary mechanisms that cannot be represented by trees. Processes such as horizontal gene transfer (HGT), hybrid speciation, and interspecific recombination, collectively referred to as reticulate evolutionary events, result in networks, rather than trees, of relationships. Various software tools have been recently developed to analyze reticulate evolutionary relationships, which include SplitsTree4, LatTrans, EEEP, HorizStory, and T-REX.Entities:
Mesh:
Year: 2008 PMID: 18662388 PMCID: PMC2533029 DOI: 10.1186/1471-2105-9-322
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Evolutionary networks and gene trees. Gene trees T and T' within species networks N. (a) The gene whose tree is depicted with a dashed line is transferred from the genome of species C to that of species B. (b) Species B and C exchanged the two genes whose trees are T and T'. (c) Species B is a hybrid whose two parents are species A and C; each gene in the genome of species B has an evolutionary tree that is either T or T'.
Figure 2Sample evolutionary networks. Two evolutionary networks N1 and N2, each with eight leaves (labeled A,..., H) and two network nodes X and Y. Shown are the orientation of the network edges; all other edges are directed away from the root (toward the leaves) Notice that the difference between the two networks is that node X in N1 has lineage G as one of its parents, whereas node X in N2 has lineage H as one of its parents.
Figure 3A modified Newick format. Three trees, N', X, and Y, along with their Newick representation. These three trees form the tree decomposition of the evolutionary network N1 in Figure 2. The eNewick representation of N is the triplet ⟨N'; X; Y⟩.
Figure 4The eNewick format. A modified Newick format for representing evolutionary networks. The figure is taken from the paper by Morin et. al. [22].
Figure 5Trees within networks. The sets and of all eight trees induced by the two networks N1 and N2, respectively, in Figure 2.
The clusters of the two networks in Figure 2.
| Network | Network |
| { | { |
| { | { |
| { | { |
| { | { |
| { | { |
| { | { |
| { | { |
| { | { |
| { | { |
| { | { |
| { | { |
| { | { |
| { | { |
| { | { |
A table of the (nontrivial) clusters of the two networks N1 and N2 in Figure 2, denoted by (N1) and (N2), respectively, in the text. Highlighted are rows corresponding to clusters that differ between the two networks.
The tripartitions of the two networks in Figure 2.
| Edge Label | Network | Network |
| 1 | ⟨{ | ⟨{ |
| 2 | ⟨{ | ⟨{ |
| 3 | ⟨{ | ⟨{ |
| ⟨{ | ⟨{ | |
| 5 | ⟨{ | ⟨{ |
| 6 | ⟨{ | ⟨{ |
| 7 | ⟨{ | ⟨{ |
| 8 | ⟨{ | ⟨{ |
| 9 | ⟨{ | ⟨{ |
| 10 | ⟨{ | ⟨{ |
| 11 | ⟨{ | ⟨{ |
| 12 | ⟨{ | ⟨{ |
A table of the (nontrivial) tripartitions of the two networks N1 and N2 in Figure 2, denoted by θ(N1) and θ(N2), respectively, in the text. Highlighted are rows corresponding to tripartitions that differ between the two networks.
Figure 6Tree-based comparison of networks. Illustration of the tree-based network comparison measure. (a) The weighted bipartite graph G that is constructed from the two networks N1 and N2 in Figure 2. On the left are four nodes that correspond to the four trees in (N1) and on the right are four nodes that correspond to the four trees in (N2). The weight of an edge between and is the values of the Robinson-Foulds (RF) distance between the two trees, which is computed as the number of clusters present in one but not both of the trees, divided by 2. (b) The edges that comprise the minimum-weight edge cover of the bipartite graph G. The weight of this cover is 2, which is the sum of the weights of the edges in the cover; therefore, m(N1, N2) = 2.
Figure 7Screenshot of the graphical output of RIATA-HGT. (a) Screen captures of the graphical output of RIATA-HGT on the pair of trees (((a, b), c), (d, (e, f))) and (((a, c), b), ((d, f), e)). (b) The eNewick representations of the two selected networks.
Figure 8An illustration of computing the support value of an HGT edge. An illustration of computing the support value of an HGT edge. In this case, the support of HGT edge X → Y added on the species tree (a), is calculated based on the bootstrap of the branches that separate Y (or B) from A in the gene tree (b). The value for the event X → Y is 80.
Figure 9The cox2 trees. The species tree (a): (((Pallavicinia, (Porella, Trichocolea)I15)I16, Marchantia)I17, (Thuidium, Brachythecium, Hypnum)I14, (((Amborella_V, Amborella_H_M, Amborella_H_E1, Amborella_H_E2)I9, ((Eichhornia, (Zea, Oryza)I6)I7, Philodendron, Agave)I8, ((Daucus, Petunia)I4, Beta, (Oenothera, (Brassica, Arabidopsis)I2)I3)I5, ((Piper, Asarum)I10, (Liriodendron, Laurus)I11)I1)I12, (Pinus, Zamia)I13)I0)I18; and gene tree (b): ((((((((Petunia, Amborella_H_E1, (((Arabidopsis, Brassica)I2:99.0, Amborella_H_E2), Oenothera, Daucus)), Beta):73.0, ((Agave, Eichhornia, Philodendron), (Oryza, Zea)I6:100.0)), (Asarum, Piper)I10), (Laurus, Liriodendron)I11), Amborella_V), (Pinus, Zamia)I13:72.0), (((Thuidium, Hypnum, Amborella_H_M):91.0, Brachythecium):98.0, (Marchantia, ((Porella, Trichocolea):97.0, Pallavicinia))I17)); for gene cox2. Bootstrap values for the branches in the gene tree that are greater than 50.0% are included in the tree Newick representation. The species tree branches do not have bootstrap values.
Figure 10An example of RIATA-HGT output. The output of RIATA-HGT on the species tree and cox2 gene tree in Figure 9. RIATA-HGT finds 4 solutions, summarized in terms of two components, so that each solution is the union of exactly one subsolution from each component.
List of tools and their description.
| charnet | Computing clusters, trees and tripartitions in a network |
| cmpnets | Computing the distance between two networks |
| lca | Finding the last common ancestor of a set of nodes |
| mast | Computing the maximum agreement subtree |
| netpars | Scoring the parsimony of sequences on a pair of networks |
| riatahgt | Reconstructing HGT events from a pair of species/gene trees |
| rf | Computing the Robinson-Foulds tree measure |
A table of the tools currently implemented in PhyloNet. With the exception of the three phylogenetic trees tools lca, mast, and rf, all the other tools are for analyzing reticulate evolutionary relationships.