| Literature DB >> 20487540 |
Miguel Arenas1, Mateus Patricio, David Posada, Gabriel Valiente.
Abstract
BACKGROUND: Typical evolutionary events like recombination, hybridization or gene transfer make necessary the use of phylogenetic networks to properly depict the evolution of DNA and protein sequences. Although several theoretical classes have been proposed to characterize these networks, they make stringent assumptions that will likely not be met by the evolutionary process. We have recently shown that the complexity of simulated networks is a function of the population recombination rate, and that at moderate and large recombination rates the resulting networks cannot be categorized. However, we do not know whether these results extend to networks estimated from real data.Entities:
Mesh:
Year: 2010 PMID: 20487540 PMCID: PMC2880032 DOI: 10.1186/1471-2105-11-268
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Characterization of networks from real data sets. Steps for the characterization of empirical networks: Data sets are aligned and fed to TCS in order to get a reticulate phylogenetic network, which can be input to NetTest for its characterization.
Characterization of TCS networks.
| Nature of data sets | POL | COX1 | 18S | ITS1 | |
|---|---|---|---|---|---|
| Number of data sets | 566 | 1417 | 2303 | 191 | 516 |
| Number of networks generated by TCS | 1535 | 3889 | 6041 | 531 | 1035 |
| Hybrid nodes | 495 0.32 | 891 0.23 | 914 0.15 | 117 0.22 | 124 0.12 |
| Tree | 1326 | 3428 | 5505 | 465 | 952 |
| Galled-tree | 1443 | 3738 | 5873 | 507 | 1015 |
| Tree-child | 1495 | 3799 | 5941 | 518 | 1022 |
| Tree-sibling | 1523 | 3865 | 6019 | 523 | 1033 |
| Uncharacterizable | 12 | 24 | 22 | 8 | 2 |
For each gene, the table indicates the total number of data sets studied (both databases), the total number of networks generated by TCS, the total number and mean (per network) of reticulate nodes and the number of networks classified as tree, galled-tree, tree-child, and tree-sibling. In parenthesis we indicate the number of networks in that category that did not belong to the previous category and the percentage of characterized networks. In brackets we indicated the number and mean (per network) of reticulate nodes. The last row indicates the number of uncharacterizable networks (i.e., more complex than tree-sibling).
Figure 2Network complexity and intra/interspecific level. Characterization of TCS networks derived from the PopSet data sets according to their intraspecific (black bars) and interspecific (white bars) composition.
Figure 3Network complexity and diversity. Characterization of TCS networks derived from the PopSet data sets according to nucleotide diversity. Error bars indicate 95% confidence intervals.