| Literature DB >> 19204821 |
Abstract
Minimum contradiction matrices are a useful complement to distance-based phylogenies. A minimum contradiction matrix represents phylogenetic information under the form of an ordered distance matrix Y(i) (,) (j) (n). A matrix element corresponds to the distance from a reference vertex n to the path (i, j). For an X-tree or a split network, the minimum contradiction matrix is a Robinson matrix. It therefore fulfills all the inequalities defining perfect order: Y(i) (,) (j) (n) >or= Y(i) (,) (k) (n) (,)Y(k j) (n) >or= Y(k) (,) (I) (n), i <or= j <or= k < n. In real phylogenetic data, some taxa may contradict the inequalities for perfect order. Contradictions to perfect order correspond to deviations from a tree or from a split network topology. Efficient algorithms that search for the best order are presented and tested on whole genome phylogenies with 184 taxa including many Bacteria, Archaea and Eukaryota. After optimization, taxa are classified in their correct domain and phyla. Several significant deviations from perfect order correspond to well-documented evolutionary events.Entities:
Keywords: minimum contradiction; phylogenetic trees; split network; whole genome phylogeny
Year: 2008 PMID: 19204821 PMCID: PMC2614196 DOI: 10.4137/ebo.s909
Source DB: PubMed Journal: Evol Bioinform Online ISSN: 1176-9343 Impact factor: 1.625
Figure 1The distance matrix Y corresponds to the distance between the leaf n and the path P(i, j ).
Figure 2The inequalities Y, ≥ Y are fulfilled for any reference vertex n with n ≥ k or n ≤ i.
Figure 3Minimum contradiction matrices corresponding to the best order found after optimization with Eq. 6,7. The contradiction is minimized over the lines of the matrix (left) and the columns (right).
Best Order (Fig. 3, 4).
| 1. α-Proteobacteria | 1–14 |
| 2. γ-Proteobacteria | 15–18 |
| 3. β-Proteobacteria | 19–29 |
| 4. γ-Proteobacteria | 30–54 |
| 5. ɛ-Proteobacteria | 55–59 |
| 6. Aquificae | 60 |
| 7. δ-Proteobacteria | 61–63 |
| 8. Chlorobi | 64 |
| 9. Bacteroidetes | 65–66 |
| 10. Spirochetes | 67–71 |
| 11. Thermotogae | 72 |
| 12. Fusobacteria | 73 |
| 13. Firmicutes | 74–116 |
| 14. Eukaryota | 117–135 |
| 15. Archaea | 136–152 |
| 16. Actinobacteria | 153–166 |
| 17. Deinococcus-Thermus | 167–168 |
| 18. Cyanobacteria | 169–176 |
| 19. Planctomycetes | 177 |
| 20. Chlamydiae | 178–184 |
(see annex for detailed list of taxa).
Figure 4Distance matrix Yi, j using the best order in Figure 3 and Pirellula (taxon 177) as reference taxon.
Figure 5Distance matrix Yi, j for a) Rickettsiales (Taxa 1–4) as reference taxa and taxa 72–152 in Figure 3. b) Eukaryota using Cyanobacteria as reference taxa. The arrow points to Arabidopsis and Cyanidioschyzon.