| Literature DB >> 18586705 |
Mukul S Bansal1, Oliver Eulenstein.
Abstract
MOTIVATION: Deciphering the location of gene duplications and multiple gene duplication episodes on the Tree of Life is fundamental to understanding the way gene families and genomes evolve. The multiple gene duplication problem provides a framework for placing gene duplication events onto nodes of a given species tree, and detecting episodes of multiple gene duplication. One version of the multiple gene duplication problem was defined by Guigó et al. in 1996. Several heuristic solutions have since been proposed for this problem, but no exact algorithms were known.Entities:
Mesh:
Year: 2008 PMID: 18586705 PMCID: PMC2718628 DOI: 10.1093/bioinformatics/btn150
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.A gene tree G and a comparable species tree S is depicted. The bold vertices in G are duplications and their intervals represent their allowed locations in the species tree S.
Performance of EXACTMGD on simulated datasets
| Dataset | Unoptimized | ApproxMGD | ExactMGD |
|---|---|---|---|
| 50 taxa | 30 | 28 | 25 |
| 100 taxa | 47 | 38 | 35 |
| 200 taxa | 64 | 54 | 49 |
| 400 taxa | 65 | 45 | 40 |
Performance of EXACTMGD on empirical datasets
| Dataset | Unoptimized | ApproxMGD | ExactMGD |
|---|---|---|---|
| Guigó | 9 | 7 | 5 |
| Burleigh | 1180 | 1152 | 1042 |