| Literature DB >> 18541057 |
Mao-Zu Guo1, Jian-Fu Li, Yang Liu.
Abstract
BACKGROUND: Inference of evolutionary trees using the maximum likelihood principle is NP-hard. Therefore, all practical methods rely on heuristics. The topological transformations often used in heuristics are Nearest Neighbor Interchange (NNI), Subtree Prune and Regraft (SPR) and Tree Bisection and Reconnection (TBR). However, these topological transformations often fall easily into local optima, since there are not many trees accessible in one step from any given tree. Another more exhaustive topological transformation is p-Edge Contraction and Refinement (p-ECR). However, due to its high computation complexity, p-ECR has rarely been used in practice.Entities:
Mesh:
Year: 2008 PMID: 18541057 PMCID: PMC2423445 DOI: 10.1186/1471-2105-9-S6-S4
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1NNI.
Figure 2SPR and TBR.
Figure 3The illustration of a 2-ECR process.
Figure 4The illustration of NJ.
Real datasets
| 1 | MouseLemurs | 35 | 115 |
| 2 | 4DAT | 35 | 452 |
| 3 | 3DAT | 39 | 1116 |
| 4 | 42 | 42 | 1167 |
| 5 | Rbcl55 | 55 | 1315 |
| 6 | 101_SC | 101 | 1858 |
| 7 | 132 | 132 | 1881 |
| 8 | 150_SC | 150 | 1269 |
| 9 | 150_ARB | 150 | 3188 |
| 10 | 218_RDPII | 218 | 4182 |
| 11 | 250_ARB | 250 | 3638 |
| 12 | 500_ZILLA | 500 | 759 |
Likelihood values of BioNJ, PHYML, RaxML, fastDNAml and ECRML on different real datasets
| likelihood | Δ | likelihood | Δ | likelihood | Δ | likelihood | Δ | ||
| 1 | -10753 | -6902 | -5119 | -1268 | -4959 | -1108 | -4019 | -168 | -3851 |
| 2 | -1082 | -1 | -1089 | -8 | -1093 | -12 | -1082 | -1 | -1081 |
| 3 | -2861 | -26 | -2843 | -8 | -2842 | -7 | -2942 | -107 | -2835 |
| 4 | -7866 | -783 | -7250 | -167 | -7281 | -198 | -7310 | -227 | -7083 |
| 5 | -22552 | -299 | -22561 | -308 | -22382 | -129 | -22603 | -350 | -22253 |
| 6 | -67480 | -1311 | -66695 | -526 | -66576 | -407 | -66481 | -312 | -66169 |
| 7 | -46930 | -3293 | -43924 | -287 | -43641 | -4 | -43773 | -136 | -43637 |
| 8 | -41090 | -623 | -40520 | -53 | -40660 | -193 | -40495 | -28 | -40467 |
| 9 | -72423 | -1329 | -71100 | -6 | -71159 | -65 | -71178 | -84 | -71094 |
| 10 | -138942 | -2035 | -137074 | -167 | -136921 | -161 | -136998 | -91 | -136907 |
| 11 | -120315 | -2627 | -117869 | -181 | -118035 | -347 | **** | **** | -117688 |
| 12 | -21917 | -588 | -22380 | -1051 | -21879 | -550 | **** | **** | -21329 |
Likelihood values of various tree building algorithms on different real datasets
| likelihood | Δ | likelihood | Δ | ||
| 1 | -5119 | -1275 | -3851 | -7 | -3844 |
| 2 | -1089 | -8 | -1081 | 0 | -1081 |
| 3 | -2843 | -13 | -2835 | -5 | -2830 |
| 4 | -7250 | -222 | -7083 | -55 | -7028 |
| 5 | -22561 | -677 | -22253 | -369 | -21884 |
| 6 | -66695 | -511 | -66169 | 15 | -66184 |
| 7 | -43924 | -292 | -43637 | -5 | -43632 |
| 8 | -40520 | -80 | -40467 | -27 | -40440 |
| 9 | -71100 | -76 | -71094 | -18 | -71076 |
| 10 | -137074 | -207 | -136907 | -40 | 136867 |
| 11 | -117869 | -260 | -117688 | -79 | -117609 |
| 12 | -22380 | -2059 | -21329 | -1008 | -20321 |
Computing time(seconds) of various tree building algorithms on different real datasets
| MouseLemurs | 3 | 14 | 7 | 187 | 142 | 276 |
| 4DAT | 1 | 2 | 2 | 362 | 35 | 55 |
| 3DAT | 1 | 7 | 5 | 1582 | 135 | 205 |
| 42 | 2 | 31 | 16 | 666 | 449 | 833 |
| Rbcl55 | 4 | 40 | 89 | 1586 | 1340 | 1733 |
| 101_SC | 10 | 155 | 622 | 26287 | 4421 | 5926 |
| 132 | 8 | 205 | 1255 | 20012 | 10623 | 13171 |
| 150_sc | 24 | 163 | 399 | 26408 | 7206 | 9163 |
| 150_ARB | 24 | 319 | 187 | 54788 | 25217 | 28857 |
| 218_RDPII | 42 | 429 | 6779 | 102388 | 14236 | 19897 |
| 250_ARB | 74 | 799 | 1103 | **** | 20788 | 29804 |
| 500_ZILLA | 92 | 2456 | 29975 | **** | 24528 | 30016 |
Figure 5The illustration of a 2-ECRNJ, where the black nodes denote supernodes, white nodes denote leaves or internal nodes, solid and dashed lines represent branches, dashed lines denotes branch to be deleted and the nodes in dashed cycles denote the ones to be refined.
Figure 6The heuristic ECRML which is based on p-ECRNJ and hill climbing.