| Literature DB >> 27095266 |
Joseph E O'Reilly1, Mark N Puttick1, Luke Parry1, Alastair R Tanner2, James E Tarver1, James Fleming1, Davide Pisani3, Philip C J Donoghue4.
Abstract
Different analytical methods can yield competing interpretations of evolutionary history and, currently, there is no definitive method for phylogenetic reconstruction using morphological data. Parsimony has been the primary method for analysing morphological data, but there has been a resurgence of interest in the likelihood-based Mk-model. Here, we test the performance of the Bayesian implementation of the Mk-model relative to both equal and implied-weight implementations of parsimony. Using simulated morphological data, we demonstrate that the Mk-model outperforms equal-weights parsimony in terms of topological accuracy, and implied-weights performs the most poorly. However, the Mk-model produces phylogenies that have less resolution than parsimony methods. This difference in the accuracy and precision of parsimony and Bayesian approaches to topology estimation needs to be considered when selecting a method for phylogeny reconstruction.Entities:
Keywords: Bayesian; likelihood; morphology; parsimony; phylogenetics
Mesh:
Year: 2016 PMID: 27095266 PMCID: PMC4881353 DOI: 10.1098/rsbl.2016.0081
Source DB: PubMed Journal: Biol Lett ISSN: 1744-9561 Impact factor: 3.703
The differences in median and the 95th percentile range of Robinson–Foulds values between the Mk and both parsimony models are greater in the full dataset compared with the realistic homoplasy subsets. mk, Bayesian Mk model; ew, equal-weights parsimony; iw, implied weights parsimony and its attendant K values.
| 100 characters | 100 characters CI | 350 characters | 350 characters CI | 1000 characters | 1000 characters CI | |
|---|---|---|---|---|---|---|
| mk | 45 (29–64) | 40.5 (28.2–62.5) | 20 (10–51) | 19.5 (10.2–57.3) | 19.5 (10.2–57.3) | 11 (5–27.8) |
| ew | 61 (31–98) | 53 (29–91.8) | 27 (12–70) | 28 (12–74.8) | 28 (12–74.8) | 16 (6.2–43.7) |
| iw k2 | 89 (39–119) | 77 (38.2–117.7) | 36 (18–76) | 36 (17.2–81.3) | 36 (17.2–81.3) | 19.5 (10–35.7) |
| iw k3 | 76 (38–112) | 69 (36.4–108) | 32 (16–69) | 34 (15.2–70) | 34 (15.2–70) | 18 (9.2–35.7) |
| iw k5 | 68 (36–104) | 61 (32.2–102) | 30 (14–66) | 31.5 (15.2–68) | 31.5 (15.2–68) | 18 (9–34) |
| iw k10 | 63 (34–100) | 55.5 (32–98) | 28 (13–68) | 30 (15.2–69.7) | 30 (15.2–69.7) | 16 (8–34) |
| iw k20 | 64 (34–100) | 53 (33–97.8) | 28 (14–68) | 30 (13.2–71.7) | 30 (13.2–71.7) | 17 (8–39.3) |
| iw k200 | 65 (34–100) | 55 (32.2–97.7) | 28 (14–72) | 30.5 (15–76) | 30.5 (15–76) | 18 (8–44) |
Figure 1.Mk tree reconstructions (blue) outperform equal-weights parsimony (grey) and implied-weights parsimony (green) for 100, 350 and 1000 characters (a,c,e,g), and these differences remain in the subset of the simulated data matrices that exhibit realistic levels of homoplasy (b,d,f,h). Bars above the plots mark the 95th percentile range for each method, and dashed vertical lines show the median values. Percentage topology error (g,h) is the Robinson–Foulds value of the reconstructed tree compared with the worst possible value, as shown in [5].
Figure 2.The Mk model exhibits higher accuracy with lower precision than parsimony methods; these results are less clear as more characters are added. Contour plots of Robinson–Foulds distances against the number of resolved nodes in each tree; the contours represent the density of the distribution of trees.