| Literature DB >> 29637019 |
Nicholas J Matzke1,2, Randall B Irmis3,4.
Abstract
Tip-dating, where fossils are included as dated terminal taxa in Bayesian dating inference, is an increasingly popular method. Data for these studies often come from morphological character matrices originally developed for non-dated, and usually parsimony, analyses. In parsimony, only shared derived characters (synapomorphies) provide grouping information, so many character matrices have an ascertainment bias: they omit autapomorphies (unique derived character states), which are considered uninformative. There has been no study of the effect of this ascertainment bias in tip-dating, but autapomorphies can be informative in model-based inference. We expected that excluding autapomorphies would shorten the morphological branchlengths of terminal branches, and thus bias downwards the time branchlengths inferred in tip-dating. We tested for this effect using a matrix for Carboniferous-Permian eureptiles where all autapomorphies had been deliberately coded. Surprisingly, date estimates are virtually unchanged when autapomorphies are excluded, although we find large changes in morphological rate estimates and small effects on topological and dating confidence. We hypothesized that the puzzling lack of effect on dating was caused by the non-clock nature of the eureptile data. We confirm this explanation by simulating strict clock and non-clock datasets, showing that autapomorphy exclusion biases dating only for the clocklike case. A theoretical solution to ascertainment bias is computing the ascertainment bias correction (Mkparsinf), but we explore this correction in detail, and show that it is computationally impractical for typical datasets with many character states and taxa. Therefore we recommend that palaeontologists collect autapomorphies whenever possible when assembling character matrices.Entities:
Keywords: Ascertainment bias; Autapomorphies; BEASTmasteR; Bayesian phylogenetics; Eureptilia; Morphological clock; Parsimony; Phylogenetic dating; Tip-dating; Total-evidence dating
Year: 2018 PMID: 29637019 PMCID: PMC5890724 DOI: 10.7717/peerj.4553
Source DB: PubMed Journal: PeerJ ISSN: 2167-8359 Impact factor: 2.984
Figure 1Comparison of the tip-dated phylogenies of early eureptiles inferred when excluding (A) or including (B) autapomorphies, under Mkv ascertainment bias correction.
Numbers are posterior probabilities. Bars represent the 95% HPD.
Comparison of summary statistics from the five Beast2 runs using “best-practices” tip dates.
| Run # | |||||
|---|---|---|---|---|---|
| Data | Including autapomorphies | Excluding autapomorphies | |||
| Model | M | M | M | M | M |
| Ln posterior | −1393.4 | −1362.2 | −1154.2 | −1144.9 | −1134.4 |
| ESS | 1,801 | 1,485 | 1,801 | 1,801 | 1,801 |
| Root age | 332.6 [330.2, 335.3] | 332.5 [330.0, 335.1] | 332.6 [330.1, 335.1] | 332.6 [330.1, 335.1] | 332.6 [330.0, 335.1] |
| Birth | 0.360 [0.0355, 1.316] | 0.424 [0.0405, 1.708] | 0.342 [0.0463, 1.221] | 0.381 [0.0402, 1.377] | 0.564 [0.0444, 2.841] |
| Death | 0.336 [9.17e−5, 1.315] | 0.3995 [1.13e−4, 1.723] | 0.318 [4.97e−6, 1.220] | 0.357 [2.57e−4, 1.391] | 0.541 [6.37e−4, 2.843] |
| Sampling | 0.0271 [7.90e−4, 0.0626] | 0.0264 [0.00104, 0.0650] | 0.0271 [8.85e−4, 0.063] | 0.0261 [9.96e−4, 0.0634] | 0.0256 [7.66e−4, 0.0643] |
| Clock rate mean | 0.0782 [0.015, 0.159] | 0.0376 [0.0074, 0.0840] | 0.788 [0.0305, 3.982] | 0.550 [0.0228, 2.655] | 0.235 [0.0142, 0.664] |
| Clock rate SD | 1.747 [1.201, 2.399] | 1.712 [1.111, 2.309] | 2.436 [1.572, 3.477] | 2.341 [1.488, 3.379] | 2.079 [1.318, 2.984] |
Figure 2Simulation procedure and results.
Simulation procedure (A) and results (B–E). The lack of an effect of excluding autapomorphies on dating in the empirical eureptile result is similar to the result on non-clock data shown in (E).
Number of patterns that are unobservable in the Mkparsinf model.
| 10 | 63 | 292 | 1,045 | 3,006 | ||
| 12 | 93 | 544 | 2,505 | 9,276 | ||
| 22 | 333 | 4,084 | 42,505 | 381,546 | ||
| 42 | 1,263 | 32,164 | 730,005 | 15,085,086 | ||
| 102 | 7,653 | 500,404 | 30,062,505 | 1,698,527,706 | ||
| 202 | 30,303 | 4,000,804 | 490,250,005 | 57,089,105,406 | ||
| 402 | 120,603 | 32,001,604 | 7,921,000,005 | 1.87E+12 | ||
| 1,002 | 751,503 | 500,004,004 | 3.11E+11 | 1.86E+14 | ||
| 2,002 | 3,003,003 | 4,000,008,004 | 4.99E+12 | 5.97E+15 |