| Literature DB >> 25429012 |
Rachel C M Warnock1, James F Parham2, Walter G Joyce3, Tyler R Lyson4, Philip C J Donoghue5.
Abstract
Calibration is the rate-determining step in every molecular clock analysis and, hence, considerable effort has been expended in the development of approaches to distinguish good from bad calibrations. These can be categorized into a priori evaluation of the intrinsic fossil evidence, and a posteriori evaluation of congruence through cross-validation. We contrasted these competing approaches and explored the impact of different interpretations of the fossil evidence upon Bayesian divergence time estimation. The results demonstrate that a posteriori approaches can lead to the selection of erroneous calibrations. Bayesian posterior estimates are also shown to be extremely sensitive to the probabilistic interpretation of temporal constraints. Furthermore, the effective time priors implemented within an analysis differ for individual calibrations when employed alone and in differing combination with others. This compromises the implicit assumption of all calibration consistency methods, that the impact of an individual calibration is the same when used alone or in unison with others. Thus, the most effective means of establishing the quality of fossil-based calibrations is through a priori evaluation of the intrinsic palaeontological, stratigraphic, geochronological and phylogenetic data. However, effort expended in establishing calibrations will not be rewarded unless they are implemented faithfully in divergence time analyses.Entities:
Keywords: Bayesian; calibration; cross-validation; fossil record; molecular clock; priors
Mesh:
Year: 2015 PMID: 25429012 PMCID: PMC4262156 DOI: 10.1098/rspb.2014.1013
Source DB: PubMed Journal: Proc Biol Sci ISSN: 0962-8452 Impact factor: 5.349
Figure 1.The posterior mean estimates obtained when the truncated Cauchy distribution was used to approximate the time of divergence relative to fossil-based minima in MCMCTree. Results are shown for two values of the location parameter p (0.1, 0.5) and four values of the scale parameter c (0.1, 0.5, 1, 2) and results are contrasted to those obtained when a uniform distribution is used to constrain node ages between the fossil-based minima and maxima. The branching order (and corresponding node labels) is the same as those shown in figure 3. The branch lengths represent the posterior means of the node ages. Four nodes are connected across the analyses to facilitate comparison. The 95% higher posterior density is indicated at the root of the tree. Ages are presented in millions of years before present. (Online version in colour.)
Figure 3.Plots contrasting the user-specified uniform calibration priors (dashed lines), firstly, with the effective marginal priors observed when each node is used for calibration during independent rounds of cross-validation (grey plots) and secondly, with the effective priors observed when all calibrations are combined in the final analysis in MCMCTree (black lines). This diagram illustrates how the interaction between different calibrations in the joint prior can result in effective priors that deviate substantially from the initial user-specified distributions.
Figure 2.(a–d) Histograms showing the average difference () (a,b) and the sum of squared differences (SS) (c,d) between molecular- and fossil-based estimates obtained during each round of cross-validation. The values obtained when fossil-based minima are used to calculate (a) and SS (c), and the values obtained when both minima and maxima are considered in the estimation of (b) and SS (d). The error bars shown in (a,b) reflect the standard error of the mean. (e–f) Plots showing the impact of sequentially removing calibrations on the average squared deviation(s) between molecular—and fossil—estimates considering minima only (e) and both minima and maxima (f). Sequential removal of the most inconsistent calibrations revealed a consistent, albeit statistically non-significant increase in s when only the fossil-based minima are used in the estimation of SS (f). By contrast, there is a steady albeit statistically non-significant decline in s when minimum and maximum temporal constraints are used to calculate SS (e). The value of s will increase if the rate of change of the denominator, n(n – 1) in the equation used to calculate s, exceeds the rate of change of the numerator ∑SS, where n represents the total number of calibrations [38]. (g) Illustrates the percentage change in ∑SS with the sequential removal of calibrations, estimated using minima (black circles) or both minima and maxima (black squares). This is contrasted to the rate of change of n(n – 1), where n represents to total number of calibrations used to calculate s (open circles). The rate of change of n(n – 1) always exceeds the rate of change of ∑SS when fossil minima are used in the calculation, hence we observe an increase in s. The converse is true when both minimum and maximum constraints are incorporated into the estimation of SS. Changes in s fluctuate markedly with the removal of calibrations as the value of s approaches zero; SS = 0 with the inclusion of the final 10 calibrations. (h) The effect of truncation in the establishment of the joint time prior at the root (node 1, figure 3) in BEAST (black) and MCMCTree (red). The dashed line represents the user-specified uniform prior. (a–g) All values were estimated and are presented in terms of millions of years. (Online version in colour.)
A posteriori versus a priori approaches to assessing calibration quality. Dash symbols denote n.a.
| nodea | divergence | fossilb | minimumc | maximumd | minimum | maximum | ||||
|---|---|---|---|---|---|---|---|---|---|---|
| Near | Marshall [ | Dornburg | Joyce | |||||||
| 1 (1) | Testudines | 210F/F/M | — | consistent | — | omitted | phyl. misplaced | 155.6 | 251.4 | |
| 2 (3) | Pleurodira | 110F−M/F/M | 134.4 | consistent | — | consistent | accurate minima | 111 | 165.2 | |
| 3 (4) | Pelomedusoides | 110F–M/F/M | 134.4 | consistent | — | consistent | accurate minima | 92.8 | 149.5 | |
| 4 (14) | Pelomedusidae | 18F/F/M | 22.0 | consistent | — | inconsistent | inaccurately dated | 5.3 | 149.5 | |
| 5 (11) | Chelidae | 71F/F/M | 86.7 | inconsistent | — | consistent | accurate minima | 65.2 | 149.5 | |
| 6 (15) | Chelodininae | 15F/F/M | 18.3 | inconsistent | — | inconsistent | accurate minima | 11.6 | 149.5 | |
| 7 (16) | 11.6F/F/M | 14.2 | inconsistent | — | inconsistent | accurate minima | 13.4 | 149.5 | ||
| 8 (2) | Cryptodira | 110F–M/F/M | — | consistent | — | omitted | phyl. misplaced | 124 | 200.2 | |
| 10 (6) | Trionychidae | 100F–M/F/M | 122.1 | consistent | selected for calibration | consistent | phyl. untested | 17.3 | 149.5 | |
| 13 (5) | Chelonioidea | 110F/F/M | 134.3 | inconsistent | inconsistent | consistent/ inconsistent | phyl. misplaced | 48.4 | 149.5 | |
| 15 (7) | Kinosternoidea | 65F/F/M | 79.4 | consistent | — | consistent | accurate minima, alternative fossils now available | 70 | 149.5 | |
| 16 (10) | Kinosternidae | 50F/F/M | 61.1 | consistent | — | consistent | accurate minima | 52.8 | 149.5 | |
| 19 (12) | Emydidae | 34F/F/M | 41.5 | consistent | — | consistent | accurate minima | 32 | 100.5 | |
| 20 (17) | 5F/F/M | 6.1 | inconsistent | — | inconsistent | phyl. untested | 3 | 34 | ||
| 21 (8) | Testuguria | 52F/F/M | 63.5 | consistent | — | consistent | accurate minima | 50.3 | 100.5 | |
| 22 (9) | “ | 50F/F/M | 61.1 | inconsistent | — | consistent | phyl. untested | 5.3 | 65.8 | |
| — (13) | — | Lindholmemydidae | 90F/F/M | 109.9 | consistent | — | consistent | phyl. untested | – | – |
| 9 (18) | Trionychia | — | — | — | — | — | — | — | 124 | 177.6 |
| 11 (19) | Durocryptodira | — | — | — | — | — | — | — | 88.6 | 149.5 |
| 12 (20) | Americhelydia | — | — | — | — | — | — | — | 70 | 149.5 |
| 14 (21) | Chelydroidea | — | — | — | — | — | — | — | 70 | 149.5 |
| 17 (—) | Testudinoidea | — | — | — | — | — | — | — | 50.3 | 149.5 |
| 18 (—) | Emydidae– | — | — | — | — | — | — | — | 32 | 100.5 |
aNode ID corresponds to those shown in figure 3. Numbers in brackets refer to the corresponding ID in Near et al. [26].
bFossil specimens used to assess calibration quality a posteriori.
cMinimum (M) or fixed (F) age constraint used to assess calibration quality a posteriori in Near et al. [26], Marshall [24] and Dornburg et al. [25].
dSoft maximum age constraint used to assess calibration quality a posteriori in Dornburg et al. [25]. Maxima established using Marshall [24]; eqn. (11).
eA priori assessment of minimum fossil constraints only. Assessments indicate whether fossil specimens have been phylogenetically (phy.) misplaced or untested, inaccurately dated or provide appropriate minimum constraints.
Figure 4.Plot contrasting the prior and posterior estimates (mean and 95% credibility intervals) obtained using BEAST (black) and MCMCTree (red). Node labels correspond to those shown in figure 3. (Online version in colour.)