| Literature DB >> 27611687 |
Jamie McCann1, Gerald M Schneeweiss1, Tod F Stuessy1,2, Jose L Villaseñor3, Hanna Weiss-Schneeweiss1.
Abstract
Chromosome number change (polyploidy and dysploidy) plays an important role in plant diversification and speciation. Investigating chromosome number evolution commonly entails ancestral state reconstruction performed within a phylogenetic framework, which is, however, prone to uncertainty, whose effects on evolutionary inferences are insufficiently understood. Using the chromosomally diverse plant genus Melampodium (Asteraceae) as model group, we assess the impact of reconstruction method (maximum parsimony, maximum likelihood, Bayesian methods), branch length model (phylograms versus chronograms) and phylogenetic uncertainty (topological and branch length uncertainty) on the inference of chromosome number evolution. We also address the suitability of the maximum clade credibility (MCC) tree as single representative topology for chromosome number reconstruction. Each of the listed factors causes considerable incongruence among chromosome number reconstructions. Discrepancies between inferences on the MCC tree from those made by integrating over a set of trees are moderate for ancestral chromosome numbers, but severe for the difference of chromosome gains and losses, a measure of the directionality of dysploidy. Therefore, reliance on single trees, such as the MCC tree, is strongly discouraged and model averaging, taking both phylogenetic and model uncertainty into account, is recommended. For studying chromosome number evolution, dedicated models implemented in the program ChromEvol and ordered maximum parsimony may be most appropriate. Chromosome number evolution in Melampodium follows a pattern of bidirectional dysploidy (starting from x = 11 to x = 9 and x = 14, respectively) with no prevailing direction.Entities:
Mesh:
Year: 2016 PMID: 27611687 PMCID: PMC5017664 DOI: 10.1371/journal.pone.0162299
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Chromosome number reconstructions in Melampodium.
Chromosome number reconstructions plotted on 95% majority rule-consensus trees from phylogenetic analysis of (A) nuclear sequence data using BEAST (ITS-B, left) and using MrBayes (ITS-MB, right) and of (B) plastid sequence data using BEAST (matK-B, left) and using MrBayes (matK-MB, right). At each node, the average and, in case of maximum likelihood reconstructions, model-weighted probabilities of ancestral chromosome base numbers are shown (from top to bottom): ordered maximum parsimony (MP), maximum likelihood using ChromEvol (ML-CE), maximum likelihood using BayesTraits (ML-BT), Bayesian Inference using Reversible Jump (BI-RJ). The pie charts represent the fraction of probability that is associated with a particular chromosome number.
Minimum and maximum ΔAICs for the Constant Rate—No Duplication (CRND) model against other ChromEvol models and, in parentheses, the proportion of cases where they are better than the CRND model.
| Dataset | Models | ||||||
|---|---|---|---|---|---|---|---|
| CRD | CRDD | CRDE | LRND | LRD | LRDD | LRDE | |
| ITS-B | -0.954 | -8.609 | -7.979 | 1.038 | 3.038 | -4.045 | -3.571 |
| 4.543 | 4.189 | 5.953 | 9.094 | 11.094 | 11.0943 | 13.517 | |
| (0.02) | (10.82) | (7.24) | (0.27) | (0.20) | |||
| ITS-MB | 1.943 | -6.863 | -6.234 | 2.667 | 4.667 | -3.301 | -2.648 |
| 2.001 | 2.001 | 4.521 | 6.010 | 7.399 | 8.010 | 14.544 | |
| (39.24) | (28.04) | (1.42) | (0.36) | ||||
| matK-B | -1.733 | -8.274 | -7.646 | 0.344 | 2.344 | -4.368 | -3.125 |
| 5.682 | 6.570 | 7.957 | 10.158 | 13.980 | 13.980 | 13.611 | |
| (0.47) | (1.11) | (0.64) | (0.18) | (0.18) | |||
| matK-MB | -0.584 | -7.861 | -7.239 | 0.890 | 2.890 | -4.189 | -3.569 |
| 4.543 | 4.662 | 7.508 | 7.705 | 9.862 | 9.466 | 11.751 | |
| (0.18) | (0.84) | (0.60) | (0.27) | (0.20) | |||
Each data set (ITS-B—nuclear sequence data analyzed using BEAST; ITS-MB—nuclear sequence data set analyzed using MrBayes; matK-B—plastid sequence data analyzed using BEAST; matK-MB—plastid sequence data analyzed using MrBayes) has been analyzed under each of eight models implemented in ChromEvol 2. The predominantly supported model (CRND—Constant Rate—No Duplication model) has been compared against the remaining models (CRD—Constant Rate—Duplication only; CRDD—Constant Rate—identical Demi-duplication and Duplication; CRDE—Constant Rate—Demi-duplication Estimated; LRND—Linear Rate—No Duplication; LRD—Linear Rate—Duplication only; LRDD—Linear Rate—identical Demi-duplication and Duplication; LRDE—Linear Rate—Demi-duplication Estimated; see text for details).
Model uncertainty and model selection uncertainty in maximum likelihood analyses using ChromEvol (values are given as averages and, in parentheses, ranges).
| Dataset | Akaike Weight | Number of Excluded Models | |
|---|---|---|---|
| Best Model | 2nd Best / Best Model | ||
| ITS-B | 0.447 (0.245–0.737) | 0.434 (0.168–1.000) | 2.195 (1–6) |
| ITS-MB | 0.374 (0.266–0.493) | 0.595 (0.368–1.000) | 1.400 (1–4) |
| matK-B | 0.464 (0.234–0.699) | 0.388 (0.130–1.000) | 2.171 (1–5) |
| matK-MB | 0.454 (0.238–0.650) | 0.379 (0.140–1.000) | 2.112 (1–4) |
For each data set (abbreviations as in Table 1) model uncertainty has been quantified using the best model’s Akaike weight (ranging from 0 to 1: the higher the weight the lower model uncertainty; column “Best Model”); model selection uncertainty has been quantified using the ratio of Akaike weights from the second best against the best model (ranging from 0 to 1: the higher the value the higher model selection uncertainty; column “2nd Best / Best Model”).
Minimum and maximum ΔAICs for the one-rate model against other BayesTraits models and, in parentheses, the proportion of cases where they are better than the one-rate model.
| Dataset | Models | |
|---|---|---|
| ITS-B | -61.048 | -69.774 |
| 12.442 | 2.000 | |
| (1.11) | (4.82) | |
| ITS-MB | -60.881 | -70.982 |
| 10.498 | 1.597 | |
| (1.80) | (9.87) | |
| matK-B | -33.241 | -43.139 |
| 12.823 | 2.000 | |
| (0.44) | (3.20) | |
| matK-MB | -29.479 | -37.926 |
| 12.148 | 2.000 | |
| (1.09) | (6.36) | |
Each data set (abbreviations as in Table 1) has been analyzed under each of three models implemented in BayesTraits 2. The predominantly supported model (one-rate model) has been compared against the remaining models (two-rate model, multi-rate model; see text for details).
Model uncertainty and model selection uncertainty in maximum likelihood analyses using BayesTraits (values are given as averages and, in parentheses, ranges).
| Dataset | Akaike Weight | Number of Excluded Models | |
|---|---|---|---|
| Best Model | 2nd Best / Best Model | ||
| ITS-B | 0.662 (0.500–1.000) | 0.434 (0.002–0.992) | 1.014 (1–2) |
| ITS-MB | 0.607 (0.481–0.995) | 0.648 (0.005–1.000) | 1.017 (0–2) |
| matK-B | 0.674 (0.499–0.993) | 0.485 (0.006–0.995) | 1.008 (1–2) |
| matK-MB | 0.650 (0.496–0.996) | 0.539 (0.004–1.000) | 1.013 (1–2) |
For each data set (abbreviations as in Table 1) model uncertainty has been quantified using the best model’s Akaike weight (ranging from 0 to 1: the higher the weight the lower model uncertainty; column “Best Model”); model selection uncertainty has been quantified using the ratio of Akaike weights from the second best against the best model (ranging from 0 to 1: the higher the value the higher model selection uncertainty; column “2nd Best / Best Model”).
Tree-wide and node-wise reconstruction precision (RP and RP).
| Dataset | Method | Reconstruction Precision | ||
|---|---|---|---|---|
| ITS-B | MP | 0.978 | 0.946 | 0.741–1.000 |
| ML-CE | 0.970 | 0.984 | 0.814–1.000 | |
| ML-BT | 0.926 | 0.925 | 0.500–0.999 | |
| BI-RJ | 0.949 | 0.954 | 0.574–1.000 | |
| ITS-MB | MP | 0.992 | 1.000 | 0.898–1.000 |
| ML-CE | 0.980 | 0.992 | 0.871–1.000 | |
| ML-BT | 0.932 | 0.928 | 0.413–1.000 | |
| BI-RJ | 0.937 | 0.962 | 0.489–1.000 | |
| matK-B | MP | 0.996 | 1.000 | 0.898–1.000 |
| ML-CE | 0.949 | 0.983 | 0.720–1.000 | |
| ML-BT | 0.948 | 0.956 | 0.588–1.000 | |
| BI-RJ | 0.954 | 0.960 | 0.612–1.000 | |
| matK-MB | MP | 1.000 | 1.000 | 0.993–1.000 |
| ML-CE | 0.951 | 0.979 | 0.713–1.000 | |
| ML-BT | 0.947 | 0.971 | 0.578–1.000 | |
| BI-RJ | 0.967 | 0.978 | 0.623–1.000 | |
Each data set (abbreviations as in Table 1) has been analyzed using each of four methods (MP—ordered Maximum Parsimony; ML-CE—Maximum Likelihood using ChromEvol; ML-BT—Maximum Likelihood using BayesTraits; BI-RJ—Bayesian Inference using Reversible Jump). Tree-wide reconstruction precision (RP) has been calculated on the majority rule consensus tree (MRC) and on the maximum clade credibility tree (MCC); node-wise reconstruction precision (RP) has been calculated for each of the posterior trees and is given as ranges.
Characteristics of the distribution of chromosome gains minus chromosome losses (G-L).
| Dataset | Method | Mean / Median (Range) | Mode | MCC Tree |
|---|---|---|---|---|
| ITS-B | MP | -0.341 / 0.000 (-4.000–4.000) | -3.000 | -1.000 |
| ML-CE | 0.802 / -0.414 (-5.012–15.900) | -2.615 | 6.365 | |
| ML-BT | -13.080 / -17.960 (-29.050–25.700) | -17.949 | -19.394 | |
| BI | -6.222 / -7.956 (-188.589–151.621) | -8.253 | n.a. | |
| ITS-MB | MP | -2.179 / -2.000 (-3.000–2.000) | -2.000 | -3.000 |
| ML-CE | -2.348 / -2.314 (-5.022–0.826) | -2.400 | -2.474 | |
| ML-BT | -17.021 / -17.131 (-48.6761–-0.799) | -16.476 | -16.116 | |
| BI | -15.262 / -14.203 (-131.142–70.216) | -11.441 | n.a. | |
| matK-B | MP | 0.1300 / -0.330 (-2.300–5.670) | -0.305 | -0.330 |
| ML-CE | 3.912 / 5.405 (-4.602–18.620) | -1.278 | 5.924 | |
| ML-BT | -9.952 / -11.697 (-27.108–52.297) | -10.986 | -10.506 | |
| BI | -0.737 / -2.622 (-163.757–130.269) | -7.996 | n.a. | |
| matK-MB | MP | -0.327 / -0.330 (-2.330–3.990) | -0.239 | -1.330 |
| ML-CE | -0.229 / -1.500 (-5.233–18.300) | -2.262 | -0.743 | |
| ML-BT | -12.690 / -13.720 (-24.560–31.230) | -16.276 | -13.940 | |
| BI | -4.957 / -5.899 (-106.285–91.412) | -5.303 | n.a. |
Each data set (abbreviations as in Table 1) has been analyzed using each of four methods (MP—ordered Maximum Parsimony; ML-CE—Maximum Likelihood using ChromEvol; ML-BT—Maximum Likelihood using BayesTraits; BI-RJ—Bayesian Inference using Reversible Jump). For the distributions of the test statistic G-L (difference between chromosome gains and chromosome losses) mean, median, range and mode (calculated using the Chernoff mode estimator with bandwidth of 0.5 as implemented in the R package modeest) are given as well as the G-L values for the maximum clade credibility (MCC) trees.
Fig 2Distributions of the number of chromosome gains minus the number of chromosome losses (G-L) in Melampodium.
G-L distributions reconstructed on phylogenetic trees obtained from analyses of (A) nuclear sequence data using BEAST (ITS-B), (B) nuclear sequence data using MrBayes (ITS-MB), (C) plastid sequence data using BEAST (matK-B) and (D) plastid sequence data using MrBayes (matK-MB). Methods of chromosome number reconstruction are indicated by colors: black—ordered maximum parsimony (MP); purple—maximum likelihood using ChromEvol (ML-CE; results are shown for the Constant-Rate No Duplication (CRND) model); white—maximum likelihood using BayesTraits (ML-BT; results are shown for the two-rate model); grey—Bayesian Inference (BI; results are shown for the two-rate model). Arrows indicate positions of the Maximum Clade Credibility (MCC) trees. Inserts show the full G-L distributions from the BI analysis, which are truncated in the main figure to aid legibility.
Colless Imbalance and Stemminess Indices (given as mean / median and, in parentheses, range) of the phylogenetic trees used in the analyses.
| Dataset | Colless Imbalance | Stemminess | |
|---|---|---|---|
| Original | Ultrametricised | ||
| ITS-B | 0.188 / 0.191 (0.084–0.350) | 1.487 / 0.604 (0.104–288.930) | |
| ITS-MB | 0.230 / 0.243 (0.115–0.316) | 1.302 / 0.589 (0.264–341.094) | 1.086 / 0.672 (0.241–262.604) |
| matK-B | 0.162 / 0.153 (0.106–0.311) | 1.453 / 0.668 (0.148–689.416) | |
| matK-MB | 0.188 / 0.189 (0.117–0.292) | 4.687 / 1.279 (0.346–2293.74) | 2.567 / 0.809 (0.239–799.423) |
For the trees of each data set (abbreviations as in Table 1) Colless Imbalance and Stemminess have been calculated (see text for details), the latter both for trees with the original branch lengths (column “Original”) and (in case of trees obtained from MrBayes) for trees, whose branch lengths have been modified so that trees become ultrametric using PATHd8 (column “Ultrametricized”).