| Literature DB >> 23813979 |
Claire C Morgan1, Peter G Foster, Andrew E Webb, Davide Pisani, James O McInerney, Mary J O'Connell.
Abstract
Heterogeneity among life traits in mammals has resulted in considerable phylogenetic conflict, particularly concerning the position of the placental root. Layered upon this are gene- and lineage-specific variation in amino acid substitution rates and compositional biases. Life trait variations that may impact upon mutational rates are longevity, metabolic rate, body size, and germ line generation time. Over the past 12 years, three main conflicting hypotheses have emerged for the placement of the placental root. These hypotheses place the Atlantogenata (common ancestor of Xenarthra plus Afrotheria), the Afrotheria, or the Xenarthra as the sister group to all other placental mammals. Model adequacy is critical for accurate tree reconstruction and by failing to account for these compositional and character exchange heterogeneities across the tree and data set, previous studies have not provided a strongly supported hypothesis for the placental root. For the first time, models that accommodate both tree and data set heterogeneity have been applied to mammal data. Here, we show the impact of accurate model assignment and the importance of data sets in accommodating model parameters while maintaining the power to reject competing hypotheses. Through these sophisticated methods, we demonstrate the importance of model adequacy, data set power and provide strong support for the Atlantogenata over other competing hypotheses for the position of the placental root.Entities:
Keywords: evolutionary models; heterogeneous modeling; mammal phylogeny; phylogenetic reconstruction; placental root
Mesh:
Year: 2013 PMID: 23813979 PMCID: PMC3748356 DOI: 10.1093/molbev/mst117
Source DB: PubMed Journal: Mol Biol Evol ISSN: 0737-4038 Impact factor: 16.240
FThree major rooting hypotheses for placental mammals. The three major competing hypotheses for the position of the root are depicted. (A) The Afrotheria root, (B) the Xenarthra root, and (C) the Atlantogenata root.
Summary of Data Sets Analyzed.
| Data Sets | Data Set Type | Alignment Length |
|---|---|---|
| 66TaxonSet_nuc | Nucleotide | 9,789 bp |
| 66TaxonSet_aa | Amino acid | 2,190 aa |
| 66TaxonSet_day | Dayhoff recoded 6 | 2,190 characters |
| 39TaxonSet_aa | Amino acid | 23,900 aa |
| 39TaxonSet_day | Dayhoff | 23,900 |
Note.—The name of each data set as per main text, the data type and corresponding alignment lengths are shown.
Models Applied Using P4.
| Model | Number of GTR Rate Matrices | Number of Composition Vectors | Parameters |
|---|---|---|---|
| 1GTR + 1C + I + 4Γ | 1 | 1 | 12 |
| 2GTR + 1C + I + 4Γ | 2 | 1 | 18 |
| 3GTR + 1C + I + 4Γ | 3 | 1 | 24 |
| 4GTR + 1C + I + 4Γ | 4 | 1 | 30 |
| 5GTR + 1C + I + 4Γ | 5 | 1 | 36 |
| 6GTR + 1C + I + 4Γ | 6 | 1 | 42 |
| 1GTR + 2C + I + 4Γ | 1 | 2 | 16 |
| 1GTR + 3C + I + 4Γ | 1 | 3 | 20 |
| 1GTR + 4C + I + 4Γ | 1 | 4 | 24 |
| 1GTR + 5C + I + 4Γ | 1 | 5 | 28 |
| 1GTR + 6C + I + 4Γ | 1 | 6 | 32 |
| 1GTR + 7C + I + 4Γ | 1 | 7 | 36 |
| 2GTR + 5C + I + 4Γ | 2 | 5 | 34 |
| 3GTR + 5C + I + 4Γ | 3 | 5 | 40 |
| 4GTR + 5C + I + 4Γ | 4 | 5 | 46 |
Note.—The model codes used in this study; the number of GTR rate matrices applied to the data; the number of composition vectors estimated; and the number of free parameters estimated for each model. All parameters were free to vary. In all cases, the proportion of invariable sites (+I) = 0.3 and the gamma distributed associated rate variation (+Γ) = 0.7.
FFit of alternative models applied to 66TaxonSet_nuc, and posterior predictive simulations for the homogeneous model and heterogeneous model of “best-fit” for 66TaxonSet_nuc. Models are shown on x axis. The specifications of each model, that is, the number of rate matrices and composition vectors, are given inside the bar that represents that model. The y axis shows the ln L score, the higher the bar the worse the fit of the model. Inset (A) shows posterior predictive simulations for the homogeneous model (1GTR + 1C + I + 4Γ). Inset (B) shows posterior predictive simulation for the best-fit heterogeneous model (3GTR + 5C + I + 4Γ). The black bar graphs in insets (A) and (B) represent the posterior predictive simulations of these models on 66TaxonSet_nuc, and the arrow in each inset represents the χ2 position in the simulation for the real data.
Summary of Data Quality and Phylogenetic Tests Applied to 66TaxonSet and 39TaxonSet.
| Data Sets | Compositional Heterogeneity Detected | P4 | PhyloBayes | Root Position (P4/PhyloBayes) | Reject Competing Hypotheses 2Ln(BF) = 6–10 | Single strongly Supported Hypothesis |
|---|---|---|---|---|---|---|
| 66TaxonSet_nuc | Yes | 3GTR + 5C + I + 4Γ | 1GTR + 1C + 4Γ | Atlantogenata | Afrotheria and Xenarthra | None |
| 66TaxonSet_aa | Yes | 1JTT + 4C + I + 4Γ | CAT | Xenarthra | Afrotheria and Atlantogenata | None |
| 66TaxonSet_day | No | 5GTR + 2C + I + 4Γ | 1GTR + 1C + 4Γ | Xenarthra/Xenarthra | None | None |
| 39TaxonSet_nuc | Yes | 2GTR + 4C + 4Γ | CAT | Atlantogenata/Atlantogenata | None | None |
| 39TaxonSet_aa | Yes | 1JTT + 5C + 4Γ | CAT_GTR | Atlantogenata/Atlantogenata | — | Atlantogenata |
| 39TaxonSet _day | Yes | 2GTR + 4C + 4Γ | CAT_GTR | Atlantogenata/Atlantogenata | Afrotheria and Xenarthra | Atlantogenata |
Note.—The results of the χ2 test of compositional homogeneity are shown. The model of best fit from P4 and PhyloBayes analyses are given. The placental root supported by each data set is indicated for both the P4 and the PhyloBayes analyses. The ability of each data set to reject alternative rooting hypotheses is given (2(ln BF) > 6 indicates alternative rooting hypotheses can be strongly rejected).
aRooting hypotheses that have a PP support value of <0.50, that is, poor support. A hypothesis is denoted as being the “single strongly supported hypothesis” if it has both 1) a high PP at the placental root and 2) all other hypotheses could be rejected.
bThis model did not fit the data.
FMammal phylogeny reconstructed using two independent approaches. Phylogeny reconstruction carried out using CAT-GTR model on the 39TaxonSet_aa in PhlyoBayes v3.2 and on 39TaxonSet_day using the heterogeneous model 2GTR + 4C + 4Γ in P4. The support values for both methods are given at each node: the numerator is the Bayesian support value for nodes based on CAT-GTR in PhyloBayes; the denominator is the support value based on the 2GTR + 4C + 4Γ model in P4. Support values for the 2GTR + 4C + 4Γ model are shown where they are in agreement with the CAT-GTR model topology. The color scheme represents the major groups of mammals: red, Euarchontoglires; blue, Laurasiatheria; green, Afrotheria; purple, Xenarthra; gray, Marsupials (* represents compositionally heterogeneous taxa). Gallus gallus and Ornithorhynchus anatinus were both compositionally heterogeneous but were excluded in this figure to retain clarity of placental mammal branch lengths.