| Literature DB >> 26733575 |
James E Tarver1, Mario Dos Reis2, Siavash Mirarab3, Raymond J Moran4, Sean Parker5, Joseph E O'Reilly5, Benjamin L King6, Mary J O'Connell4, Robert J Asher7, Tandy Warnow8, Kevin J Peterson9, Philip C J Donoghue5, Davide Pisani10.
Abstract
Placental mammals comprise three principal clades: Afrotheria (e.g., elephants and tenrecs), Xenarthra (e.g., armadillos and sloths), and Boreoeutheria (all other placental mammals), the relationships among which are the subject of controversy and a touchstone for debate on the limits of phylogenetic inference. Previous analyses have found support for all three hypotheses, leading some to conclude that this phylogenetic problem might be impossible to resolve due to the compounded effects of incomplete lineage sorting (ILS) and a rapid radiation. Here we show, using a genome scale nucleotide data set, microRNAs, and the reanalysis of the three largest previously published amino acid data sets, that the root of Placentalia lies between Atlantogenata and Boreoeutheria. Although we found evidence for ILS in early placental evolution, we are able to reject previous conclusions that the placental root is a hard polytomy that cannot be resolved. Reanalyses of previous data sets recover Atlantogenata + Boreoeutheria and show that contradictory results are a consequence of poorly fitting evolutionary models; instead, when the evolutionary process is better-modeled, all data sets converge on Atlantogenata. Our Bayesian molecular clock analysis estimates that marsupials diverged from placentals 157-170 Ma, crown Placentalia diverged 86-100 Ma, and crown Atlantogenata diverged 84-97 Ma. Our results are compatible with placental diversification being driven by dispersal rather than vicariance mechanisms, postdating early phases in the protracted opening of the Atlantic Ocean.Entities:
Keywords: genome; mammalian; microRNA; palaeontology; phylogeny; placental
Mesh:
Substances:
Year: 2016 PMID: 26733575 PMCID: PMC4779606 DOI: 10.1093/gbe/evv261
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
Fig. 1.The three principal competing hypotheses for the higher-level relationships among placental mammals, with either (a) Afrotheria, (b) Xenarthra, or (c) Atlantogenata being the sister taxon to all other placentals.
Total Size of All Five Data Sets Analyzed and the Percentage of Missing Data in Each
| Data Set | Total Sites | % Missing |
|---|---|---|
| miRNAs | 674,100 | 22 |
| Nucleotide | 770,794,920 | 39 |
| 7,116,417 | 21 | |
| 522,790 | 8 | |
| 1,065,012 | 52 |
All 23 Fossil Calibrations Used in This Study
| Node | Minimum Soft Bound | Maximum Soft Bound | References | |
|---|---|---|---|---|
| 37 | Mammalia—Root | 201.1 | 252.23 | Herein—see below |
| 38 | Theria | 156.3 | 169.6 | |
| 39 | Marsupialia | 47.6 | 131.3 | |
| 40 | Placentalia | — | 164.6 | |
| 42 | Xenarthra | 47.6 | — | |
| 43 | Afrotheria | 56.0 | — | |
| 47 | Eulipotyphla | 61.6 | — | |
| 49 | Chiroptera | 45.0 | 58.9 | |
| 51 | Carnivora | 37.3 | 66.0 | |
| 52 | Euungulata | 62.5 | — | |
| 53 | Artiodactyla | — | 66.0 | |
| 55 | Dolphin/Cow | 52.4 | — | |
| 56 | Euarchontoglires | 61.6 | — | |
| 59 | Lagomorpha | 47.6 | 66.0 | |
| 60 | Rodentia | 56.0 | 66.0 | |
| 61 | Guinea Pig/Rat | 47.6 | 59.2 | |
| 63 | Muridae | 10.4 | 14.0 | |
| 64 | Primates | 56.0 | — | |
| 65 | Strepsirrhini | 33.9 | 56.0 | |
| 67 | Anthropoidea | 33.9 | — | |
| 68 | Catarrhini | 24.44 | 33.9 | |
| 69 | Hominidae | 11.6 | — | |
| 71 | Hominini | 6.5 | 10.0 | |
Note.—There are 12 joint (maximum and minimum), two maximum and nine minimum bounds with all maximum and minimum bounds being ‘soft’. Although many of the same nodes are calibrated as in dos Reis et al. (2012), only three of the calibrations are retained with all of the others being revised due to:
aChange to a different but existing fossil.
bDiscovery of a new fossil.
cRevision of timescale.
dRevision of phylogeny.
Fig. 2.Results from four of the phylogenetic analyses with each one providing support for Atlantogenata as the sister taxon to all other eutherians. (a) The 21.4 million whole-genome nucleotide alignment analyzed using Phylobayes (CAT–GTR+G), RAxML and ASTRAL with support values for almost all nodes being either 1 or 100. (b) The single concatenated nucleotide alignment for the pre-mir sequences analyzed under GTR+G in Phylobayes. Laurasiatheria is shown collapsed as the interrelationships among the constituent taxa vary between data sets.
Results from the Likelihood Tests of the 21.4m Nucleotide Data Set
| Topology | GTR+G4, 1st+2nd Sites, 1 Partition | GTR+G4, 1st +2nd+3rd Sites, 1 Gene Per Partition | AIC Test of Significance | ||||
|---|---|---|---|---|---|---|---|
| lnL | Delta lnL | AU test | lnL | Delta lnL | % | % | |
| Afrotheria | 1125 | 664 | 33.84 | 0.12 | |||
| Xenarthra | 1518 | 1113 | 29.90 | 0.22 | |||
| Indecisive | 2.30 | 99.46 | |||||
Note.—The total log likelihoods for the single partition (1st and 2nd sites) were calculated using BASEML under a GTR+G4 model, with the AU test being conducted on these log likelihoods, and showing unequivocal statistical support for Atlantogenata. Additional log likelihoods were then calculated for each individual gene from a reduced dataset of 11,169 genes (see methods) using a GTR+G4 model with all sites included; given the size of this dataset it is computationally impossible to conduct the AU test (as above) although it is clear that Atlantogenata is the most highly supported topology based on the Delta lnL values. Intriguingly, this topology was not supported by a majority of the genes with approximately 30–33% of genes supporting each alternate topology. However, results of the the AIC test of significance show that 99.46% of genes were unable to distinguish between the three competing hypotheses, while the distribution of support for competing topologies reflects the weak phylogenetic signal present in single gene alignments.
Fig. 3Results from the discordance analysis of the unbinned gene trees with a threshold bootstrap support value of 50% (“left”) and 75% (“right”). These results clearly show that Atlantogenata is the preferred topology, and that much of the incongruence observed across gene trees is due to stochastic errors and not ILS.
Shows the Number of Genes, Either Binned or Unbinned Which Support One of Five Outcomes
| Binned (50%) | Unbinned (50%) | Binned (75%) | Unbinned (75%) | |
|---|---|---|---|---|
| Reject all three hypotheses | 2994 | 2418 | 1673 | 763 |
| Indecisive | 1788 | 5256 | 5293 | 9079 |
| Xenarthra | 1751 | 876 | 1113 | 281 |
| Afrotheria | 1969 | 926 | 1199 | 303 |
| Atlantogenata | 2667 | 1693 | 1891 | 743 |
| Sum of three hypotheses | 6387 | 3495 | 4203 | 1327 |
| % Supporting Atlantogenata | 0.417566933 | 0.484406295 | 0.449916726 | 0.55990957 |
| Length in coalescent units | 0.135075898 | 0.256971108 | 0.192220497 | 0.415309943 |
Note.—A gene tree can either reject all three hypotheses (i.e., when Xenarthra, Afrotheria, Boreoeutheria, or the branch uniting the three outgroups are rejected), or be indecisive (i.e., be compatible with all three hypotheses; this happens when in the collapsed gene tree, the relationship between Xenarthra, Afrotheria, Boreoeutheria is unresolved), or can support one the three hypotheses. The number of genes that support Atlantogenata is divided by the total number of gene trees that support one of the three hypotheses giving a percentage which can then be used to calculate branch lengths in coalescent units following Degnan and Rosenberg (2009), see Materials and Methods.
Posterior Predictive Analyses Conducted to Assess the Fit of the Model to the Data
| Nucleotide | miRNAs | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| JTT+G | CAT–GTR+G | WAG2000–G+I | CAT–GTR+G | LG+G | CAT–GTR+G | GTR+G | CAT–GTR+G | GTR+G | CAT–GTR+G | |
| Observed Diversity | 3.1336 | 3.1336 | 1.8485 | 1.8485 | 2.1998 | 2.1998 | 3.1998 | 3.1998 | 1.3715 | 1.3715 |
| Posterior Predictive | 3.5652 | 3.1694 | 2.0711 | 1.8597 | 2.3331 | 2.2086 | 3.2733 | 3.2038 | 1.3297 | 1.4800 |
| PP Value | 0 | 0.12 | 0 | 0.2381 | 0 | 0.4367 | 0 | 0.3333 | 0.9588 | 0.0557 |
Note.—For each of the three previously published data sets, the models used in the original studies, JTT+G, WAG2000+G and LG+G, did not adequately fit the data. In comparison the CAT–GTR+G model, which we used in the reanalyses was an adequate fit to the data. For our nucleotide and miRNAs data sets the CAT–GTR+G model was compared with a GTR+G model, for the nucleotide analysis CAT+GTR+G was found to be the best fitting model, while for the miRNAs data set it was the GTR+G model, in both instances the better fitting model was used.
Fig. 4.Results from the molecular clock analysis showing the divergence times for placental lineages with all posterior probabilities shown in “green” and overlaid on the joint prior shown in “red,” with both shaded to show values of highest likelihood (see table 6 for the 95% HPD values). Current biogeographic reconstructions for the breakup of Pangea at 180, 120, and 90 Ma, from “left to right,” respectively, with hotter colors (“red”) indicating faster rates of sea floor formation than colder colors (“blue”) based on Seton et al. (2012) and downloadable from http://www.earthbyte.org/Resources/global_plate_model_ESR12.html. Both the Northern and Southern hemisphere continents have separated by 90 Ma, highlighting the role of dispersal, rather than vicariance, for the biogeographic distribution of crown placentals as the breakup of Pangaea predates current molecular clock estimates for the divergence of crown placentals.
Prior and Posterior Divergence Times for All Nodes in the Mammal Tree
| Node | Marginal Prior | Posterior | |||||
|---|---|---|---|---|---|---|---|
| Mean | 95% HPD | Mean | 95% HPD | ||||
| Lower | Upper | Lower | Upper | ||||
| 37 | Mammalia—Root | 226.58 | 200.99 | 252.03 | 223.75 | 200.47 | 251.31 |
| 38 | Theria | 163.55 | 156.45 | 169.68 | 163.92 | 156.67 | 169.79 |
| 39 | Marsupialia | 95.24 | 49.77 | 132.30 | 78.28 | 49.11 | 104.26 |
| 40 | Placentalia | 144.55 | 115.18 | 166.80 | 92.96 | 86.43 | 99.91 |
| 41 | Atlantogenata | 119.81 | 78.70 | 160.81 | 90.32 | 83.73 | 96.54 |
| 42 | Xenarthra | 80.50 | 47.46 | 125.59 | 67.08 | 56.62 | 76.83 |
| 43 | Afrotheria | 90.58 | 55.92 | 134.03 | 72.54 | 64.85 | 79.20 |
| 44 | Paenungulata | 49.22 | 0.12 | 91.45 | 60.21 | 51.45 | 67.79 |
| 45 | Boreotheria | 135.49 | 104.09 | 163.06 | 85.07 | 79.93 | 90.42 |
| 46 | Laurasiatheria | 121.96 | 84.99 | 157.18 | 77.74 | 73.75 | 81.96 |
| 47 | Eulipotyphla | 92.65 | 61.31 | 132.29 | 64.35 | 61.49 | 67.63 |
| 48 | Scrotifera | 107.66 | 70.35 | 146.41 | 74.82 | 71.17 | 78.50 |
| 49 | Chiroptera | 52.07 | 44.98 | 58.88 | 58.23 | 55.71 | 60.28 |
| 50 | Carnivora/Euungulata | 95.27 | 64.18 | 131.20 | 73.65 | 70.18 | 77.18 |
| 51 | Carnivora | 52.25 | 37.43 | 66.15 | 52.61 | 45.22 | 59.70 |
| 52 | Euungulata | 81.43 | 62.35 | 110.69 | 71.35 | 68.16 | 74.58 |
| 53 | Artiodactyla | 63.37 | 55.61 | 69.78 | 61.40 | 59.38 | 63.40 |
| 54 | Pig/Cow | 59.83 | 53.00 | 66.50 | 58.62 | 56.92 | 60.23 |
| 55 | Dolphin/Cow | 56.60 | 52.21 | 63.53 | 52.98 | 52.10 | 54.26 |
| 56 | Euarchontoglires | 119.78 | 84.01 | 155.50 | 76.69 | 72.63 | 80.60 |
| 57 | Tree Shrew/Glires | 96.61 | 60.66 | 136.15 | 74.93 | 71.26 | 78.77 |
| 58 | Glires | 78.48 | 56.43 | 115.21 | 70.34 | 67.22 | 73.53 |
| 59 | Lagomorpha | 56.43 | 47.29 | 65.59 | 48.57 | 46.13 | 51.65 |
| 60 | Rodentia | 60.89 | 55.93 | 65.84 | 61.97 | 60.07 | 63.78 |
| 61 | Guinea Pig/Rat | 53.92 | 47.79 | 59.20 | 58.76 | 56.91 | 60.28 |
| 62 | Kangarro Rat/Rat | 36.64 | 11.75 | 56.26 | 51.91 | 49.14 | 54.13 |
| 63 | Muridae | 12.18 | 10.40 | 13.98 | 12.21 | 10.46 | 13.98 |
| 64 | Primates | 93.18 | 57.25 | 130.31 | 69.27 | 65.64 | 72.96 |
| 65 | Strepsirrhini | 44.92 | 34.04 | 55.71 | 52.77 | 47.91 | 56.69 |
| 66 | Haplorrhini | 68.64 | 39.642 | 108.87 | 64.96 | 61.18 | 68.72 |
| 67 | Anthropoidea | 50.22 | 33.85 | 77.91 | 38.35 | 33.95 | 42.77 |
| 68 | Catarrhini | 29.49 | 24.53 | 33.97 | 26.84 | 24.11 | 30.32 |
| 69 | Hominidae | 19.04 | 11.59 | 28.17 | 17.85 | 15.40 | 20.52 |
| 70 | Homininae | 13.06 | 6.85 | 21.03 | 10.12 | 8.53 | 11.45 |
| 71 | Hominini | 8.26 | 6.51 | 10.01 | 8.94 | 7.51 | 10.10 |
aThe marginal prior was constructed for each node using either the fossil calibrations or from a birth–death process if no calibration was available.
bPosterior time estimates for each node based upon the calibrations and the 14k gene data set.