Literature DB >> 19430604

Mammoth and elephant phylogenetic relationships: Mammut americanum, the missing outgroup.

Ludovic Orlando1, Catherine Hänni, Christophe J Douady.   

Abstract

At the morphological level, the woolly mammoth has most often been considered as the sister-species of Asian elephants, but at the DNA level, different studies have found support for proximity with African elephants. Recent reports have increased the available sequence data and apparently solved the discrepancy, finding mammoths to be most closely related to Asian elephants. However, we demonstrate here that the three competing topologies have similar likelihood, bayesian and parsimony supports. The analysis further suggests the inadequacy of using Sirenia or Hyracoidea as outgroups. We therefore argue that orthologous sequences from the extinct American mastodon will be required to definitively solve this long-standing question.

Entities:  

Keywords:  Ancient DNA; Elephantidae; Mammoth; Phylogeny

Year:  2007        PMID: 19430604      PMCID: PMC2674638     

Source DB:  PubMed          Journal:  Evol Bioinform Online        ISSN: 1176-9343            Impact factor:   1.625


Introduction

The mammoth lineage offers one of the most complete palaeontological records among vertebrates (Lister and Sher, 2001). Large sampling and dating evidence have contributed to set morphological adaptive changes in a precise geographical and temporal framework (Lister et al. 2005). The elephant and mammoth lineages probably diverged some 4–6 million years ago (MYA) in Africa (Shoshani and Tassy, 2005; Todd, 2006), but it is around 3 MYA that mammoths spread across the temperate and wooded habitats from Europe to China (Lister et al. 2005). Populations from China and Northern Siberia, adapted to cold and steppe conditions, progressively supplanted older forms (Lister and Sher, 2001). By 200 KYA, the woolly mammoth stage (Mammuthus primigenius) was reached in Northern Siberia and started to spread westwards to Europe. It spread later eastwards across the Beringia into Northern America, where descendants of ancestral forms adapted to the temperate grasslands already lived, Mammuthus columbi as well as pigmy mammoths (Mammuthus exilis) (Agenbroad, 2005). The cooling from the end of the Last Ice Age considerably restricted their habitat and precipitated their extinction; at the beginning of the Holocene, mammoths only survived in small refugial islands from the Arctic and Bering Sea and by 3.7 KYA the very last specimen disappeared (Vartanyan et al. 1993; Guthrie et al. 2004; Stuart et al. 2004). Several points in this impressively well documented model are still debated though. Among these, the tempo and mode for mammoth extinction, especially with regards to possible overkilling by hunters, is perhaps the most controversial issue (Agenbroad, 2005; Stuart, 2005). But the question of the origin of the mammoth lineage has also received much attention in the last decade. Palaeontologists have long found support in morphological characters for a sister group relationship between mammoths and Asian elephants (Elephas), rather than African elephants (Loxodonta) (Maglio, 1973). This model has received additional support from the analysis of new characters, such as the hyoid apparatus (an association of nine bones connected to the cranium, the tongue and the larynx) (Shoshani and Marchant, 2001). Surprisingly, the very first mammoth DNA sequence exhibited minimum genetic distance with extant African but not Asian elephants (Hagelberg et al. 1994; Table 1). Some larger sequence datasets grouped mammoth and Elephas (Yang et al. 1996; Ozawa et al. 1997; Table 1). But reanalysis of these data found again a grouping of Mammuthus and Loxodonta. These results called into question (i) the validity of morphological synapomorphies between mammoths and Asian elephants (Noro et al. 1998; Thomas et al. 2000; Debruyne et al. 2003), and (ii) the authenticity of some of the previously reported sequences (Yang et al. 1996; Thomas et al. 2000; Debruyne et al. 2003). Most recently, partial nuclear gene sequences (Capelli et al. 2006) and complete mitochondrial genomes (Krause et al. 2006; Rogaev et al. 2006) have revived the debate, showing support for the (Mammuthus, Elephas) clade. Despite some claims to the contrary, Mammuthus affinities are still far from being conclusively settled, with different topologies supported by Rogaev et al. (2006), Krause et al. (2006), and Capelli et al. (2006; summarized in Table 1). In this study, we evaluate for the first time the phylogenetic signal contained in all the data, by combining all the available nuclear and mitochondrial genes.
Table 1.

Molecular support for unravelling intra-Elephantidae phylogenetic relationships.

ReferenceGeneLength (nt)SitesTopologyaType of analysisBootstrap (%) / Posterior ProbabilityRoota
Hagelberg et al. 1994Cyt b278all(M,L),EDistance--
Höss et al. 199416S rDNA93alluninf.Distance--
Yang et al. 1996Cyt b228all(M,E),LMP74Ma
Ozawa et al. 1997Cyt b6701+2(M,E),LNJ, MP72, 72Dd, Hg, Tm, Pc
Cyt b330aa(M,E),LNJ, MP90, 91Dd, Hg, Tm, Pc
Noro et al. 1998Cyt b1137allb(M,L),ENJ, MP92, 73Dd, Db, Eg, Bt
12S rDNA961all(M,L),ENJ, MP55, 81Dd, Tm, Pc, Ddo, Db, Eg, Bt
Greenwood et al. 1999Cyt b305all(M,L),EDistance--
16S rDNA94all(M,E),LDistance--
28S rDNA138all(M,E),LDistance--
IRBP43all(M,E),LDistance--
A2AB57alluninf.Distance--
vWF114alluninf.Distance--
Thomas et al. 2000Cyt b255all(M,L),EMP, ML84, 67cDd
Cyt b453all(M,L),EML, BI67, 0.43dMidpoint rooting
Debruyne et al. 2003Cyt b228call(M,L),ENJ, MP<50, 25Ma
Cyt b561all(M,L),ENJ, MP63, 88Dd, Hg, Tm, Pc
Cyt b561all(M,E),LMLnot provideddDd, Hg, Tm, Pc
Capelli et al. 20065 nuclears701all(E,L),MNJ, MP, ML70, 61, 77dPc
5 nuclears677all(M,E),LNJ, MP, ML100, 100, 100Midpoint rooting
Krause et al. 2006mt genome16770all(M,L),ENJ, ML, BI73, 56, 0.97Dd
mt genome16770all(M,E),LMP62Dd
mt genome16770all(M,E),LNJ, MP, ML, BI83, 93, 79, 0.91Pc
mt genome16770all(M,E),LNJ, MP, BI87, 90, 1.0Dd, Pc
mt genome16770all(M,L),EML54Dd, Pc
mt genome16770all(M,E),LML, BIe97, 0.998Midpoint rooting
Rogaev et al. 2006mt genome16842all(M,E),LfMP, ML, BI95, 8, 0.88–1.0gDd, Pc
12S rDNA962all(M,E),LBI0.72Dd, Pc
ATP6669all(M,E),LBI0.68Dd, Pc
COX11551all(M,E),LBI0.90Dd, Pc
COX3784all(M,E),LBI0.92Dd, Pc
Cyt b1137all(M,E),LBI0.76Dd, Pc
ND1957all(M,E),LBI0.85Dd, Pc
ND4L297all(M,E),LBI0.99Dd, Pc
ND6528all(M,E),LBI0.96Dd, Pc
COX2684all(M,L),EBI0.56Dd, Pc
ND3346all(M,L),EBI0.92Dd, Pc
ND51812all(M,L),EBI0.89Dd, Pc
16S rDNA1566all(E,L),MBI0.88Dd, Pc
ND21044all(E,L),MBI0.94Dd, Pc
ND41368all(E,L),MBI1.0Dd, Pc
ATP8201alluninf.BI-Dd, Pc

M: Mammuthus primigenius; E: Elephas maximus; L Loxodonta africana; Ma: Mammut americanum; Dd: Dugong dugon; Tm: Trichechus manatus; Pc: Procavia capensis; Hg: Hydrodamalis gigas; Db: Diceros bicornis; Eg: Equus grevyi; Bt: Bos taurus; Ddo: Dendrohyrax dorsalis.

but (M,E),L when considering only transversions or translated sequences

casts doubt on the 228-nt sequence reported in Yang et al. 1996

support for this topology but similar likelihood / probability for alternative topologies

trifurcation rejected based on parsimony statistics

significant ML ratio test

two different models were used for BI inference

Material and Methods

Data construction

Mitochondrial sequences were retrieved from Genbank and manually aligned using the Seaview software (Galtier et al. 1996). An alignment of the nuclear sequences available for elephantids was kindly provided by A.D. Greenwood. The complete data sets as a whole and 16 different partitions were further analyzed (Table 2).
Table 2.

Likelihood values and number of substitutions in favor of each topology.

GeneLength (nt)Sites(M,E),L-Ln L(E,L),MNumber of synapomorphies
(M,L),E(M,E)(M,L)(E,L)
AllmtDNA+nucDNA (n = 48)17917all56081.7556081.1956082.85181145153
mtDNA (n = 38)17072all54272.1554272.0954273.97176144150
nucDNA (n = 10)845all1715.071715.481714.53513
ProteicmtDNA+nucDNA (n = 18)11699all38164.7938165.5638166.03144121127
mtDNA (n = 13)11396all37446.0637447.5237447.99141121126
nucDNA (n = 5)303all623.53623.57623.57301
mtDNA+nucDNA (n = 18)78211+219861.7519859.5019857.60323038
mtDNA (n = 13)76191+219490.5819488.6119486.74313038
nucDNA (n = 5)2021+2346.94346.94346.94100
mtDNA+nucDNA (n = 18)392028102.648104.738104.96965
mtDNA (n = 13)381927928.227930.317930.51865
nucDNA (n = 5)1012163.90163.90163.90100
RibosomalmtDNA (n = 2)2526all6613.816613.426613.7717k4k6k
tRNAmtDNA (n = 22)1522all4112.114110.834114.72a11115
Non codingmtDNA+nucDNA2224all6861.466861.466859.509915
mtDNA1682all5736.275736.275734.487813
nucDNA542all1073.181074.231072.65212

indicates topologies that are significantly worst than the most likely alternative based on AU test.

indicates a significantly different number of synapomorphy. Mp, Em, La, Dd, Pc stand respectively for Mammoth, Asian and African elephants, Dugong and Hyrax accession numbers: Complete mtDNA = DQ316067(Mp) DQ188829(Mp) DQ316068(Em) LAAJ4821(La) DQ316069(La) AY075116(Dd) DDU421723(Dd) AB096865(Pc); BGN 5’ = DQ267154(Mp) DQ265809(Em) DQ265820(La) DQ265813(Pc); BGN 3’ = DQ265811(Mp) DQ265809(Em) DQ265820(La) DQ265813(Pc); CHRNA1 5’ = DQ267155(Mp) DQ265827(Em) DQ265838(La) DQ265831(Pc); CHRNA1 3’ = DQ267156(Mp) DQ265827(Em) DQ265838(La); GBA = DQ265846(Mp) DQ265844(Em) DQ265843(La) DQ265848(Pc); LEPR = DQ265868(Mp) DQ265866(Em) DQ265888(La) DQ265871(Pc); VWF 5’= AF154875(Mp) DQ265898(Em) DQ265919(La) DDU31608(Dd) DQ265902(Pc); VWF 3’ = consensus of AF154873/AF154874(Mp) DQ265898(Em) DQ265919(La) DDU31608(Dd) DQ265902(Pc); IRBP = AF155042(Mp) AY243443(Em) LAU48711(La) DDU48583(Dd) PCU48586(Pc); A2AB = AF154876(Mp), Y12525, (Em), AF154877(La), Y15947(Dd), Y12523(Pc).

Most likely topologies and number of synapomorphies

For all data sets and partition subsets we estimated the most likely topologies in favor of (Mammuthus, Elephas), (Mammuth, Loxodonta) and (Elephas, Loxodonta) clustering. All computations were done using PAUP (Swofford, 2002) and the best fitted model according to Akaike criterion (Akaike, 1974) as implemented in Modeltest (Posada and Crandall, 1998). Approximatively Unbiased (Shimodaira, 2002), Kishino-Hasegawa (1989) and Shimodaira-Hasegawa (1999) tests were done using CONSEL (Shimodaira and Hasegawa, 2001). The number of synapomorphies for each of these alternatives were all collected via direct pairwise comparisons and possible significant differences were evaluated using a Chi-square test.

Partitioned bayesian inferences and partition contents

In order to test for phylogenetic support in a partitioned Bayesian framework we analyzed two partition schemes using MRBAYES v3.1.2. Both analyses employed a GTR model of evolution assuming a fraction of invariant sites and a rate heterogeneity across sites. For each, two sets of four chains sampled every 100 generations were ran until the average standard deviation of split frequencies between the two set fell below the default critical value of 0.01 using a burn-in fraction of 25%. To ensure that consensus trees were based on a rather large collection of trees, average standard deviation of split frequencies were only evaluated every 100000 generations. Our first partition scheme was designed to ensure that sites under different evolutionary dynamic received independent evaluation. Thus we defined 10 partitions after one for each nuclear codon position, one for each mitochondrial codon positions, one for mitochondrial ribosomal RNA positions, one for mitochondrial transfer RNA positions, one for nuclear non coding positions and one for mitochondrial non coding positions. Our second scheme allowed independent estimation of model parameters for each gene or gene fragment. Mitochondrial non coding positions (to the exception of the D-loop positions) were all lump together, leading to a total of 46 partitions). These schemes are referred as P10 and P46 respectively. Finally the information content of the different genes were estimated using the hidden branch support approach described in Gatesy et al. (1999). These estimations were done both in a parsimony and likelihood framework using PAUP (Swofford, 2002).

Results/Discussion

We first started by computing the likelihood of the three alternative topologies under the best-fitting model of molecular evolution using different sets of sequences (Table 2: protein coding genes, rRNA genes, tRNA genes, whole mtDNA, and all mitochondrial and nuclear data merged). Strikingly, all topologies have almost identical likelihood values, resulting in largely non significant likelihood tests. Elephas-Mammuthus could be rejected only for the tRNA data partition (at p-value < 0.05 under an Approximatively Unbiased test; Table 2). Interestingly none of the tests performed on complete mitochondrial genome data sets was able to corroborate Rogaev et al. (2006) reports of significant support for the Mammuthus-Elephas clade. Noteworthy, Rogaev et al. (2006) did not provide any details on how likelihood ratio tests were performed to discriminate between alternative topologies (while this procedure is still unknown to most phylogeneticists; see Felsenstein, 2004 for a discussion of the inadequacy of likelihood ratio in testing alternative topologies). We then decided to count the total number of synapomorphies of the three possible pairs of taxa, using the state of Hyrax and Dugong sequences to polarize character changes (Table 2). Merging all data, none of the three pairs exhibits significant deviation from the mean number of synapomorphies (Chi-square, p-value = 0.774), suggesting similar parsimony support for the three alternatives. All but ribosomal RNA (p = 0.004) partitions of the data yielded non-significant deviations as well (0.158 < p-value < 0.819). Such a pattern might be indicative either of lineage-sorting effects among mammoths (one gene leading to a first phylogenetic signature whereas another one to the opposite), of differential parallel or convergent evolution in some genes, or of poor polarization of characters. Indeed, Hyrax and Dugong have diverged from the elephant lineage about 65 MYA. In theory, Bayesian analyses reported in Rogaev et al. (2006) could support the lineage-sorting hypothesis (Table 1) since different topologies were supported by different genes. Since mitochondria are mostly non-recombinant in animal (e.g. Barr, Neiman and Taylor, 2005), this explanation is unlikely. Furthermore, while the Mammuthus-Elephas relationship was supported by both partitioned Bayesian analyses (p = 0.931 and 0.751 for P46 and P10 respectively), the Mammuthus-Loxodonta and Elephas-Loxodonta (P10 only) alternatives were both present in the 95% credibility interval. Similarly, none of the three possible pairs of taxa exhibits significant differences in the number of synapomorphies (Figure 1). Moreover, whatever the framework or alternatives, hidden branch support is systematically detected in our data set. Such results are rather unexpected since alternatives are mutually incompatible. One possible explanation for the strong and incompatible Bayesian posterior probabilities reported by Rogaev et al. (2006) would imply a hard polytomy between Mammuthus-Loxodonta and Elephas. Lewis and co-workers (2005) have indeed convincingly demonstrated that standard MCMC procedure tends to become unpredictable, including strong shift from one to another alternative, when the true phylogeny is a hard or near hard polytomy (but see Kolaczkowski and Thornton, 2006). Alternatively, overall results could be due to inadequate polarization of the data. This finding is corroborated in littera since different rooting procedures (or phylogenetic methods) often come to opposite conclusions (Table 1).
Figure 1.

Absolute (Left) and relative (Right) number of synapomorphies for the ((E,L),M) (Black), ((M,L),E) (Grey) and ((M,E),L) (White) topologies. Each bar represents a given gene in the dataset.

In that context, one possibility would be to build unrooted trees and check the stability of molecular clock along the branches, to infer the most probable midpoint position of the root. This strategy has already been followed in several studies (Table 1) but leaves no possibility to test if the resulting topology is better than the alternatives. We thus consider tree rooting as a prerequisite before drawing any definite conclusion with regards to the phylogenetic relationships of mammoths and elephants and possible hard polytomy. The American mastodon (Mammut americanum) lineage would be very useful for that purpose since it diverged from the lineage of mammoth and elephants about 24 MYA, that is 40 MY later than the Hyracoidea (e.g. Hyrax)—Sirenia (e.g. Dugong) —Proboscidia (elephantids) split (Shoshani and Tassy, 1996). Moreover, the American mastodon (Mammut americanum) became extinct at the Late Glacial Maximum, and specimens from that time range are compatible with ancient DNA recovery (Gilbert et al. 2005). Such a mastodon DNA sequence (circa 10 KYA) was used in Yang et al. (1996), but several authors have raised concerns on the authenticity of the mammoth sequences recovered (Thomas et al. 2000; Debruyne et al. 2003). Finally, we should bear in mind that star trees are the null hypothesis in phylogenetic reconstructions. Thus if the adaptive radiation among elephantids occurred very rapidly (ca. 500 KY) as recently suggested (Krause et al. 2006), this near hard polytomy might require extensive data to be solved. One fruitful strategy would most probably be to take advantage of the ongoing Loxodonta genome project and of the 13 Mbp of the mammoth genome already published (Poinar et al. 2006). In silico analyses might identify most-informative candidate genes and enable to design primers in order to collect orthologous sequences in both Elephas and mastodons. New technological advances in two-round multiplex-PCR (Krause et al. 2006; Römpler et al. 2006) and large-scale sequencing (Poinar et al. 2006), combined with the exceptionally-well preserved mammoth specimens from permafrost, make this an exceptional model for the genomic study of speciation.
Table 3:

Hidden Branch Support.

Parsimony
Likelihood
(M,E),L(M,L),E(E,L),M(M,E),L(M,L),E(E,L),M
BSind2–45–40–10,92–12,05–15,64
BS28–32–28–0,560,56–1,65
HBS26131210,3712,613,98

∑BSind : sum of branch support (BS) scores for that node from each data partition. BS : difference in the number of character steps (or likelihood difference) between the best topology with and without that node. HBS (Hidden branch support) = BS-∑BSind. For further details see Gatesy et al. 1999.

  26 in total

1.  CONSEL: for assessing the confidence of phylogenetic tree selection.

Authors:  H Shimodaira; M Hasegawa
Journal:  Bioinformatics       Date:  2001-12       Impact factor: 6.937

2.  An approximately unbiased test of phylogenetic tree selection.

Authors:  Hidetoshi Shimodaira
Journal:  Syst Biol       Date:  2002-06       Impact factor: 15.683

3.  MrBayes 3: Bayesian phylogenetic inference under mixed models.

Authors:  Fredrik Ronquist; John P Huelsenbeck
Journal:  Bioinformatics       Date:  2003-08-12       Impact factor: 6.937

4.  Assessing ancient DNA studies.

Authors:  M Thomas P Gilbert; Hans-Jürgen Bandelt; Michael Hofreiter; Ian Barnes
Journal:  Trends Ecol Evol       Date:  2005-07-21       Impact factor: 17.712

5.  Nuclear gene indicates coat-color polymorphism in mammoths.

Authors:  Holger Römpler; Nadin Rohland; Carles Lalueza-Fox; Eske Willerslev; Tatyana Kuznetsova; Gernot Rabeder; Jaume Bertranpetit; Torsten Schöneberg; Michael Hofreiter
Journal:  Science       Date:  2006-07-07       Impact factor: 47.728

6.  The origin and evolution of the woolly mammoth.

Authors:  A M Lister; A V Sher
Journal:  Science       Date:  2001-11-02       Impact factor: 47.728

7.  Nuclear DNA sequences from late Pleistocene megafauna.

Authors:  A D Greenwood; C Capelli; G Possnert; S Pääbo
Journal:  Mol Biol Evol       Date:  1999-11       Impact factor: 16.240

8.  Mitochondrial cytochrome b of the Lyakhov mammoth (Proboscidea, Mammalia): new data and phylogenetic analyses of Elephantidae.

Authors:  Régis Debruyne; Véronique Barriel; Pascal Tassy
Journal:  Mol Phylogenet Evol       Date:  2003-03       Impact factor: 4.286

9.  Pleistocene to Holocene extinction dynamics in giant deer and woolly mammoth.

Authors:  A J Stuart; P A Kosintsev; T F G Higham; A M Lister
Journal:  Nature       Date:  2004-10-07       Impact factor: 49.962

10.  Complete mitochondrial genome and phylogeny of Pleistocene mammoth Mammuthus primigenius.

Authors:  Evgeny I Rogaev; Yuri K Moliaka; Boris A Malyarchuk; Fyodor A Kondrashov; Miroslava V Derenko; Ilya Chumakov; Anastasia P Grigorenko
Journal:  PLoS Biol       Date:  2006-02-07       Impact factor: 8.029

View more
  2 in total

1.  Revising the recent evolutionary history of equids using ancient DNA.

Authors:  Ludovic Orlando; Jessica L Metcalf; Maria T Alberdi; Miguel Telles-Antunes; Dominique Bonjean; Marcel Otte; Fabiana Martin; Véra Eisenmann; Marjan Mashkour; Flavia Morello; Jose L Prado; Rodolfo Salas-Gismondi; Bruce J Shockey; Patrick J Wrinn; Sergei K Vasil'ev; Nikolai D Ovodov; Michael I Cherry; Blair Hopwood; Dean Male; Jeremy J Austin; Catherine Hänni; Alan Cooper
Journal:  Proc Natl Acad Sci U S A       Date:  2009-12-09       Impact factor: 11.205

2.  Revisiting traditional SSR based methodologies available for elephant genetic studies.

Authors:  M S L R P Marasinghe; R M R Nilanthi; H A B M Hathurusinghe; M G C Sooriyabandara; C H W M R B Chandrasekara; K A N C Jayawardana; M M Kodagoda; R C Rajapakse; P C G Bandaranayake
Journal:  Sci Rep       Date:  2021-04-22       Impact factor: 4.379

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.