| Literature DB >> 21203580 |
Nadin Rohland1, David Reich, Swapan Mallick, Matthias Meyer, Richard E Green, Nicholas J Georgiadis, Alfred L Roca, Michael Hofreiter.
Abstract
To elucidate the history of living and extinct elephantids, we generated 39,763 bp of aligned nuclear DNA sequence across 375 loci for African savanna elephant, African forest elephant, Asian elephant, the extinct American mastodon, and the woolly mammoth. Our data establish that the Asian elephant is the closest living relative of the extinct mammoth in the nuclear genome, extending previous findings from mitochondrial DNA analyses. We also find that savanna and forest elephants, which some have argued are the same species, are as or more divergent in the nuclear genome as mammoths and Asian elephants, which are considered to be distinct genera, thus resolving a long-standing debate about the appropriate taxonomic classification of the African elephants. Finally, we document a much larger effective population size in forest elephants compared with the other elephantid taxa, likely reflecting species differences in ancient geographic structure and range and differences in life history traits such as variance in male reproductive success.Entities:
Mesh:
Year: 2010 PMID: 21203580 PMCID: PMC3006346 DOI: 10.1371/journal.pbio.1000564
Source DB: PubMed Journal: PLoS Biol ISSN: 1544-9173 Impact factor: 8.029
Figure 1Strategy for obtaining overlapping DNA from four elephantids and a mastodon.
(a) Mastodon shotgun 454 sequencing. We ligated 454-adaptors (green and blue) to the ends of the DNA molecules and sequenced the libraries on a Roche 454 GS. (b) Bioinformatic analysis of shotgun 454 sequences. To identify proboscidean sequence, we compared the sequences to databases consisting of the savanna elephant draft genome (loxAfr1), the human genome (hg18), the mouse genome (mm8), NCBI's nucleotide database of environmental samples (env), and NCBI's non-redundant nucleotide database (nr). The 454 sequences with a best match to loxAfr1 (in red) were aligned to loxAfr1. Alignments of at least 90 bp in length and with a similarity higher than 87% were used for primer design after filtering out known repeat elements (using the UCSC RepeatMasker database). Primers were based on loxAfr1 sequence flanking the mastodon sequence. (c) Multiplex PCR and sequencing of the targeted loci in modern elephants and mammoth. We show the protocol for the first of four rounds of the project (Table S3 provides details of the further rounds). A total of 213 primer pairs were randomly divided into 5 multiplex primer mixes with 41–44 primer pairs per mix. These mixes were used for the first step of the two-step multiplex PCR approach, for each of the 5 samples (La, Loxodonta africana; Lc, L. cyclotis; Em 1, Elephas maximus 1; Em 2, E. maximus 2; Mp, Mammuthus primigenius). Dilutions of these products were used as templates to amplify the loci individually in the second step (shown for L. africana), resulting in 213 distinct products per sample. These products were quantified, normalized, and merged into one pool per sample. A 454 library was prepared and sequenced on 1/16th of a picotiter plate of a Roche 454 GS.
Genetic divergence and heterozygosity estimates for the elephantids.
| First Taxon | Second Taxon | Genetic Divergence (Heterozygosity If Within Taxa) Normalized by Savanna-Asian Genetic Divergence | ±1 Standard Deviation | Rate Test for More Substitutions in One Taxon Than the Other |
|
| ||||
| Savanna | Forest | 74% | 6% |
|
| Savanna | Mammoth | 92% | 5% |
|
| Savanna | Asian | 100% | n/a |
|
| Forest | Mammoth | 96% | 7% |
|
| Forest | Asian | 103% | 5% |
|
| Mammoth | Asian | 65% | 5% |
|
|
| ||||
| Savanna | Savanna | 8% | 2% | n/a |
| Forest | Forest | 30% | 4% | n/a |
| Mammoth | Mammoth | 9% | 2% | n/a |
| Asian | Asian | 15% | 3% | n/a |
We calculated genetic divergences based on 549 sites that are polymorphic among the elephantids, normalizing by savanna-Asian elephant genetic divergence. Standard errors are from a Weighted Jackknife (calculated in this way, savanna-Asian genetic divergence has no uncertainty since it is used for normalization). The results show that savanna and forest elephants are sister groups (>4 standard deviations less diverged than savanna-Asian) and that Asian elephants and mammoths are also sister groups (>6 standard deviations less diverged than savanna-Asian).
Incomplete lineage sorting: More deeply coalescing lineages between forest-savanna than Asian-mammoth.
| 4-taxon alignment: 1-2-3-Mastodon | 1 (only) | 2 (only) | 3 (only) | 12 (cluster) | 13 (cluster) | 23 (cluster) | Mastodon | Genetic Divergence of 1 & 2 (Divided by Savanna-Asian Divergence) | Rate of 13+23 Sites Suggesting ILS (Divided by Savanna-Asian Divergence) | ||
| Savanna-Forest-Eurasian-Mastodon | 84.8 | 89.7 | 124.3 | 39.0 | 15.4 | 12.1 | 1,257.2 | 74%±6% | 0.082±0.020 | ||
| Asian-Mammoth-Loxodonta-Mastodon | 91.5 | 74.8 | 121.2 | 55.6 | 4.4 | 7.3 | 1,264.8 | 65%±5% | 0.027±0.009 | ||
|
|
|
| |||||||||
The outgroup is the mastodon. To calculate the rate of any class of sites, we used the product of the relevant allele frequencies; for example, the expected rate of a “12” site where 1 and 2 share the derived allele to the exclusion of 3 is (f1)(f2)(1−f3). Values are summed over 1,775 sites. Standard errors are from a Weighted Jackknife.
*p values for the Asian-mammoth divergence being less than that of forest-savanna are based on a Weighted Jackknife. The Incomplete Lineage Sorting analysis in the last column, which is based on rare “13” and “23” divergent site classes, shows that there is a significantly higher probability of forest and savanna elephant alleles being unrelated all the way back to the time of their common ancestry with the Eurasian elephantids than is the case for Asian elephants and mammoth.
Figure 2Demographic model for the history of the Elephantidae.
Demographic model that is fit by MCMCcoal, in which all population splits are instantaneous (without subsequent gene flow), and all population sizes are assumed to be constant over intervals. Here, T FS refers to forest-savanna elephant population divergence time, T AM refers to Asian elephant-mammoth population divergence time, T Lox-Eur refers to African-Eurasian population divergence time, and T Elephantid-Mastodon refers to elephantid-mastodon population divergence time, presented here in millions of years. The Ν quantities refer to constant diploid effective population sizes ancestral to each of these splits (in thousands). For obtaining estimates of years and population sizes, we assume that the elephantids have an average of 31 years per generation, based on estimates of 17–20 years for females [53],[54] and 40–49 years for males [43],[55]. A lower or higher number of years per generation would produce a proportionate effect on the population size estimates. For each parameter, two sets of numbers are shown. The upper set shows the range consistent with the fossil record, calibrating to an assumed African-Eurasian population split of T Lox-Eur = 4.2–9 Mya (justified in Text S5). For example for forest-savanna population divergence, this leads to T FS = 2.6–5.6 Mya given that MCMCcoal estimates T FS /T Lox-Eur = 62%. The lower set of numbers (in parentheses) provides MCMCcoal's 90% credible interval for the parameters as a fraction of the best estimate (e.g. 76%–126% for T FS). In the main text, we conservatively quote a range that combines the uncertainty from the fossil record and from MCMCcoal (e.g. T FS = 1.9–7.1 Mya).
Estimates of demographic parameters from MCMCcoal.
| Quantity | Populations Analyzed | Estimate (90% Credible Interval) |
|
| ||
|
| Forest-Savanna | 0.00135 (0.00102, 0.00170) |
|
| Asian-Mammoth | 0.00131 (0.00104, 0.00161) |
|
| African-Eurasian | 0.00220 (0.00193, 0.00248) |
|
| ||
|
| Forest (current) | 0.00238 (0.00185, 0.00298) |
|
| Savanna (current) | 0.00068 (0.00050, 0.00089) |
|
| Asian (current) | 0.00113 (0.00087, 0.00142) |
|
| Mammoth (current) | 0.00093 (0.00070, 0.00119) |
|
| Asian-Mammoth (ancestral) | 0.00181 (0.00107, 0.00265) |
|
| Forest-Savanna (ancestral) | 0.00263 (0.00152, 0.00384) |
|
| African-Eurasian (ancestral) | 0.00259 (0.00200, 0.00323) |
All estimates are from MCMCcoal and are scaled in coalescent units; that is, a demographic parameter times a mutation rate. Abbreviations: F, forest elephant; S, savanna elephant; A, Asian elephant; M, woolly mammoth; Lox, Loxodonta (African forest and savanna elephant); Eur, “Eurasian” (Asian elephant and mammoth).
Relative values of population divergence times estimated by MCMCcoal.
|
|
|
| |
|
| 1 | 0.96 (0.69–1.36) | 1.67 (1.27–2.21) |
|
| 1.05 (0.73–1.42) | 1 | 1.70 (1.36–2.15) |
|
| 0.62 (0.45–0.79) | 0.60 (0.46–0.74) | 1 |
All quantities are expressed as a column-to-row ratio with a 90% credible interval. For example, the ratio of forest-savanna population split time T FS in years to African-Eurasian split time T Lox-Eur is Τ FS/Τ Lox-Eur = 0.62 (0.45–0.79). We do not include the T Elephantid-Mastodon parameter ancestral to elephant-mastodon divergence as it is not estimated in a stable way. Abbreviations: F, forest elephant; S, savanna elephant; A, Asian elephant; M, woolly mammoth; Lox, Loxodonta (African forest plus savanna elephant); Eur, “Eurasian” (Asian elephant plus mammoth).
Relative values of effective population sizes estimated by MCMCcoal.
|
|
|
|
|
|
|
| |
|
| 1 | 0.29 (0.20–0.40) | 0.48 (0.34–0.67) | 0.40 (0.27–0.55) | 1.15 (0.57–1.89) | 0.78 (0.44–1.22) | 1.11 (0.78–1.52) |
|
| 3.58 (2.53–4.92) | 1 | 1.71 (1.15–2.42) | 1.40 (0.93–2.01) | 4.03 (2.01–6.69) | 2.74 (1.49–4.33) | 3.92 (2.65–5.54) |
|
| 2.15 (1.50–2.96) | 0.62 (0.41–0.87) | 1 | 0.84 (0.59–1.14) | 2.38 (1.26–3.65) | 1.65 (0.87–2.69) | 2.34 (1.63–3.22) |
|
| 2.63 (1.80–3.66) | 0.75 (0.50–1.07) | 1.24 (0.88–1.68) | 1 | 2.91 (1.57–4.54) | 2.02 (1.05–3.30) | 2.86 (1.96–4.00) |
|
| 1.00 (0.53–1.75) | 0.28 (0.15–0.50) | 0.47 (0.27–0.77) | 0.38 (0.22–0.64) | 1 | 0.74 (0.38–1.26) | 1.08 (0.59–1.89) |
|
| 1.42 (0.83–2.31) | 0.41 (0.23–0.67) | 0.68 (0.37–1.15) | 0.56 (0.30–0.95) | 1.55 (0.80–2.62) | 1 | 1.56 (0.86–2.66) |
|
| 0.94 (0.66–1.28) | 0.27 (0.18–0.38) | 0.45 (0.31–0.61) | 0.37 (0.25–0.51) | 1.05 (0.53–1.71) | 0.72 (0.38–1.16) | 1 |
All quantities are expressed as a column-to-row ratio with a 90% credible interval. For example, the ratio of effective population sizes of forest to savanna populations = N F/N S is 3.58 (2.53–4.92). We do not include the N Elephantid-Mastodon parameter ancestral to elephant-mastodon divergence as it is not estimated in a stable way. Abbreviations: F, forest elephant; S, savanna elephant; A, Asian elephant; M, woolly mammoth; Lox, Loxodonta (African forest plus savanna elephant); Eur, “Eurasian” (Asian elephant plus mammoth).