| Literature DB >> 21978175 |
Satish Kumar1, Claire Bellis, Mark Zlojutro, Phillip E Melton, John Blangero, Joanne E Curran.
Abstract
BACKGROUND: The Asian origin of Native Americans is largely accepted. However uncertainties persist regarding the source population(s) within Asia, the divergence and arrival time(s) of the founder groups, the number of expansion events, and migration routes into the New World. mtDNA data, presented over the past two decades, have been used to suggest a single-migration model for which the Beringian land mass plays an important role.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21978175 PMCID: PMC3217880 DOI: 10.1186/1471-2148-11-293
Source DB: PubMed Journal: BMC Evol Biol ISSN: 1471-2148 Impact factor: 3.260
Figure 1Schematic representation of mtDNA phylogenetic tree of Native American haplogroups A2 and B2 and immediate Siberian-Asian sister clades (A2a, A2b, A4a, A4b and A4c). Coalescent age calculated in thousand years (ky) as per the slow mutation rate of Mishmar et al. [58] and as per calibrated mutation rate of Soares et al. [59] are indicated in blue and red color respectively. The founder age wherever calculated are italicized. The geographical locations of the samples are identified with colors. For more details see complete phylogenetic reconstruction in additional file 2 (panels A-B) and additional file 3.
Figure 2Schematic representation of mtDNA phylogenetic tree of Native American haplogroups C1, C4c, D1, and D4e1c and their immediate Siberian-Asian sister clades (C1a, C4a, C4b, D2 and D4e1a). Coalescent age calculated in thousand years (ky) as per the slow mutation rate of Mishmar et al. [58] and as per calibrated mutation rate of Soares et al. [59] are indicated in blue and red color respectively. The founder age wherever calculated are italicized. The geographical locations of the samples are identified with colors. For more details see complete phylogenetic reconstruction in additional file 2 (panels C-D) and additional file 3.
Haplogroup coalescence time estimates
| All substitutions | Synonymous transitions | Synonymous substitutions | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Haplogroup | |||||||||||||
| 171 | 5.36 | 1.33 | 27.5 ± 6.8 | 2.86 | 0.94 | 19.4 ± 6.4 | 2.88 | 0.94 | 22.7 ± 7.4 | 8.2-37.2 | |||
| A2 | 158 | 3.48 | 0.24 | 17.9 ± 1.2 | 1.89 | 0.19 | 12.8 ± 1.3 | 1.92 | 0.19 | 15.1 ± 1.5 | 12.2-18.0 | ||
| 133 | 3.80 | 0.25 | 19.5 ± 1.3 | 2.03 | 0.19 | 13.7 ± 1.3 | 2.05 | 0.19 | 16.1 ± 1.5 | 13.2-19.1 | |||
| A4a | 6 | 1.83 | 0.83 | 9.4 ± 4.3 | 1.50 | 0.80 | 10.1 ± 5.4 | 1.50 | 0.80 | 11.8 ± 6.3 | 0-24.2 | ||
| A4b | 3 | 3.00 | 1.20 | 15.4 ± 6.2 | 0.66 | 0.47 | 4.5 ± 3.2 | 1.00 | 0.58 | 7.9 ± 4.6 | 0-16.8 | ||
| A4c | 4 | 4.50 | 1.06 | 23.1 ± 5.5 | 2.25 | 0.75 | 15.2 ± 5.1 | 2.25 | 0.75 | 17.7 ± 5.9 | 6.1-29.2 | ||
| 96 | 4.05 | 0.39 | 20.8 ± 2.0 | 2.21 | 0.31 | 14.9 ± 2.1 | 2.30 | 0.31 | 18.1 ± 2.4 | 13.3-22.9 | |||
| B2b | 8 | 4.50 | 0.94 | 23.1 ± 4.8 | 2.13 | 0.62 | 14.4 ± 4.2 | 2.13 | 0.62 | 16.8 ± 4.9 | 7.2-26.3 | ||
| 35 | 4.09 | 0.90 | 21.0 ± 4.6 | 2.54 | 0.81 | 17.2 ± 5.5 | 2.54 | 0.81 | 20.0 ± 6.4 | 7.5-32.5 | |||
| C4a | 12 | 5.00 | 1.44 | 25.7 ± 7.4 | 2.75 | 1.05 | 18.6 ± 7.1 | 2.75 | 1.05 | 21.6 ± 8.3 | 5.4-37.8 | ||
| C4b | 21 | 1.81 | 0.43 | 9.3 ± 2.2 | 0.86 | 0.36 | 5.8 ± 2.4 | 0.86 | 0.36 | 6.8 ± 2.8 | 1.2-12.3 | ||
| C4c** | 2 | 4.00 | 1.41 | 20.6 ± 7.3 | 1.50 | 0.87 | 10.1 ± 5.9 | 1.50 | 0.87 | 11.8 ± 6.8 | 0-25.2 | ||
| 160 | 4.20 | 0.28 | 21.6 ± 2.7 | 2.056 | 0.19 | 13.9 ± 1.3 | 2.08 | 0.19 | 16.4 ± 1.5 | 13.5-19.3 | |||
| C1a | 4 | 1.50 | 0.61 | 7.7 ± 3.2 | 0.25 | 0.25 | 1.7 ± 1.7 | 0.25 | 0.25 | 1.9 ± 1.9 | 0-5.8 | ||
| C1b | 47 | 3.49 | 0.44 | 17.9 ± 2.3 | 2.34 | 0.39 | 15.8 ± 2.6 | 2.34 | 0.39 | 18.4 ± 3.1 | 12.4-24.4 | ||
| C1c | 28 | 4.32 | 0.65 | 22.2 ± 3.3 | 2.64 | 0.54 | 17.9 ± 3.7 | 2.71 | 0.55 | 21.3 ± 4.3 | 12.8-29.8 | ||
| C1d | 81 | 3.91 | 0.86 | 20.1 ± 4.4 | 1.70 | 0.21 | 11.5 ± 1.4 | 1.72 | 0.21 | 13.5 ± 1.7 | 10.2-16.8 | ||
| 53 | 4.70 | 1.76 | 24.1 ± 9.0 | 2.28 | 1.27 | 15.4 ± 8.6 | 2.28 | 1.27 | 17.9 ± 10 | 0-37.5 | |||
| D2 | 50 | 2.84 | 1.23 | 14.6 ± 6.3 | 1.38 | 0.90 | 9.3 ± 6.1 | 1.38 | 0.90 | 10.9 ± 7.1 | 0-24.8 | ||
| D4e1c** | 2 | 0.50 | 0.50 | 2.6 ± 2.6 | - | - | - | - | - | - | |||
| 49 | 3.35 | 0.39 | 17.2 ± 2.0 | 1.84 | 0.28 | 12.4 ± 1.9 | 1.90 | 0.28 | 14.9 ± 2.2 | 10.6-19.3 | |||
| D1f | 4 | 3.75 | 1.15 | 19.3 ± 5.9 | 1.75 | 0.75 | 11.8 ± 5.1 | 1.75 | 0.75 | 13.8 ± 5.9 | 2.2-25.3 | ||
a mutation rate: one base substitution in the coding region per 5, 140 years [58].
b mutation rate: one synonymous transition per 6, 764 year [76].
c Calculated using as per calibrated mutation rate of Soares et al. [59].
d 95% confidence interval: negative values are zeroed.
* Excluding branches A2a and A2b.
** Only two sequences hence coalescent age not reliable.
Diversity Indices and Neutrality tests for geographical region Beringia, North America, Central America and South America based on mtDNA coding region (np 577 - 16023) sequences.
| Population/Geographic Region | Nucleotide Diversity (SD)%c | Tejima's | Fu's | ||||||
|---|---|---|---|---|---|---|---|---|---|
| Beringia | 66 | 0.0959 (0.0482) | 14.198 (8.4 - 23.4) | 11.136 (3.241) | 14.811 (7.441) | 1.021 | 0.892 | 0.448 | 0.645 |
| North America | 73 | 0.1747 (0.0858) | 1338.057 (469.4 - 4380.0) | 56.465 (14.649) | 27.006 (13.258) | -2.042 | 0.004 | -24.041 | 0.000 |
| Central America | 139 | 0.1735 (0.0846) | 917.489 (532.7 - 1664.5) | 70.373 (16.098) | 26.820 (13.079) | -2.195 | 0.000 | -23.787 | 0.004 |
| South America | 75 | 0.1834 (0.0899) | 413.514 (203.6 - 908.3) | 52.168 (13.563) | 28.326 (13.890) | -1.847 | 0.011 | -24.048 | 0.001 |
| North America | 29 | 0.0615 (0.0323) | 415.445 (108.5 - 1745.1) | 24.485 (7.837) | 9.494 (4.986) | -2.569 | 0.000 | -21.684 | 0.000 |
| Central America | 86 | 0.0495 (0.0258) | 465.938 (239.7 - 972.2) | 40.591 (10.376) | 7.651 (3.991) | -2.816 | 0.000 | -24.854 | 0.000 |
| South America | 17 | 0.0602 (0.0326) | 141.445 (36.0 - 601.3) | 21.001 (7.691) | 9.294 (5.029) | -2.376 | 0.001 | -9.819 | 0.001 |
| North America | 18 | 0.0668 (0.0358) | 158.779 (40.6 - 673.9) | 20.351 (7.364) | 10.320 (5.525) | -2.471 | 0.000 | -10.079 | 0.001 |
| Central America | 55 | 0.0539 (0.0281) | 261.236 (120.8 - 614.9) | 28.850 (8.115) | 8.330 (4.346) | -2.598 | 0.000 | -24.826 | 0.000 |
| B2 South America | 22 | 0.0586 (0.0313) | 216.779 (55.9 - 916.5) | 18.928 (6.565) | 9.048 (4.832) | -2.415 | 0.000 | -12.381 | 0.000 |
| North America | 14 | 0.0487 (0.0271) | 82.112 (20.6 - 352.2) | 13.836 (5.432) | 7.516 (4.193) | -1.994 | 0.009 | -5.519 | 0.005 |
| Central America | 61 | 0.0501 (0.0262) | 221.898 (112.1 - 468.9) | 27.351 (7.564) | 7.741 (4.054) | -2.510 | 0.000 | -24.908 | 0.000 |
| South America | 17 | 0.0527 (0.0288) | 57.228 (18.5 - 196.2) | 14.790 (5.520) | 8.147 (4.451) | -1.908 | 0.017 | -5.787 | 0.012 |
| North America | 12 | 0.0363 (0.0211) | 69.779 (17.4 - 300.2) | 10.596 (4.402) | 5.606 (3.259) | -2.125 | 0.001 | -7.565 | 0.000 |
| Central America | 17 | 0.0425 (0.0237) | 141.445 (36.0 - 601.3) | 12.719 (4.796) | 6.574 (3.657) | -2.235 | 0.002 | -12.457 | 0.000 |
| South America | 19 | 0.0491 (0.0268) | 45.013 (16.5 - 133.6) | 12.017 (4.436) | 7.591 (4.141) | -1.488 | 0.047 | -6.217 | 0.004 |
Randomly selected from total 219 sequences to overcome computational limitation.
Sample size.
Nucleotide diversity (standard deviation) multiplied with 100.
Significant P-values < 0.05 (for Tajima's D) and < 0.02 (for Fu's Fs).
Samples belonging to sub haplogroups (A2a, A2b and D2) in Chukchi, Eskimos, and Aleuts populations from Siberia and Commander Island.
Samples belonging to pan American haplogroups (A2, B2, C1 and D1) only
SD: Standard deviation
Figure 3Bayesian skyline plot showing Effective population size (. The thick solid line is the median estimate and the thin lines show the 95% highest posterior density limits. In this analysis N, where Ne is the population of breeding females and τ is the generation length is plotted on Y axis. Time estimates obtained using a log-normal relaxed clock with the standard substitution rate of 1.26 × 10-8 sites per year [58] are plotted on X axis and limited to < 45 kya. Panel 'A' - The Bayesian skyline plot (m = 10) for North America (n = 75). Panel 'B' - The Bayesian skyline plot (m = 10) for Central America (n = 151, randomly selected from total 215 sequences). Panel 'C' - The Bayesian skyline plot (m = 10) for South America (n = 76). Panel 'D' - The Bayesian skyline plot (m = 10) for Beringia (n = 66).
Figure 4Schematic presentation of the average divergence of the American clades from Siberian-Asian sister clades and their dispersal into the America. Horizontal bars indicate the coalescent age estimates (at the center) ± Standard errors for each haplogroup shown on the Y axis. The average divergence time and the average time of the peopling of America across the haplogroups are also superimposed. X axis shows the time in thousand years ago. The coalescent estimates were calculated by Rho (ρ) statistics and three different mutation rates (i) panel 'A': one base substitution (i.e. one mutation other than indel) in the coding region (577 - 16023) per 5, 140 years [58], (ii) panel 'B': calibrated mutation rate of Soares et al. [59] based on all synonymous substitutions, (iii) panel 'C': one synonymous transition per 6, 764 year [76]