| Literature DB >> 26875094 |
Andreia Brandão1,2,3,4, Ken Khong Eng5,6, Teresa Rito1,7,8, Bruno Cavadas1,2, David Bulbeck9, Francesca Gandini3, Maria Pala3, Maru Mormina5,10, Bob Hudson11, Joyce White12, Tsang-Ming Ko13, Mokhtar Saidin6, Zainuddin Zafarina14,15, Stephen Oppenheimer16, Martin B Richards17,18, Luísa Pereira1,2,19, Pedro Soares1,2,5,20.
Abstract
There has been a long-standing debate concerning the extent to which the spread of Neolithic ceramics and Malay-Polynesian languages in Island Southeast Asia (ISEA) were coupled to an agriculturally driven demic dispersal out of Taiwan 4000 years ago (4 ka). We previously addressed this question using founder analysis of mitochondrial DNA (mtDNA) control-region sequences to identify major lineage clusters most likely to have dispersed from Taiwan into ISEA, proposing that the dispersal had a relatively minor impact on the extant genetic structure of ISEA, and that the role of agriculture in the expansion of the Austronesian languages was therefore likely to have been correspondingly minor. Here we test these conclusions by sequencing whole mtDNAs from across Taiwan and ISEA, using their higher chronological precision to resolve the overall proportion that participated in the "out-of-Taiwan" mid-Holocene dispersal as opposed to earlier, postglacial expansions in the Early Holocene. We show that, in total, about 20% of mtDNA lineages in the modern ISEA pool result from the "out-of-Taiwan" dispersal, with most of the remainder signifying earlier processes, mainly due to sea-level rises after the Last Glacial Maximum. Notably, we show that every one of these founder clusters previously entered Taiwan from China, 6-7 ka, where rice-farming originated, and remained distinct from the indigenous Taiwanese population until after the subsequent dispersal into ISEA.Entities:
Mesh:
Substances:
Year: 2016 PMID: 26875094 PMCID: PMC4796337 DOI: 10.1007/s00439-016-1640-3
Source DB: PubMed Journal: Hum Genet ISSN: 0340-6717 Impact factor: 4.132
Fig. 1Schematic tree of the subclades most representative in SEA belonging to haplogroups B4b1, B4c1, B5b, D5, F1a4, F3, N9a, R9b, R9c and Y2. The higher-frequency lineages B4a1a, E1, E2 are not shown in the figure, since they were analysed previously by Soares et al. (2016) Tree scaled using maximum likelihood and time-dependent molecular clock for whole-mtDNA genome (in ka). The shading represents the geographic distribution of the subclades. Details of age estimates are shown in Table 1
Age estimates using rho (ρ) and ML for major subclades in ISEA for haplogroups B4b1, B4c1, B5b, D5, F1a4, F3, N9a, R9b, R9c and Y2. Ages and 95 % confidence intervals (CI) in thousands of years
| mtDNA lineage |
| PAML | Rho | ||||
|---|---|---|---|---|---|---|---|
| Total | Synonymous | ||||||
| Age | 95 % confidence interval | Age | 95 % confidence interval | Age | 95 % confidence interval | ||
| B4b1 | 129 | 25,100 | [17,000–33,600] | 22,600 | [12,600–33,200] | 32,600 | [9400–55,900] |
| B4b1a2 | 89 | 9300 | [6800–11,800] | 8700 | [6400–11,000] | 9100 | [5300–12,900] |
| B4c1b2a | 56 | 14,500 | [6400–23,000] | 11,200 | [3800–19,000] | 15,500 | [0–31,400] |
| B4c1b2a2 | 53 | 8000 | [5600–10,500] | 5800 | [3500–8100] | 7700 | [1900–13,600] |
| B5b | 89 | 29,900 | [20,700–39,300] | 34,300 | [24,300–44,700] | 43,500 | [24,500–62,500] |
| B5b1 | 54 | 23,900 | [13,900–34,300] | 27,000 | [15,800–38800] | 44,700 | [19,200–70,300] |
| B5b1a | 15 | 19,000 | [8,800–29,800] | 20,500 | [9300–32,400] | 12,600 | [0–25,800] |
| D5 | 174 | 33,300 | [24,600–42,200] | 34,500 | [23,000–46,500] | 35,600 | [15,500–55,800] |
| D5b1c1 | 15 | 9100 | [4000–14,400] | 12,300 | [4300–20,600] | 21,000 | [900–41,100] |
| D5b1c1a | 11 | 6000 | [0–13,800] | 7400 | [1200–13,700] | 8600 | [0–19,500] |
| D5b3 | 40 | 10,900 | [5600–16,400] | 9900 | [2400–17,700] | 17,700 | [0–37,100] |
| F1a4 | 26 | 16,300 | [7000–25,900] | 18,600 | [6500–31,400] | 11,500 | [0–26,100] |
| F1a4a | 25 | 11,700 | [3000–20,900] | 10,600 | [2600–19,000] | 11,700 | [0–26,800] |
| F1a4a1 | 23 | 4300 | [1800–6800] | 5200 | [1500–9000] | 3900 | [800–7100] |
| F1a4a1a | 15 | 3300 | [1300–5300] | 3500 | [1500–5500] | 3700 | [1000–6400] |
| F3 | 88 | 31,700 | [21,500–42,300] | 37,900 | [22,900–53,600] | 35,300 | [12,400–58,400] |
| F3a | 20 | 26,600 | [16,500–37,200] | 31,500 | [18,700–44,900] | 26,400 | [6000–46,800] |
| F3a1 | 16 | 16,600 | [9000–24,500] | 15,600 | [9300–22,100] | 13,300 | [1300–25,300] |
| F3b | 68 | 25,200 | [15,400–35,400] | 28,900 | [13,600–45,100] | 27,800 | [3500–52,200] |
| F3b1 | 65 | 12,400 | [5200–20,000] | 12,000 | [4700–19,700] | 12,100 | [0–25,100] |
| N9 | 254 | 50,600 | [37,100–64,600] | 38,500 | [27,700–49,600] | 36,100 | [20,700–51,400] |
| N9a | 127 | 20,000 | [14,500–25,500] | 17,500 | [13,000–22,100] | 18,400 | [10,900–25,900] |
| N9a10 | 18 | 16,600 | [11,000–22,400] | 14,600 | [8800–20,600] | 14,900 | [4600–25,200] |
| N9a10a | 9 | 10,000 | [4700–15,400] | 9100 | [4100–14,200] | 14,000 | [1600–26,400] |
| N9a10a1 | 6 | 6300 | [200–12,500] | 5200 | [1200–9300] | 6600 | [0–14,300] |
| N9a6 | 45 | 14,800 | [9900–19,800] | 12,700 | [7100–18,500] | 9600 | [2900–16,400] |
| R9b | 45 | 38,700 | [23,900–54,300] | 32,800 | [20,400–45,900] | 31,400 | [13,600–49,200] |
| R9b1a | 32 | 18,600 | [10,900–26,700] | 20,700 | [13,200–28,500] | 23,900 | [9800–38,100] |
| R9b1a1 | 18 | 11,600 | [6000–17,300] | 13,400 | [6600–20,300] | 9600 | [3800–15,500] |
| R9b2 | 4 | 5700 | [1300–10,200] | 5200 | [1600–8900] | 5900 | [0–12,600] |
| R9c1 | 45 | 28,500 | [17,200–40,300] | 25,200 | [13,200–37,800] | 33,100 | [9600–56,600] |
| R9c1a | 33 | 5900 | [3700–8300] | 5100 | [2600–7700] | 6900 | [400–13,500] |
| Y | 98 | 28,000 | [16,100–40,500] | 24,250 | [14,000–34,900] | 31,200 | [11,600–50,900] |
| Y2 | 50 | 9600 | [5000–14,400] | 9250 | [3100–15,600] | 8800 | [0–20,100] |
| Y2a | 43 | 6100 | [3200–9100] | 6700 | [2000–11,500] | 8400 | [0–21,400] |
| Y2a1 | 36 | 4100 | [2300–5900] | 4500 | [2400–6600] | 1500 | [400–2700] |
Peaks of population size through time as obtained from BSPs for all haplogroups examined in this study, and the overall data for ISEA
| Data set | Peak (ka) | Range of increment (ka) | Increment ratio |
|---|---|---|---|
| Haplogroup | |||
| N9 | 11.9 | 9.8–14.9 | 0.5 |
| 6.8 | 3.8–7.7 | 1.8 | |
| B4b1 | 6.1 | 4.9–7.0 | 2 |
| 0.8 | 0–2.0 | 6 | |
| B4c1 | 6.1 | 5.0–7.6 | 2.5 |
| Y2 | 3.1 | 2.2–4.2 | 1.1 |
| F3 | 7.4 | 5.2–10.3 | 0.1 |
| 2.5 | 4.1–1.2 | 1.1 | |
| R9b | 5.4 | 0.9–7.1 | 1.5 |
| B5b | 16.8 | 15.8–18.9 | 0.7 |
| 3.9 | 3.3–5.0 | 7.9 | |
| D5 | 7.7 | 3.5–13.1 | 2.1 |
| R9c | 3.6 | 2.1–5.2 | 1.7 |
| F1a4 | 2.4 | 1.7–4.0 | 0.3 |
| Region | |||
| ISEA | 9.0 | 8.2–10.6 | 10.1 |
| 2.6 | 2.3–4.1 | 76.9 | |
Fig. 2Analysis of maternal genetic flow into Taiwan. a Probabilistic distribution of founders from mainland Asia, assuming three migrations, using ƒ1 criterion; b scan of migration time into Taiwan from South China (orange line) and ISEA (black line). c Frequency distribution maps of Taiwan based on HVS-I data: c1 Pooled frequency of candidate postglacial mainland South China haplogroups (B5a2, B4a2, D5b3 and R9b1a2); c2 Pooled frequency of candidate Neolithic South China haplogroups (B4b1a2, F1a4, M7c3, Y2a, F4b, N9a10, M7b3a and M7b1d3); c3 Pooled frequency of candidate ISEA influx haplogroups (B4a1a, B5b1c, F3b1a, B4c1b2a2, E1 and E2). The map of Taiwan was adapted under the terms of the GNU Free Documentation License
Fig. 3Estimated contributions of Taiwanese “out-of-Taiwan” mtDNA lineages in the ISEA and Taiwanese aboriginals gene pool. The grey bar represents the overall frequency of those lineages in each population and the second bar represents the relative frequency of those haplogroups within each population
Fig. 4Outline of maternal lineages involved in the main human migrations in the region of Southeast Asia and Taiwan. Includes those discussed here and also those described previously in Soares et al. (2016), including B5a and F1a1a, which were inferred to have dispersed from MSEA with the Neolithic. Dark shading represents the modern coastlines and the extent of Sundaland at the LGM is represented by the light shading. The map was obtained from the website http://www.outline-world-map.com