| Literature DB >> 15339343 |
Mait Metspalu1, Toomas Kivisild, Ene Metspalu, Jüri Parik, Georgi Hudjashov, Katrin Kaldma, Piia Serk, Monika Karmin, Doron M Behar, M Thomas P Gilbert, Phillip Endicott, Sarabjit Mastana, Surinder S Papiha, Karl Skorecki, Antonio Torroni, Richard Villems.
Abstract
BACKGROUND: Recent advances in the understanding of the maternal and paternal heritage of south and southwest Asian populations have highlighted their role in the colonization of Eurasia by anatomically modern humans. Further understanding requires a deeper insight into the topology of the branches of the Indian mtDNA phylogenetic tree, which should be contextualized within the phylogeography of the neighboring regional mtDNA variation. Accordingly, we have analyzed mtDNA control and coding region variation in 796 Indian (including both tribal and caste populations from different parts of India) and 436 Iranian mtDNAs. The results were integrated and analyzed together with published data from South, Southeast Asia and West Eurasia.Entities:
Mesh:
Substances:
Year: 2004 PMID: 15339343 PMCID: PMC516768 DOI: 10.1186/1471-2156-5-26
Source DB: PubMed Journal: BMC Genet ISSN: 1471-2156 Impact factor: 2.797
Characteristics of the Indian and Iranian population samples whose mtDNA variation has been determined in the course of this study.
| State / Region | Socio-cultural affiliation | Linguistic affiliation | Population | Code | n | N | |
| West Bengal | Caste | Indo-European | Mixed caste people | Ben | 50 | 80 Million | |
| Uttar Pradesh | Tribe | Indo-European | Bhoksa | Bho | 5 | 32,000 | |
| Kerala | Caste | Indo-European | Mixed caste people from Cochin | Co | 55 | 600,000 | |
| Kerala | -a | Indo-European | Cochin Jews | CoJ | 45 | 5,000 | (in Israel) |
| Gujarat | Caste | Indo-European | Mixed caste people | Guj | 53 | 50 Million | |
| Himachal | Tribe | Tibeto-Burman | Kanet | Kan | 37 | 33,000 | |
| Maharashtra | Caste | Indo-European | Konkanastha Brahmin | Kon | 58 | N/A | |
| West Bengal | Caste | Indo-European | Kurmi | Kur | 55 | N/A | |
| West Bengal | Tribe | Austro-Asiatic | Lodha | Lod | 56 | 59,000 | |
| Sri Lanka | Caste | Dravidic | Moor | Mo | 50 | 3.4 Million | |
| Maharashtra | -a | Indo-European | Parsi | Par | 55 | 76,000 | |
| Punjab | Caste | Indo-European | Mixed caste people | Pun | 109 | 24 Million | |
| Rajasthan | Caste | Indo-European | Rajput | Raj | 35 | 5 Million | (in Rajasthan) |
| Sri Lanka | Caste | Indo-European | Sinhalese | Sin | 82 | 14.6 Million | |
| Uttar Pradesh | Tribe | Indo-European | Tharu | Tha | 26 | 96,000 | (in India) |
| Uttar Pradesh | Caste | Indo-European | Uttar Pradesh Brahmin | UPb | 25 | 2.5 Million | |
| total | 796 | ||||||
| northwest | 226 | ||||||
| southwest | 138 | ||||||
| northeast | 30 | ||||||
| southeast | 6 | ||||||
| central | 36 | ||||||
| total | Indo-European | Iranian | IR | 436 | ca. 68 Million | ||
a For some analyzes these populations where grouped with the caste populations. (see Table 1 in the Supplementary Material for the complete set of studied populations including those whose mtDNA variation was previously published)
Geographic, linguistic and socio-cultural distribution of major Indian-specific mtDNA haplogroups
| HAPLOGROUP FREQUENCY (95% CR FOR PROPORTION) | |||||||||||||||
| n | Mb | M2 | M3a | M4a | M6 | M18 | M25 | Rc | R5 | R6 | U2id | R2; U7; W | EEe | WEf | |
| India | 2544g | 65 (63.1–66.8) | 6.3 (5.4–7.3) | 2.2 (1.7–2.9) | 0.6 (.4–1) | 2.9 (2.3–3.6) | 2.1 (1.6–2.8) | 0.9 (.6–1.4) | 11.4 (10.2–12.7) | 2.2 (1.7–2.8) | 1.3 (.9–1.8) | 5.3 (4.5–6.2) | 3.4 (2.7–4.2) | 10.5 (9.4–11.8) | 12.1 (10.9–13.4) |
| Iran | 436 | 5.3 (3.6–7.8) | 0 (0–.8) | 0.5 (.1–1.6) | 0 (0–.8) | 0 (0–.8) | 0.2 (.1–1.3) | 0 (0–.8) | 3.2 (1.9–5.3) | 0.9 (.4–2.3) | 0 (0–.8) | 0.2 (.1–1.3) | 13.3 (10.4–16.8) | 3.9 (2.5–6.2) | 90.8 (87.7–93.2) |
| Caste population | 1204 | 57.8 (54.9–60.5) | 5.1 (4–6.5) | 4.2 (3.2–5.5) | 1.1 (.6–1.8) | 3 (2.2–4.1) | 1.4 (.9–2.2) | 1.4 (.9–2.2) | 14.1 (12.3–16.2) | 3.3 (2.5–4.5) | 1.7 (1.1–2.5) | 7.8 (6.4–9.5) | 4.8 (3.7–6.2) | 4 (3–5.2) | 17.1 (15.1–19.3) |
| Tribal population | 1332 | 71.5 (69–73.8) | 7.5 (6.2–9) | 0.5 (.2–1) | 0.2 (0–.5) | 2.7 (2–3.7) | 2.7 (2–3.7) | 0.5 (.2–1) | 8.9 (7.5–10.6) | 1.2 (.7–1.9) | 0.9 (.5–1.6) | 3 (2.2–4.1) | 2 (1.4–2.9) | 16.5 (14.6–18.6) | 7.5 (6.2–9) |
| Brahmins & Kshatriyas | 313 | 60.1 (54.5–65.3) | 1.9 (.9–4.1) | 5.1 (3.2–8.1) | 1 (.3–2.8) | 0.6 (.2–2.3) | 1 (.3–2.8) | 2.2 (1.1–4.5) | 14.7 (11.2–19.1) | 4.5 (2.7–7.4) | 1.3 (.5–3.2) | 6.4 (4.2–9.7) | 6.1 (3.9–9.3) | 3.5 (2–6.2) | 17.6 (13.8–22.2) |
| Other Castesh | 517 | 61.5 (57.2–65.6) | 6.2 (4.4–8.6) | 4.6 (3.1–6.8) | 1.2 (.5–2.5) | 2.9 (1.8–4.7) | 2.1 (1.2–3.8) | 0.8 (.3–2) | 12.6 (10–15.7) | 2.3 (1.3–4) | 1.5 (.8–3) | 7.4 (5.4–9.9) | 3.9 (2.5–5.9) | 1.7 (.1–1.4) | 16.6 (1.2–3.8) |
| Indo-European | 963 | 57.5 (54.4–60.6) | 3.8 (2.8–5.3) | 4.5 (3.3–6) | 1.0 (.6–1.9) | 2.7 (1.9–3.9) | 1.1 (.6–2) | 1.6 (1–2.6) | 7.8 (6.3–9.7) | 2.5 (1.7–3.7) | 1.6 (1–2.6) | 7.1 (5.6–8.9) | 6.9 (5.4–8.6) | 4.0 (3–5.5) | 19.9 (17.5–22.6) |
| Dravidic | 1063 | 70.0 (67.2–72.7) | 10.8 (9.1–12.8) | 1.2 (.7–2.1) | 0.5 (.2–1.1) | 4.0 (2.9–5.3) | 1.4 (.9–2.3) | 0.5 (.2–1.1) | 8.9 (7.4–10.8) | 2.7 (1.9–3.9) | 1.4 (.9–2.3) | 5.9 (4.7–7.5) | 1.2 (.7–2.1) | 3.0 (2.1–4.2) | 8.8 (7.3–10.7) |
| Tibeto-Burman | 249 | 59.8 (53.6–65.7) | 0 (0–1.5) | 0.4 (.1–2.2) | 0 (0–1.5) | 1.6 (.7–4) | 1.2 (.4–3.5) | 1.2 (.4–3.5) | 0.8 (.2–2.9) | 1.2 (.4–3.5) | 0 (0–1.5) | 0.8 (.2–2.9) | 2.0 (.9–4.6) | 61.4 (55.3–67.3) | 3.2 (1.7–6.2) |
| Austro-Asiatic (Mundari) | 90 | 86.7 (78.1–92.2) | 2.2 (.7–7.7) | 0 (0–4) | 0 (0–4) | 0 (0–4) | 26.7 (18.6–36.7) | 0 (0–4) | 4.4 (1.8–10.9) | 0 (0–4) | 1.1 (.3–6) | 1.1 (.3–6) | 1.1 (.3–6) | 0 (0–4) | 1.1 (.3–6) |
| Indo-European i | 250 | 66.4 (60.3–72) | 8.0 (5.3–12) | 0.4 (.1–2.2) | 0.8 (.2–2.8) | 4.8 (2.8–8.2) | 1.6 (.6–4) | 0.4 (.1–2.2) | 8.8 (5.9–13) | 2.0 (.9–4.6) | 1.6 (.6–4) | 6.4 (4–10.1) | 4.0 (2.2–7.2) | 1.2 (.4–3.5) | 13.6 (9.9–18.4) |
| Indo-European j | 713 | 54.4 (50.7–58) | 2.4 (1.5–3.8) | 5.9 (4.4–7.9) | 1.1 (.6–2.2) | 2.0 (1.2–3.3) | 1.0 (.5–2) | 2.0 (1.2–3.3) | 7.4 (5.7–9.6) | 2.7 (1.7–4.1) | 1.5 (.9–2.7) | 7.3 (5.6–9.4) | 7.9 (6.1–10.1) | 5.0 (3.7–6.9) | 22.2 (19.3–25.4) |
| Andhra Pradesh | 245 | 65.3 (59.1–71.0) | 7.7 (5–11.8) | 2.8 (1.4–5.7) | 1.6 (.6–4.1) | 3.3 (1.7–6.4) | 4.1 (2.3–7.4) | 0 (0–1.5) | 18 (13.7–23.2) | 4.1 (2.3–7.4) | 1.2 (.4–3.5) | 5.3 (3.2–8.9) | 2.4 (1.1–5.2) | 1.2 (.4–3.5) | 8.9 (6–13.2) |
| Bangladesh | 30 | 66.67 (48.6–80.8) | 9.96 (3.61–25.7) | 0 (.08–11.2) | 0 (.08–11.2) | 3.29 (.78–16.6) | 3.29 (.78–16.6) | 0 (.08–11.2) | 6.67 (2.04–21.4) | 0 (.08–11.2) | 0 (.08–11.2) | 6.67 (2.04–21.4) | 3.29 (.78–16.6) | 9.96 (3.61–25.7) | 6.67 (2.04–21.4) |
| Gujarat | 57 | 40.4 (28.6–53.4) | 5.3 (1.9–14.4) | 7 (2.9–16.7) | 1.8 (.4–9.2) | 1.8 (.4–9.2) | 1.8 (.4–9.2) | 0 (0–6.2) | 17.5 (9.9–29.4) | 5.3 (1.9–14.4) | 1.8 (.4–9.2) | 1.8 (.4–9.2) | 17.5 (9.9–29.4) | 5.3 (1.9–14.4) | 33.3 (22.5–46.3) |
| Karnataka | 47h | 55.3 (41.2–68.6) | 2.1 (.5–11.1) | 2.1 (.5–11.1) | 0 (.1–7.4) | 0 (.1–7.4) | 0 (.1–7.4) | 0 (.1–7.4) | 19.1 (10.5–32.6) | 6.4 (2.3–17.2) | 0 (.1–7.4) | 10.6 (4.7–22.7) | 6.4 (2.3–17.2) | 0 (.1–7.4) | 14.9 (7.5–27.8) |
| Kashmir | 19 | 26.3 (11.9–49.1) | 0 (.1–16.8) | 0 (.1–16.8) | 5.3 (1.2–24.9) | 10.5 (3.2–31.7) | 0 (.1–16.8) | 0 (.1–16.8) | 10.5 (3.2–31.7) | 0 (.1–16.8) | 5.3 (1.2–24.9) | 10.5 (3.2–31.7) | 5.3 (1.2–24.9) | 21.1 (8.7–43.7) | 31.6 (15.4–54.3) |
| Kerala | 100 | 54 (44.2–63.5) | 4 (1.6–9.8) | 0 (0–3.6) | 0 (0–3.6) | 0 (0–3.6) | 0 (0–3.6) | 5 (2.2–11.2) | 25 (17.6–34.3) | 9 (4.9–16.2) | 1 (.2–5.4) | 7 (3.5–13.8) | 1 (.2–5.4) | 4 (1.6–9.8) | 11 (6.3–18.7) |
| Maharashtra | 117 | 65 (55.9–73) | 3.4 (1.4–8.5) | 11.1 (6.6–18.1) | 0.9 (.2–4.6) | 0 (0–3.1) | 0 (0–3.1) | 5.1 (2.4–10.7) | 6.8 (3.5–12.9) | 0.9 (.2–4.6) | 0.9 (.2–4.6) | 4.3 (1.9–9.6) | 4.3 (1.9–9.6) | 5.1 (2.4–10.7) | 23.1 (16.4–31.5) |
| Punjab | 150 | 36 (28.8–44) | 0.7 (.2–3.6) | 4.7 (2.3–9.3) | 1.3 (.4–4.7) | 0.7 (.2–3.6) | 0 (0–2.4) | 2.7 (1.1–6.6) | 11.3 (7.2–17.4) | 0.7 (.2–3.6) | 0.7 (.2–3.6) | 7.3 (4.2–12.7) | 14 (9.4–20.5) | 5.3 (2.8–10.2) | 42.7 (35–50.7) |
| Rajasthan | 36 | 69.4 (53–82) | 0 (.1–9.5) | 13.9 (6.2–28.8) | 0 (.1–9.5) | 0 (.1–9.5) | 5.6 (1.7–18.2) | 0 (.1–9.5) | 22.2 (11.8–38.2) | 5.6 (1.7–18.2) | 2.8 (.7–14.2) | 2.8 (.7–14.2) | 5.6 (1.7–18.2) | 0 (.1–9.5) | 2.8 (.7–14.2) |
| Sri Lanka | 132 | 57.6 (49–65.7) | 6.8 (3.7–12.5) | 1.5 (.5–5.3) | 1.5 (.5–5.3) | 1.5 (.5–5.3) | 0.8 (.2–4.1) | 0 (0–2.7) | 12.9 (8.2–19.7) | 0.8 (.2–4.1) | 3 (1.2–7.5) | 12.1 (7.6–18.8) | 2.3 (.8–6.5) | 2.3 (.8–6.5) | 15.2 (10–22.3) |
| Tamil Nadu | 51 | 64.7 (50.9–76.4) | 5.9 (2.1–15.9) | 3.9 (1.2–13.2) | 2 (.5–10.3) | 2 (.5–10.3) | 2 (.5–10.3) | 0 (0–6.8) | 11.8 (5.6–23.4) | 0 (0–6.8) | 3.9 (1.2–13.2) | 2 (.5–10.3) | 3.9 (1.2–13.2) | 23.5 (14–36.8) | 7.8 (3.2–18.5) |
| Uttar Pradesh | 98 | 52 (42.2–61.7) | 1 (.2–5.5) | 8.2 (4.2–15.3) | 1 (.2–5.5) | 0 (0–3.7) | 2 (.6–7.1) | 1 (.2–5.5) | 11.2 (6.4–19) | 3.1 (1.1–8.6) | 2 (.6–7.1) | 15.3 (9.5–23.8) | 3.1 (1.1–8.6) | 4.1 (1.7–10) | 13.3 (8–21.4) |
| West Bengal | 106 | 71.7 (62.5–79.4) | 3.8 (1.5–9.3) | 1.9 (.6–6.6) | 0 (0–3.4) | 5.7 (2.7–11.8) | 0 (0–3.4) | 0.9 (.2–5.1) | 12.3 (7.3–19.9) | 6.6 (3.3–13) | 2.8 (1–8) | 6.6 (3.3–13) | 0.9 (.2–5.1) | 0.9 (.2–5.1) | 7.5 (3.9–14.2) |
a a See Tables 8–13 (additional files 3–7, 9) for frequencies of other haplogroups including breakdown of R2, U7 and W
b Includes all haplogroup M sub-haplogroups
c Includes sub-haplogroups R2, R5, R6 and R10
d Composite of haplogroups U2a, U2b and U2c [27]
e Composite of haplogroups A, B, D, G, M7, M8 (including C and Z), M9 (including E), M10, N9 (including Y), R9 (including F)
f Composite of haplogroups HV, TJ, X, N1, N2 and U (except U2i)
g Caste and tribal populations plus 8 Siddi [49] who belong to neither groupings
h Datasets where caste affiliation was not determined (undefined/mixed caste population), Scheduled Caste populations (e.g. the Mukri, see text for explanation) and data from [3] (dataset of 27 haplogroup M samples) are excluded.
i Indo-European speakers of southern India (Tamil Nadu, Kerala, Karnataka, Andhra Pradesh and Sri Lanka)
j Indo-European speakers of northern India (States other than listed in the previous footnote)
k The sum of the presented caste populations sample sizes is different from that given in the third row of the current table because of the addition of the Bangladeshi (n = 30) and extraction of the Mukri (n = 42), Orissa (n = 2) and Bihar (n = 2) data.
Figure 4Spatial Autocorrelation Analyses Correlograms of different haplogroups and haplogroup groupings frequencies in South Asia. In the case of haplogroup M3a some datasets had to be excluded because lack of resolution (see footenote for Table 3. for detailes). There was no significant cline in the frequencies of haplogroup R2. In the case of haplogroups M6a and M6b only the potential cline along the Bay of Bengal is investigated. Therefor the number of distance classes is reduced to three.
Figure 2The spatial distribution of Indian-specific mtDNA haplogroups (U2, M4a, M18 and M25) and their sub-haplogroups. In the case of haplogroup M25 some datasets had to be excluded because the discrimination between M* and the respective subgroup was not possible on the basis of the HVS-I data alone (see footnote of Table 3). For other details see legend to Figure 1.
Indian-specific sub-clades of mtDNA haplogroups M and R.
| DIAGNOSTIC CODING REGION MARKERS | ANCESTRAL HVS-I MOTIF | COALESCENCE (years) | ρ/nδ2 a | COALESCENCE b (years) | ρ/nδ2 a | nc | Proportion (95% CR) | |
| M2 | 477G-1780-8502 | 16223–16319 | 70,600 ± 21,000 | 0.02 | 70,100 ± 20,700 | 0.04 | 166 | 6.1 (5.3–7.1) |
| M2a | 5252–8369 | 16223-16319-16270 | 48,300 ± 20,100 | 0.03 | 46,700 ± 22,800 | 0.06 | 79 | 2.9 (2.3–3.6) |
| M2b | 16223-16319-16274 | 54,800 ± 25,000 | 0.02 | 57,600 ± 22,300 | 0.06 | 87 | 3.2 (2.6–3.9) | |
| M3a | 4580 | 16126–16223 | 17,300 ± 7,400 | 0.10 | 17,300 ± 7,600 | 0.11 | 62 | 3.0f (2.4 – 3.8) |
| M4a | 6620–7859 | 16223–16311 | 19,200 ± 9,000 | 0.23 | 19,100 ± 9,000 | 0.25 | 21 | 0.8 (0.5–1.2) |
| M6 | 3537 | 16223-16231-16362 | 33,000 ± 13,900 | 0.04 | 30,000 ± 13,600 | 0.08 | 79 | 2.9 (2.3–3.6) |
| M6a | 16223-16231-16356-16362 | 19,100 ± 7,600 | 0.17 | 15,700 ± 8,100 d | 0.54 | 37 | 1.4 (1.0–1.9) | |
| M6b | 5585 | 16188-16223-16231-16362 | 6,000 ± 2,100 | 0.73 | 12,100 ± 4,500 | 0,82 | 37 | 1.4 (1.0–1.9) |
| M18 | 16223-16318T | 9,400 ± 3,200 | 0.31 | 17,100 ± 4,700 | 0.61 | 58 | 2.1 (1.7–2.7) | |
| M25 | 15928 | 16223–16304 | 19,400 ± 7,200 | 0.30 | 22,300 ± 8,600 | 0.32 | 25 | 1.0g (0.7 – 1.5) |
| R2e | 4216 | 16071 | 40,400 ± 14,300 | 0.45 | 52,500 ± 21,700 | 0.5 | 8 | 0.3 (0.2–0.6) |
| R5 | 8594 | 16266–16304 | 66,100 ± 22,000 | 0.04 | 69,800 ± 24,800 | 0.05 | 58 | 2.1 (1.7–2.7) |
| R6 | -12282 AluI | 16129–16362 | 30,000 ± 11,000 | 0.14 | 30,300 ± 11,600 | 0.19 | 35 | 1.3 (0.9–1.8) |
| Total | 2719 |
a Efficiency of the sample for coalescence calculation (Saillard et al. 2000)
b Calculated without the tribal populations
c In the Indian subcontinent (a total of 2719 mtDNAs)
d Excluding also the scheduled cast of Mukri
e R2 is spread also in the Near East, Central Asia and Volga basin
f Excluding the datasets where M3* / M3a discrimination was impossible: 246 subjects from Andhra Pradesh (Bamshad et al. 1998); 382 Pushtoon, Koragas, Naga, Yerava, Pardhi, Paniya Adi, Andh, Apatani and Soligas (Cordaux et al. 2003); 35 Kota and Kurumba (Roychoudhury et al. 2001)
g Excluding the datasets where M* / M25 discrimination was impossible: 246 subjects from Andhra Pradesh (Bamshad et al. 1998); 39 Thoti and 29 subjects from Bangladesh (Cordaux et al. 2003); 12 Muria (Roychoudhury et al. 2001)
Figure 1The spatial distribution of Indian-specific mtDNA haplogroups (M2, M6 and M3) and their sub-haplogroups. Maps of South and Southwest Asia depicting the spatial frequency distribution of various Indian-specific mtDNA haplogroups. For India, the tribal populations were excluded (see 'Methods' for explanation). The published (full reference in Table 6, see Additional file 1) and new data were averaged to the resolution of states in India, geographic regions in Iran and provinces in China and Thailand. Numbers adjacent to the data points indicate sample sizes. In the case of haplogroup M3a some datasets had to be excluded because the discrimination between M* and the respective subgroup was not possible on the basis of the HVS-I data alone (see footnote of Table 3).
Figure 3The spatial distribution of Indian-specific mtDNA haplogroups (R5 and R6) and West Eurasian-specific U7, W and R2 in South and southwest Asia. For other details see legend to Figure 1.
Figure 5Peopling of Eurasia. Map of Eurasia and northeastern Africa depicting the peopling of Eurasia as inferred from the extant mtDNA phylogeny. The bold black arrow indicates the possible "coastal" route of colonization of Eurasia by anatomically modern humans (ca. 60,000 – 80,000 ybp.). This "Southern Coastal Route" is suggested by the phylogeography of mtDNA haplogroup M, the virtual absence of which in the Near East and Southwest Asia undermines the likelihood of the initial colonization of Eurasia taking a route north around the Red Sea. Therefore, the initial split between West and East Eurasian mtDNAs is postulated between the Indus Valley and Southwest Asia. Spheres depict expansion zones where, after the initial (coastal) peopling of the continent, local branches of the mtDNA tree (haplogroups given in the spheres) arose (ca. 40,000 – 60,000 ybp), and from where they where further carried into the interior of the continent (thinner black arrows). Admixture between the expansion zones has been surprisingly limited ever since. We note that while there is no obvious need to introduce the "northern route" – from northeast Africa over Sinai to the Near East – to explain the initial colonization of Eurasia, the spread of some mtDNA and Y-chromosomal haplogroups implies that the "northern" passage might have been used in a later period [33, 34].
Figure 6Network of HVS-I haplotypes belonging to haplogroup M6. Circle areas are proportional to haplotype frequencies. Variant bases of the HVS-I are numbered as in (Anderson et al. 1981) minus 16,000 and shown along links between haplotypes. Character changes are specified only for transversions. Underlined samples are those in which the diagnostic coding region markers (3537 and 5585) were assayed by either RFLP analysis (M6: -3537 AluI; M6b: -5584 AluI) or direct sequencing. Sample codes are as in Table 6 (see Additional file 1). Coalescence estimates marked with an asterisk are calculated excluding tribal populations (see Materials and Methods for explanation). The coalescence estimate marked with two asterisks is calculated without the data on tribal and scheduled caste (the Mukri) populations (see text for details).
Figure 7A network of haplogroup M3a haplotypes. Underlined samples are those in which the diagnostic coding region marker (4580) of M3a was assayed by either RFLP analysis (-4577 NlaIII) or direct sequencing. For other details, see the legend to Figure 6.
Figure 8Network of HVS-I haplotypes belonging to haplogroup M4a. Underlined samples are those in which the diagnostic coding region marker (6620) of M4a was assayed by either RFLP analysis (+6618 MboI) or direct sequencing. For other details, see the legend to Figure 6.
Figure 9Network of HVS-I haplotypes belonging to haplogroup M18. For other details, see the legend to Figure 6
Figure 10Network of HVS-I haplotypes belonging to haplogroup M25. Underlined samples are those in which the diagnostic coding region marker (15928) of M25 was assayed by either RFLP analysis (-15925 MspI) or direct sequencing. For other details, see the legend to Figure 6.
Figure 11The segregation of West Eurasian, East Eurasian and South Asian mtDNA pools. Partial map of Eurasia illustrating the spatial frequency distribution of mtDNA haplogroups native to West Eurasia (panel A), South Asia (panel B) and East Eurasia (panel C). Data points represent states in India, geographic divisions in Iran and provinces in China and Thailand. Numbers adjacent to the data points indicate the applicable sample size. The distribution of West Eurasian-, South Asian- and East Eurasian-specific mtDNA haplogroups amongst the tribal populations of South and Southeast Asia are depicted as pie diagrams on panel D (colors as on panels A, B, and C, while the white color represents unclassified M* and R* mtDNAs). The data on tribal populations was not used for the isofrequency maps of panels A, B and C (see Materials and Methods for explanation).
Coalescence estimates and diversity values for mtDNA haplogroups U7, W and R2 in India, Central Asia, Near and Middle East.
| INDIA | NEAR AND MIDDLE EAST | CENTRAL ASIA | ||||
| HAPLO-GROUP | COALESCENCE (years) | DIVERSITY | COALESCENCE (years) | DIVERSITY | COALESCENCE (years) | DIVERSITY |
| U7 | 41,400 ± 15,800 | 0.941 | 41,200 ± 14,800 | 0.91 | 34,400 ± 15,500 | 0.941 |
| W | 37,900 ± 11,100 | 0.883 | 32,000 ± 8,700 | 0.934 | 27,400 ± 8,300 | 0.758 |
| R2 | 40,400 ± 14,300 | 0.923 | 36,100 ± 11,100 | 0.955 | 25,200 ± 13,300 | 0.889 |
Proportion of mtDNA haplotype sharing between population groups of South Asia
| Group 1 | Group 2 | |||||||
| pop. group | n | nb | 1/2d | pop. group | n | nb | 2/1e | nc |
| Random 1g | 1256 ± 215 | 617 ± 109 | .399 – .453 | Random 2g | 1426 ± 215 | 643 ± 114 | .396 – .447 | 132 (112 – 154) |
| Tribals | 1197 | 437 | .411 – .467 | Castes | 1485 | 806 | .324 – .372 | 107 (90 – 128) |
| Northern states | 1204 | 647 | .277 – .329 | Southern states | 1478 | 585 | .345 – .394 | 96 (80 – 116) |
| Western states | 1142 | 508 | .453 – .511 | Eastern states | 1540 | 735 | .331 – .379 | 107 (90 – 128) |
| Dravidians | 974 | 380 | .428 – 490 | Others | 1708 | 846 | .273 – .317 | 90 (74 – 109) |
| Indo-Europeans | 1322 | 686 | .328 – .379 | Others | 1360 | 556 | .393 – .446 | 106 (89 – 127) |
| AA tribals | 90 | 35 | .552 – .746 | Others | 2592 | 1110 | .014 – .025 | 9 (5 – 17) |
| TB tribals | 249 | 142 | .174 – .277 | Others | 2433 | 1018 | .091 – .115 | 24 (16 – 35) |
| Indo-Europeans | 1322 | 686 | .295 – .346 | Dravidians | 974 | 380 | .425 – .487 | 87 (71 – 106) |
| Total | 2682 | 1136 | 340 f | |||||
a continental Indians including Pakistan and Bangladesh (excluding Andaman & Nicobar)
b number of haplotypes
c number of shared haplotypes between group 1 and group 2 (95% credible regions; CR)
d CR of the proportion of those haplotypes in group 1 that occur also in group 2
e CR of the proportion of those haplotypes in group 2 that occur also in group 1
f 41% of the shared haplotypes are shared only between members of the same population
g ± indicate SD