| Literature DB >> 18680585 |
Gyaneshwer Chaubey1, Monika Karmin, Ene Metspalu, Mait Metspalu, Deepa Selvi-Rani, Vijay Kumar Singh, Jüri Parik, Anu Solnik, B Prathap Naidu, Ajay Kumar, Niharika Adarsh, Chandana Basu Mallick, Bhargav Trivedi, Swami Prakash, Ramesh Reddy, Parul Shukla, Sanjana Bhagat, Swati Verma, Samiksha Vasnik, Imran Khan, Anshu Barwa, Dipti Sahoo, Archana Sharma, Mamoon Rashid, Vishal Chandra, Alla G Reddy, Antonio Torroni, Robert A Foley, Kumarasamy Thangaraj, Lalji Singh, Toomas Kivisild, Richard Villems.
Abstract
BACKGROUND: Human genetic diversity observed in Indian subcontinent is second only to that of Africa. This implies an early settlement and demographic growth soon after the first 'Out-of-Africa' dispersal of anatomically modern humans in Late Pleistocene. In contrast to this perspective, linguistic diversity in India has been thought to derive from more recent population movements and episodes of contact. With the exception of Dravidian, which origin and relatedness to other language phyla is obscure, all the language families in India can be linked to language families spoken in different regions of Eurasia. Mitochondrial DNA and Y chromosome evidence has supported largely local evolution of the genetic lineages of the majority of Dravidian and Indo-European speaking populations, but there is no consensus yet on the question of whether the Munda (Austro-Asiatic) speaking populations originated in India or derive from a relatively recent migration from further East.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18680585 PMCID: PMC2529308 DOI: 10.1186/1471-2148-8-227
Source DB: PubMed Journal: BMC Evol Biol ISSN: 1471-2148 Impact factor: 3.260
Geographical, Linguistic and Haplogroup Affiliations of Completely Sequenced mtDNAs.
| Si No. | Sample code | Haplogroup | Population | Location | Lingustic affiliation |
|---|---|---|---|---|---|
| 1 | Kol77 | R5a1 | Koli | Gujarat | Indo-European |
| 2 | Ben46 | R5a1a | Bengal | West Bengal | Indo-European |
| 3 | Up41 | R5a1a | Middle caste | Uttar Pradesh | Indo-European |
| 4 | Kall43 | R5a2b | Kallar | Tamil Nadu | Dravidian |
| 5 | K35 | R5a2b | Kota | Tamil Nadu | Dravidian |
| 6 | Ori74 | R5a2b2 | Oraon | Orissa | Dravidian |
| 7 | Mo38 | R5a2b3 | Moor | Sri Lanka | Dravidian |
| 8 | Gu35 | R5a2b3 | Gujarat | Gujarat | Indo-European |
| 9 | Pn32 | R5a2b4 | Paniya | Kerala | Dravidian |
| 10 | Mal33 | R5a2b4 | Malayan | Kerala | Dravidian |
| 11 | Ko 5 | R6a1a | Koya | Andhra Pradesh | Dravidian |
| 12 | Ko31 | R6a1a | Koya | Andhra Pradesh | Dravidian |
| 13 | Lam43 | R7a1 | Lambadi | Andhra-Pradesh | Dravidian |
| 14 | As426 | R7a1 | Asur | Jharkhand | Austro-Asiatic |
| 15 | Mw1 | R7a1a | Mawasi | Chhattisgarh | Austro-Asiatic |
| 16 | Tor45 | R7a1a | Sindhi | Pakistan | Indo-European |
| 17 | Ho433 | R7a1b1 | Ho | Jharkhand | Austro-Asiatic |
| 18 | Ori7 | R7a1b1 | Oraon | Jharkhand | Dravidian |
| 19 | Ori37 | R7b1a | Oraon | Orissa | Dravidian |
| 20 | A474 | R7a1b2 | Oraon | Jharkhand | Dravidian |
| 21 | G39 | R7a1b2 | Santhal | Bihar | Austro-Asiatic |
| 22 | G19 | R7a1b2 | Kanwar | Madhya-Pradesh | Indo-European |
| 23 | KO18 | R7b | Koya | Andhra-Pradesh | Dravidian |
| 24 | KO55 | R7b1a | Koya | Andhra-Pradesh | Dravidian |
| 25 | G66 | R7b1a | Gond | Madhya-Pradesh | Dravidian |
| 26 | Ko74 | R8a | Koya | Andhra Pradesh | Dravidian |
| 27 | Lam10 | R8a1a1 | Lambadi | Andhra Pradesh | Dravidian |
| 28 | Ko30 | R8a1a2 | Koya | Andhra Pradesh | Dravidian |
| 29 | Ko37 | R8a1a2 | Koya | Andhra Pradesh | Dravidian |
| 30 | CoB41 | R8a1b | Konkanastha Brahmin | Maharashtra | Indo-European |
| 31 | CoB23 | R30 | Konkanastha Brahmin | Maharashtra | Indo-European |
| 32 | Sin49 | R30a | Sinhalese | Sri Lanka | Indo-European |
| 33 | Pun47 | R30b | Punjab | Punjab | Indo-European |
| 34 | Raj25 | R31a1 | Rajput | Rajasthan | Indo-European |
| 35 | Raj48 | R31a1 | Rajput | Rajasthan | Indo-European |
Figure 1The most parsimonious tree of haplogroup R7 complete mtDNA sequences observed in the Indian subcontinent. This tree was redrawn manually from the output of median joining/reduced network obtained using NETWORK program (version 4.1) [34]http://www.fluxus-engineering.com. The samples were selected through a preliminary sequence analysis of the control region in order to include the widest possible range of R7 variation, language and geographical groups. Coalescent times were calculated by a calibration method described elsewhere [32]. 16182C, 16183C and 16519 polymorphisms were omitted. Suffixes A, C, G, and T indicate transversions, recurrent mutations are underlined. Synonymous (s) and non-synonymous (ns) mutations are distinguished. DRA-Dravidian, AA-Austro-Asiatic, IE-Indo-European. The ethnic affiliation of the samples is as follows: Lam, Lambadi; As, Asur; Mw, Mawasi; Tor45, Pakistan; Ho, Ho; Ori&A, Oraon; G19, Kanwar; G39, Santhal; G66, Gond; KO, Koya. Two sequences, T35 (Thogataveera) and C35 (Brahmin), were taken from the literature [4].
Frequency of Autochthonous R Subgroups Among Different Language Groups of India.
| R5 | R6 | R7 | R8 | R30 | R31 | Total Samples | |
|---|---|---|---|---|---|---|---|
| 1.12% | 4.27% | 5.90% | 2.64% | 0.61% | 0.00% | 983 | |
| 3.62% | 1.70% | 0.58% | 1.61% | 2.63% | 0.85% | 2240 | |
| 3.65% | 1.69% | 1.37% | 1.64% | 2.15% | 0.32% | 2190 | |
| 1.74% | 0.00% | 0.00% | 0.58% | 0.58% | 0.00% | 172 |
Figure 2Principal component (PC) analysis of R5-8, R30 and R31 lineages in Indian populations. Munda group and a few Indo-European/Dravidian populations collected from Bihar, Jharkhand and Chhattisgarh states, predominantly cluster with haplogroup R7. Haplogroup frequencies were obtained from published sources [14] and our unpublished data.
Figure 3The reduced-median network of 152 mtDNAs belonging to haplogroup R7. Each sample represented on the diagram has been sequenced for the HVS-I region and genotyped for the coding region mutations that are indicated. Circle sizes are proportional to the number of mtDNAs with that haplotype. Recurrent mutations are underlined.
Coalescent times of hg R7 subclades estimated from HVS-I data.
| Clade | Number of Samples | Motif (Coding region) | Time (SD) | ||
|---|---|---|---|---|---|
| 152 | 1442-6248-7870-9051-9110-10289-13105-13830 | 0.796 | 0.31 | 16.064 (6.260) | |
| 47 | 1442-6248-7870-9051-9110-10289-13105-13830 | 0.234 | 0.102 | 4.723 (2.059) | |
| 29 | 1442-6248-7870-9051-9110-10289-13105-13830 | 0.793 | 0.306 | 16.005 (6.185) | |
| 76 | 1442-6248-7870-9051-9110-10289-13105-13830 | 1.145 | 0.536 | 23.101 (10.822) | |
| 107 | 10143-10915-13404-15346 | 0.389 | 0.102 | 7.848 (2.064) | |
| 37 | 10143-10915-13404-15346 | 0.514 | 0.151 | 10.363 (3.037) | |
| 24 | 10143-10915-13404-15346 | 0.5 | 0.24 | 10.090 (4.757) | |
| 45 | 1804-2282-8557-12432-14064-15942 | 0.797 | 0.292 | 16.052 (5.891) | |
| 39 | 1804-2282-8557-12432-14064-15942 | 0.744 | 0.268 | 15.006 (5.402) | |
| 86 | 12406-13674 | 0.337 | 0.115 | 6.805 (2.311) | |
| 47 | 12406-13674 | 0.234 | 0.102 | 4.723 (2.059) | |
| 23 | 12406-13674 | 0.522 | 0.246 | 10.529 (4.663) | |
| 16 | 12406-13674 | 0.375 | 0.153 | 7.568 (3.090) |
Figure 4The frequency distribution of R7a and R7b clades in Indian subcontinent. The upper panel (a, b) shows the spatial distribution (%) of these clades in Indian populations. Isofrequency maps were generated by using Surfer7 of Golden Software (Golden Software Inc., Golden, Colorado), following the Kriging procedure. These isofrequency maps illustrate the geographic spread of the respective mtDNA haplogroups. It should be cautioned, however, that these illustrative maps should not be used to predict the frequency of the clade in geographical areas with missing data. The lower panel (c, d) depicts the frequencies of R7a and R7b in different social and language groups. DRA-Dravidian, AA-Austro-Asiatic, IE-Indo-European.
Mantel correlation test of Autochthonous R Subgroups to assess the significance of correlations between gene and geography, or language.
| Haplogroup | Gene vs Geography | p | Gene vs Language | p |
|---|---|---|---|---|
| 0.1276 | 0.0475 | 0.1748 | 0.2 | |
| 0.2654 | 0.037 | 0.13248 | 0.19 | |
| 0.299 | 0.023 | 0.219 | 0.225 | |
| 0.211496 | 0.01753 | 0.23248 | 0.31 | |
| 0.189917 | 0.127 | 0.1348 | 0.28 | |
| 0.172 | 0.1873 | 0.141 | 0.25 |
Figure 5The frequency distribution of haplogroup R7 in different branches of the Austro-Asiatic language family of India[26].