| Literature DB >> 23209554 |
Niraj Rai1, Gyaneshwer Chaubey, Rakesh Tamang, Ajai Kumar Pathak, Vipin Kumar Singh, Monika Karmin, Manvendra Singh, Deepa Selvi Rani, Sharath Anugula, Brijesh Kumar Yadav, Ashish Singh, Ramkumar Srinivasagan, Anita Yadav, Manju Kashyap, Sapna Narvariya, Alla G Reddy, George van Driem, Peter A Underhill, Richard Villems, Toomas Kivisild, Lalji Singh, Kumarasamy Thangaraj.
Abstract
Linguistic and genetic studies on Roma populations inhabited in Europe have unequivocally traced these populations to the Indian subcontinent. However, the exact parental population group and time of the out-of-India dispersal have remained disputed. In the absence of archaeological records and with only scanty historical documentation of the Roma, comparative linguistic studies were the first to identify their Indian origin. Recently, molecular studies on the basis of disease-causing mutations and haploid DNA markers (i.e. mtDNA and Y-chromosome) supported the linguistic view. The presence of Indian-specific Y-chromosome haplogroup H1a1a-M82 and mtDNA haplogroups M5a1, M18 and M35b among Roma has corroborated that their South Asian origins and later admixture with Near Eastern and European populations. However, previous studies have left unanswered questions about the exact parental population groups in South Asia. Here we present a detailed phylogeographical study of Y-chromosomal haplogroup H1a1a-M82 in a data set of more than 10,000 global samples to discern a more precise ancestral source of European Romani populations. The phylogeographical patterns and diversity estimates indicate an early origin of this haplogroup in the Indian subcontinent and its further expansion to other regions. Tellingly, the short tandem repeat (STR) based network of H1a1a-M82 lineages displayed the closest connection of Romani haplotypes with the traditional scheduled caste and scheduled tribe population groups of northwestern India.Entities:
Mesh:
Substances:
Year: 2012 PMID: 23209554 PMCID: PMC3509117 DOI: 10.1371/journal.pone.0048477
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1The most parsimonious route of prehistoric expansion of Y-chromosomal haplogroup H1a1a-M82 and the recent out-of -India migration of European Roma ancestors.
Measure of Genetic Diversity and expansion ages of haplogroup H1a1a-M82 among different population groups.
| Group |
|
| HD | MPD | AMD | Variance | Age |
| Roma Portugal | 21 | 8 | 0.8286±0.0646 | 1.5190±0.9501 | 1.14 | 0.08 | 3.69±2.53 |
| Roma Serbia | 41 | 17 | 0.8744±0.0363 | 1.4183±0.8851 | 1.66 | 0.1 | 3.71±2.18 |
| Roma Croatia | 76 | 13 | 0.5607±0.0674 | 1.2758±0.8111 | 1.36 | 0.05 | 1.91±0.72 |
| Northwest India | 64 | 57 | 0.9950±0.0042 | 7.0164±3.3392 | 5.86 | 0.38 | 16.04±2.38 |
| West India | 13 | 13 | 1.0000±0.0302 | 6.6795±3.3706 | 5.23 | 0.45 | 17.65±3.97 |
| North India | 26 | 24 | 0.9908±0.0151 | 8.3108±3.9781 | 8.04 | 0.49 | 19.46±2.95 |
| East India | 14 | 13 | 0.9890±0.0314 | 8.0659±3.9862 | 9.14 | 0.48 | 20.79±3.84 |
| Northcentral India | 35 | 20 | 0.9092±0.0389 | 7.3126±3.5062 | 7.94 | 0.46 | 19.12±3.01 |
| Southcentral India | 43 | 36 | 0.9900±0.0078 | 8.7741±4.1281 | 8.84 | 0.59 | 24.94±3.09 |
| South India | 37 | 35 | 0.9970±0.0071 | 8.8769±4.1865 | 8.78 | 0.56 | 23.63±3.22 |
n = number of samples; h = number of haplotypes; HD = haplotype diversity; MPD = mean pairwise difference; AMD = average mutational distance from Roma modal haplotype.
The average number of pairwise differences are shown within (PiX along diagonal) and between (PiXY above diagonal) populations; pairwise (δμ)2 genetic distance values are depicted below the diagonal.
| RP | RS | RC | NWI | WI | NI | EI | NCI | SCI | SI | Afg | |
| Roma-Portugal (RP) |
| 1.73984 | 2.14662 | 5.52827 | 4.7326 | 7.39011 | 8.26871 | 7.02449 | 7.49723 | 7.65508 | 6.34921 |
| Roma-Serbia (RS) | 0.27117 |
| 2.42202 | 5.75191 | 4.88368 | 7.59568 | 8.40592 | 7.06899 | 7.64265 | 7.61437 | 6.61382 |
| Roma-Croatia (RC) | 0.7492 | 1.07497 |
| 6.03372 | 4.90891 | 7.92814 | 7.98872 | 7.22744 | 7.54376 | 7.98898 | 6.02851 |
| Northwest India (NWI) | 1.26057 | 1.53457 | 1.88764 |
| 7.0012 | 8.0631 | 8.66853 | 8.08304 | 8.45858 | 8.46917 | 7.52344 |
| West India (WI) | 0.63333 | 0.83479 | 0.93127 | 0.15327 |
| 8.24852 | 8.47802 | 7.8044 | 8.19678 | 8.35135 | 7.17949 |
| North India (NI) | 2.4752 | 2.73115 | 3.13486 | 0.39953 | 0.75339 |
| 9.12363 | 8.14945 | 8.85599 | 8.79002 | 8.19231 |
| East India (EI) | 3.47622 | 3.66381 | 3.31786 | 1.12738 | 1.10531 | 0.93527 |
| 8.62653 | 9.26578 | 8.94402 | 7.4881 |
| Northcentral India (NCI) | 2.60866 | 2.70354 | 2.93325 | 0.91855 | 0.80835 | 0.33776 | 0.93726 |
| 8.6 | 8.60541 | 7.72857 |
| Southcentral India (SCI) | 2.35066 | 2.54647 | 2.51882 | 0.56335 | 0.46999 | 0.31357 | 0.84577 | 0.55665 |
| 9.04274 | 8.18605 |
| South India (SI) | 2.45712 | 2.46679 | 2.91264 | 0.52255 | 0.57317 | 0.1962 | 0.47261 | 0.51066 | 0.21726 |
| 8.27928 |
| Afghani (Afg) | 2.92302 | 3.23801 | 2.72395 | 1.34859 | 1.17308 | 1.37026 | 0.78846 | 1.4056 | 1.13234 | 1.17417 |
|
Figure 2Phylogenetic network relating Y-STR haplotypes within haplogroup H1a1a -M82.
The network was constructed using a median joining with MP (maximum parsimony) algorithm as implemented in the Network 4.6 program. The size of the circles is proportional to the number of samples. The data used for comparison have been taken from the literature (See Text S1).
Y-chromosomal haplogroup H1a1a-M82 founder analysis for Roma.
|
| TMRCA (years) | SD (years) |
| 5 | 1110 | 785 |
| 65 | 1024 | 407 |
| 61 | 2081 | 873 |
|
|
|
Different founders were identified based on the Network analysis (Text S1). The age was estimated from the ρ statistic (the mean number of mutations from the assumed root of each and every founder), using a 25-year generation time and the TD statistic, assuming a mutation rate of 6.9×10−4 [11], based on variation at 15 common Y-STR loci.
Figure 3Phylogenetic network relating Y-STR haplotypes within haplogroup H1a1a -M82- of Roma with northwest Indian populations.
The network was constructed using a median joining with MP (maximum parsimony) algorithm as implemented in the Network 4.6 program. The size of the circles is proportional to the number of samples. SC = Scheduled Caste and ST = Scheduled Tribe.
Analysis of Molecular Variance (AMOVA) using Y-STRs between groups of populations categorized on the basis of geography.
| Group | Fst |
| Roma Vs. Northwest India | 0.34990 |
| Roma Vs. West India | 0.36792 |
| Roma Vs. North India | 0.55269 |
| Roma Vs. East India | 0.65300 |
| Roma Vs. Northcentral India | 0.52934 |
| Roma Vs. Southcentral India | 0.45002 |
| Roma Vs. South India | 0.48570 |
| Roma Vs. Afghani | 0.68314 |