Literature DB >> 21187967

The influence of natural barriers in shaping the genetic structure of Maharashtra populations.

Kumarasamy Thangaraj¹, B Prathap Naidu, Federica Crivellaro, Rakesh Tamang, Shashank Upadhyay, Varun Kumar Sharma, Alla G Reddy, S R Walimbe, Gyaneshwer Chaubey, Toomas Kivisild, Lalji Singh.

Abstract

BACKGROUND: The geographical position of Maharashtra state makes it rather essential to study the dispersal of modern humans in South Asia. Several hypotheses have been proposed to explain the cultural, linguistic and geographical affinity of the populations living in Maharashtra state with other South Asian populations. The genetic origin of populations living in this state is poorly understood and hitherto been described at low molecular resolution level. METHODOLOGY/PRINCIPAL
FINDINGS: To address this issue, we have analyzed the mitochondrial DNA (mtDNA) of 185 individuals and NRY (non-recombining region of Y chromosome) of 98 individuals belonging to two major tribal populations of Maharashtra, and compared their molecular variations with that of 54 South Asian contemporary populations of adjacent states. Inter and intra population comparisons reveal that the maternal gene pool of Maharashtra state populations is composed of mainly South Asian haplogroups with traces of east and west Eurasian haplogroups, while the paternal haplogroups comprise the South Asian as well as signature of near eastern specific haplogroup J2a.
CONCLUSIONS/SIGNIFICANCE: Our analysis suggests that Indian populations, including Maharashtra state, are largely derived from Paleolithic ancient settlers; however, a more recent (∼10 Ky older) detectable paternal gene flow from west Asia is well reflected in the present study. These findings reveal movement of populations to Maharashtra through the western coast rather than mainland where Western Ghats-Vindhya Mountains and Narmada-Tapti rivers might have acted as a natural barrier. Comparing the Maharastrian populations with other South Asian populations reveals that they have a closer affinity with the South Indian than with the Central Indian populations.

Entities: Chemical Disease Gene Species

Mesh：

Substances：
DNA, Mitochondrial

Year: 2010 PMID： 21187967 PMCID： PMC3004917 DOI： 10.1371/journal.pone.0015283

Source DB: PubMed Journal: PLoS One ISSN： 1932-6203 Impact factor: 3.240

Introduction

The Indian subcontinent is legendary for the cultural, linguistic and genetic diversity of its inhabitants. The contemporary populations of India offer a platform to study the wide range of disciplines viz. demography, history, linguistics and genetics. Genetic diversity in India can be understood as a result of long term large effective population size, a number of dispersal events and its unique social structure. Maharashtra is the western most state of India inhabited by several caste and tribal populations. It is politically bordered by Gujarat, Madhya Pradesh, Chhattisgarh, Karnataka and Andhra Pradesh in the northwest, northeast, east, south and southeast, respectively. The Arabian Sea makes up Maharashtra's west coast. There are three mountain ranges in this state viz. Sahyadri in west, Satpuda in north and Gondwan in the east. More importantly, the Western Ghats mountain ranges of India is considered as the most densely populated global biodiversity hotspot which has a mosaic of natural, semi-natural and agroecosystems in close proximity to one another [1]. In India, the Maharashtra state ranks second in population and third in area. It constitutes 9.33 percent of India's population (Census 2001). The tribal populations accounts to about 9.3 percent of the total population of the state (Census 2001). There are 47 scheduled tribal population groups in the state and the majority of them are inhabitants of these geographically difficult topography [2]. The major tribal populations of Sahyadri range are the Mahadeo Koli, Thakur, Katkari, Warli, Malhar Koli and Kokana group. Among Satpuda ranges, Bhil, Pawara, Korku and Tadvi are the major groups. The Madia, Gond, Pardhan, Halbi Otkar and Andha are found in the Gondwan range. These tribal groups differ from each other in various aspects, for instance their different cultural practices, marriage pattern and socioeconomic categories. The origin and migration of these groups are uncertain. As majority of these tribal groups living in the remote forest areas remain isolated from each other thus, minimizes the chances of gene flow among them. Mitochondrial DNA (mtDNA) evidence has been argued to support the model according to which populations of South Asians can trace their origin back to the Out-of-Africa (OoA) dispersal along the southern coastal route approximately 60 KYA [3]–[5] that is arguably in line with archaeological evidence [6]. Based on similarities reported in engraved pieces found at Blombos, South Africa, to those of Patne in India, and cresentic blade and microblade forms reported in India and Sri Lanka to Africa strongly indicated a direct connection between early human colonists in Asia and their ancestors in Africa [6]. However, the recent archaeological research has raised question about this dispersal and hypothesized alternative route [7], [8]. There is no archaeological evidence concerning the Paleolithic settlement of Western Ghats region by early modern human. It has been suggested that the early human populations in South Asia avoided the Western Ghats region because of high rainfall and thick vegetation [9]. Alternatively, it was proposed that such highly rich vegetation zone might have attracted early human because of ease of resources and the lack of corresponding evidence were explained due to various reasons (e.g., inadequate surveys or thick vegetation not allowing proper surveys, lithics being transported after use instead of being discarded) [10]. Previous genetic studies on South Asian populations have identified their genepool as a composite of lineages that have evolved locally since the OoA dispersal and those that have been introduced by a number of more recent dispersal events [4], [11]–[18]. The analyses of mtDNA, Y chromosome, and autosomal genes have agreed on that the contribution of western Eurasian gene flow to India is more pronounced in the modern populations than that from East and Central Asia whereas, the relative proportions of imported vs locally differentiated genes appear to be dependent on the particular locus and populations being examined. The geographical location of Maharashtra state makes it quite interesting to study the dispersal of modern humans in South Asia. Besides harboring such an important geographical position in Indian subcontinent, the origin and migration of several populations living in various regions in this state of India and their affinity with other contemporary South Asian population has not been explored, so far, at the high resolution level. The available genetic source of this region is poorly known due to less sample sizes and low resolution [19], [20]. Moreover, this state works as a bridge among central, northern and southern parts of India and might give some clues for the peopling of Indian subcontinent, placing it on the way of coastal route migration [3], [5]. Therefore, to shed more light on the origin of the Maharashtra population, we first analyzed the control-regions and partial coding-region sequence variations of mtDNA and 20 Y-SNP and 17 Y-STR markers of Y chromosome in two tribal populations (Mahadeo-Koli and Thakur) inhabited in westernmost coastal region of India and compared the results with published sources from contemporary populations [4], [11]–[14], [19]–[30]. Our results not only help to further understand the phylogenetic position of Maharashtra state in South Asia but also provide deeper insights into the origin of Western Ghat populations.

Materials and Methods

Sampling

About 8–10 ml of blood was collected from 185 healthy unrelated individuals belonging to two tribal populations (Mahadeo Koli n = 95, Thakur n = 90) residing in Thane district of Maharashtra (Fig. S1). This project has been approved by the Institutional Ethical Committee (IEC) of Centre for Cellular and Molecular Biology (CCMB) and the informed written consent was obtained from all the participants. DNA was extracted from whole blood by using the standard protocol [31].

mtDNA typing

Polymerase Chain Reactions were carried out with 10 ng DNA in a 10 ul reaction volume with 1U of Taq DNA polymerase enzyme. Cycling conditions used were 94°C for 5 min, 35 cycles at 95°C for 30 s, 58°C for 30 s, and 72°C for 2 min, then 72°C for 7 min. Sequencing of the PCR products were directly carried out by using Big Dye™ Terminator cycle sequencing Kit (Applied Biosystems, Foster City USA) in 3730 DNA Analyzer, following manufacture's protocol. To minimize errors, both strands were double-sequenced. The individual mtDNA sequences were compared against the rCRS [32] using AutoAssembler-Ver 2.1 (Applied Biosystems, Foster City USA). The sequencing data corresponding to nucleotide positions 15927–16510 and 16520–400 that includes HVS-I and HVS-II of the control region were sequenced in 94 Mahadeo Koli and 90 Thakur samples. Further, 19 segments of coding regions (nps.14890–15430,14120–14950,13370–14150,11970–12720,11330–12080,10000–10780,8630–9390,7950–8710,7170–8040,5880–6650,5280–6030,4520–5330,3810–4620,3190–3870,2510–3300,1870–2670,1240–1980,630–1370) were sequenced for 184 samples. All the newly generated sequences has been deposited in the GenBank (accession number HQ427694–HQ427878).

Y-chromosomal typing

Twenty Y chromosome biallelic polymorphic markers M89, YAP, M216, M130, M9, M45, M74, M52, M304, M172, M410, M67, M11, M20, M27, M175, M95, M173, M17, and M124 were typed. The PCR cycles were set-up with an initial denaturation of 5 min at 95°C, followed by 30–35 cycles of 30 sec. at 94°C, 30 sec at the primer-specific annealing temperature (52–60°C), and 45 sec. at 72°C, and final extension of 7 min at 72°C. Length variation at 17 Y-STR loci: DYS456, DYS3891, DYS390, DYS389II, DYS458, DYS19, DYS385a/b, DYS393, DYS391, DYS439, DYS635, DYS392, DYS437, DYS438, DYS448 and Y GATA H4 were genotyped using AmpFℓSTR® Y-filer™ PCR amplification Kit (Applied Biosystems). Multiplex polymerase chain reaction was carried out with 0.5ng template with 1 U AmpliTaq Gold DNA polymerase enzyme (Applied Biosystems, Foster City, CA) with total reaction volume of 6.25 µl which includes 10mM Tris-HCl (PH 8.3), 50mM KCl, 1.5mM Mgcl2, 250 µm dNTPs, 3.0 µm of each primer. The conditions for the polymerase chain reaction are (1) 95°C for 10 min, (2) 28 cycles; 94°C for 1 min, 55°C for 1 min, 72°C for 1 min, (3) 60°C for 45 min, and (4) 25°C hold. ABI 3730 DNA Analyser (Applied Biosystems, Foster City, CA) and GeneMapper V4.0 software program (Applied Biosystems, Foster City, CA) was used for the analysis of raw data.

Phylogenetic and Statistical analysis

Principal component analysis (PCA) was performed using POPSTR, kindly provided by H. Harpending. Median-joining and reduced median joining networks were reconstructed with NETWORK program (version 4.1) (www.fluxus-engineering.com). Reduced median and median-joining procedures were applied sequentially. Coalescence time has been calculated between nucleotide positions 16090–16365 (HVS-I) considering one transition equals to 20,180 years [33]. Standard deviation of the rho estimate (σ) was calculated as in Saillard et al. [34]. The diversity indices including AMOVA (analysis of molecular variance) based Fst analysis were calculated using ARLEQUIN 3.01 software [35]. The age of a Ychromosomal haplogroup have been obtained by the TD statistic, assuming mutation rate of 6.9×10−4 [36]. Haplogroups carrying less than 13 samples has not been included in age calculation. Out of 17 loci obtained, two DYS385 loci were excluded from the current analyses because they could not be distinguished using the typing method employed. Thus, all the analysis linked with Y-STR data were carried out with 15 loci. Haplogroup isofrequency maps were generated by using Surfer 8 of Golden Software (Golden Software Inc., Golden, Colorado), following the Kriging procedure. The present data from two tribes (Mahadeo Koli and Thakur) of central India were compared with previously published datasets [4], [11]–[14], [19]–[28], [30].

Results and Discussion

In order to understand the genetic relationship of populations of Maharashtra in the context of rest of the populations of India, we determined the summary genetic distances based on mtDNA and Y-chromosomal haplogroup frequencies, and subjected the resulting genetic distance matrices to principal component analysis and median joining network analysis (Fig. 1 and Fig. S2). The mtDNA PCA plot (Fig. 1a) did not show any clear-cut geographic or linguistic clustering in the data, while PC1 in the Y chromosomal plot (Fig. 1b) illustrates a distinct east to west clinal pattern. The Maharashtra populations cluster together and remain closer to South Indian and Gujarati populations than Central Indian populations (Fig. 1b). Within the Maharashtra populations we can see two clear-cut geographical clustering among the populations living on both sides of Western Ghat (Fig. 1b and Table 1). The most likely explanation of the above observations is the complex geographical structuring of this region. Populations living in Central India and in Maharashtra state are well separated by Narmada and Tapti rivers as well as Satpuda range of hills and populations living at the coastal region are equally separated from mainland populations by Sahyadri mountains (a range of Western Ghats), thus, restricting the easy population movement in either direction, while the long coastal region facilitates the gene flow from Karnataka and Gujarat states (Table 1). Therefore, geography is the main factor shaped the genetic composition of Maharashtra populations. Austroasiatic populations are the exceptions, who unanimously show their paternal affinity according to their linguistic division (Table 2). This is due to their highly frequent paternal haplogroup (hg) O2a [13], [15], [30].

Figure 1

Principal component Analysis.

Principal component (PC) scatter plot of mtDNA (a) and Y chromosomal (b) showing the genetic relation among population of Maharashtra and contemporary populations.

Table 1

The Analysis of Molecular Variance (AMOVA) results.

	Fst
	mtDNA	Y Chromosome
Maharashtra (east) Vs Madhya Pradesh	0.0537	0.2761
Maharashtra (east) Vs Andhra Pradesh	0.0411	0.1469
Maharashtra (east) Vs Karnataka	0.0492	0.0898
Maharashtra (east) Vs Gujarat	0.0501	0.1998
Maharashtra (west) Vs Madhya Pradesh	0.0794	0.4859
Maharashtra (west) Vs Andhra Pradesh	0.0567	0.3771
Maharashtra (west) Vs Karnataka	0.0293	0.0419
Maharashtra (west) Vs Gujarat	0.0173	0.0211

Table 2

The frequencies of different Y chromosomal haplogroups in Maharashtra populations, n is number of samples and * is the position relative to Western Ghat.

Population	Position*	Language	Social Status	n	F*(M89)	H(M69)	J(xJ2)	J2(M172)	L(M11)	O2a(M95)	O3(M122)	P*(M45)	R*(M207)	R1a1(M17)	R2(M124)	Reference
Maratha	East	Indo-European	Caste	53	0	0.39	0	0.15	0.06	0	0	0	0	0.19	0.06	Trivedi et al. 2008
Dhangar	East	Indo-European	Caste	33	0	0.38	0	0.06	0.06	0	0	0	0	0.31	0.18	Trivedi et al. 2008
Pawara	East	Indo-European	Tribe	16	0	0.75	0	0.06	0.06	0	0	0	0	0.06	0.06	Trivedi et al. 2008
Katkari	East	Indo-European	Tribe	19	0	0.63	0	0.05	0.05	0	0	0.05	0.05	0.16	0	Trivedi et al. 2008
Madia-Gond	East	Dravidian	Tribe	14	0	0.57	0	0	0.29	0	0	0.07	0	0	0.07	Trivedi et al. 2008
Maratha	East	Indo-European	Caste	20	0.05	0.35	0	0.15	0.15	0	0	0	0	0.10	0.20	Sengupta et al. 2006
Naba-Baudh	East	Indo-European	Religious group	14	0	0.43	0	0.14	0.07	0	0	0	0	0.21	0	Sengupta et al. 2006
Korku	East	Austroasiatic	Tribe	59	0.07	0.08	0	0	0	0.81	0.02	0.02	0	0.00	0	Kumar et al. 2007
Desasth Brahmin	West	Indo-European	Caste	35	0.05	0.13	0	0.18	0.11	0	0	0	0	0.37	0.16	Trivedi et al. 2008
Chitpavana Brahmin	West	Indo-European	Caste	39	0	0.23	0	0.17	0.13	0	0	0	0	0.27	0.20	Trivedi et al. 2008
Konkan Brahmins	West	Indo-European	Caste	23	0	0.08	0	0.16	0.04	0	0	0	0	0.48	0.20	Sengupta et al. 2006
Mahadeo-Koli	West	Indo-European	Tribe	50	0.24	0.18	0.04	0.14	0.02	0	0	0	0	0.26	0.12	Present Study
Thakur	West	Indo-European	Tribe	48	0.13	0.08	0	0.27	0.06	0	0	0	0	0.29	0.04	Present Study

Principal component Analysis.

Principal component (PC) scatter plot of mtDNA (a) and Y chromosomal (b) showing the genetic relation among population of Maharashtra and contemporary populations. The Network based analysis of mtDNA haplogroups in currently studied Mahadeo-Koli and Thakur populations identified several South Asian and a few West Eurasian specific sub-clades in the background of mtDNA macrohaplogroup M and N(R) (Fig. 2 and Table S1). The haplotype diversity in Thakur population was least among Maharashtra populations (Table 3). In macrohaplogroup M background haplogroups (hgps); M2, M3, M4 and M18 are shared by both Mahadeo-Koli and Thakur populations, while hgps M5, M6, M30, M33, M35, M37, M39, M40, M44 and M52 are exclusively present in Mahadeo-Koli. Two hgps of macrohaplogroup N (N5 and W) are present in Mahadeo-Koli while most of the macrohaplogroup R and U clades are shared by both of the populations, except hgps R6 and HV3 (Fig. 2).

Figure 2

The reduced-median network of 185 mtDNA subjects belonging to Mahadeo-Koli and Thakur populations.

Table 3

mtDNA diversity indices calculated in Maharashtra and neighbouring state populations.

Population	State	n	HD (SD)	MPD (SD)	ND (SD)
Andh	Andhra Pradesh	66	0.981 (0.005)	5.190 (2.544)	0.013 (0.007)
Brahmin	Andhra Pradesh	40	0.995 (0.007)	5.394 (2.655)	0.014 (0.007)
Chenchu	Andhra Pradesh	110	0.925 (0.011)	5.740 (2.770)	0.015 (0.008)
Erukala	Andhra Pradesh	27	0.966 (0.022)	5.325 (2.652)	0.014 (0.007)
Kapu	Andhra Pradesh	52	0.993 (0.005)	5.812 (2.824)	0.015 (0.008)
Koya	Andhra Pradesh	81	0.941 (0.016)	6.363 (3.047)	0.016 (0.008)
Lambadi	Andhra Pradesh	86	0.985 (0.005)	5.858 (2.826)	0.015 (0.008)
Naikpod	Andhra Pradesh	92	0.976 (0.005)	5.198 (2.538)	0.014 (0.007)
Thoti	Andhra Pradesh	39	0.906 (0.030)	5.519 (2.711)	0.015 (0.007)
Baiga	Chhattishgarh	26	0.910 (0.230)	5.592 (2.834)	0.014 (0.006)
Birhor	Chhattishgarh	20	0.891 (0.002)	5.363 (2.149)	0.015 (0.007)
Gujarati	Gujarat	57	0.996 (0.004)	6.223 (2.999)	0.017 (0.008)
Lobana	Gujarat	61	0.913 (0.033)	6.268 (3.127)	0.017 (0.011)
Koragas	Karnataka	51	0.988 (0.066)	5.895 (2.754)	0.015 (0.008)
BettuKurumba	Karnataka	79	0.899 (0.078)	6.912 (3.397)	0.014 (0.007)
Kanwar	Madhya Pradesh	19	0.965 (0.036)	5.556 (2.792)	0.014 (0.008)
Mushar	Madhya Pradesh	46	0.852 (0.041)	6.719 (3.227)	0.017 (0.009)
Satnami	Madhya Pradesh	18	0.974 (0.025)	5.301 (2.684)	0.014 (0.007)
Bhar	Madhya Pradesh	22	0.922 (0.045)	4.208 (2.171)	0.011 (0.006)
Harijan	Madhya Pradesh	20	0.990 (0.019)	5.326 (2.683)	0.014 (0.007)
Mahadeo-Koli	Maharashtra	95	0.990 (0.003)	6.454 (3.082)	0.017 (0.009)
Thakur	Maharashtra	90	0.903 (0.020)	6.353 (3.039)	0.016 (0.008)
Maratha	Maharashtra	30	0.995 (0.010)	6.749 (3.273)	0.017 (0.009)
Chitpawana Brahmin	Maharashtra	20	0.979 (0.026)	6.389 (3.159)	0.017 (0.009)
Desastha Brahmin	Maharashtra	19	1.000 (0.017)	5.895 (2.944)	0.015 (0.008)
Dhangar	Maharashtra	19	1.000 (0.017)	6.327 (3.139)	0.016 (0.009)
Konkan Brahmin	Maharashtra	58	0.985 (0.008)	6.059 (2.927)	0.016 (0.008)

HD = Haplotype Diversity.

MPD = Mean number of pairwise difference.

ND = Nucleotide Diversity.

The reduced-median network of 185 mtDNA subjects belonging to Mahadeo-Koli and Thakur populations.

This network was redrawn manually from the output of median joining/reduced network obtained using NETWORK program (version 4.1) (www.fluxus-engineering.com). Each sample represented on the diagram has been sequenced for the HVS-I region and genotyped for the coding region mutations that are indicated. Circle sizes are proportional to the number of mtDNAs with that haplotype. Recurrent mutations are underlined. Coalescent times were calculated by a calibration method described elsewhere [33]. 16182C, 16183C and 16519 polymorphisms were omitted. Suffixes A, C, G, and T indicate transversions. HD = Haplotype Diversity. MPD = Mean number of pairwise difference. ND = Nucleotide Diversity. The high resolution analysis at haplogroup and sub-haplogroup level identified a monophylacity of previously classified M2a and M2b subclades with a single coding as well as control region substitution and named as M2a'b (Fig. 2). This finding recognizes a sub-branch M2c and refines the defining mutations for haplogroup M2 [37], [39]. It is now defined by four coding and single control region mutations (Fig. 2). Similarly the finding of several branches in the background of haplogroups M4, R5, R8 and R30 has improved the resolution of mtDNA phylogeny of this region. It is noteworthy that although Thakur and Mahadeo-Koli live in a close proximity, there is no haplotype sharing among them, except single haplotype share in West Eurasian haplogroup U4 (Fig. 2). This suggests a high level of strict endogamy in these two populations regardless both being at the same social level and exchange the rituals and other traditional occupations with one another and maintain their unique identity. It is consistent with the previous observation on South Asian populations [15]. The coalescent age of different mtDNA lineages are calculated in the studied populations (Fig. 2). Majority of the sub-clades have a coalescent time ranging from 10–30 KYA (Fig. 2). The Y chromosome analysis identified nine major haplogroups in Maharashtra populations (Table 2), of which South Asian specific haplogroup H is most frequent in caste and tribal populations. Second most frequent haplogroup is hg R1a present in caste as well as tribal populations. Some of the studies considered hg H as a tribal and hg R1a as caste specific marker previously [12], [24], [25]. In contrast to them, the present study supports the occurrence of these haplogroups in both caste and tribal populations of India [11], [15]. The discrepancy of frequency distribution of these haplogroups in caste and tribal populations can be explained by their different population sizes where evolutionary forces act in a different way and diverse social customs that involve practicing endogamy at different levels [18]. Near Eastern specific hg J2 is also significantly present in both of the studied populations (Table 2 and Table S2). This haplogroup thought to be associated with the intrusion from Near East during Neolithic agricultural expansion [14]. Further dissection of this hg revealed most of the samples to be derived for marker M410 (hg. J2a). The further genotyping of M410 derived samples remained ancestral to M67 marker (hg. J2a4). The worldwide phylogeographic distribution of hg J2a suggests its entry in Indian subcontinent through northwestern corridor and an abrupt drop further south due to Western Ghat mountain ranges (Fig. 3). The rooted Y-STR network of different Y chromosomal haplogroups provided a diverse haplotype distribution in Maharashtra populations (Fig. 4).

Figure 3

The spatial distribution (%) of M410(J2a) clade in worldwide populations.

Isofrequency maps were generated by using Surfer8 of Golden Software (Golden Software Inc., Golden, Colorado), following the Kriging procedure. Data is used from present study and previously published literatures [14], [23], [26]. The dots represent sampling locations.

Figure 4

Rooted phylogenetic network of Maharashtra populations.

The spatial distribution (%) of M410(J2a) clade in worldwide populations.

Rooted phylogenetic network of Maharashtra populations.

(a) Rooted phylogenetic network of Maharashtra populations relating Y-STR haplotypes within macrohaplogroup CF (P143) and its associated phylogeny: The network was constructed using a median-joining algorithm as implemented in the Network 4.5.0.2 program. The size of the circles is proportional to the number of samples. (b) Rooted phylogenetic network of Maharashtra populations relating Y-STR haplotypes within macrohaplogroup K(M9) and M45(P) with their associated phylogeny. The network was constructed using a median-joining algorithm as implemented in the Network 4.5.0.2 program. The size of the circles is proportional to the number of samples. By using the Y-STR data from both of the populations, we have calculated the variance and coalescent ages for different haplogroups (Table 4). The age of microsatellites variation in all of the major haplogroups ranges from 7–35 KYA (Table 4). The South Asian specific haplogroups F*, H1a and R2 show pre-Neolithic, while hg L1 shows Neolithic expansion time. The age of haplogroup R1a ranges from 10–17 KYA which is consistent with previous large scale study on this haplogroup [38]. The network analysis of R1a with other Indian populations failed to provide any regional or linguistic clustering (Fig. S2).

Table 4

The coalescent age and variance of different Y chromosomal haplogroups observed in studied populations.

Haplogroup	Sample size	Age (Kya)	Variance
F*	18	26,3±4,8	0.66
H1a	13	17,65±3,9	0.45
J2a	15	15,3±7,6	0.36
R1a	27	14,1±3,2	0.34

In conclusion, our results on Maharashtra populations are consistent with other Indian populations suggest that the tribal as well as caste populations of Indian subcontinent practice a strict endogamy even though they live in a close proximity and share the ritual and social customs. The mtDNA results dissected and increased the clarity of South Asian mtDNA phylogeny. The colonization of western part of Western Ghat is facilitated mainly through migration of populations via western coast rather than mainland where Western Ghat-Vindhya mountains and Narmada-Tapti rivers worked as a natural barrier. Our data is in congruent with the other observations that Indian populations including Maharashtra state are largely derived from Paleolithic ancient settlers, however, a more recent (∼10 Ky older) detectable paternal gene flow from west Asia is well reflected in present genetic study. The sampling location of Mahadeo Koli and Thakar populations. (TIF) Click here for additional data file. Unrooted phylogenetic network of haplogroup R1a Y-STR haplotypes among different Indian populations showing the haplotype sharing of Thakur and Mahadeo-Koli. The network was constructed using a median-joining algorithm as implemented in the Network 4.5.0.2 program. The size of the circles is proportional to the number of samples. (TIF) Click here for additional data file. The complete mtDNA data from Mahadeo-Koli and Thakur populations. (XLS) Click here for additional data file. The Y-SNP and Y-STR complete data from Mahadeo-Koli and Thakur populations. (XLS) Click here for additional data file.

32 in total

1. Y-chromosome and mitochondrial DNA polymorphisms in Indian populations.

Authors: K Thangaraj; G V Ramana; L Singh
Journal: Electrophoresis Date: 1999-06 Impact factor: 3.535

2. mtDNA variation among Greenland Eskimos: the edge of the Beringian expansion.

Authors: J Saillard; P Forster; N Lynnerup; H J Bandelt; S Nørby
Journal: Am J Hum Genet Date: 2000-08-02 Impact factor: 11.025

3. The effective mutation rate at Y chromosome short tandem repeats, with application to human population-divergence time.

Authors: Lev A Zhivotovsky; Peter A Underhill; Cengiz Cinnioğlu; Manfred Kayser; Bharti Morar; Toomas Kivisild; Rosaria Scozzari; Fulvio Cruciani; Giovanni Destro-Bisol; Gabriella Spedini; Geoffrey K Chambers; Rene J Herrera; Kiau Kiun Yong; David Gresham; Ivailo Tournev; Marcus W Feldman; Luba Kalaydjieva
Journal: Am J Hum Genet Date: 2003-12-19 Impact factor: 11.025

4. The genetic heritage of the earliest settlers persists both in Indian tribal and caste populations.

Authors: T Kivisild; S Rootsi; M Metspalu; S Mastana; K Kaldma; J Parik; E Metspalu; M Adojaan; H-V Tolk; V Stepanov; M Gölge; E Usanga; S S Papiha; C Cinnioğlu; R King; L Cavalli-Sforza; P A Underhill; R Villems
Journal: Am J Hum Genet Date: 2003-01-20 Impact factor: 11.025

5. Mitochondrial DNA diversity in tribal and caste groups of Maharashtra (India) and its implication on their genetic origins.

Authors: M M Baig; A A Khan; K M Kulkarni
Journal: Ann Hum Genet Date: 2004-09 Impact factor: 1.670

6. Out of Africa: new hypotheses and evidence for the dispersal of Homo sapiens along the Indian Ocean rim.

Authors: Michael D Petraglia; Michael Haslam; Dorian Q Fuller; Nicole Boivin; Chris Clarkson
Journal: Ann Hum Biol Date: 2010-06 Impact factor: 1.533

7. Population genetic structure in Indian Austroasiatic speakers: the role of landscape barriers and sex-specific admixture.

Authors: Gyaneshwer Chaubey; Mait Metspalu; Ying Choi; Reedik Mägi; Irene Gallego Romero; Pedro Soares; Mannis van Oven; Doron M Behar; Siiri Rootsi; Georgi Hudjashov; Chandana Basu Mallick; Monika Karmin; Mari Nelis; Jüri Parik; Alla Goverdhana Reddy; Ene Metspalu; George van Driem; Yali Xue; Chris Tyler-Smith; Kumarasamy Thangaraj; Lalji Singh; Maido Remm; Martin B Richards; Marta Mirazon Lahr; Manfred Kayser; Richard Villems; Toomas Kivisild
Journal: Mol Biol Evol Date: 2010-10-26 Impact factor: 16.240

8. Origin and evolution of Native American mtDNA variation: a reappraisal.

Authors: P Forster; R Harding; A Torroni; H J Bandelt
Journal: Am J Hum Genet Date: 1996-10 Impact factor: 11.025

9. Human population in the biodiversity hotspots.

Authors: R P Cincotta; J Wisnewski; R Engelman
Journal: Nature Date: 2000-04-27 Impact factor: 49.962

10. Genetic affinities among the lower castes and tribal groups of India: inference from Y chromosome and mitochondrial DNA.

Authors: Ismail Thanseem; Kumarasamy Thangaraj; Gyaneshwer Chaubey; Vijay Kumar Singh; Lakkakula V K S Bhaskar; B Mohan Reddy; Alla G Reddy; Lalji Singh
Journal: BMC Genet Date: 2006-08-07 Impact factor: 2.797

17 in total

1. The paternal ancestry of Uttarakhand does not imitate the classical caste system of India.

Authors: Neetu Negi; Rakesh Tamang; Veena Pande; Amrita Sharma; Anish Shah; Alla G Reddy; Satti Vishnupriya; Lalji Singh; Gyaneshwer Chaubey; Kumarasamy Thangaraj
Journal: J Hum Genet Date: 2015-10-29 Impact factor: 3.172

2. Indian Siddis: African descendants with Indian admixture.

Authors: Anish M Shah; Rakesh Tamang; Priya Moorjani; Deepa Selvi Rani; Periyasamy Govindaraj; Gururaj Kulkarni; Tanmoy Bhattacharya; Mohammed S Mustak; L V K S Bhaskar; Alla G Reddy; Dharmendra Gadhvi; Pramod B Gai; Gyaneshwer Chaubey; Nick Patterson; David Reich; Chris Tyler-Smith; Lalji Singh; Kumarasamy Thangaraj
Journal: Am J Hum Genet Date: 2011-07-07 Impact factor: 11.025

3. The Genetic Ancestry of Modern Indus Valley Populations from Northwest India.

Authors: Ajai K Pathak; Anurag Kadian; Alena Kushniarevich; Francesco Montinaro; Mayukh Mondal; Linda Ongaro; Manvendra Singh; Pramod Kumar; Niraj Rai; Jüri Parik; Ene Metspalu; Siiri Rootsi; Luca Pagani; Toomas Kivisild; Mait Metspalu; Gyaneshwer Chaubey; Richard Villems
Journal: Am J Hum Genet Date: 2018-12-06 Impact factor: 11.025

Review 4. Complex genetic origin of Indian populations and its implications.

Authors: Rakesh Tamang; Lalji Singh; Kumarasamy Thangaraj
Journal: J Biosci Date: 2012-11 Impact factor: 1.826

5. Genomic view on the peopling of India.

Authors: Rakesh Tamang; Kumarasamy Thangaraj
Journal: Investig Genet Date: 2012-10-01

6. Genetic affinities of the central Indian tribal populations.

Authors: Gunjan Sharma; Rakesh Tamang; Ruchira Chaudhary; Vipin Kumar Singh; Anish M Shah; Sharath Anugula; Deepa Selvi Rani; Alla G Reddy; Muthukrishnan Eaaswarkhanth; Gyaneshwer Chaubey; Lalji Singh; Kumarasamy Thangaraj
Journal: PLoS One Date: 2012-02-29 Impact factor: 3.240

7. Afghanistan's ethnic groups share a Y-chromosomal heritage structured by historical events.

Authors: Marc Haber; Daniel E Platt; Maziar Ashrafian Bonab; Sonia C Youhanna; David F Soria-Hernanz; Begoña Martínez-Cruz; Bouchra Douaihy; Michella Ghassibe-Sabbagh; Hoshang Rafatpanah; Mohsen Ghanbari; John Whale; Oleg Balanovsky; R Spencer Wells; David Comas; Chris Tyler-Smith; Pierre A Zalloua
Journal: PLoS One Date: 2012-03-28 Impact factor: 3.240

8. Genetic Affinity of the Bhil, Kol and Gond Mentioned in Epic Ramayana.

Authors: Gyaneshwer Chaubey; Anurag Kadian; Saroj Bala; Vadlamudi Raghavendra Rao
Journal: PLoS One Date: 2015-06-10 Impact factor: 3.240

9. The influence of habitats on female mobility in Central and Western Africa inferred from human mitochondrial variation.

Authors: Valeria Montano; Veronica Marcari; Mariano Pavanello; Okorie Anyaele; David Comas; Giovanni Destro-Bisol; Chiara Batini
Journal: BMC Evol Biol Date: 2013-01-29 Impact factor: 3.260

10. Patrilineal perspective on the Austronesian diffusion in Mainland Southeast Asia.

Authors: Jun-Dong He; Min-Sheng Peng; Huy Ho Quang; Khoa Pham Dang; An Vu Trieu; Shi-Fang Wu; Jie-Qiong Jin; Robert W Murphy; Yong-Gang Yao; Ya-Ping Zhang
Journal: PLoS One Date: 2012-05-07 Impact factor: 3.240