Literature DB >> 32692490

Exploring European ancestry among the Kalash population: a mitogenomic perspective.

Zia Ur Rahman1,2,3, Yu-Chun Li4, Jiao-Yang Tian1, Qing-Peng Kong5.   

Abstract

With a population of around 4 000 individuals, the Kalash people have been living in the Hindu-Kush mountain valleys of present-day northern Pakistan for centuries. Due to their mysterious origin and fairer European complexion, the genetic history of this ethnic group has been investigated previously using different markers. To date, however, the maternal genetic architecture has not been systematically dissected based on high-resolution complete mitochondrial genomes (mitogenomes), making their maternal genetic history, especially their genetic connection with Europeans from a matrilineal perspective, unclear. To unravel this issue, we analyzed mitogenome data of 34 Kalash samples together with 6 075 individuals from across Eurasia. Our results indicated exclusive western Eurasian origin of the Kalash people, represented by eight haplogroups. Among these haplogroups, J2b1a7a and R0a5a (accounting for ~50% of the Kalash gene pool) displayed in situ differentiations in the Kalash and could be traced to the Mediterranean region. Age estimations suggested these haplogroups arose in the Kalash population ~2.26 and 3.01 thousand years ago (kya), a time frame consistent with the invasion of Alexander III of Macedon to the region. One possible explanation for the maternal genetic contribution from Europeans to the Kalash people would be the involvement of women in foreign campaigns of ancient Greek warfare, followed by a founder effect. Our study thus sheds important light on the genetic origin of the Kalash community of Pakistan.

Entities:  

Keywords:  Founder effect; Kalash; Mediterranean; Mitochondrial genome

Mesh:

Substances:

Year:  2020        PMID: 32692490      PMCID: PMC7475010          DOI: 10.24272/j.issn.2095-8137.2020.052

Source DB:  PubMed          Journal:  Zool Res        ISSN: 2095-8137


DEAR EDITOR,

With a population of around 4 000 individuals, the Kalash people have been living in the Hindu-Kush mountain valleys of present-day northern Pakistan for centuries. Due to their mysterious origin and fairer European complexion, the genetic history of this ethnic group has been investigated previously using different markers. To date, however, the maternal genetic architecture has not been systematically dissected based on high-resolution complete mitochondrial genomes (mitogenomes), making their maternal genetic history, especially their genetic connection with Europeans from a matrilineal perspective, unclear. To unravel this issue, we analyzed mitogenome data of 34 Kalash samples together with 6 075 individuals from across Eurasia. Our results indicated exclusive western Eurasian origin of the Kalash people, represented by eight haplogroups. Among these haplogroups, J2b1a7a and R0a5a (accounting for ~50% of the Kalash gene pool) displayed in situ differentiations in the Kalash and could be traced to the Mediterranean region. Age estimations suggested these haplogroups arose in the Kalash population ~2.26 and 3.01 thousand years ago (kya), a time frame consistent with the invasion of Alexander III of Macedon to the region. One possible explanation for the maternal genetic contribution from Europeans to the Kalash people would be the involvement of women in foreign campaigns of ancient Greek warfare, followed by a founder effect. Our study thus sheds important light on the genetic origin of the Kalash community of Pakistan. The Kalash or Kalasha people are an ancient Indo-European speaking indigenous group with unique culture and traditions, living restrictively in the Hindu-Kush mountain range of present-day northern Pakistan. The enigmatic origin of the Kalash and interestingly their distinct European complexion, e.g., lighter skin tone and blue eyes, in addition to certain customs and beliefs have so far reinforced their claim to be Greek descents following the invasion of Alexander III of Macedon to the region (Cacopardo, 2011). In the past several decades, various genetic studies have been carried out to investigate the genetic structure and history of the Kalash people, in particular their genetic connection with western Eurasians. For example, several studies have indicated that this ethnic group originated from either the Middle East or Europe, followed by a population bottleneck (Qamar et al., 2002; Rosenberg et al., 2002). It is also widely concerned whether the Kalash were genetically isolated for more than 10 kya (Ayub et al., 2015) or received genetic admixture from western Eurasia during 990 and 210 BCE (Hellenthal et al., 2014). Moreover, the possible genetic connection between Greeks and the Kalash remains controversial (Cacopardo, 2011; Firasat et al., 2007; Mansoor et al., 2004; Qamar et al., 2002). Many previous genetic studies have been based on nuclear genome or Y chromosome data, while the maternal genetic structure of the Kalash had only been dissected based on mitochondrial DNA (mtDNA) restricted fragment length polymorphism (RFLP) and control region variations (Quintana-Murci et al., 2004), thus greatly limiting our understanding of the maternal genetic landscape of this ethnic group. Therefore, whether there is a substantial maternal genetic contribution from Europeans to the Kalash, and when this genetic contact was established, remain unclear. To provide more insight into the genetic history of the Kalash from a matrilineal perspective, we collected and analyzed available complete mitochondrial genome (mitogenome) data of 34 Kalash individuals (25 from the CEPH Human Genome Diversity Project (HGDP) panel (Cann et al., 2002) and nine from this work), as well as 6 075 individuals sampled from Europe and Asia (Figure 1A; Supplementary Table S1). As showed in our results, a total of eight mtDNA haplogroups were identified in the Kalash, including R0a, U4a1, J2b1a, U2e1h, H2a1a, U4b1a4, T2a1a, and U2e2a1, all of which exclusively arise from the Eurasian macro haplogroup R, an observation in agreement with previous study (Quintana-Murci et al., 2004). Comparison of the maternal composition between Kalash and other Eurasian populations (Supplementary Table S1) showed that most of the identified haplogroups in the Kalash were substantially shared with neighboring Dardic group (Kho), as well as being ubiquitous in other western Eurasians (Figure 1B), indicating a western origination of this ethnic group. This is consistent with previous studies that were based on both uniparental markers and whole-genome data (Hellenthal et al., 2014; Qamar et al., 2002; Quintana-Murci et al., 2004). Phylogeographic analysis based on all available complete mitogenomes retrieved from the online platform MitoTool (http://mitotool.kiz.ac.cn/) (Fan & Yao, 2011) as well as from published literature further suggested that most haplogroups identified in Kalash, like R0a, U2e1h, U4a1, H2a1a, T2a1a, and U2e2a1, had sub-branches (e.g., R0a5a, U2e1h1, U4a1f, H2a1a3, etc.) distributed restrictively in northern Pakistan and shared by the Kalash and other Indo-European-speaking populations in the area (Supplementary Figure S1; Supplementary Table S2). Interestingly, the Kalash individuals distributed sporadically in the terminal positions of the sub-branches, strongly suggesting traces of recent gene flow from other groups into the Kalash (Supplementary Figure S1). Moreover, these haplogroups also showed prevalence in the Mediterranean region (e.g., U2e2a1, J2b1a1, and R0a) or in Eurasian Steppe (e.g., H2a1a, T2a1a, U2e1h, U4a1, and U4b1a4), thus possibly reached the Hindu-Kush region in different periods and further introgressed into the Kalash by recent gene flow.
Figure 1

Sample locations, distribution of haplogroups identified in Kalash people, and phylogeographic structure of haplogroup J2b1a

A: Geographic locations of populations with complete mitogenome sequences in Pakistan and surrounding countries are shown in the inset. Number of mitogenomes available from each region is proportional to color intensity in respective regions defined in figure legends (comprising 6 075 complete mitogenome sequences; see Supplementary Table S1). B: Schematic tree showing eight west Eurasian haplogroups identified in Kalash and their frequency in other local and west Eurasian populations. C: Phylogeographic reconstruction using median-joining network for haplogroup J2b1a from complete mitogenomes (comprising 106 complete mitogenome sequences belonging to haplogroup J2b1a and five ancient mitogenomes belonging to J2b1; see Supplementary Table S3). Each circle represents one individual sample, unless represented by a number in the circle. Dotted line shows case in which J2b1a7a and J2b1a7b emanate from root of J2b1a independently, with position 16274 (in italics) serving as a parallel mutation on both branches. Mutated positions are shown on branches with different colors for each type of mutation, as seen in legend. Specific clade shared between Sardinians and Kalash is enclosed in red circle; Kalash and Pashtun samples are shown in italics on different branches of node. Geographic affiliations of samples are shown in different colors, as defined in legends. Red circles represent ancient mitogenomes included in network construction. R or Y indicate heteroplasmic states. @ represents reverse mutation, < represents parallel mutation on branches. Different from the above lineages in which the Kalash samples distributed sporadically in different branches, haplogroup J2b1a had a sub-branch (defined by a non-synonymous transition at position 11204 and tentatively named as J2b1a7a) occupied by six Kalash and two Pashtun individuals, a neighboring group previously shown to have had a limited European connection based on Y chromosome study (Firasat et al., 2007). Further phylogeographic analysis showed that the root types of J2b1a7a were predominantly found in Kalash, whereas a Pashtun individual positioned in one terminal branch, indicating an in-situ differentiation of this lineage in the region and further spread into the Pashtuns. Importantly, J2b1a7a shared substitution 16274 with its sister haplogroup (defined by substitutions 15319 and 16213 and tentatively named as J2b1a7b) from Europe (nine Sardinians) (Figure 1C; Supplementary Table S3), indicating a close genetic connection between the Kalash and Europeans. Together with the relatively high proportion of J2b1a7a in the Kalash samples (17.6%), this haplogroup sheds important light on the European ancestry of this ethnic group. Moreover, considering that the shared position 16274 between the Kalash and Sardinians is hypervariable, it is also probable that the two lineages J2b1a7a and J2b1a7b were derived from the root of J2b1a independently, with 16274 serving as a parallel mutation on both branches. We therefore turned our attention to the ancestral node, J2b1a. Coincidently, the majority (74%) of J2b1a samples, as well as its ancient root type J2b1, were found in Europe, especially in Sardinia (Figure 1C; Supplementary Table S3). This evidence therefore implies an origination of J2b1a in Europe (probably around the Mediterranean region), in agreement with previous study (Pala et al., 2012). Additional support comes from the observation of haplogroup J2b1a in bones of ancient Europeans (Figure 1C, Supplementary Table S3). Further age estimations using mitogenome rate (Soares et al., 2009) revealed that the major haplogroup J2b1a can be traced back to 10.59±1.28 kya, a timeframe within the Neolithization and Bronze Age processes in the Mediterranean region (Marcus et al., 2020), with the Kalash branch (J2b1a7a) 2.26±1.44 kya reflecting a recent split from its European counterpart, followed by independent differentiation in the Hindu-Kush region. Similarly, haplogroup R0a5a, with root types found around the Mediterranean region and a coalescent age of ~3.01±1.5 kya in the Kalash, would also have been introduced into the Kalash gene pool during these recent times. Taken together, about ~50% of the Kalash maternal genetic components were derived from haplogroups J2b1a7a and R0a5a, thus documenting recent genetic introgression (likely from the ancestors of modern Sardinians) to the Kalash, around the time when migration to Sardinia was active from the northern and eastern Mediterranean regions (starting ~1 000 BCE) (Fernandes et al., 2020). Interestingly, this genetic connection echoes well with the close genetic affinity found between Sardinians and Kalash from studies based on eye-color informative single nucleotide polymorphisms (SNPs) (Walsh et al., 2011), thus probably underlying the similarities in physical features, e.g., lighter complexion of Kalash and Europeans. Moreover, given that the age of J2b1a7a fell within the Macedonian advancement towards northern Pakistan (327 BCE) (Olivieri et al., 2019), and the existence of J2b1c, J2b1a1, and J2b1a3 (sister and sub-type lineages of J2b1a) in ancient and modern Greeks (Lazaridis et al., 2017; Pala et al., 2012), including evidence of eastern Mediterranean immigrants in South Asia (Harney et al., 2019), it is also probable that this genetic connection was mediated by the Greeks. In fact, according to historical records, limited females participated in foreign campaigns of ancient Greek warfare (Loman, 2004), making it likely that the females also took part in this occupation, thus contributing to the Kalash gene pool. This scenario is further supported by evidence of human mobility towards mainland Greece and islands like Sardinia, especially from the Mediterranean, via both sea and land routes during the Mesolithic and even more recent times (Demand, 2012; Fernandes et al., 2020; Marcus et al., 2020). However, the absence of J2b1a in other regions that had been occupied by Alexander’s ancient empire (especially Greece), as well as its prevalence in Sardinian and Kalash people, should not be ignored. One probable explanation would be limited female migration along with Alexander’s siege into other regions, or genetic dilution by later demographic events. Additionally, genetic isolation, followed by bottlenecks in both Sardinians (Di Gaetano et al., 2014) and Kalash (Ayub et al., 2015), further played likely roles in the increase of this lineage in these two regions. Moreover, the limited number of reported mitogenome sequences available from Greece so far could also result in this observation. More studies will be carried out to explain whether this maternal genetic connection between the Kalash and Sardinians was mediated by Greek expansion. In summary, our analysis observed a genetic ancestry from Europe (probably around the Mediterranean) within the Kalash people from about 3.01±1.5 and 2.26±1.4 kya. This recent genetic contribution from Europe, as revealed in this study, accounts for a significant proportion (~50%) of the Kalash, thus playing an important role in the formation of the maternal gene pool of this ethnic group. Thus, our study sheds important light on the genetic history of the Kalash people of northern Pakistan.

DATA AVAILABILITY

The mitogenome sequences of nine Kalash individuals were obtained from our unpublished dataset (GenBank accession Nos. MN595835-MN595843). Additionally, the 11 mitogenome sequences from northern Pakistan were retrieved from our unpublished work (GenBank accession Nos. MN595685, MN595706, MN595718, MN595749, MN595751, MN595765, MN595769, MN595807, MN595818, MN595820, and MN595890). Supplementary data to this article can be found online. Click here for additional data file.

COMPETING INTERESTS

The authors declare that they have no competing interests.

AUTHORS’ CONTRIBUTIONS

Q.P.K., Y.C.L., and Z.U.R. designed the research; Z.U.R. collected samples; Z.U.R. and J.Y.T. collected and analyzed the data; Z.U.R., Y.C.L., and Q.P.K. wrote the paper. All authors read and approved the final version of the manuscript.
  17 in total

1.  Y-chromosomal DNA variation in Pakistan.

Authors:  Raheel Qamar; Qasim Ayub; Aisha Mohyuddin; Agnar Helgason; Kehkashan Mazhar; Atika Mansoor; Tatiana Zerjal; Chris Tyler-Smith; S Qasim Mehdi
Journal:  Am J Hum Genet       Date:  2002-03-15       Impact factor: 11.025

2.  Genetic structure of human populations.

Authors:  Noah A Rosenberg; Jonathan K Pritchard; James L Weber; Howard M Cann; Kenneth K Kidd; Lev A Zhivotovsky; Marcus W Feldman
Journal:  Science       Date:  2002-12-20       Impact factor: 47.728

3.  IrisPlex: a sensitive DNA tool for accurate prediction of blue and brown eye colour in the absence of ancestry information.

Authors:  Susan Walsh; Fan Liu; Kaye N Ballantyne; Mannis van Oven; Oscar Lao; Manfred Kayser
Journal:  Forensic Sci Int Genet       Date:  2010-03-27       Impact factor: 4.882

4.  Mitochondrial DNA signals of late glacial recolonization of Europe from near eastern refugia.

Authors:  Maria Pala; Anna Olivieri; Alessandro Achilli; Matteo Accetturo; Ene Metspalu; Maere Reidla; Erika Tamm; Monika Karmin; Tuuli Reisberg; Baharak Hooshiar Kashani; Ugo A Perego; Valeria Carossa; Francesca Gandini; Joana B Pereira; Pedro Soares; Norman Angerhofer; Sergei Rychkov; Nadia Al-Zahery; Valerio Carelli; Mohammad Hossein Sanati; Massoud Houshmand; Jiři Hatina; Vincent Macaulay; Luísa Pereira; Scott R Woodward; William Davies; Clive Gamble; Douglas Baird; Ornella Semino; Richard Villems; Antonio Torroni; Martin B Richards
Journal:  Am J Hum Genet       Date:  2012-05-04       Impact factor: 11.025

5.  MitoTool: a web server for the analysis and retrieval of human mitochondrial DNA sequence variations.

Authors:  Long Fan; Yong-Gang Yao
Journal:  Mitochondrion       Date:  2010-10-07       Impact factor: 4.160

6.  Correcting for purifying selection: an improved human mitochondrial molecular clock.

Authors:  Pedro Soares; Luca Ermini; Noel Thomson; Maru Mormina; Teresa Rito; Arne Röhl; Antonio Salas; Stephen Oppenheimer; Vincent Macaulay; Martin B Richards
Journal:  Am J Hum Genet       Date:  2009-06-04       Impact factor: 11.025

7.  A genetic atlas of human admixture history.

Authors:  Daniel Falush; Simon Myers; Garrett Hellenthal; George B J Busby; Gavin Band; James F Wilson; Cristian Capelli
Journal:  Science       Date:  2014-02-14       Impact factor: 47.728

8.  Genetic origins of the Minoans and Mycenaeans.

Authors:  Iosif Lazaridis; Alissa Mittnik; Nick Patterson; Swapan Mallick; Nadin Rohland; Saskia Pfrengle; Anja Furtwängler; Alexander Peltzer; Cosimo Posth; Andonis Vasilakis; P J P McGeorge; Eleni Konsolaki-Yannopoulou; George Korres; Holley Martlew; Manolis Michalodimitrakis; Mehmet Özsait; Nesrin Özsait; Anastasia Papathanasiou; Michael Richards; Songül Alpaslan Roodenberg; Yannis Tzedakis; Robert Arnott; Daniel M Fernandes; Jeffery R Hughey; Dimitra M Lotakis; Patrick A Navas; Yannis Maniatis; John A Stamatoyannopoulos; Kristin Stewardson; Philipp Stockhammer; Ron Pinhasi; David Reich; Johannes Krause; George Stamatoyannopoulos
Journal:  Nature       Date:  2017-08-02       Impact factor: 49.962

9.  Ancient DNA from the skeletons of Roopkund Lake reveals Mediterranean migrants in India.

Authors:  Éadaoin Harney; Ayushi Nayak; Nick Patterson; Pramod Joglekar; Veena Mushrif-Tripathy; Swapan Mallick; Nadin Rohland; Jakob Sedig; Nicole Adamski; Rebecca Bernardos; Nasreen Broomandkhoshbacht; Brendan J Culleton; Matthew Ferry; Thomas K Harper; Megan Michel; Jonas Oppenheimer; Kristin Stewardson; Zhao Zhang; Maanwendra Singh Bartwal; Sachin Kumar; Subhash Chandra Diyundi; Patrick Roberts; Nicole Boivin; Douglas J Kennett; Kumarasamy Thangaraj; David Reich; Niraj Rai
Journal:  Nat Commun       Date:  2019-08-20       Impact factor: 14.919

10.  Genetic history from the Middle Neolithic to present on the Mediterranean island of Sardinia.

Authors:  Joseph H Marcus; Cosimo Posth; Harald Ringbauer; Luca Lai; Robin Skeates; Carlo Sidore; Jessica Beckett; Anja Furtwängler; Anna Olivieri; Charleston W K Chiang; Hussein Al-Asadi; Kushal Dey; Tyler A Joseph; Chi-Chun Liu; Clio Der Sarkissian; Rita Radzevičiūtė; Megan Michel; Maria Giuseppina Gradoli; Patrizia Marongiu; Salvatore Rubino; Vittorio Mazzarello; Daniela Rovina; Alessandra La Fragola; Rita Maria Serra; Pasquale Bandiera; Raffaella Bianucci; Elisa Pompianu; Clizia Murgia; Michele Guirguis; Rosana Pla Orquin; Noreen Tuross; Peter van Dommelen; Wolfgang Haak; David Reich; David Schlessinger; Francesco Cucca; Johannes Krause; John Novembre
Journal:  Nat Commun       Date:  2020-02-24       Impact factor: 14.919

View more
  2 in total

1.  Complete mitogenomes document substantial genetic contribution from the Eurasian Steppe into northern Pakistani Indo-Iranian speakers.

Authors:  Zia Ur Rahman; Jiao-Yang Tian; Zong-Liang Gao; Hao-Tian Wang; Wang-Xiao Xia; Bin-Yu Yang; Li-Qin Yang; Yu-Chun Li; Qing-Peng Kong
Journal:  Eur J Hum Genet       Date:  2021-02-26       Impact factor: 5.351

2.  Current pool of ultimate collection of mitochondrial DNA from remnants of Kalash.

Authors:  Muhammad Hassan Siddiqi; Allah Rakha; Khushbukhat Khan; Tanveer Akhtar
Journal:  Mitochondrial DNA B Resour       Date:  2021-07-15       Impact factor: 0.658

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.