| Literature DB >> 30846778 |
Kai Tätte1,2, Luca Pagani3,4, Ajai K Pathak5,3, Sulev Kõks6,7, Binh Ho Duy8, Xuan Dung Ho9, Gazi Nurun Nahar Sultana10, Mohd Istiaq Sharif10, Md Asaduzzaman10, Doron M Behar3, Yarin Hadid11, Richard Villems5,3, Gyaneshwer Chaubey3,12, Toomas Kivisild3,13, Mait Metspalu14.
Abstract
Surrounded by speakers of Indo-European, Dravidian and Tibeto-Burman languages, around 11 million Munda (a branch of Austroasiatic language family) speakers live in the densely populated and genetically diverse South Asia. Their genetic makeup holds components characteristic of South Asians as well as Southeast Asians. The admixture time between these components has been previously estimated on the basis of archaeology, linguistics and uniparental markers. Using genome-wide genotype data of 102 Munda speakers and contextual data from South and Southeast Asia, we retrieved admixture dates between 2000-3800 years ago for different populations of Munda. The best modern proxies for the source populations for the admixture with proportions 0.29/0.71 are Lao people from Laos and Dravidian speakers from Kerala in India. The South Asian population(s), with whom the incoming Southeast Asians intermixed, had a smaller proportion of West Eurasian genetic component than contemporary proxies. Somewhat surprisingly Malaysian Peninsular tribes rather than the geographically closer Austroasiatic languages speakers like Vietnamese and Cambodians show highest sharing of IBD segments with the Munda. In addition, we affirmed that the grouping of the Munda speakers into North and South Munda based on linguistics is in concordance with genome-wide data.Entities:
Mesh:
Substances:
Year: 2019 PMID: 30846778 PMCID: PMC6405872 DOI: 10.1038/s41598-019-40399-8
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1The distribution of genetic components (K = 13) based on the global ADMIXTURE analysis (Supplementary Figs S1, S2, S3) for a subset of populations on a map of South and Southeast Asia. The circular legend in the bottom left corner shows the ancestral components corresponding to the colours on pie charts. The sector sizes correspond to population median.
Figure 2The plotted average counts of IBD segments up to 1 cM (short) and over 1 cM (long) shared with the Munda speakers. The points are coloured based on linguistics and geography according to the legend on the right.
Figure 3Admixture times as evaluated by ALDER. We let ALDER pair up populations from Southeast Asia and South Asia as several populations from either area were good proxies for the admixture event based on Refined IBD and f3 analyses. For accuracy, North Munda speaking Santhal, Ho, Korwa and Birhor were addressed separately as admixed populations; due to a small sample size South Munda speakers were treated as one population. Reference population pair was chosen based on LD decay curve amplitude. Standard errors are estimated by jackknifing on chromosomes. Generation length is 30 years[47]. For all the pairs, see Supplementary Table S7.
Figure 4A branch from a FineSTRUCTURE tree where all the Munda samples used in this analysis are situated on. Samples are coloured as follows: North Munda speakers – blue, South Munda speakers – red, samples from other populations than Munda – black.