| Literature DB >> 31036896 |
Choongwon Jeong1,2,3, Oleg Balanovsky4,5, Elena Lukianova4, Nurzhibek Kahbatkyzy6,7, Pavel Flegontov8,9, Valery Zaporozhchenko4,5, Alexander Immel10, Chuan-Chao Wang10,11, Olzhas Ixan6, Elmira Khussainova6, Bakhytzhan Bekmanov6,7, Victor Zaibert12, Maria Lavryashina13, Elvira Pocheshkhova14, Yuldash Yusupov15, Anastasiya Agdzhoyan4,5, Sergey Koshel16, Andrei Bukin17, Pagbajabyn Nymadawa18, Shahlo Turdikulova19, Dilbar Dalimova19, Mikhail Churnosov20, Roza Skhalyakho5, Denis Daragan5, Yuri Bogunov4,5, Anna Bogunova5, Alexandr Shtrunov5, Nadezhda Dubova21, Maxat Zhabagin22,23, Levon Yepiskoposyan24, Vladimir Churakov25, Nikolay Pislegin25, Larissa Damba26, Ludmila Saroyants27, Khadizhat Dibirova4,5, Lubov Atramentova28, Olga Utevska28, Eldar Idrisov29, Evgeniya Kamenshchikova5, Irina Evseeva30, Mait Metspalu31, Alan K Outram32, Martine Robbeets33, Leyla Djansugurova6,7, Elena Balanovska5, Stephan Schiffels10, Wolfgang Haak10, David Reich34,35, Johannes Krause36.
Abstract
The indigenous populations of inner Eurasia-a huge geographic region covering the central Eurasian steppe and the northern Eurasian taiga and tundra-harbour tremendous diversity in their genes, cultures and languages. In this study, we report novel genome-wide data for 763 individuals from Armenia, Georgia, Kazakhstan, Moldova, Mongolia, Russia, Tajikistan, Ukraine and Uzbekistan. We furthermore report additional damage-reduced genome-wide data of two previously published individuals from the Eneolithic Botai culture in Kazakhstan (~5,400 BP). We find that present-day inner Eurasian populations are structured into three distinct admixture clines stretching between various western and eastern Eurasian ancestries, mirroring geography. The Botai and more recent ancient genomes from Siberia show a decrease in contributions from so-called 'ancient North Eurasian' ancestry over time, which is detectable only in the northern-most 'forest-tundra' cline. The intermediate 'steppe-forest' cline descends from the Late Bronze Age steppe ancestries, while the 'southern steppe' cline further to the south shows a strong West/South Asian influence. Ancient genomes suggest a northward spread of the southern steppe cline in Central Asia during the first millennium BC. Finally, the genetic structure of Caucasus populations highlights a role of the Caucasus Mountains as a barrier to gene flow and suggests a post-Neolithic gene flow into North Caucasus populations from the steppe.Entities:
Mesh:
Year: 2019 PMID: 31036896 PMCID: PMC6542712 DOI: 10.1038/s41559-019-0878-2
Source DB: PubMed Journal: Nat Ecol Evol ISSN: 2397-334X Impact factor: 15.460
Fig. 1Geographic locations of the Eneolithic Botai site (red triangle), 65 groups including newly sampled individuals (filled diamonds) and nearby groups with published data (filled squares).
Mean latitude and longitude values across all individuals under each group label were used. Two zoom-in plots for the Caucasus (blue) and the Altai-Sayan (magenta) regions are presented in the lower left corner. A list of new groups, their three-letter codes, and the number of new individuals (in parenthesis) are provided at the bottom. Present-day populations are color-coded based on the language family for Figs. 1-3, following key codes listed in Fig. 2. Corresponding information for the previously published groups is provided in Supplementary Table 2. The map is overlayed with ecoregional information, divided into 14 biomes, downloaded from https://ecoregions2017.appspot.com/ (credited to Ecoregions 2017 © Resolve). The main inner Eurasian map is on the Albers equal area projection and was produced using the spTransform function in the R package rgdal v1.2-5.
Sequencing statistics and radiocarbon dates of two Eneolithic Botai individuals analyzed in this study.
For Botai individuals we produced additional data, we provide corresponding individual ID from a previous publication23 (“Published ID”), radiocarbon date, the number of total reads sequenced, mean autosomal coverage for the 1240K target sites, the number of SNPs covered at least once for the 1240K and HumanOrigins panels, uniparental haplogroup and contamination estimates.
| ID | Published ID | Genetic Sex | Uncal. | Cal. | # of reads sequenced | Mean autosomal coverage | # of SNPs covered | MT / Y haplogroup | MT.cont | X.cont |
|---|---|---|---|---|---|---|---|---|---|---|
| TU45 | BOT14 | M | 4620 ± 80 | 3632-3100 | 84,170,835 | 0.827x | 169,053 | K1b2 / R1b1a1 | 0.02 | 0.0122 |
| BKZ001 | BOT2016 | F | 4660 ± 25 | 3517-3367 | 69,678,735 | 2.420x | 825,332 | Z1 / NA | 0.01 | NA |
The uncalibrated date of TU45 was published in Levine (1999) under the ID OxA-431670.
The calibrated 14C dates are calculated based on uncalibrated dates, by the OxCal v4.3.2 program71 using the INTCAL13 atmospheric curve72.
The number of SNPs in the 1240K panel (out of 1,233,013) or autosomal SNPs in the HumanOrigins array (out of 581,230; within the parenthesis) covered at least by one read. Only transversion SNPs are considered for the non-UDG libraries (both of the TU45 libraries, one of two BKZ001 libraries).
The contamination rate of mitochondrial reads estimated by the Schmutzi program (95% confidence interval in parentheses)
The nuclear contamination rate for the male (TU45) estimated based on X chromosome data by ANGSD software (standard error in parentheses)
Fig. 2The genetic structure of inner Eurasian populations.
(a) The first two PCs of 2,077 Eurasian individuals separate western and eastern Eurasians (PC1) and Northeast and Southeast Asians (PC2). Most inner Eurasians are located between western and eastern Eurasians on PC1. Ancient individuals (color-filled shapes) are projected onto PCs calculated based on contemporary individuals. Present-day individuals are marked by grey dots, with their per-group mean coordinates marked by three-letter codes listed in Supplementary Table 2. Individuals are colored by their language family. (b) ADMIXTURE results for a chosen set of ancient and present-day groups (K = 14). The top row shows ancient inner Eurasians and representative present-day eastern Eurasians. The following three rows show forest-tundra, steppe-forest and southern steppe cline populations. Most inner Eurasians are modeled as a mixture of components primarily found in eastern or western Eurasians. Results for the full set of individuals are provided in Supplementary Fig. 3.
Fig. 3Correlation of longitude and ancestry proportion across inner Eurasian populations.
Across inner Eurasian populations, mean longitudinal coordinates (x-axis) and mean eastern Eurasian ancestry proportions (y-axis) are strongly correlated. Eastern Eurasian ancestry proportions are estimated from ADMIXTURE results with K=14 by summing up six components maximized in Surui, Chipewyan, Itelmen, Nganasan, Atayal and early Neolithic Russian Far East individuals (“Devil’s Gate”), respectively (Supplementary Fig. 3). The yellow curve shows a probit regression fit following the model in Sedghifar et al.69. Three groups (Kalmyks, Dungans, Nogai2) are marked with grey square due to their substantial deviation from the curve as well as their historically known migration history.
Fig. 4Characterization of the western and eastern Eurasian source ancestries in inner Eurasian populations.
(a) Admixture f values are compared for different eastern Eurasian references (Mixe, Nganasan, Ulchi; left) or western Eurasian ones (Srubnaya, Iran_ChL; right). For each target group, darker shades mark more negative f values. (b) Weights of donor populations in two sources characterizing the main admixture signal (“date 1 PC 1”) in the GLOBETROTTER analysis. We merged 167 donor populations into 12 groups, as listed on the top right side. Target populations are split into five groups: Aleuts, the forest-tundra cline populations, the steppe-forest cline populations, the southern steppe cline populations and the rest of four populations (“others”), from the top to bottom.
Fig. 5qpAdm-based admixture models for the forest-tundra and steppe-forest cline populations.
For the forest-tundra population to the west of the Urals, Nganasan+Srubnaya+WHG+LBK_EN or its submodel provides a good fit, while additional ANE-related contribution (AG3) is required for those to the east of the Urals (Enets, Selkups, Kets, and Mansi). For the steppe-forest populations, Srubnaya+Ulchi, Srubnaya+Ulchi+AG3, or Srubnaya+Nganasan provides a good fit. 5 cM jackknifing standard errors are marked by the horizontal bar. Models with p-value between 0.01 and 0.05 are marked by grey color and those with p-value < 0.01 are marked by grey color and italic font. Details of the model information are presented in Supplementary Tables 5 and 8.