| Literature DB >> 30479341 |
Thiseas C Lamnidis1, Kerttu Majander1,2,3, Choongwon Jeong1,4, Elina Salmela1,3, Anna Wessman5, Vyacheslav Moiseyev6, Valery Khartanovich6, Oleg Balanovsky7,8,9, Matthias Ongyerth10, Antje Weihmann10, Antti Sajantila11, Janet Kelso10, Svante Pääbo10, Päivi Onkamo12,13, Wolfgang Haak14, Johannes Krause1, Stephan Schiffels15.
Abstract
European population history has been shaped by migrations of people, and their subsequent admixture. Recently, ancient DNA has brought new insights into European migration events linked to the advent of agriculture, and possibly to the spread of Indo-European languages. However, little is known about the ancient population history of north-eastern Europe, in particular about populations speaking Uralic languages, such as Finns and Saami. Here we analyse ancient genomic data from 11 individuals from Finland and north-western Russia. We show that the genetic makeup of northern Europe was shaped by migrations from Siberia that began at least 3500 years ago. This Siberian ancestry was subsequently admixed into many modern populations in the region, particularly into populations speaking Uralic languages today. Additionally, we show that ancestors of modern Saami inhabited a larger territory during the Iron Age, which adds to the historical and linguistic information about the population history of Finland.Entities:
Mesh:
Year: 2018 PMID: 30479341 PMCID: PMC6258758 DOI: 10.1038/s41467-018-07483-5
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Sample information
| Individual ID | Site/ location | Date | Population label | Genetic Sex | # SNPs overlap with Human Origins | Avg. coverage on target (1240 K MapQ ≥ 30) | Nuclear contamination | mtDNA contamination | mtDNA, Y haplotypes |
|---|---|---|---|---|---|---|---|---|---|
| BOO001 | Bolshoy Oleni Ostrov, Murmansk, Russia | 3473 ± 87 calBP | Bolshoy | F | 347,709 | 1.22 | N/A | 0.001 (0.000–0.007) | U4a1 |
| BOO002 | Bolshoy | M | 403,994 | 0.81 | 0.004 ± 0.003 | 0.000 (0.000–0.002) | Z1a1a, N1c1a1a | ||
| BOO003 | Bolshoy | F | 300,598 | 1.09 | N/A | 0.016 (0.010–0.024) | T2d1b1 | ||
| BOO004 | Bolshoy | M | 342,582 | 2.92 | 0.002 ± 0.001 | 0.003 (0.000–0.059) | C4b, N1c1a1a | ||
| BOO005 | Bolshoy | F | 347,042 | 2.88 | N/A | 0.005 (0.001–0.026) | U5a1d | ||
| BOO006 | Bolshoy | F | 390,835 | 1.08 | N/A | 0.008 (0.004–0.014) | D4e4 | ||
| CHV001 | Chalmny- Varre, Murmansk, Russia | 18–19th cent CE | Chalmny Varre | F | 426,702 | 1.42 | N/A | 0.001 (0.000–0.003) | U5b1b1a3 |
| CHV002 | Chalmny Varre | M | 215,228 | 0.47 | 0.016 ± 0.011 | 0.000 (0.000–0.003) | V7a1, I2a1 | ||
| JK1963 | Levänluhta, Isokyrö, Finland | 300–800 CE | N/A | f | 10,492 | 0.02 | N/A | 0.004 (0.001–0.013) | N/A |
| JK1967 | N/A | f | 5267 | 0.01 | N/A | 0.000 (0.000–0.009) | N/A | ||
| JK1968 | Levänluhta | F | 207,076 | 0.47 | N/A | 0.004 (0.001–0.065) | U5a1a1a’b’n | ||
| JK1970 | Levänluhta | F | 194,764 | 0.44 | N/A | 0.023 (0.005–0.129) | U5a1a1 | ||
| JK2065 | Levänluhta_B | F | 133,224 | 0.29 | N/A | 0.028 (0.012–0.064) | K1a4a1b | ||
| JK2066 | N/A | ? | 4045 | 0.01 | N/A | 0.002 (0.001–0.007) | N/A | ||
| JK2067 | N/A | ? | 2356 | 0.01 | N/A | 0.010 (0.002–0.032) | N/A | ||
| Saami001 | Finland | Modern | Modern Saami | M | 593,094 | 6.32 | N/A | N/A | U5b1b1a1, I1a1b3a1 |
Summary information for all individuals for which we report genomic data in this study. A radiocarbon date is given for the Bolshoy samples, as described in Supplementary Text 1. Other dates are context-based, as described in Supplementary Text 1. Population labels for individuals of low coverage are shown as N/A. Genetic sex is determined as described in Methods, and lowercase letters denote probable genetic sex for low-coverage individuals. Question marks denote undefined genetic sex due to low coverage. Mitochondrial contamination estimates are given as posterior mode and 95% posterior intervals in brackets, as provided by ContamMix[70]
Fig. 1Location and age of archaeological sites used in this study. The location of other sites relevant to this study is also shown. Markers used correspond to the ones used in Fig. 2. Map generated with QGIS 2.18.19 (http://www.qgis.org/) using the Natural Earth country boundary dataset (http://www.naturalearthdata.com) for the basemap. Source data are provided as a Source Data file
Fig. 2PCA and ADMIXTURE analysis. a PCA plot of 113 Modern Eurasian populations, with individuals from this study and other relevant ancient genomes projected on the principal components, using the “shrinkmode: YES” option. Uralic-speaking populations are highlighted in dark purple. PCA of Europe can be found in Supplementary Figure 3. b Plot of ADMIXTURE (K = 11) results containing worldwide populations. Ancient individuals from this study (black box) are represented by thicker bars and shown in bold. For a figure of ADMIXTURE results over multiple K values see Supplementary Figure 4a. Source data are provided as a Source Data file
Fig. 3Calculated f4 (Mbuti, Nganasan; Lithuanian, Test). Modern populations are sorted by f4 magnitude; ancient populations are sorted through time. Populations are coloured based on language family, with Ancient individuals with unknown language shown in blue. “AllSaami” refers to a grouping including the 2 individuals from the SGDP (Saami(SGDP)) and the high-coverage modern Saami shotgun genome in this study (Modern Saami). Individuals from this study are indicated by labels in bold. Error bars represent 3 standard errors, to indicate significant difference from 0. Source data are provided as a Source Data file
Fig. 4Mixture proportions from five sources estimated using qpAdm. Sources used were Nganasan, WHG, EHG, Yamnaya and LBK (see Methods/Supplementary Data 4). P-values (chi-square) for each model are shown in square brackets next to the test population. Results from the least complex model for each test population/individual are shown. “AllSaami” corresponds to a population consisting of the two genomes from the SGDP and the genome from this study (ModernSaami). Error bars represent one standard error and are plotted to the right of their associated mixture proportion. Populations containing individuals from this study are shown in bold. Source data are provided as a Source Data file
Fig. 5Dating the introduction of Siberian ancestry using ALDER. a ALDER-inferred admixture dates (filled circles) for different populations, using Nganasan and Lithuanian as sources. Available dates for ancient populations are shown in white diamonds. Error bars represent one standard error provided by ALDER and include the uncertainty surrounding the dating of ancient population samples, calculated using standard propagation. Populations containing individuals from this study are shown in bold. b LD decay curve for Bolshoy, using Nganasan and EHG as sources. The fitted trendline considers a minimum distance of 1 cM. A full set of LD decay plots can be found in Supplementary Figure 5. Source data are provided as a Source Data file