| Literature DB >> 34411538 |
Toomas Kivisild1, Lehti Saag2, Ruoyun Hui3, Simone Andrea Biagini4, Vasili Pankratov5, Eugenia D'Atanasio6, Luca Pagani7, Lauri Saag5, Siiri Rootsi5, Reedik Mägi5, Ene Metspalu5, Heiki Valk8, Martin Malve8, Kadri Irdt5, Tuuli Reisberg9, Anu Solnik9, Christiana L Scheib10, Daniel N Seidman11, Amy L Williams11, Kristiina Tambets5, Mait Metspalu5.
Abstract
The Finnish population is a unique example of a genetic isolate affected by a recent founder event. Previous studies have suggested that the ancestors of Finnic-speaking Finns and Estonians reached the circum-Baltic region by the 1st millennium BC. However, high linguistic similarity points to a more recent split of their languages. To study genetic connectedness between Finns and Estonians directly, we first assessed the efficacy of imputation of low-coverage ancient genomes by sequencing a medieval Estonian genome to high depth (23×) and evaluated the performance of its down-sampled replicas. We find that ancient genomes imputed from >0.1× coverage can be reliably used in principal-component analyses without projection. By searching for long shared allele intervals (LSAIs; similar to identity-by-descent segments) in unphased data for >143,000 present-day Estonians, 99 Finns, and 14 imputed ancient genomes from Estonia, we find unexpectedly high levels of individual connectedness between Estonians and Finns for the last eight centuries in contrast to their clear differentiation by allele frequencies. High levels of sharing of these segments between Estonians and Finns predate the demographic expansion and late settlement process of Finland. One plausible source of this extensive sharing is the 8th-10th centuries AD migration event from North Estonia to Finland that has been proposed to explain uniquely shared linguistic features between the Finnish language and the northern dialect of Estonian and shared Christianity-related loanwords from Slavic. These results suggest that LSAI detection provides a computationally tractable way to detect fine-scale structure in large cohorts.Entities:
Keywords: Estonian Biobank; Finnish bottleneck; ancient DNA; community structure; identity by descent; imputation; long shared allele intervals; medieval Estonian genomes; population structure
Mesh:
Substances:
Year: 2021 PMID: 34411538 PMCID: PMC8456179 DOI: 10.1016/j.ajhg.2021.07.012
Source DB: PubMed Journal: Am J Hum Genet ISSN: 0002-9297 Impact factor: 11.025