| Literature DB >> 34002043 |
Morten Mattingsdal1,2, S Sunna Ebenesersdóttir3,4, Kristjan H S Moore3, Ole A Andreassen5,6, Thomas F Hansen7,8, Thomas Werge8,9,10, Ingrid Kockum11, Tomas Olsson11, Lars Alfredsson12, Agnar Helgason3,4, Kári Stefánsson3,13, Eivind Hovig14,15.
Abstract
The aim of the present study was to describe the genetic structure of the Norwegian population using genotypes from 6369 unrelated individuals with detailed information about places of residence. Using standard single marker- and haplotype-based approaches, we report evidence of two regions with distinctive patterns of genetic variation, one in the far northeast, and another in the south of Norway, as indicated by fixation indices, haplotype sharing, homozygosity, and effective population size. We detect and quantify a component of Uralic Sami ancestry that is enriched in the North. On a finer scale, we find that rates of migration have been affected by topography like mountain ridges. In the broader Scandinavian context, we detect elevated relatedness between the mid- and northern border areas towards Sweden. The main finding of this study is that despite Norway's long maritime history and as a former Danish territory, the region closest to mainland Europe in the south appears to have been an isolated region in Norway, highlighting the open sea as a barrier to gene flow into Norway.Entities:
Mesh:
Year: 2021 PMID: 34002043 PMCID: PMC8560852 DOI: 10.1038/s41431-021-00899-6
Source DB: PubMed Journal: Eur J Hum Genet ISSN: 1018-4813 Impact factor: 4.246
Summary statistics per county.
| County | Abb | Median sum of ROH | Mean sum of IBD | Ne | Pop per km2 | Pop | Ne/pop | ||
|---|---|---|---|---|---|---|---|---|---|
| Østfold | OF | 388 | 200 | 5.5 | 5.7 | 396,000 | 56 | 221,386 | 1.79 |
| Akershus | AK | 1132 | 200 | 5 | 5.2 | 919,000 | 70 | 324,390 | 2.83 |
| Oslo | OS | 913 | 200 | 4.9 | 4.7 | 579,000 | 1127 | 481,548 | 1.20 |
| Hedmark | HE | 325 | 200 | 8 | 8.4 | 93,600 | 6 | 179,204 | 0.52 |
| Oppland | OP | 294 | 200 | 7.5 | 8.1 | 89,100 | 7 | 172,479 | 0.52 |
| Buskerud | BU | 388 | 200 | 5.6 | 7 | 204,000 | 14 | 198,852 | 1.03 |
| Vestfold | VE | 417 | 200 | 6 | 6.1 | 115,000 | 81 | 175,402 | 0.66 |
| Telemark | TE | 240 | 200 | 6.7 | 9.4 | 91,400 | 11 | 156,778 | 0.58 |
| Aust-Agder | AA | 152 | 152 | 8.2 | 10.2 | 118,000 | 9 | 80,839 | 1.46 |
| Vest-Agder | VA | 252 | 200 | 12 | 13.5 | 44,100 | 18 | 124,171 | 0.36 |
| Rogaland | RO | 225 | 200 | 8.4 | 14.2 | 27,600 | 31 | 268,682 | 0.10 |
| Hordaland | HO | 52 | 52 | 8.1 | 7.1 | 55,500 | 25 | 260,492 | 0.21 |
| Sogn og Fjordane | SF | 22 | 22 | 10.5 | 14.8 | 12,000 | 5 | 100,933 | 0.12 |
| Møre og Romsdal | MR | 187 | 187 | 7.8 | 9.6 | 270,000 | 15 | 223,709 | 1.21 |
| Sør-Trøndelag | ST | 1011 | 200 | 6.7 | 8.7 | 187,000 | 13 | 234,022 | 0.80 |
| Nord-Trøndelag | NT | 187 | 187 | 8.3 | 9.2 | 116,000 | 5 | 117,998 | 0.98 |
| Nordland | NO | 100 | 100 | 6.6 | 8 | 57,400 | 6 | 240,951 | 0.24 |
| Troms | TR | 54 | 54 | 8.8 | 11.5 | 25,600 | 5 | 136,805 | 0.19 |
| Finnmark | FI | 30 | 30 | 27 | 52.2 | 2600 | 2 | 39,757 | 0.07 |
| All | 6369 | 2984 | 6.8 | – | – | 12 | 3,888,305 | – |
N = the number of samples passing quality control. N* = the final number of random samples per county included in the final analysis, with max 200. Mean ROH = mean sum of Runs-of-Homozygosity in cM. Mean IBD = Mean within-county IBD sharing in cM. Ne = estimate of effective population size at g = 5 ago. Pop. size and pop. per km2 = census population size in 1970.
Fig. 1The genetic structure of Norway demonstrated by PCA and Fst values.
a, b PCA plots of LD pruned SNPs (102,023) color-coded by county. PC1 captures the Sami component and PC2 a southern component of distinctive drift. c Hierarchical clustering of Reich’s Fst values, using squared dissimilarities (ward.D2) presented as a phylogram. d Color-coded map of the counties in Norway.
Fig. 2Visual representation and hierarchical clustering of the mean cumulative sum of haplotype sharing (IBD > 3 cM) within and between counties in Norway, in centiMorgans (cM).
Overall, there is an increased relatedness within the counties (diagonal), and pronounced relatedness between counties form squares.
Fig. 3Contour plot of the cumulative sum of homozygous segments (cM) on the log10 scale detected by Beagle, extrapolated by spatial regression (Krig/fields).
The black dots represent jittered coordinates of zip codes, using 2984 individuals (max 200 per county). The ten most populous cities (>50,000 inhabitants) are marked with white squares. A continuous belt of elevated homozygosity stretches along with the interior, towards the southwestern coast.