| Literature DB >> 31481615 |
Edmund Gilbert1,2, Seamus O'Reilly3, Michael Merrigan3, Darren McGettigan3, Veronique Vitart4, Peter K Joshi5, David W Clark5, Harry Campbell5, Caroline Hayward4, Susan M Ring6,7, Jean Golding8, Stephanie Goodfellow9, Pau Navarro4, Shona M Kerr4, Carmen Amador4, Archie Campbell10, Chris S Haley4,11, David J Porteous10, Gianpiero L Cavalleri1,2, James F Wilson12,5.
Abstract
Britain and Ireland are known to show population genetic structure; however, large swathes of Scotland, in particular, have yet to be described. Delineating the structure and ancestry of these populations will allow variant discovery efforts to focus efficiently on areas not represented in existing cohorts. Thus, we assembled genotype data for 2,554 individuals from across the entire archipelago with geographically restricted ancestry, and performed population structure analyses and comparisons to ancient DNA. Extensive geographic structuring is revealed, from broad scales such as a NE to SW divide in mainland Scotland, through to the finest scale observed to date: across 3 km in the Northern Isles. Many genetic boundaries are consistent with Dark Age kingdoms of Gaels, Picts, Britons, and Norse. Populations in the Hebrides, the Highlands, Argyll, Donegal, and the Isle of Man show characteristics of isolation. We document a pole of Norwegian ancestry in the north of the archipelago (reaching 23 to 28% in Shetland) which complements previously described poles of Germanic ancestry in the east, and "Celtic" to the west. This modern genetic structure suggests a northwestern British or Irish source population for the ancient Gaels that contributed to the founding of Iceland. As rarer variants, often with larger effect sizes, become the focus of complex trait genetics, more diverse rural cohorts may be required to optimize discoveries in British and Irish populations and their considerable global diaspora.Entities:
Keywords: fine-scale structure; history; migration; population genetics
Mesh:
Substances:
Year: 2019 PMID: 31481615 PMCID: PMC6754546 DOI: 10.1073/pnas.1904761116
Source DB: PubMed Journal: Proc Natl Acad Sci U S A ISSN: 0027-8424 Impact factor: 11.205
Fig. 1.A comprehensive description of population structure across Britain and Ireland. (A) The dendrogram with each branch as one k = 65 final cluster identified by fineSTRUCTURE. Some clusters have been merged, resulting in k = 43 clusters. Cluster membership count is shown in parentheses. Merged clusters are indicated throughout by the same colored shape and merge label. (B) The average geographic position of 2,429 British and Irish individuals’ ancestors’ birthplaces, with cluster membership shown. The ancestry information available for some individuals’ ancestral place of birth are regional in resolution; therefore, a random jitter was introduced to these individuals’ positions. The exit of the Firth of Forth is indicated by the red arrow. The administrative boundaries of mainland Britain and Ireland were sourced from the R package, rworldxtra. (C) The genetic space positions of 2,544 British and Irish individuals as calculated in t-distributed stochastic neighbor embedding t-SNE analysis of the coancestry matrix obtained by ChromoPainter. Shown plotted are the first and second t-SNE dimensions. All plots were generated in R with the ggplots2 package.
Fig. 2.EEMS of Britain and Ireland. Shown are the posterior mean migration rates of 10 independent EEMS analytical runs (m, on a log10 scale). The image was produced using R and the rEEMSplots package, and the administrative boundaries and outline were sourced from the R package rworldxtra. Many strong barriers to migration can be seen in orange shades in Scotland, Wales, and Ireland.
Fig. 3.Norwegian Ancestry across Britain and Ireland. (A) Ancestry proportions of a supervised ADMIXTURE analysis, using 625 English, 130 Welsh, and 893 Norwegians as references and select fineSTRUCTURE clusters as test individuals. (B) The mean total genome-wide proportion of haplotype contributions of reference British and Scandinavian populations to tested British and Irish fineSTRUCTURE clusters. (C) The mean genome-wide proportions of haplotype contributions of reference British and Scandinavian reference clusters (y axis) that contribute ≥5% of the total proportion to any one individual target cluster (x axis). Error bars indicate the 95% confidence intervals calculated from the 200 sampled iterations of the SOURCEFIND analysis.
Fig. 4.Shared drift between ancient Gaels and modern British and Irish populations. The D statistics of ancient predominantly Gaelic (n = 7) and predominantly Norse (n = 10) Icelanders, comparing affinity to either modern British or Irish genetic regions and modern Scandinavia. A negative D statistic indicates more shared drift with Britain or Ireland, and a positive value indicates more shared drift with modern Scandinavia. Shown are the estimates with SEs (Left), with estimates with a |Z| > 3 shown as filled circles and |Z| < 3 shown as hollow circles. We also show the values of the ancient Gaelic Icelander D-statistic estimates for each modern British or Irish region mapped to the general geographic position of that cluster(s). This was plotted in the statistical computing language R (39) and the packages ggplots2 and rworldxtra.