| Literature DB >> 27708455 |
Jens Kandt1, James A Cheshire1, Paul A Longley1.
Abstract
Following the increasing availability of DNA-sequenced data, the genetic structure of populations can now be inferred and studied in unprecedented detail. Across social science, this innovation is shaping new bio-social research agendas, attracting substantial investment in the collection of genetic, biological and social data for large population samples. Yet genetic samples are special because the precise populations that they represent are uncertain and ill-defined. Unlike most social surveys, a genetic sample's representativeness of the population cannot be established by conventional procedures of statistical inference, and the implications for population-wide generalisations about bio-social phenomena are little understood. In this paper, we seek to address these problems by linking surname data to a censored and geographically uneven sample of DNA scans, collected for the People of the British Isles study. Based on a combination of global and local spatial correspondence measures, we identify eight regions in Great Britain that are most likely to represent the geography of genetic structure of Great Britain's long-settled population. We discuss the implications of this regionalisation for bio-social investigations. We conclude that, as the often highly selective collection of DNA and biomarkers becomes a more common practice, geography is crucial to understanding variation in genetic information within diverse populations.Entities:
Keywords: Great Britain; cluster analysis; geodemographics; population genetics; regional geography; surnames
Year: 2016 PMID: 27708455 PMCID: PMC5032893 DOI: 10.1111/tran.12131
Source DB: PubMed Journal: Trans Inst Br Geogr ISSN: 0020-2754
Figure 1The ‘POBI’ sample locations (left) and population density of ‘rural’ parishes in 1881 (right)
The reference population after application of the four inclusion criteria
| Population | Surnames | Parishes | |
|---|---|---|---|
| Total | 29 912 298 | 518 153 | 7203 |
| With local surname | 27 213 993 | 61 286 | 7203 |
| With local surname and rural | 18 692 871 | 57 511 | 6848 |
aPopulation registers are not complete. bSome parishes have been grouped to meet a minimum population threshold of 750 people.
Figure 2Dendrogram of 1881 rural parish groups of Great Britain classified by isonymy (left) and Adjusted Rand Index values of agreement between l genetic clusters and k isonymy groups (right). Each line represents a genetic cluster solution with l ϵ {2..53}
Figure 3Isonymy groups (left) and the geographic distribution of genetic clusters (right). (a) k = 3 isonymy groups and l = 3 genetic clusters with R = .605. (b) k = 7 isonymy groups and l = 11 genetic clusters with R = .284. (c) k = 20 isonymy groups and l = 15 genetic clusters with R = .158
Figure 4The distinctiveness (left), dominance (middle) and regional integrity (right) indices for each combination with high global correspondence. (a) k = 3 isonymy groups and l = 3 genetic clusters. (b) k = 7 isonymy groups and l = 11 genetic clusters. (c) k = 20 isonymy groups and l = 15 genetic clusters
Figure 5The synthesised regions of population structure in Great Britain (left) and their corresponding regional integrities (right)