| Literature DB >> 28754995 |
Tiroyamodimo Tau1, Anthony Wally2, Thokozile Patricia Fanie2, Goitseone Lorato Ngono2, Sununguko Wata Mpoloka3, Sean Davison1, María Eugenia D'Amato4.
Abstract
Population structure was investigated in 990 Botswana individuals according to ethno-linguistics, Bantu and Khoisan, and geography (the nine administrative districts) using the Identifiler autosomal microsatellite markers. Genetic diversity and forensic parameters were calculated for the overall population, and according to ethno-linguistics and geography. The overall combined power of exclusion (CPE) was 0.9999965412 and the combined match probability 6,28 × 10-19. CPE was highest for the Khoisan Tuu ethnolinguistic group and the Northeast District at 0.9999582029 and 0.9999922652 respectively. CMP ranged from 6.28 × 10-19 (Khoisan Tuu) to 1,02 × 10-18 (Northwest district). Using pairwise genetic distances (FST), analysis of molecular variance (AMOVA), factorial correspondence analysis (FCA), and the unsupervised Bayesian clustering method found in STRUCTURE and TESS, ethno-linguistics were found to have a greater influence on population structure than geography. FCA showed clustering between Bantu and Khoisan, and within the Bantu. This Bantu sub-structuring was not seen with STRUCTURE and TESS, which detected clustering only between Bantu and Khoisan. The patterns of population structure revealed highlight the need for regional reference databases that include ethno-linguistic and geographic location information. These markers have important potential for bio-anthropological studies as well as for forensic applications.Entities:
Mesh:
Year: 2017 PMID: 28754995 PMCID: PMC5533702 DOI: 10.1038/s41598-017-06365-y
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Botswana Administrative Districts and sampling locations made with R 3.2.4[79] (http://www.R-project.org) package ggplot2 (https://cran.r-project.org/web/packages/ggmap/citation.html)[80] and package ggmap (https://cran.r-project.org/web/packages/ggplot2/citation.html)[81].
Hierarchical and non-hierarchical analysis of molecular variance (AMOVA) between the different Botswana population groups according to ethno-linguistic (A) and geographic (B) heterogeneity.
| Groups | Source of variation | Variation (%) |
|
|
| |
|---|---|---|---|---|---|---|
|
| ||||||
|
| ||||||
| Test 1 (Non-hierarchical) | (1) Central-K Bantu(2) Bantu: Central-R Bantu(3) Bantu: Central-S Bantu(4) Khoe-Kwadi Khoisan(5) Kx’a Khoisan(6) Tuu Khoisan | Among populations | 3.30 | |||
| Within populations | 96.70 | 0.03301* | ||||
| Test 2 (Hierarchical) | (1) Bantu: Central-K + Central-R + Central-S (2) Khoisan: Khoe-Kwadi + Kx’a + Tuu | Among groups | 3.37 | 0.03375 | ||
| Among populations within groups | 1.00 | 0.01033* | ||||
| Within populations | 95.63 | 0.04373* | ||||
|
| ||||||
|
| ||||||
| Test 3 (Non-hierarchical) | (1) Central(2) Ghanzi(3) Kgalagadi(4) Kgatleng(5) Kweneng(6) North-east(7) North-west(8) Southern(9) South-east | Among populations | 1.54 | |||
| Within populations | 98.46 | 0.01544* | ||||
| Test 4 (Hierarchical) | (1) North: North-west + Ghanzi + Central + North-east (2) South: Kgalagadi + Kweneng + Kgatleng + Southern + South-east | Among groups | 0.12 | 0.00119 | ||
| Among populations within groups | 1.49 | 0.01488* | ||||
| Within populations | 98.39 | 0.01606* | ||||
*P < 0.001.
Figure 2(a) Factorial correspondence analysis (FCA) of the language subgroups of the Bantu (Central-K, -R and -S) and Khoisan (Tuu, Kx’a, and Khoe-Kwadi) speaking people of Botswana. (b) Factorial correspondence analysis (FCA) of the nine administrative districts of Botswana.
Figure 3STRUCTURE analysis of Botswana individuals with Identifiler assuming K = 2 to 6. Colours represent the inferred ancestry from K ancestral populations and vertical black lines indicate the nine administrative districts.
Figure 4Geographical representation of the admixture coefficients through spatial kriging with low (cool colours) to high (hot colours) representing mean (TESS) admixture proportions. Individuals are classed from the non-admixture analysis in TESS with different colour and/or shapes representing different clusters. The green represents the Bantu and the pink represents the Khoisan. Map created using R 3.0.3[72] (http://www.R-project.org) package spatial 7.3.971[73] and maps 2.3.972[74].
Summary results of assignment of individuals to Bantu and Khoisan ethno-linguistic population groups with WHICHRUN (A) and STRUCTURE (B).
| Population group | Assigned to | ||||
|---|---|---|---|---|---|
| Bantu | Khoisan | Not assigned (%) | Correctly assigned (%) | Error rate (%) | |
|
| |||||
| Bantu | 523 | 111 | 15.2 | 69.9 | 14.8 |
| Khoisan | 43 | 189 | 4.1 | 78.1 | 17.7 |
|
| |||||
| Bantu | 213 | 28 | 68.1 | 28.2 | 3.7 |
| Khoisan | 10 | 143 | 35.7 | 60.1 | 4.2 |
The tables show the count of individuals assigned to each ethno-linguistic population group and the proportion of not assigned (admixed), correctly assigned, and wrongly assigned individuals.
Figure 5Log likelihood ratios of assignment for all 990 Botswana individuals to Bantu and Khoisan ethno-linguistic groups. The cut off range for Bantu was ≥0.477 (log103) and for Khoisan it was <−0.477. The self-declared Bantu individuals are indicated in blue and the Khoisan in maroon. The samples that fall within the purple dashes represent admixed individuals.
Informativeness of loci for the inference of ancestry I .
| Locus |
|
|---|---|
|
| 0.154 |
|
| 0.130 |
|
| 0.111 |
|
| 0.100 |
|
| 0.093 |
|
| 0.091 |
|
| 0.075 |
|
| 0.073 |
|
| 0.067 |
|
| 0.059 |
|
| 0.053 |
|
| 0.045 |
|
| 0.044 |
|
| 0.044 |
|
| 0.036 |
|
| 0.078 |
|
| 0.033 |