| Literature DB >> 33828095 |
Dhriti Sengupta1, Ananyo Choudhury1, Cesar Fortes-Lima2, Shaun Aron1, Gavin Whitelaw3,4, Koen Bostoen5, Hilde Gunnink5, Natalia Chousou-Polydouri6, Peter Delius7, Stephen Tollman8, F Xavier Gómez-Olivé8, Shane Norris9, Felistas Mashinya10, Marianne Alberts10, Scott Hazelhurst1,11, Carina M Schlebusch2,12,13, Michèle Ramsay14,15.
Abstract
South Eastern Bantu-speaking (SEB) groups constitute more than 80% of the population in South Africa. Despite clear linguistic and geographic diversity, the genetic differences between these groups have not been systematically investigated. Based on genome-wide data of over 5000 individuals, representing eight major SEB groups, we provide strong evidence for fine-scale population structure that broadly aligns with geographic distribution and is also congruent with linguistic phylogeny (separation of Nguni, Sotho-Tswana and Tsonga speakers). Although differential Khoe-San admixture plays a key role, the structure persists after Khoe-San ancestry-masking. The timing of admixture, levels of sex-biased gene flow and population size dynamics also highlight differences in the demographic histories of individual groups. The comparisons with five Iron Age farmer genomes further support genetic continuity over ~400 years in certain regions of the country. Simulated trait genome-wide association studies further show that the observed population structure could have major implications for biomedical genomics research in South Africa.Entities:
Mesh:
Year: 2021 PMID: 33828095 DOI: 10.1038/s41467-021-22207-y
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919