| Literature DB >> 32697300 |
Thijessen Naidoo1,2,3,4, Jingzi Xu1, Mário Vicente1, Helena Malmström1,5, Himla Soodyall6,7,8, Mattias Jakobsson1,3,5, Carina M Schlebusch1,3,5.
Abstract
Although the human Y chromosome has effectively shown utility in uncovering facets of human evolution and population histories, the ascertainment bias present in early Y-chromosome variant data sets limited the accuracy of diversity and TMRCA estimates obtained from them. The advent of next-generation sequencing, however, has removed this bias and allowed for the discovery of thousands of new variants for use in improving the Y-chromosome phylogeny and computing estimates that are more accurate. Here, we describe the high-coverage sequencing of the whole Y chromosome in a data set of 19 male Khoe-San individuals in comparison with existing whole Y-chromosome sequence data. Due to the increased resolution, we potentially resolve the source of haplogroup B-P70 in the Khoe-San, and reconcile recently published haplogroup A-M51 data with the most recent version of the ISOGG Y-chromosome phylogeny. Our results also improve the positioning of tentatively placed new branches of the ISOGG Y-chromosome phylogeny. The distribution of major Y-chromosome haplogroups in the Khoe-San and other African groups coincide with the emerging picture of African demographic history; with E-M2 linked to the agriculturalist Bantu expansion, E-M35 linked to pastoralist eastern African migrations, B-M112 linked to earlier east-south gene flow, A-M14 linked to shared ancestry with central African rainforest hunter-gatherers, and A-M51 potentially unique to the Khoe-San.Entities:
Keywords: Khoe-San; Y chromosome; haplogroups; next-generation sequencing; southern Africa
Mesh:
Year: 2020 PMID: 32697300 PMCID: PMC7375190 DOI: 10.1093/gbe/evaa098
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
Population Information and Y-Chromosome Haplogroups of Individuals Used in the Study
| Ref. | Sample Code | Population | AMY-Tree Haplogroup | AMY-Tree Marker | ISOGG Haplogroup |
|---|---|---|---|---|---|
| 1 | KSP103 | Ju|’hoansi | A2a1b1 | A-M114 | A1b1a1a2a |
| 1 | KSP154 | !Xun | A2a1b1 | A-M114 | A1b1a1a2a |
| 1 | KSP116 | Ju|’hoansi | A2a1b2 | A-P262 | A1b1a1a2b |
| 1 | KSP139 | Nama | A3b1a | A-P71 | A1b1b2a1a |
| 1 | KSP124 | Nama | A3b1a | A-P71 | A1b1b2a1a1 |
| 1 | KSP140 | Nama | A3b1a | A-P71 | A1b1b2a1a1 |
| 1 | KSP105 | Ju|’hoansi | A3b1c | A-V306 | A1b1b2a1b |
| 1 | KSP106 | Ju|’hoansi | A3b1c | A-V306 | A1b1b2a1b |
| 1 | KSP146 | !Xun | A3b1b | A-V37 | A1b1b2a2 |
| 1 | KSP150 | !Xun | A3b1b | A-V37 | A1b1b2a2 |
| 4 | PygmyBaka1 | Baka | B2b1a1b | B-M8035 | B2b1a1c2a1∼ |
| 4 | PygmyBaka3 | Baka | B2b1a1b | B-M8035 | B2b1a1c2a2∼ |
| 4 | Hadza2 | Hadza | B2b1a2 | B-M7583 | B2b1a2a1∼ |
| 4 | Hadza5 | Hadza | B2b1a2 | B-M7583 | B2b1a2a2∼ |
| 4 | Sandawe5 | Sandawe | B2b1a2 | B-M7583 | B2b1a2b1∼ |
| 4 | Hadza3 | Hadza | B2b1a2 | B-M7583 | B2b1a2b2a∼ |
| 1 | KSP111 | Ju|’hoansi | B2b1a2 | B-M7583 | B2b1a2b2b∼ |
| 1 | KSP155 | !Xun | B2b1a2 | B-M7583 | B2b1a2b2b∼ |
| 4 | Sandawe4 | Sandawe | B2b1b* | B-M7104 | B2b1b∼ |
| 4 | Hadza4 | Hadza | B2b1b* | B-M7104 | B2b1b1b∼ |
| 2 | NA19429 | Luhya | E1b1a1a1a | E-M58 | E1b1a1a1a1a3a∼ |
| 4 | Hadza1 | Hadza | E1b1a1a1f1a1c* | E-P116 | E1b1a1a1a1c1a1a3a1a1a2b∼ |
| 4 | PygmyBakola1 | Bakola | E1b1a1a1f1a1c* | E-P116 | E1b1a1a1a1c1a1a3a1a1b∼ |
| 1 | KSP063 | Karretjie | E1b1a1a1f1a1d | E-CTS8030 | E1b1a1a1a1c1a1a3a1c1 |
| 2 | NA19443 | Luhya | E1b1a1a1f1a1* | E-U174 | E1b1a1a1a1c1a1a3a1c1a1a∼ |
| 2 | NA19428 | Luhya | E1b1a1a1f1a1d | E-CTS8030 | E1b1a1a1a1c1a1a3a1d1b1a1∼ |
| 4 | PygmyBaka2 | Baka | E1b1a1a1f1a1d | E-CTS8030 | E1b1a1a1a1c1a1a3a1g∼ |
| 2 | NA18504 | Yoruba | E1b1a1a1f1a1* | E-U174 | E1b1a1a1a1c1a1a3c1b1a1∼ |
| 2 | NA18507 | Yoruba | E1b1a1a1f1a1* | E-U174 | E1b1a1a1a1c1a1a3d3∼ |
| 3 | NA19834 | Afr. American | E1b1a1a1f1a1* | E-U174 | E1b1a1a1a1c1a1a3d6a |
| 1 | KSP067 | Karretjie | E1b1a1a1f1a1* | E-U174 | E1b1a1a1a1c1a1a3e1∼ |
| 2 | NA18501 | Yoruba | E1b1a1a1g1* | E-U209 | E1b1a1a1a2a1a1b∼ |
| 2 | NA18498 | Yoruba | E1b1a1a1g1* | E-U209 | E1b1a1a1a2a1a2a1∼ |
| 1 | KSP092 | |Gui and ‖Gana | E1b1a1a1g1a1 | E-U181 | E1b1a1a1a2a1a3b1a1 |
| 1 | KSP096 | |Gui and ‖Gana | E1b1a1a1g1a1 | E-U181 | E1b1a1a1a2a1a3b1a1 |
| 2 | NA18871 | Yoruba | E1b1a1a1g1a* | E-U290 | E1b1a1a1a2a1a3b1a10a∼ |
| 3 | NA19703 | Afr. American | E1b1a1a1g1a1 | E-U181 | E1b1a1a1a2a1a3b1a1b∼ |
| 1 | KSP069 | Karretjie | E1b1a1a1g1a2 | E-Z1725 | E1b1a1a1a2a1a3b1a2a2 |
| 2 | NA19397 | Luhya | E1b1a1a1g1a2 | E-Z1725 | E1b1a1a1a2a1a3b1a2a2a1a∼ |
| 3 | NA19700 | Afr. American | E1b1a1a1g1a* | E-U290 | E1b1a1a1a2a1a3b1a5∼ |
| 4 | Sandawe1 | Sandawe | E1b1a1a1g1* | E-U209 | E1b1a1a1a2a1a3b1d1c1a1b∼ |
| 4 | PygmyBedzan1 | Bedzan | E1b1a1a1g1* | E-U209 | E1b1a1a1a2a1a3b1d1d∼ |
| 3 | NA21732 | Maasai | E1b1b1a3c | E-AM00003 | E1b1b1a1b2a3 |
| 3 | NA21737 | Maasai | E1b1b1a3c | E-AM00003 | E1b1b1a1b2a3 |
| 1 | KSP137 | Nama | E1b1b1d* | E-M293 | E1b1b1b2b2a1 |
| 1 | KSP152 | !Xun | E1b1b1d* | E-M293 | E1b1b1b2b2a1a1c∼ |
| 4 | Sandawe2 | Sandawe | E1b1b1d* | E-M293 | E1b1b1b2b2a1a1c∼ |
| 4 | Sandawe3 | Sandawe | E1b1b1d* | E-M293 | E1b1b1b2b2a1a1d∼ |
1, this study; 2, Auton et al. (2015); 3, Drmanac et al. (2010); 4, Lachance et al. (2012).
*designates a Y chromosome paragroup
Fig. 1.Y-chromosome phylogeny of the 48 individuals in the data set. Populations and haplogroups are shown on the right of the tree. Refer to supplementary table S1, Supplementary Material online, for additional information for each branch (numbered) in the phylogeny, including numbers of variants per branch and the lists of variants defining each branch. Branch lengths in the figure are representative of topology, and do not reflect TMRCA estimates nor the number of variants per branch.
TMRCA Estimates of Y-Chromosome Haplogroups Found in the Study
| TMRCA Estimate (ka) |
| ||||
|---|---|---|---|---|---|
| Mutation rate (mutations/bp/year) | 0.74×10−9 | 0.87×10−9 | 0.82×10−9a | ||
| Haplogroup | Median | 95% HPD | Median | 95% HPD | Median |
| A1b | 169,018 | 133,711–216,880 | 146,952 | 116,496–185,147 | 193,000 |
| A1b1a1a2 | 12,507 | 7,829–18,812 | 10,745 | 7,119–15,375 | 33,000 |
| A1b1b2a | 47,966 | 33,936–66,832 | 40,981 | 29,632–54,679 | 64,000 |
| A1b1b2a1 | 35,572 | 23,600–52,078 | 30,342 | 20,532–41,834 | |
| A1b1b2a1a | 6,933 | 4,685–9,705 | 5,917 | 3,998–8,207 | |
| A1b1b2a1b | 8,267 | 4,620–13,098 | 7,045 | 4,018–10,887 | |
| A1b1b2a2 | 1,669 | 763–2,996 | 1,410 | 655–2,415 | |
| B2b1 | 50,073 | 40,106–61,847 | 42,908 | 34,338–52,537 | 62,000 |
| B2b1a | 46,948 | 37,249–58,643 | 40,260 | 32,103–49,891 | |
| B2b1a1c2a | 143 | 2–524 | 120 | 2–368 | |
| B2b1a2 | 34,798 | 26,491–44,885 | 29,731 | 22,676–37,405 | |
| B2b1a2a | 734 | 245–1,502 | 622 | 217–1,240 | |
| B2b1a2b | 28,981 | 20,801–38,357 | 24,766 | 18,283–31,989 | |
| B2b1a2b2 | 25,557 | 17,459–34,967 | 21,819 | 15,397–28,878 | |
| B2b1b | 44,411 | 33,525–57,120 | 38,086 | 28,982–48,180 | |
| E1b1a1a1a1c1a1a3 | 14,332 | 11,338–18,158 | 12,253 | 10,133–14,921 | 22,000 |
| E1b1a1a1a2a1a | 9,358 | 7,418–11,582 | 8,078 | 6,512–9,769 | 15,000 |
| E1b1b | 30,063 | 19,813–42,157 | 25,468 | 17,560–35,183 | 21,000 |
| E1b1b1b2b2a1 | 7,376 | 5,088–10,121 | 6,268 | 4,390–8,472 | |
Mutation rate from Poznik et al. (2013).