| Literature DB >> 34294811 |
Adam B Rohrlach1,2, Luka Papac3, Ainash Childebayeva3, Maïté Rivollat3,4, Vanessa Villalba-Mouco3,5, Gunnar U Neumann3, Sandra Penske3, Eirini Skourtanioti3, Marieke van de Loosdrecht3, Murat Akar6, Kamen Boyadzhiev7, Yavor Boyadzhiev7, Marie-France Deguilloux4, Miroslav Dobeš8, Yilmaz S Erdal9, Michal Ernée8, Marcella Frangipane10, Mirosław Furmanek11, Susanne Friederich12, Emmanuel Ghesquière13,14, Agata Hałuszko11,15, Svend Hansen16, Mario Küßner17, Marcello Mannino18, Rana Özbal19, Sabine Reinhold16, Stéphane Rottier4, Domingo Carlos Salazar-García20,21,22, Jorge Soler Diaz23, Philipp W Stockhammer3,24, Consuelo Roca de Togores Muñoz23, K Aslihan Yener25, Cosimo Posth3,26, Johannes Krause3, Alexander Herbig3, Wolfgang Haak27,28.
Abstract
Uniparentally-inherited markers on mitochondrial DNA (mtDNA) and the non-recombining regions of the Y chromosome (NRY), have been used for the past 30 years to investigate the history of humans from a maternal and paternal perspective. Researchers have preferred mtDNA due to its abundance in the cells, and comparatively high substitution rate. Conversely, the NRY is less susceptible to back mutations and saturation, and is potentially more informative than mtDNA owing to its longer sequence length. However, due to comparatively poor NRY coverage via shotgun sequencing, and the relatively low and biased representation of Y-chromosome variants on capture assays such as the 1240 k, ancient DNA studies often fail to utilize the unique perspective that the NRY can yield. Here we introduce a new DNA enrichment assay, coined YMCA (Y-mappable capture assay), that targets the "mappable" regions of the NRY. We show that compared to low-coverage shotgun sequencing and 1240 k capture, YMCA significantly improves the mean coverage and number of sites covered on the NRY, increasing the number of Y-haplogroup informative SNPs, and allowing for the identification of previously undiscovered variants. To illustrate the power of YMCA, we show that the analysis of ancient Y-chromosome lineages can help to resolve Y-chromosomal haplogroups. As a case study, we focus on H2, a haplogroup associated with a critical event in European human history: the Neolithic transition. By disentangling the evolutionary history of this haplogroup, we further elucidate the two separate paths by which early farmers expanded from Anatolia and the Near East to western Europe.Entities:
Year: 2021 PMID: 34294811 PMCID: PMC8298398 DOI: 10.1038/s41598-021-94491-z
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1The number of ISOGG SNPs covered per five million quality-filtered mapped reads (y-axis) for the same libraries (x-axis) for shotgun, 1240 k and YMCA sequencing (colours).
Figure 2The percentage of SNPs (y-axis) covered (up to three branches downstream) for four Y-chromosome haplogroups (x-axis) associated with ancient populations. Colours indicate assay SNPs targeted for 1240 k (green), 1240 k with a 120 bp window (blue) and our YMCA (orange). The dashed black line indicates at least half of the SNPs are represented, and the total number of targetable SNPs is given below each group.
Figure 3(a) Phylogenetic relationships (no branch length units) and (b) H2 sample locations (map created using ggmap, Kahle and Wickham 2013). Shapes and colours indicate the two major clades inferred from the phylogenetic tree. Symbol shading indicates early to late Neolithic (solid) or post-Neolithic (transparent). Black dots indicate all non-H2 Neolithic individuals from Freeman2020 to indicate H2 sampling prevalence[53]. Stars in haplogroup assignments in (a) indicate a lack of resolution to assign samples (not used in the ML tree) to downstream sub-haplogroups.
Figure 4Estimated time since most recent common ancestor (TMRCA) (y-axis) for each H2 subgroup with each other (facets), calibrated by the split time of ~ 163 kya of haplogroup A0 with all other Y haplogroups. All pairwise calculations are filtered to exclude individuals from the same sampling site. The dashed line indicates the mean estimate, and error bars indicate 95% confidence intervals for individual observations.