Rocío Gómez1, Miguel G Vilar2,3, Marco Antonio Meraz-Ríos4, David Véliz5,6, Gerardo Zúñiga7, Esther Alhelí Hernández-Tobías8, Maria Del Pilar Figueroa-Corona4, Amanda C Owings2, Jill B Gaieski2, Theodore G Schurr2. 1. Departamento de Toxicología, CINVESTAV-IPN, Mexico City 07360, Mexico. 2. Department of Anthropology, University of Pennsylvania, Philadelphia, PA 19104-6398, USA. 3. National Geographic Society, Washington, DC 20005, USA. 4. Departmento de Biomedicina Molecular, CINVESTAV-IPN, Mexico City 07360, Mexico. 5. Departamento de Ciencias Ecológicas, Instituto de Ecología y Biodiversidad, Facultad de Ciencias, Universidad de Chile, Santiago 7800003, Chile. 6. Núcleo Milenio de Ecología y Manejo Sustentable de Islas Oceánicas, Departamento de Biología Marina, Universidad Católica del Norte, Coquimbo 1781421, Chile. 7. Departamento de Zoología, Laboratorio de Variación Biológica y Evolución, Escuela Nacional de Ciencias Biológicas, Instituto Politécnico Nacional, Mexico City 11340, Mexico. 8. Universidad Autónoma de Nuevo León, Facultad de Salud Pública y Nutrición, Monterrey, Nuevo León 64460, Mexico.
Abstract
Native Mexican populations are crucial for understanding the genetic ancestry of Aztec descendants and coexisting ethnolinguistic groups in the Valley of Mexico and elucidating the population dynamics of the prehistoric colonization of the Americas. Mesoamerican societies were multicultural in nature and also experienced significant admixture during Spanish colonization of the region. Despite these facts, Native Mexican Y chromosome diversity has been greatly understudied. To further elucidate their genetic history, we conducted a high-resolution Y chromosome analysis with Chichimecas, Nahuas, Otomies, Popolocas, Tepehuas, and Totonacas using 19 Y-short tandem repeat and 21 single nucleotide polymorphism loci. We detected enormous paternal genetic diversity in these groups, with haplogroups Q-MEH2, Q-M3, Q-Z768, Q-L663, Q-Z780, and Q-PV3 being identified. These data affirmed the southward colonization of the Americas via Beringia and connected Native Mexicans with indigenous populations from South-Central Siberia and Canada. They also suggested that multiple population dispersals gave rise to Y chromosome diversity in these populations.
Native Mexican populations are crucial for understanding the genetic ancestry of Aztec descendants and coexisting ethnolinguistic groups in the Valley of Mexico and elucidating the population dynamics of the prehistoric colonization of the Americas. Mesoamerican societies were multicultural in nature and also experienced significant admixture during Spanish colonization of the region. Despite these facts, Native Mexican Y chromosome diversity has been greatly understudied. To further elucidate their genetic history, we conducted a high-resolution Y chromosome analysis with Chichimecas, Nahuas, Otomies, Popolocas, Tepehuas, and Totonacas using 19 Y-short tandem repeat and 21 single nucleotide polymorphism loci. We detected enormous paternal genetic diversity in these groups, with haplogroups Q-MEH2, Q-M3, Q-Z768, Q-L663, Q-Z780, and Q-PV3 being identified. These data affirmed the southward colonization of the Americas via Beringia and connected Native Mexicans with indigenous populations from South-Central Siberia and Canada. They also suggested that multiple population dispersals gave rise to Y chromosome diversity in these populations.
A geographic area known for its abundant resources, the Valley of Mexico (VM) was the homeland for great ancient civilizations such as Teotihuacan, Tula, and the Aztec Empire. Before the arrival of the Aztecs, the VM was occupied by Otomies (Hñähñús, as they self-identify), Toltecs (the first Nahuatl speakers to reside there), Chichimecas (Éza'r, as they self-identify, singular Úza’), Totonacas, Tepehuas, and Popolocas, among other ethnic groups (Lastra, 2006; Matos-Moctezuma, 2012; Nichols and Rodríguez-Alegría, 2017; Soustelle, 1993). These populations experienced different migratory displacements before settling in the VM, and intermarrying with their neighbors, thereby diversifying their gene pools (Instituto Nacional de los Pueblos Indígenas, 2019; Matos-Moctezuma, 2012). The Aztecs, an amalgam of different peoples including the Mexicas (seven Nahuatl-speaking migrant tribes from Aztlan), Tepanecas (from Tlacopan and Tacuba), and Alcohuas (from Texcoco), also intermarried with resident populations to establish political and military alliances (Berdan et al., 1996). Through this strategy, the Aztecs obtained supreme power in Mesoamerica and, in the process, created a culturally, linguistically, and ethnically diverse empire (Nichols and Rodríguez-Alegría, 2017).Considered to be contemporary descendants of the Aztecs, Nahuas are the largest indigenous group of Mexico. Since the 13th century CE, they have occupied the VM, a region where more than 30% of Amerindian populations currently live (Instituto Nacional de los Pueblos Indígenas, 2019). These populations are comprised by more than 7 million people who speak languages from different linguistic families, with many of them remaining geographically isolated and deeply rooted to their pre-Hispanic past (Instituto Nacional de Estadística, Geografía e Informática, 2019). Despite their present-day population sizes and sustained cultural practices, these populations lost a significant proportion of their indigenous genetic lineages during the last five centuries of European colonization and Mexican state rule (Sandoval et al., 2012). Thus high-resolution genetic analysis with single nucleotide polymorphism (SNP) and short tandem repeat (STR) loci located in the non-recombining region of the Y chromosome (NRY) is crucial for reconstructing the genetic history of Aztec descendant populations.Haplogroups C and Q are the main paternal lineages found in Native Americans, being derived from ancestral lineages in Native Siberian populations (Schurr and Sherry, 2004). Haplogroup Q is the most prevalent and ancient founding lineage in the New World, having diverged into at least four different sublineages over the past 20,000 years, including Q-MEH2 (Q1a), Q-M346 (Q1b), Q-L54 (Q1b1a), and Q-M3 (Q1b1a1a) (Dulik et al., 2012a, 2012b; Malyarchuk et al., 2011). Although all four sublineages have been observed in the Americas, the latter two are more common in Central and South America (Battaglia et al., 2013; Dulik et al., 2012a, 2012b; Malyarchuk et al., 2011; Regueiro et al., 2013; Schurr and Sherry, 2004). By contrast, haplogroup C3 occurs at low frequencies in indigenous North American populations (Malyarchuk et al., 2011; Pinotti et al., 2019; Roewer et al., 2013), excepting a different C3 sublineage observed in South American populations (Jota et al., 2016; Roewer et al., 2013). It derives from an ancient East Asian lineage that has expanded throughout northeast Siberia over the past several millennia (Dulik et al., 2012a, 2012b; Jota et al., 2016; Malyarchuk et al., 2010, 2012; Wei et al., 2017).The multi-ethnic nature of contemporary Native Mexicans provides a unique opportunity to explore their genetic history, which is crucial for expanding our knowledge about the initial colonization of the New World. The genetic features of these groups have been briefly explored and consequently remain unclear (Battaglia et al., 2013; Dulik et al., 2012a, 2012b; Mata-Miguez et al., 2012; Regueiro et al., 2013; Roewer et al., 2013; Sandoval et al., 2012; Vargas-Alarcon et al., 2007). Hence, in this study, we characterized Y chromosome diversity in populations speaking languages belonging to three linguistic families (Oto-Manguean, Totonaca-Tepehua, and Uto-Aztecan) that coexisted during the Aztec empire and presently occupy the VM in an effort to better understand the formation of these ethnolinguistic groups.Our results reveal a complex array of paternal lineages that appear to have arrived or expanded in the VM at different times. Comparison of Native Mexican data with those from populations of Asia and North, Central, and South America further reveals connections between Aztlan descendants and South-Central Siberians and indigenous populations from both Canada and South America. These findings affirm the southward colonization route for ancestral Native Americans and support the view that Mesoamerica played a crucial role in the diversification of indigenous Y chromosomes in the Americas.
Results
Frequency and diversity of Native Mexican paternal lineages
We traced the patrilineal ancestry of 231 unrelated men belonging to Oto-Manguean-, Totonaca-Tepehua-, and Uto-Aztecan-speaking populations (Table S1). Based on the analysis of 19 STR and 21 SNP loci, 175 (75.8%) male participants exhibited Amerindian Y chromosomes and 56 (24.2%) had non-native Y chromosomes (Table S2). Our study populations also exhibited a remarkably high proportion of Native American paternal lineages relative to Mestizo populations subjected to similar analyses (Gonzalez-Sobrino et al., 2016; Lopez-Ramirez et al., 2020; Santana et al., 2014). All chromosomes were assigned to paternal lineages or haplogroups based on variants at these loci.Of the indigenous haplogroups seen in Native Mexicans, Q-M3 was the most prominent (79.4%), followed by Q-L54 (18.3%) and then Q-MEH2 (2.3%) (Figure 1). SNP genotyping further revealed Q-M3 to contain subhaplogroup Q-Z768 (12.9%), which was present in Éza'r, Hñähñús, Nahuas, and Tepehuas, and subhaplogroup Q-L663 (0.8%), which appeared in only Hñähñús from Hidalgo (HGO), with the remainder (86.3%) being defined by only the Q-M3 SNP. Haplogroup Q-L54 was primarily composed by subhaplogroup Q-Z780 (92.8%), which appeared in all populations except for the Popolocas, and subhaplogroup Q-PV3 (7.1%), which occurred only in the Nahuas from San Luis Potosi (Nahua-SLP) and Tepehuas (Figure 1; Tables S1 and S2). No individuals had Y chromosomes belonging to haplogroup C3.
Figure 1
A phylogenetic tree of haplogroup Q
The haplogroup nomenclature follows the Y-chromosome Consortium (2002), Karafet et al., (2008) and International Society of Genetic Genealogy (2019–2020) recommendations. Note: The number of individuals belonging to each haplogroup in Native Mexican populations is shown on the right side in the table. The blue, green, and purple colors show the haplogroups found in the present study; the yellow color indicates the Q-F1096 sublineage within Q-MEH2. The dotted line indicates continuity of haplotypes within a specific lineage that cannot be fully represented here due to space constraints.
A phylogenetic tree of haplogroup QThe haplogroup nomenclature follows the Y-chromosome Consortium (2002), Karafet et al., (2008) and International Society of Genetic Genealogy (2019–2020) recommendations. Note: The number of individuals belonging to each haplogroup in Native Mexican populations is shown on the right side in the table. The blue, green, and purple colors show the haplogroups found in the present study; the yellow color indicates the Q-F1096 sublineage within Q-MEH2. The dotted line indicates continuity of haplotypes within a specific lineage that cannot be fully represented here due to space constraints.Overall, Y-STR haplotype diversity in Native Mexican populations was enormous, with 146 different indigenous haplotypes being observed among 175 unrelated men. Oto-Manguean- (h = 0.995), and Totonaca-Tepehua- (h = 0.994) speaking populations exhibited the greatest diversity of Y-STR haplotypes, whereas Uto-Aztecan speakers (Aztec descendants) were slightly less diverse (h = 0.989) (Table 1). Most interestingly, only 1.37% of the 146 Y-STR haplotypes were shared between populations. This sharing occurred mainly within Oto-Manguean (Éza'r and Hñähñús) groups and between Oto-Manguean- (Hñähñús) and Uto-Aztecan- (Nahuas) speaking groups (Table S3).
Table 1
Diversity parameters for NRY haplogroups in Native Mexican populations
Population
N
H
Haplotype diversity
MPD
ρ ± σ
Vp
Q1a (Q-MEH2)
Nahuas
4
2
1 ± 0.500
0
9.060 ± 1.45
0
Q1b1a1a (Q-M3)
Éza'r
6
6
1.000 ± 0.096
8.600 ± 3.633
12.667 ± 2.444
0.533
Hñähñús
30
26
0.988 ± 0.013
8.110 ± 3.872
12.069 ± 1.915
0.561
Nahuas
47
38
0.982 ± 0.012
7.273 ± 3.467
9.953 ± 1.678
0.464
Popolocas
8
7
0.964 ± 0.077
6.250 ± 3.322
9.334 ± 2.149
0.504
Tepehuas
11
10
0.982 ± 0.046
7.891 ± 3.987
17.154 ± 3.018
0.548
Totonacas
15
14
0.990 ± 0.028
7.590 ± 3.753
12.500 ± 2.019
0.581
Overall
117
101
0.996 ± 0.002
7.850 ± 3.679
21.025 ± 3.501
0.550
Q1b1a1a1e1a∗ (Q-Z768)
Éza'r
1
1
1.000
0
8.000 ± 0.500
0
Hñähñús
12
8
0.924 ± 0.057
5.015 ± 2.621
15.769 ± 3.395
0.306
Nahuas
5
4
0.900 ± 0.161
5.900 ± 3.391
12.833 ± 2.769
0.304
Tepehuas
2
2
1.000 ± 0.500
1.000 ± 1.000
7.667 ± 2.236
0.029
Overall
20
15
0.968 ± 0.025
5.326 ± 2.683
15.286 ± 3.357
0.640
Q1b1a1a1e1b1a∗ (Q-L633)
Hñähñús
2
2
1
0
5.333 ± 0.943
0
Q1b1a2 (Q-Z777/Z780)
Éza'r
3
3
1.000 ± 0.272
8.000 ± 5.127
13.250 ± 1.300
0.471
Hñähñús
9
8
0.972 ± 0.064
9.028 ± 4.593
12.400 ± 2.069
0.716
Nahuas
6
6
1.000 ± 0.096
10.267 ± 5.472
12.143 ± 1.895
0.671
Tepehuas
6
4
0.800 ± 0.172
5.534 ± 3.097
19.571 ± 3.717
0.358
Totonacas
6
5
0.934 ± 0.122
2.534 ± 1.578
17.286 ± 3.761
0.212
Overall
30
26
0.988 ± 0.013
8.689 ± 4.127
26.000 ± 4.220
0.635
Q1b1a3a1f (Q-PV3)
Nahuas
1
1
1.000 ± 0.272
0
12.000 ± 1.225
0
Tepehuas
1
1
1.000 ± 0.272
0
9.500 ± 0.500
0
Overall
2
2
1.000 ± 0.500
6.000 ± 4.583
14.300 ± 2.963
0
N, number of samples; h, number of different haplotypes found; MPD, mean pairwise differences; ρ, average distance root haplotypes; σ, standard deviation for ρ; Vp, intrapopulation genetic variance.
Diversity parameters for NRY haplogroups in Native Mexican populationsN, number of samples; h, number of different haplotypes found; MPD, mean pairwise differences; ρ, average distance root haplotypes; σ, standard deviation for ρ; Vp, intrapopulation genetic variance.
Statistical analysis of NRY diversity in Native Mexicans
Analysis of molecular variance
To delineate the structure of the study populations, we conducted an analysis of molecular variance (AMOVA) using different criteria (Table 2). Overall, the differences within groups regardless of the categories into which they were placed accounted for the vast majority of the diversity observed in them. Based on geographic location, these ethnic groups were categorized using two criteria. In the geography-1 criterion, variation among groups was insignificant, whereas a marginally significant difference among groups (p = 0.085) was found using the geography-2 criterion. On the basis of their linguistic affiliation, the AMOVA did not reveal any significant differences between populations.
Table 2
Analysis of molecular variance using Y-STR haplotypes from Native Mexicans
Groups
Percentage of variation
p value
Geography-1
Among groups
0.41
0.291
Between populations within group
5.85
≤0.0001
Within group
93.74
≤0.0001
Geography-2
Among groups
2
0.085
Between populations within group
5.08
≤0.0001
Within group
92.91
≤0.0001
Language
Among groups
−1.36
0.403
Between populations within group
7.16
≤0.0001
Within group
94.20
≤0.0001
Criteria definitions: Geography-1: Center (Hidalgo: Hñähñús, Nahuas, Tepehuas; Morelos: Nahuas; Puebla: Popolocas, Totonacas); West (Guanajuato: Hñähñús, Éza'r; Querétaro: Hñähñús); South-East (Veracruz, Totonacas); North-East (San Luis Potosí, Nahuas). Geography-2: North-Central (Guanajuato: Hñähñús, Éza'r; Querétaro: Hñähñús; San Luis Potosí: Nahuas); Eastern (Hidalgo: Hñähñús, Nahuas, Tepehuas; Puebla: Popolocas, Totonacas; Veracruz: Totonacas); South-Central (Morelos: Nahuas). Language: Oto-Manguean: Éza'r, Hñähñús, and Popolocas; Toto-Zoquean: Tepehuas and Totonacas; Uto-Aztecan: Nahuas.
Analysis of molecular variance using Y-STR haplotypes from Native MexicansCriteria definitions: Geography-1: Center (Hidalgo: Hñähñús, Nahuas, Tepehuas; Morelos: Nahuas; Puebla: Popolocas, Totonacas); West (Guanajuato: Hñähñús, Éza'r; Querétaro: Hñähñús); South-East (Veracruz, Totonacas); North-East (San Luis Potosí, Nahuas). Geography-2: North-Central (Guanajuato: Hñähñús, Éza'r; Querétaro: Hñähñús; San Luis Potosí: Nahuas); Eastern (Hidalgo: Hñähñús, Nahuas, Tepehuas; Puebla: Popolocas, Totonacas; Veracruz: Totonacas); South-Central (Morelos: Nahuas). Language: Oto-Manguean: Éza'r, Hñähñús, and Popolocas; Toto-Zoquean: Tepehuas and Totonacas; Uto-Aztecan: Nahuas.To determine the influence of ethnicity on the pattern of genetic variation, an AMOVA was conducted with data from the Hñähñús, Nahuas, and Totonacas (Table S4). Hñähñú and Totonaca populations did not show any significant genetic differences among themselves. By contrast, Nahua populations exhibited small but statistically significant differences among themselves (p = 0.009).
Population differentiation
To further explore the pattern of patrilineal diversity in Native Mexican populations, we analyzed the Y chromosome data with an exact test of population differentiation. This test was conducting using R values estimated from 17 loci Y-STR haplotypes, with p values being adjusted for false discovery rates (Benjamin and Hochberg, 1995). The populations were categorized by geography, language family, or ethnicity for these tests, as done for the AMOVA. The resulting R estimates for Native Mexican populations were then visualized in a multidimensional scaling (MDS) plot.Using the geography-1 criterion, we observed a clear separation between the South-East populations (Totonacas-Veracruz) and those from the other geographic regions (Figure S1A). However, using the geography-2 criterion, there was no clear genetic separation between the South-Central region with the North-Central and Eastern populations (Figure S1B). When grouping Native Mexican populations by the language criterion, the Totonaca-Tepehua populations were clearly distinguished from both the Oto-Manguean and Uto-Aztecan populations, which themselves were genetically similar (Figure 2A; Table S5).
Figure 2
Multidimensional scale plot of R values estimated from 17 Y-STRs haplotypes among Central Valley of Mexico Native Americans belonging to haplogroup Q
(A) With linguistic affiliation as criterion.
(B) With ethnicity as criterion. The dotted circles enclose populations that share no significant genetic distances.
Note: HGO, Hidalgo; GTO, Guanajuato; MOR, Morelos; PUE, Puebla; QRO, Querétaro; SLP, San Luis Potosí; VER, Veracruz. Oto-Manguean: Éza'r (Chichimecas); Hñähñús (from HGO, GTO, and QRO) and Popolocas; Totonacas-Tepehuas: Tepehuas and Tononacas from PUE and VER; Uto-Aztecans: Nahuas from HGO, MOR, and SLP. All p values were adjusted with the method of false discovery rates in R-software.
Multidimensional scale plot of R values estimated from 17 Y-STRs haplotypes among Central Valley of Mexico Native Americans belonging to haplogroup Q(A) With linguistic affiliation as criterion.(B) With ethnicity as criterion. The dotted circles enclose populations that share no significant genetic distances.Note: HGO, Hidalgo; GTO, Guanajuato; MOR, Morelos; PUE, Puebla; QRO, Querétaro; SLP, San Luis Potosí; VER, Veracruz. Oto-Manguean: Éza'r (Chichimecas); Hñähñús (from HGO, GTO, and QRO) and Popolocas; Totonacas-Tepehuas: Tepehuas and Tononacas from PUE and VER; Uto-Aztecans: Nahuas from HGO, MOR, and SLP. All p values were adjusted with the method of false discovery rates in R-software.Using the ethnicity criterion, we observed a set of overlapping relationships among the Native Mexican populations. There were differences between Nahuas-HGO and Nahua-SLP versus Nahuas-MOR, whereas the Hñähñú showed affinities with all groups except the Éza'r and Popolocas. Interestingly, the Tepehuas were genetically similar to all other groups. By contrast, the Éza'r showed genetic affinities with only Nahuas-MOR, Nahuas-SLP, and Tepehuas. The Popoloca were the most distant group from the rest of the populations, although showing affinities with Nahuas-SLP and Nahuas-MOR (Figure 2B; Table S6).
Non-native haplogroups in Native Mexicans
Non-indigenous Y chromosomes were observed within Native Mexican populations, and primarily those living in Hidalgo state, reflecting differing degrees of admixture in these ethnic groups. The extent of non-native admixture varied in the HGO populations: Hñähñús (62.3%), Nahuas (15.9%), and Tepehuas (11.6%) (Tables S2 and S7). The non-native lineages belonged mainly to West Eurasian haplogroups deriving from North-Central Europe, the Mediterranean region, and North Africa, with R1b representing over half of them (50.8%) (Table S8). These lineages also showed a high diversity of Y-STR haplotypes, with relatively few being shared in Native Mexicans populations.In general, these results revealed the impact of the influx of European men into indigenous communities. The geographically diverse origins of the Spanish conquerors were mirrored in the diversity of the most frequent haplogroups found within Native Mexicans.The detailed analysis of R1b haplotypes showed them to have the closest genetic relationship to those from the regions of Andalucía, Castilla La Mancha, Cataluña, and Lower Navarre (France) (Figure 3A; Tables S2, S7, and S8) (Martinez-Cadenas et al., 2016; Martinez-Gonzalez et al., 2012). Notably, the R1b haplotypes in the Nahua populations were distinct from those in the central cluster, which included the Mestizo, Hñähñús, and Tepehua populations, suggesting a specific source for them. Inside the Hñähñús from Hidalgo, several R1b haplotypes were present (Table S1; Figure 3B). Thus the Hñähñús from Ixmiquilpan were more related with Spaniards from Madrid and Ciudad Real, whereas Hñähñús from Pañhé and Tutotepec were closely related to French individuals (Martinez-Cadenas et al., 2016; Martinez-Gonzalez et al., 2012; Myres et al., 2011). By contrast, the Hñähñús from Zimapán showed similarities to the Jaén from the south-central region of Spain.
Figure 3
Multidimensional scale plot of R values
(A) Estimated from 14 Y-STRs haplotypes belonging to haplogroup R1b.
(B) Estimated from 7 Y-STRs and comparing the Hñähñú populations from Hidalgo state by sampling zone.
Note: The comparative data were taken from Barrot et al. (2007); Martinez-Cadenas et al. (2016); Martinez-Gonzalez et al. (2012); Myres et al. (2011); and Santana et al. (2014). Although Hñähñú is the name by which the Otomi now self-identify, we use the older name provided by the original authors (Barrot et al., 2007). The red diamonds show the data from the present study.
Multidimensional scale plot of R values(A) Estimated from 14 Y-STRs haplotypes belonging to haplogroup R1b.(B) Estimated from 7 Y-STRs and comparing the Hñähñú populations from Hidalgo state by sampling zone.Note: The comparative data were taken from Barrot et al. (2007); Martinez-Cadenas et al. (2016); Martinez-Gonzalez et al. (2012); Myres et al. (2011); and Santana et al. (2014). Although Hñähñú is the name by which the Otomi now self-identify, we use the older name provided by the original authors (Barrot et al., 2007). The red diamonds show the data from the present study.Regarding other paternal lineages, the haplotypes from haplogroups E, I, and J in Native Mexicans showed similarities to those from Aragon, Castilla y Leon, Extremadura, Madrid, and the Valencian communities of Spain (Tables S2, S7, and S8; Figure S2) (Martinez-Cadenas et al., 2016; Martinez-Gonzalez et al., 2012). It is noteworthy that the Native Mexican populations living fairly close to urban settlements exhibited the highest frequency and widest diversity of non-native haplogroups (i.e., E1b1b, G, I, J, L, R1b, and T). In this regard, some of the sampling locations for the Hñähñú were located 11 to 20 km from urban settlements, a location that facilitates intermarriage with Mestizos. In fact, Hñähñú communities close to Ixmiquilpan-HGO had close to 30% non-native lineages, whereas the Hñähñú community located closest to the Queretaro border had a moderate frequency of haplogroup R1a haplotypes (18%)
Ancestral connections with Asian and other Native American populations
The remarkable Y-STR diversity described above was mirrored in the networks for haplogroups Q-MEH2, Q-L54, and Q-M3. The data from these three lineages supported a connection between present-day Native Mexicans and ancient founder populations that gave rise to modern Native Americans. However, each haplogroup showed subtle differences in its phylogenetic structure.The Q-MEH2 network contained haplotypes from Northeast Asians, Native North Americans, and the Nahuas-HGO (Figure 4). At least 16 mutations separated the haplotypes in Nahuas-HGO from those in the other populations, suggesting a significant divergence between them. Interestingly, the Tlingit haplotype was closer to those of Northeast Asians than to that of the Nahua-HGO. This finding suggests that the ancestral Q-MEH2 haplotype in the Nahua-HGO arrived much earlier in the Americas than the haplotypes present in more northerly populations. However, in being based on only seven Y-STR loci, this network likely does not fully represent the pattern of diversity in this haplogroup.
Figure 4
A median-joining network of Q1a (Q-MEH2) lineage using 7 Y-STRs
Comparative data from Regueiro et al. (2013) and Schurr et al. (2012). Note: The smallest circles represent one individual. NE-Siberia, Northeast Siberia.
A median-joining network of Q1a (Q-MEH2) lineage using 7 Y-STRsComparative data from Regueiro et al. (2013) and Schurr et al. (2012). Note: The smallest circles represent one individual. NE-Siberia, Northeast Siberia.By contrast, the Q-M3 and Q-L54 networks exhibited long branches, multiple reticulations, and missing intermediate nodes, with these features hinting at the existence of as yet undefined sublineages (Figures 5A, 5B and 6). Each network also lacked a clear central node, suggesting considerable mutational divergence from the founding haplotype for each of them. Furthermore, each network had a high number of singletons and the presence of isolated groups and distant haplotype clusters within it. The overall lack of haplotype sharing among these Native Mexican populations was again quite remarkable. The haplotype diversity within the Q-M3 network was reflected by the intrapopulation variance (Vp) obtained from 17 Y-STRs, which was high among the different Native Mexican populations, suggesting an ancient diversification process (Tables 1 and S2). A tight group with several long, separated branches suggested the coexistence of different sublineages within it (Figure 5A). When defining SNPs were added to the network analysis, the Q-Z768 haplotypes were mainly distributed inside of the central network, whereas Q-L663 and some Q-M3 haplotypes formed separated branches (Figure 5B). VM populations also presented nuanced connections with other Nahua populations from the south of Mexico (i.e., Santo Domingo Ocotitlan in Morelos, and Zitlala in Guerrero) and the southeast of Mexico City (i.e., San Pedro Actopan and Xochimilco) (Sandoval et al., 2012) (Figure S3; Table S9). Remarkably, these different Nahua groups did not share any haplotypes, a finding reflecting the great genetic diversity within this ethnic group.
Figure 5
Median-joining network of haplogroup Q1b1a1a (Q-M3) based on 17 Y-STRs
(A) With the ethnicity as criterion.
(B) With the distribution of sublineages Q-Z768 and Q-L663.
Note: The circle size indicates the number of individuals with shared haplotypes; the smallest circles represent one individual.
Figure 6
A median-joining network of Q1b1a2 (Q-Z780) and Q1b1a3a1f (Q-PV3) lineages using 17 Y-STRs
Note: The dotted line shows the carriers of PV3 sublineage.
Median-joining network of haplogroup Q1b1a1a (Q-M3) based on 17 Y-STRs(A) With the ethnicity as criterion.(B) With the distribution of sublineages Q-Z768 and Q-L663.Note: The circle size indicates the number of individuals with shared haplotypes; the smallest circles represent one individual.A median-joining network of Q1b1a2 (Q-Z780) and Q1b1a3a1f (Q-PV3) lineages using 17 Y-STRsNote: The dotted line shows the carriers of PV3 sublineage.Q-M3 haplotypes in Native Mexicans using 14 Y-STRs showed connections with those of different indigenous North American groups (Figure S4; Table S9). These included the Inuvialuit (Hñähñús, Nahuas-SLP, and Totonacas), Tlingit (Hñähñús, Nahuas-HGO, Nahuas-SLP, and Totonacas), and Gwich'in (Hñähñús, Nahuas-SLP, and Popolocas). A number of these North American haplotypes appeared on a single branch extending from the left side of the network. On the other side, a separate cluster (shown in inset) included Tlicho, Tlingit, Gwich'in, and one Hñähñús haplotype and branch extending from it included Gwich'in, Tlingit, Totonaca, and Nahua haplotypes.We also compared Native Mexican haplotypes with those from the Greenland Inuits (Olofsson et al., 2015) and Inuvialuit of the Northwest Territories (Dulik et al., 2012a, 2012b) using 17 Y-STR haplotypes. Most Inuit and Inuvialuit haplotypes clustered together with a single Nahua haplotype (Figure S5; Table S9). One other Inuit Q-M3 haplotype appeared on a branch with Native Mexican haplotypes. The position of the Inuit haplotypes suggested a possible ancestor-descendant relationship with those of Native Mexicans, although the former may possess SNPs not present in the latter.We further analyzed the genetic relationships between Native Mexican populations and indigenous groups from Central America (Guatemala, El Salvador, Nicaragua, and Panama) using R values from 14 Y-STR haplotypes belonging to Q-M3. Through this analysis, we observed some similarities among these populations (Figure S6; Table S10). Populations in regions close to Mexico (Guatemala, El Salvador, Nicaragua) were more genetically similar, whereas those from Panama differed from nearly all Native Mexican groups. This finding contrasts with previous studies, where certain Chibchan-Panamanian haplotypes were similar to ones appearing in the Nahuas and Hñähñús, suggesting possible Uto-Aztecan and Oto-Manguean gene flow to the Panama region (Nunez et al., 2010).We also evaluated the genetic relationships between indigenous South American and Native Mexican populations using 14 Y-STRs from Q-M3 (Figure S7; Table S11). In the resulting MDS plot, the South American populations were generally similar to Native Mexicans, although appearing around the periphery of the main cluster of Mexican populations. When mapping known SNPs in South American Q-M3 haplotypes onto these populations, we observed that the Q-SA04, Q-SA05, and Q-Z5915 sublineages were confined to South American populations, as previously seen (Jota et al., 2016). The Q-SA04 sublineage seems to have a recent origin, whereas the Z5915 SNP along with other markers may help to differentiate Y-chromosomes in South American populations (Jota et al., 2016).The apparent connection between Bolivian and Peruvian populations may be due to a shared apomorphy, which has diverged recently in indigenous South Americans. However, Bolivians with Q-CTS11357 haplotypes clustered among Native Mexicans. This was not wholly surprising, as Q-CTS11357 haplotypes have recently been documented in Mexican populations (International Society of Genetic Genealogy [ISOGG], 2019-20). Thus, it is likely that some Native Mexican men having Q-M3 Y chromosomes belong to this sublineage.Unlike that for Q-M3, the Q-L54 network exhibited a slightly simpler topology, with a number of long branches (Figure 6). Hñähñú and Nahua haplotypes were scattered across the network reflecting their great diversity, an assessment also supported by the highest Vp and the mean pairwise differences values in these groups (Table 1). A similar distribution was previously noted in Mexican and some Central American populations (Battaglia et al., 2013) (Figure S8; Table S9). Totonacan haplotypes grouped into a single subbranch and also showed the lowest Vp values (Table 1). Again, remarkably, almost none of these Y-STR haplotypes were shared between the study and comparative populations. Furthermore, the Q-PV3 sublineage within Q-L54 was quite divergent from the other haplotypes in the network, suggesting it might represent another founding lineage in the Americas (Figure 6).We evaluated the genetic relationship of Q-L54 haplotypes present in Native Mexican and other comparative populations using 14 Y-STR haplotypes. We noted that Native Mexicans showed considerable similarity to other Mesoamerican and South American populations (Battaglia et al., 2013) (Figure S9; Tables S9 and S12). By contrast, the Gwich'in, Altai-kizhi, and Tuvans were distinct from all New World populations.A network analysis of Q-L54 haplotypes defined by 15 Y-STRs in Native Mexican and comparative populations revealed substructure within this haplogroup (Figure S10A; Table S9). Altai-kizhi and Tuvan haplotypes formed a central cluster in the network that contained one Tepehua sample, whereas all indigenous Canadian haplotypes appeared in a separate branch containing one Nahua haplotype. The remaining Native Mexican haplotypes were widely dispersed along other branches.We further explored the relationship between Native Mexicans and South American populations using 15 Y-STR haplotype data from Q-L54. The resulting MDS plot showed genetic affinities between these populations through the Q-CTS1780 sublineage (Figure S10B; Tables S9 and S13), which is closely related to the Q-Z780 sublineage. Both Q-CTS1780 and Q-Z780 define the sub-haplogroup Q1b1a2 (ISOGG, 2019-20). Their dissimilarity with Colombian populations was possibly related to the ancient differentiation of Q-L54 haplotypes in the Isthmo-Colombian region, as reported previously (Grugni et al., 2015) and evident by the presence of the Q-SA02 and Q-SA03 sublineages (Jota et al., 2016). In both the Q-M3 and in Q-L54 networks, the comparative populations shared few haplotypes, a finding mirroring the great Y chromosome diversity of the Native Mexican populations.
Timing of the arrival of indigenous haplogroups in Mesoamerica
Coalescent times (time to more recent common ancestor [in years] [TMRCAs]) were estimated for haplogroups Q-MEH2, Q-M3, and Q-L54 using ρ-statistics and Bayesian analysis. The results presented in Table 3 correspond to the average of three generation times (18, 20, and 25 years). In general, the TMRCA estimates for Q-M3 and Q-L54 suggested their simultaneous arrival in Mesoamerica, consistent with previous coalescent estimates for these lineages (Grugni et al., 2019). Nonetheless, subtle differences in the TMRCAs were found among the populations. Using Bayesian methods, the TMRCA for Q-M3 ranged between 17,823 years before present (YBP) (Nahuas) and more than 24,000 YBP (Éza'r and Tepehuas), whereas the TMRCA for Q-L54 ranged from 17,505 YBP (Totonacas) to 23,788 YBP (Nahuas). These data support the diversification of these lineages soon after their arrival in Mexico and Mesoamerica. The TMRCA for Q-MEH2 was 9,699 YBP, suggesting a more recent arrival in a separate migration, although the sample size for this lineage was quite small.
Table 3
Coalescent time estimates for NRY haplogroups in Native Mexican populations
Population
N
ρ-StatisticTMRCA (YBP)
ρ-StatisticSE
BatwingTMRCA (YBP)
Batwing95% CI
BeastTMRCA (YBP)
95% HPD
Q1a (Q-MEH2)
Nahuas
4
3,624
±1,001
9,699
8,067–11,331
4,400
0–12,600
Q1b1a1a (Q-M3)
Éza'r
7
4,700
±875
24,310
21,218–27,402
8,810
3,880–14,400
Hñähñús
44
6,254
±805
21,041
19,974–22,109
14,990
9,460–20,420
Nahuas
52
5,436
±694
17,823
16,991–18,654
15,170
12,520–24,270
Popolocas
8
4,926
±981
22,921
20,194–25,648
9,870
3,720–16.500
Tepehuas
13
5,506
±968
24,880
22,557–27,202
10,470
4,510–16,890
Totonacas
15
6,946
±971
20,043
18,320–21,785
13,470
6,860–20,480
Overall
139
4,771
±583
18,589
18,058–19,119
17,151
15,970 - 20,250
Q1b1a2 (Q-L54)
Éza'r
3
5,662
±555
22,775
18,350–27,200
12,000
2,000–29,000
Hñähñús
9
6,946
±1,087
22,702
20,155–25,249
13,000
3,000–35,000
Nahuas
7
5,371
±782
23,788
20,763–26,814
13,890
2,000–24,000
Tepehuas
7
2,537
±496
20,462
17,859–23,065
7,000
1,000–18,000
Totonacas
6
2,265
±830
17,505
15,100–19,910
5,000
1,000–13,000
Overall
32
5,053
±664
27,479
25,844–29,113
17,155
14,730 - 24,150
N, number of samples; ρ, average distance from root haplotype (in years); YBP, years before present; SE, standard error; TMRCA, time to more recent common ancestor (in years); CI, confident interval; HPD, highest posterior density.
Coalescent time estimates for NRY haplogroups in Native Mexican populationsN, number of samples; ρ, average distance from root haplotype (in years); YBP, years before present; SE, standard error; TMRCA, time to more recent common ancestor (in years); CI, confident interval; HPD, highest posterior density.Overall, the TMRCAs determined using Bayesian methods were 2–3 times the values generated with ρ-statistics (Bouckaert et al., 2014; Wilson et al., 2003). Of note, the overall TMRCA values obtained for Q-M3 by the two Bayesian methods were consistent, whereas those estimated for Q-L54 were discrepant with each other, possibly due to the modest sample size for this paternal lineage.
Discussion
Spanish chroniclers described the Aztec Empire as being culturally and ethnically diverse, where populations from different language groups, such as Hñähñú, Nahuatl, Totonac, Popoloca, and Huasteco, coexisted in the tropical lowlands during the Olmec period (Sahagun, 2011; Soustelle, 1993; Torquemada, 2018). These features provide a unique opportunity to reconstruct the diversity of the Aztec descendants and the coexisting ethnolinguistic groups, which has been briefly explored here. In addition, more fully characterizing the multi-ethnic nature of contemporary Mexican Native Americans is crucial for expanding our knowledge about the initial colonization of the New World. The key to answering these two questions is elucidating the kinds of ancient lineages present in the contemporary Native Mexicans rather than Mestizo populations, which have experienced higher levels of European and African admixture (Battaglia et al., 2013; Sandoval et al., 2012).
Y chromosome diversity in Native Mexican populations
Contemporary Native Mexicans exhibit remarkable Y chromosome diversity within the three main haplogroups found within their populations. This diversity is not strongly structured by ethnicity and geography. Comparable findings have been reported previously in indigenous populations from Mexico (Sandoval et al., 2012) and South America (Roewer et al., 2013).The notable diversity in the Hñähñús could reflect their antiquity in the region, being contemporaries of the Cholula, Pinotl, Teotihuacan, Tlaxcalteca, and Toltec peoples (Sahagun, 2011; Soustelle, 1993; Torquemada, 2018). The significant differences found in the Nahuas-MOR could reflect their historic link with Toltec descendants (i.e., Tlahuicas, Xochimilcas), Aztec sister cultures (i.e., Tepanecas and Alcohuas), and the spread of Oto-Manguean languages (i.e., Matlatzincas, Mazahuas, Hñähñús), which has been supported in previous reports (Instituto Nacional de los Pueblos Indígenas, 2019; Nichols and Rodriguez-Alegria, 2017; Sandoval et al., 2012).The continuous genetic exchange among ancestral Native Mexican populations likely enriched their diversity over time, leading to the relative genetic similarities among the study populations (Perego et al., 2010). These practices contributed to acculturation and allowed Aztecs to gain supremacy in Mesoamerica, incorporating new cultural, linguistic, and social variation into the communities of themselves and subject populations (Nichols and Rodriguez-Alegria, 2017). Aztec populations, diverse per se, intermarried with Toltecs, Totonacas, Tepehuas, and Hñähñús (Berdan et al., 1996; Duverger, 2007). The Hñähñús continuously interacted with other Oto-Manguean groups, such as the Éza'r, Mazatecas, Nonoalcas, and Popolocas (Josserand et al., 1984; Lastra, 2006). Furthermore, the Hñähñús also mixed with groups from Oaxaca (Chinantecos, Mixes, Mixtecos, and Zapotecos), Veracruz (Popolucas, Totonacas), Hidalgo (Tepehuas), and Puebla (Totonacas-PUE) (Lastra, 2006; Soustelle, 1993). Thus, continual gene flow between different peoples who apparently lacked cultural boundaries can partly explain the remarkable diversity found in the study populations.Consequently, the Aztec Empire could have been a conglomerate of different populations that traced their origins to Aztlan. Aztec cities were founded through fissiparity, leading to the homogenization and agglutination of different peoples (Berdan et al., 1996; Duverger, 2007). The lack of genetic stratification by ethnicity and geographic criteria would be supported by these practices, besides the continuous population growth of the constituent ethnic groups (Roewer et al., 2013). Notwithstanding these population dynamics, the gene pool of pre-Columbian Mesoamericans has been diversifying since the founding populations first arrived there (de Azevedo et al., 2015). Thus the notable diversity observed in this study could also reflect admixture events before Mesoamerican populations began to ethnically diversify, consistent with previous reports (de Azevedo et al., 2015; Regueiro et al., 2013; Ruiz-Linares, 2014). Founder populations had recurrent streams both between them and with recently emerging Native Mexican groups, who had probably become locally differentiated (Kumar et al., 2011).The significant differences between the Popolocas and the other four ethnic groups was intriguing. These differences could possibly be related to the Popoocas' recent separation from the Ngiwas (Chocho-Popolocan speakers) around 1000 YBP (Instituto Nacional de los Pueblos Indígenas, 2019). Popolocas have also maintained genetic and cultural exchange with Nahuas-MOR and Totonacas-PUE, whereas their interactions with the Nahuas-SLP are marginal. Thuis, further studies are needed to elucidate their remarkable differences, perhaps by increasing the sample size.The amalgam between Oto-Manguean and Uto-Aztecan linguistic families could reflect the biocultural makeup of the ancient cosmopolitan societies. A tight relationship between the Nahua ancestors and the first settlers of the VM was described (Sahagun, 2011; Soustelle, 1993; Torquemada, 2018; Vargas-Alarcon et al., 2007). Before the Aztecs began their pilgrimage to the VM, Hñähñús and other ethnic groups already resided in this geographic area, establishing cultural and military alliances with them (Berdan et al., 1996; Duverger, 2007; Nichols and Rodriguez-Alegria, 2017; Soustelle, 1993). Overall, this pattern of settlement is consistent with the TMRCAs for the Q haplogroups, which support the antiquity of these indigenous groups in the region (Delgado-Burbano et al., 2010; Sandoval et al., 2012). Thus, the connection between these two linguistic families could reflect at least 370 years of gene flow between them (Beezley, 2011). The Nahuatl monopolized Mesoamerica, at least for three centuries, homogenizing the different peoples and serving as a lingua franca during the Aztec Empire (Duverger, 2007; Hill, 2001).Alternately, the remarkable diversity within the Nahua groups could reflect the consequences of the “Nahualization” process experienced by the neo-Aztec civilizations during the diffusion of maize domestication with the subsequent imperial expansion (Vargas-Alarcon et al., 2007). However, considerable work will be needed to determine whether the ancestors of the Hñähñús or the Nahuas contributed to the diversity of the others (aside from the military supremacy). Given the lack of continuity between the Uto-Aztecan and Aztec populations, some results should be interpreted in light of this limitation.The differences between Oto-Manguean and Uto-Aztecan regarding the Totonaca-Tepehua linguistic affiliation could be a consequence of its geographic localization (Puebla's Sierra Norte). While the Tepehua and the Totonac were related for at least 26 centuries (Hernandez-Montes and Heiras-Rodríguez, 2004), the Totonac have also been associated with Mayan, Mixe, Popoluca, P'urhépecha, and Zoque languages (Hernandez-Montes and Heiras-Rodríguez, 2004; Moreno-Estrada et al., 2014). Likewise, populations from Guatemala, El Salvador, and Panama have historically interacted with Totonacas, suggesting genetic interaction between them (Witschey and Brown, 2012). The separation between the Tepehuas and the Totonacas could also be a consequence of cultural and demographic events (with sex bias), as well as by the genetic drift to which uniparental markers are more susceptible (Raghavan et al., 2015).The absence of population stratification observed in our analysis contrasts with the results of previous reports analyzing autosomal diversity (Ávila-Arcos et al., 2020; Moreno-Estrada et al., 2014). In this case, although genetic substructure was reported, it was detected only when long geographic distances were compared (i.e., Mesoamerican versus Northern populations, or among Central, Northern, and Southern regions) (Ávila-Arcos et al., 2020; Gonzalez-Sobrino et al., 2016; Sandoval et al., 2012). Alternatively, the lack of genetic structure could be explained by the high prevalence of Q-M3 haplotypes in Native Mexicans (>79%). The fact that the majority of the populations studied were geographically adjacent could also possibly have influenced our analysis. It may further be the case that the lack of genetic stratification found in our study could focus on the haploid portion of the genome, not autosomal markers as in other studies (Ávila-Arcos et al., 2020; Moreno-Estrada et al., 2014).
Impact of European admixture in Native Mexican population from Hidalgo
The most significant contribution of non-native paternal lineages is within ethnic groups from Hidalgo state (Hñähñús, Nahuas, and Tepehuas), some of them living in the Sierra de Pachuca, a small extension of the Sierra Madre Oriental. This geographic region is rich in metallic minerals, with the silver mines discovered before 1552 being exploited since the colonial period (Saavedra-Silva and Sanchez-Salazar, 2008). Spanish immigrants moved to this geographic region to establish the first mining towns (Beezley, 2011) and, in the process, contributed their paternal lineages to local Native Mexican communities. The scarcity of female migrants (6%) (Ongaro et al., 2019) further stimulated gene flow between Spanish men and Native Mexican women, reducing the representation of indigenous paternal lineages in these communities but allowing them to maintain their indigenous languages. This admixture process could have occurred primarily in those communities located near the Pachuca-Real del Monte corridor (Saavedra-Silva and Sanchez-Salazar, 2008), and then extended to nearby towns (i.e., Ixmiquilpan). Gene flow between populations from urban and rural regions of Central Mexican states could thus have affected the frequency of Y chromosome lineages in indigenous populations, including haplogroup Q and its major sub-branches (Gonzalez-Sobrino et al., 2016).Given this history, it was not unexpected to find that the European paternal lineages in Native Mexicans resembled those from several regions of the Iberian Peninsula. Similar findings using eight Y-STR haplotypes were previously reported in Huastecos and Hhähñús from Ixmiquilpan, where 26% non-indigenous lineages were found (Barrot et al., 2007). In addition, our findings were in accordance with those reported in the Hñähñú population from Mezquital Valley (located at 27 km to Ixmiquilpan in Hidalgo) based on Y-SNPs (Gonzalez-Sobrino et al., 2016). On the other hand, some Nahua R1b haplotypes showed some genetic dissimilarity from others coming from Spain. These haplotypes could have been brought more recently (during the 19th century), when mining economic activity was opened to other foreign investment from England, France, Germany, Italy, and North America (Ongaro et al., 2019; Saavedra-Silva and Sanchez-Salazar, 2008).Another important contribution of non-indigenous haplogroups was found on the border between Hidalgo and Queretaro (18%), where individuals with R1a Y chromosomes were found. A previous analysis of paternal diversity in populations from several states from the VM (including Queretaro) also detected R1a in them (Santana et al., 2014). Queretaro is a state with a remarkable diversity of paternal lineages from Spain, the Basque Country, and even Jewish populations (Santana et al., 2014), due its importance in the mining production (i.e., gold, silver, copper, among others) (Servicio Geológico Mexicano, 2018).
Native American origins from a Y chromosome perspective
In the context of the peopling of the Americas, the striking diversity found in Native Mexican populations, as well as the genetic connections with Central and South American and Northeast Siberia populations, supports a Beringian entrance with subsequent genetic differentiation, as also seen with mtDNA data (Dryomov et al., 2015; Llamas et al., 2016; Schurr and Sherry, 2004; Skoglund and Reich, 2016). The presence of haplogroups Q-L54 and Q-M3 reflects the emergence and diversification of these lineages in Asia before the Mesoamerican settlement (de Azevedo et al., 2015; Llamas et al., 2016; Reich et al., 2012; Ruiz-Linares, 2014), as well as their subsequent diversification during the long Beringian standstill (Grugni et al., 2019; Pinotti et al., 2019) and the subsequent pre-Columbian period (Sandoval et al., 2012).Haplogroup Q-L54, which connects modern Nahua with indigenous Altaians and Tuvinians of Siberia (Dulik et al., 2012b), seems to have undergone in situ differentiation during which a number of derivative branches arose (Grugni et al., 2019; Jota et al., 2016; Pinotti et al., 2019). Within Q-Z780, the most prominent lineage (∼93%) found in our Q-L54 samples, three main sublineages have been defined, namely, Q-Z781, Q-SA02, and the Q-Z780 paragroup (Q-Z780)∗ (Grugni et al., 2019). Based on the frequency and the TMRCA for these sublineages (Q-Z780∗ + Q-SA02; 26,900 YBP) (Grugni et al., 2019), our Mexican Q-L54 samples likely belong to the Q-Z780∗ sublineage.Nonetheless, we did not dismiss the possible presence of other sublineages within Q-Z780. In light of other published evidence, samples from Nahuas, Pimas, and others obtained from Mexican ancestry in Los Angeles (MXL) have been shown to have Q-Z781 Y chromosomes, while the Q-Z782 has also been found in Mexican samples (ISOGG, 2019-20; Pinotti et al., 2019; The International Genome Sample Resource, 2020). In addition, the Z780 and CTS1780 SNPs defining Q1b1a2 have been observed in 6% of MXL samples (ISOGG, 2019-20; The International Genome Sample Resource, 2020). Thus, we may detect these sublineages in our populations through more detailed SNP genotyping.The main sublineage of Q-L54 (Q-Z-780/Z-777) also connects Aztlan descendants to the Clovis individual found in Montana (Anzick-1) (Rasmussen et al., 2014). This connection suggests that individuals carrying Q-Z780 Y chromosomes followed the Pacific coast route southward, first peopling Mexico and then moving into South America (Grugni et al., 2019; Sandoval et al., 2012). Although the Q-PV4 and Q-L191 sublineages have been previously described within this haplogroup in Mexican populations (Battaglia et al., 2013), they were not found in the present study.The other major Y chromosome lineage in the Americas, Q-M3, exhibits extraordinary diversity (with sublineages Q-Z768 and Q-L663) in Native Mexican populations. Our comparative analysis of this paternal lineage also reveals close connections between populations from North, Central, and South America, affirming a southward peopling process (Jota et al., 2016; Olofsson et al., 2015). Although most Y chromosome haplotypes belonging to this haplogroup were not fully defined by the panel of SNPs and STRs used in this analysis, our data suggest that additional undefined sublineages are present among them. To further assess this possibility, we compared our data with those from recently published studies to make inferences about the substructure of haplogroup Q-M3 in Native Mexican populations.Recent reports have identified two major sub-branches within Q-M3, these being Q-M848 (Q1b1a1a) and Q-Y4276 (Q1b1a1a2) (Grugni et al., 2019). Several different sublineages within Q-M848 have also been defined, including Q-M925 (Q1b1a1a1e) (Grugni et al., 2019; ISOGG, 2019-20). As both the Q-L663 and Q-Z768 sublineages have the M925 SNP, they must belong to this particular sublineage, as well as to Q-M848. The rest of our Q-M3 samples could belong to other sublineages within Q-M925, such as Q-CTS748 (Q1b1a1a1e1) or Q-CTS1002 (Q1b1a1a1e1a), which have also been identified in Native Mexicans (Grugni et al., 2019; The International Genome Sample Resource, 2020).Along these same lines, the MDS plot depicted in Figure S9 showed a close connection between Bolivian carriers of Q-CTS11357 Y chromosomes and our Nahua and Hñähñú samples. CTS11357 is essentially equivalent to the M925 SNP, phylogenetically speaking (ISOGG, 2019-20), and thus identifies the same sublineage. Previous studies have shown the Nahua and Pima to have the Q-CTS11357 sublineage (Pinotti et al., 2019), and the same is true for MXL, Colombians in Medellin, Karitianas from Brazil, and indigenous populations from Guatemala, El Salvador, and Nicaragua (ISOGG, 2019-20; The International Genome Sample Resource, 2020). Thus, the Q-CTS11357/Q-M925 sublineage (Q1b1a1a1e) is ubiquitous among indigenous populations of the Americas.Yet other sublineages with haplogroup Q-M3 could be present in our Native Mexican populations. One of these might be Q-Z5915 (Q1b1a1a1k1) (ISOGG, 2019-20). This is suggested by Bolivian and Peruvian populations having Q-Z5915 Y chromosomes (The International Genome Sample Resource, 2020) showing some connection with the Mexican Éza'r, Nahuas, Popolocas, and Tepehuas (Figure S9) (Jota et al., 2016). Others may be part of the clade Q1b1a1a1m (Q-CTS2731), because samples belonging to this sublineage have been detected in MXL, Mixtec, and Zapotec populations (ISOGG, 2019-20; Pinotti et al., 2019; The International Genome Sample Resource, 2020). However, because Mixtec and Zapotec populations live in southern Mexico, it is possible they will not be present in Native Mexican groups from the VM.As noted above, a second major branch of Q-M3 is Q-Y4276 (Q1b1a1a2), which appears in populations extending from Siberia to South America (Grugni et al., 2019). Its distribution across this region suggests that this sublineage could be present in our Native Mexican populations. This possibility is suggested by the general similarity of our Native Mexican populations and Athapaskan-speaking populations from North America. If true, then we might find Q-Y4276 Y chromosomes in Mexico, perhaps as a result of North American populations migrating the southern part of California and northern Mexico (Grugni et al., 2019).By contrast, the genetic relationship with the Greenland Inuit and Aboriginal populations from Canada seen in the Q-M3 and Q-L54 data for our study populations suggests the possible influence of post-glacial migrations during which ancestors of these populations crossed eastern North America to reach Greenland. To explore this possibility, the Mexican Q-M3 samples might be surveyed for markers such as Q-B143 that have been related to the Arctic route (Grugni et al., 2019), even if this SNP has not yet been observed in Native Mexican samples. In any case, this connection should be interpreted with caution given the small sample size of Greenland Inuit and Mi'kmaq haplotypes used in the analysis.Despite Q-MEH2, Q-L54, and Q-M3 being related to the initial migration (Battaglia et al., 2013), the belated arrival of Q-MEH2 (3624–4400 YBP) in Mesoamerica could be associated with a more recent migration (close to 5,500 YBP, via Beringia) (Rasmussen et al., 2010). Owing to the low-resolution (i.e., seven Y-STRs) data used to construct the median network and the reduced sample size, however, the TMRCA for this haplogroup should be taken with caution.Given the relationship between Nahuas and Athapaskan-speaking populations (present study), Q-MEH2 could have arrived in Mexico using the Pacific coastal route, a view that is also supported by mtDNA evidence (i.e., A2, D2a, and D4h3) (Dulik et al., 2012a; Llamas et al., 2016; Raghavan et al., 2015). Under this scenario, Q-MEH2 would have been brought by founder populations dispersing along the western coast of North America (Figure 7) (Raff et al., 2015). Of the four Nahua carriers of Q-MEH2, three of them also had mtDNA haplogroup A2, a maternal lineage that is particularly frequent within Nahua populations (Mata-Miguez et al., 2012; Penaloza-Espinosa et al., 2007). These postglacial migrations appear to have involved recurrent gene flow between Beringia to the Arctic Circle (Dulik et al., 2012b; Helgason et al., 2006; Rasmussen et al., 2010; Ruiz-Linares, 2014). Alternatively, the apparent connections with circumpolar populations could be due to recurrent mutations (Olofsson et al., 2015) or perhaps a recent Y chromosome diversification process (Dulik et al., 2012b).
Figure 7
Frequency of Q-MEH2 lineage and its sublineage Q-F1096 in Asian and Native American populations
Note: Data from Battaglia et al. (2013), Bisso-Machado et al. (2011), Karmin et al. (2015)Malyarchuk et al. (2011), Rasmussen et al. (2010), Regueiro et al. (2013), and Schurr et al. (2012). The map was obtained from http://mapswire.com.
Frequency of Q-MEH2 lineage and its sublineage Q-F1096 in Asian and Native American populationsNote: Data from Battaglia et al. (2013), Bisso-Machado et al. (2011), Karmin et al. (2015)Malyarchuk et al. (2011), Rasmussen et al. (2010), Regueiro et al. (2013), and Schurr et al. (2012). The map was obtained from http://mapswire.com.However, Q-F1096, a sublineage within Q-MEH2, has been reported in Alaska and Peru (Figure 7) (Grugni et al., 2019; Karmin et al., 2015; The International Genome Sample Resource, 2020). Thus, although the frequency of this haplogroup is limited per se, its rarity in Native Mexicans could be explained by demographic events occurring before the entry into Mesoamerica. Founder effects (possibly occurred in North America), along with isolation (geographic and population), may have made populations more susceptible to strong genetic drift, which would have impacted the diversity and frequency of Q-MEH2 haplotypes (Bortolini et al., 2003; Regueiro et al., 2013).To summarize, although the genetic signatures of contemporary Amerindians do not totally reflect the diversity of Late Pleistocene expansions, our data demonstrate a clear connection between Native Mexican and other indigenous Native Americans. It is important to note that these insights emerged by studying isolated populations, which allowed us to make inferences about the genetic dispersal processes with greater precision than possibly with admixed or Mestizo populations.The genetic diversity of Aztlan descendants indicates that the colonization of Mesoamerica was the result of complex population dynamics beginning from the first migratory wave in the Late Pleistocene to more recent Holocene migrations. The genetic lineages brought by these different migrations were then exchanged among emerging ethnic communities in subsequent millennia, giving rise to pattern observed today. Thus, Native Mexicans, like other indigenous American peoples, do not descend from a single, unique founding population, but instead from those arriving in a series of migratory waves.Overall, our data contribute to the understanding of the initial peopling of the Americas and the settlement of Mesoamerica and provide further impetus to new studies that will clarify the population history of Mexico. Moreover, we have revealed the extraordinary diversity of haplogroup Q-M3 in Native Mexicans, with the further phylogenetic dissection of this lineage requiring additional Y chromosome sequencing and SNP genotyping.
Limitations of the study
Despite the 21 SNPs characterized in the samples from our study populations, these were insufficient to fully elaborate the diversity within haplogroup Q-M3. In addition, owing to the modest number of Q-MEH2 Y chromosomes found, the TMRCA for this lineage may have not been accurately estimated.
Resource availability
Lead contact
Further information and requests for resources and reagents should be directed to and will be fulfilled by the lead contact, Theodore G. Schurr (tgschurr@sas.upenn.edu).
Materials availability
This study did not generate new unique reagents.
Data and code availability
The published article includes all datasets generated or analyzed during this study in the supplemental tables.
Methods
All methods can be found in the accompanying transparent methods supplemental file.
Authors: Tatiana M Karafet; Fernando L Mendez; Monica B Meilerman; Peter A Underhill; Stephen L Zegura; Michael F Hammer Journal: Genome Res Date: 2008-04-02 Impact factor: 9.043
Authors: Blanca Z González-Sobrino; Ana P Pintado-Cortina; Leticia Sebastián-Medina; Fabiola Morales-Mandujano; Alejandra V Contreras; Yasnaya E Aguilar; Juan Chávez-Benavides; Aurelio Carrillo-Rodríguez; Irma Silva-Zolezzi; Luis Medrano-González Journal: Biodemography Soc Biol Date: 2016
Authors: Thomaz Pinotti; Anders Bergström; Maria Geppert; Matt Bawn; Dominique Ohasi; Wentao Shi; Daniela R Lacerda; Arne Solli; Jakob Norstedt; Kate Reed; Kim Dawtry; Fabricio González-Andrade; Cesar Paz-Y-Miño; Susana Revollo; Cinthia Cuellar; Marilza S Jota; José E Santos; Qasim Ayub; Toomas Kivisild; José R Sandoval; Ricardo Fujita; Yali Xue; Lutz Roewer; Fabrício R Santos; Chris Tyler-Smith Journal: Curr Biol Date: 2018-12-20 Impact factor: 10.834
Authors: Morten Rasmussen; Yingrui Li; Stinus Lindgreen; Jakob Skou Pedersen; Anders Albrechtsen; Ida Moltke; Mait Metspalu; Ene Metspalu; Toomas Kivisild; Ramneek Gupta; Marcelo Bertalan; Kasper Nielsen; M Thomas P Gilbert; Yong Wang; Maanasa Raghavan; Paula F Campos; Hanne Munkholm Kamp; Andrew S Wilson; Andrew Gledhill; Silvana Tridico; Michael Bunce; Eline D Lorenzen; Jonas Binladen; Xiaosen Guo; Jing Zhao; Xiuqing Zhang; Hao Zhang; Zhuo Li; Minfeng Chen; Ludovic Orlando; Karsten Kristiansen; Mads Bak; Niels Tommerup; Christian Bendixen; Tracey L Pierre; Bjarne Grønnow; Morten Meldgaard; Claus Andreasen; Sardana A Fedorova; Ludmila P Osipova; Thomas F G Higham; Christopher Bronk Ramsey; Thomas V O Hansen; Finn C Nielsen; Michael H Crawford; Søren Brunak; Thomas Sicheritz-Pontén; Richard Villems; Rasmus Nielsen; Anders Krogh; Jun Wang; Eske Willerslev Journal: Nature Date: 2010-02-11 Impact factor: 49.962
Authors: María C Ávila-Arcos; Kimberly F McManus; Karla Sandoval; Juan Esteban Rodríguez-Rodríguez; Viridiana Villa-Islas; Alicia R Martin; Pierre Luisi; Rosenda I Peñaloza-Espinosa; Celeste Eng; Scott Huntsman; Esteban G Burchard; Christopher R Gignoux; Carlos D Bustamante; Andrés Moreno-Estrada Journal: Mol Biol Evol Date: 2020-04-01 Impact factor: 16.240