| Literature DB >> 29044207 |
Sandra Romero-Hidalgo1, Adrián Ochoa-Leyva1,2, Alejandro Garcíarrubio2, Victor Acuña-Alonzo3, Erika Antúnez-Argüelles1, Martha Balcazar-Quintero4, Rodrigo Barquera-Lozano3, Alessandra Carnevale1, Fernanda Cornejo-Granados2, Juan Carlos Fernández-López1, Rodrigo García-Herrera1, Humberto García-Ortíz1, Ángeles Granados-Silvestre5, Julio Granados6, Fernando Guerrero-Romero7, Enrique Hernández-Lemus1, Paola León-Mimila1,5, Gastón Macín-Pérez3, Angélica Martínez-Hernández1, Marta Menjivar1,5,8, Enrique Morett1, Lorena Orozco1, Guadalupe Ortíz-López8, Fernando Pérez-Villatoro1,9, Javier Rivera-Morales3, Fernando Riveros-McKay1,9, Marisela Villalobos-Comparán1, Hugo Villamil-Ramírez1,5, Teresa Villarreal-Molina1, Samuel Canizales-Quinteros10,11, Xavier Soberón12,13.
Abstract
Understanding the genetic structure of Native American populations is important to clarify their diversity, demographic history, and to identify genetic factors relevant for biomedical traits. Here, we show a demographic history reconstruction from 12 Native American whole genomes belonging to six distinct ethnic groups representing the three main described genetic clusters of Mexico (Northern, Southern, and Maya). Effective population size estimates of all Native American groups remained below 2,000 individuals for up to 10,000 years ago. The proportion of missense variants predicted as damaging is higher for undescribed (~ 30%) than for previously reported variants (~ 15%). Several variants previously associated with biological traits are highly frequent in the Native American genomes. These findings suggest that the demographic and adaptive processes that occurred in these groups shaped their genetic architecture and could have implications in biological processes of the Native Americans and Mestizos of today.Entities:
Mesh:
Year: 2017 PMID: 29044207 PMCID: PMC5647344 DOI: 10.1038/s41467-017-01194-z
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Fig. 1Map of Mexico. The figure shows the geographical origin of each NA and unrelated Mestizo individuals included in the present study. The region of Mesoamerica is shaded in gray. TAR (Tarahumara), TEP (Tepehuano), NAH (Nahua), TOT (Totonaca), ZAP (Zapoteca), MAY (Maya), and MES (Mestizo)
Fig. 2Multidimensional scaling plots and admixture analysis. a MDS plot for components 1 and 2 of 12 Native Americans of Mexico combined with continental (CEU and YRI from HapMap) and NA reference populations. b MDS plot for components 1 and 3, separating the NA populations of Mexico. The 12 sequenced NA samples are shown in gray, and population labels are as described in Supplementary Table 8. c Global ancestry proportions of NA and continental reference populations assuming K = 10. d Global ancestry proportions of the 12 NA individuals assuming K = 10. The NA individuals are displayed North-to-South, and the Mestizo individuals are displayed at the far right. Tar1 and Tar2 (Tarahumara), Tep1 and Tep2 (Tepehuano), Nah1 and Nah2 (Nahua), Tot1 and Tot2 (Totonaca), Zap1 and Zap2 (Zapoteca), May1 and May2 (Maya), Mes1 and Mes2 (Mestizo)
Characteristics of Native American and Mestizo individuals
| ID | Population | Location of origin | Linguistic group | Y chromosome | mtDNA haplogroup |
|---|---|---|---|---|---|
| Tar1 | Tarahumara | Northern Mexico | Uto-Aztecan | Q1a2a1b | C |
| Tar2 | — | C1c1a | |||
| Tep1 | Tepehuano | Northern Mexico | Uto-Aztecan | Q1a2a1a1 | C1b10 |
| Tep2 | Q1a2a1a1 | A2c | |||
| Nah1 | Nahua | Central Mexico | Uto-Aztecan | Q1a2a1a1 | B2 |
| Nah2 | — | A2B | |||
| Tot1 | Totonaca | Central Mexico | Totonac | Q1a2a1a1 | C1c2 |
| Tot2 | — | A2u | |||
| Zap1 | Zapoteca | Southwestern Mexico | Oto-Manguean | — | A2m |
| Zap2 | — | A2 | |||
| May1 | Maya | Southeast Mexico | Mayan | — | A2 |
| May2 | — | C | |||
| Mes1 | Mestizo | Central Mexico | Spanish | R1b1a2a1a2b1a1 | A2g |
| Mes2 | — | C |
Fig. 3Inference of population sizes from whole-genome NA sequences. a Effective population size estimates from autosomes of six Native American (NA) individuals and an African, European, and Asian genome from the 1KG project, inferred using pairwise sequentially Markovian coalescent (PSMC) models. b Effective population size estimates from four haplotypes (two phased individuals from each of the six NA and 1KG populations), inferred using multiple sequentially Markovian coalescent (MSMC) models. c Effective population size estimates from eight haplotypes [four phased individuals from each of NA Northern (Tepehuanos and Tarahumaras), NA Southern (Nahuas and Zapotecas) and 1KG populations], inferred using MSMC models
Fig. 4Enrichment analysis for disease-associated genes. Annotation matrix for disease-associated genes enriched in different populations shown as a heatmap, red indicates disease terms significantly enriched. Disease terms enrichment was analyzed by WebGestalt using the hypergeometric test for enrichment evaluation analysis. P-values were adjusted using Benjamini & Hochberg method. MES (Mestizo), TEP (Tepehuano), ZAP (Zapoteca), MAY (Maya), NAH (Nahua), TOT (Totonaca), TAR (Tarahumara), HPR TAR (high physical resistance)