| Literature DB >> 30568240 |
Juan-Camilo Chacón-Duque1, Kaustubh Adhikari1, Macarena Fuentes-Guajardo1,2, Javier Mendoza-Revilla1,3, Victor Acuña-Alonzo1,4, Rodrigo Barquera4,5, Mirsha Quinto-Sánchez6, Jorge Gómez-Valdés7, Paola Everardo Martínez8, Hugo Villamil-Ramírez9, Tábita Hünemeier10, Virginia Ramallo11,12, Caio C Silva de Cerqueira12, Malena Hurtado3, Valeria Villegas3, Vanessa Granja3, Mercedes Villena13, René Vásquez14, Elena Llop15, José R Sandoval16, Alberto A Salazar-Granara16, Maria-Laura Parolin17, Karla Sandoval18, Rosenda I Peñaloza-Espinosa19, Hector Rangel-Villalobos20, Cheryl A Winkler21, William Klitz22, Claudio Bravi23, Julio Molina24, Daniel Corach25, Ramiro Barrantes26, Verónica Gomes27,28, Carlos Resende27,28, Leonor Gusmão27,28,29, Antonio Amorim27,28,30, Yali Xue31, Jean-Michel Dugoujon32, Pedro Moral33, Rolando González-José11, Lavinia Schuler-Faccini12, Francisco M Salzano12, Maria-Cátira Bortolini12, Samuel Canizales-Quinteros9, Giovanni Poletti3, Carla Gallo3, Gabriel Bedoya34, Francisco Rothhammer15,35, David Balding1,36, Garrett Hellenthal37, Andrés Ruiz-Linares38,39.
Abstract
Historical records and genetic analyses indicate that Latin Americans trace their ancestry mainly to the intermixing (admixture) of Native Americans, Europeans and Sub-Saharan Africans. Using novel haplotype-based methods, here we infer sub-continental ancestry in over 6,500 Latin Americans and evaluate the impact of regional ancestry variation on physical appearance. We find that Native American ancestry components in Latin Americans correspond geographically to the present-day genetic structure of Native groups, and that sources of non-Native ancestry, and admixture timings, match documented migratory flows. We also detect South/East Mediterranean ancestry across Latin America, probably stemming mostly from the clandestine colonial migration of Christian converts of non-European origin (Conversos). Furthermore, we find that ancestry related to highland (Central Andean) versus lowland (Mapuche) Natives is associated with variation in facial features, particularly nose morphology, and detect significant differences in allele frequencies between these groups at loci previously associated with nose morphology in this sample.Entities:
Mesh:
Year: 2018 PMID: 30568240 PMCID: PMC6300600 DOI: 10.1038/s41467-018-07748-z
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Fig. 1Reference population samples and SOURCEFIND ancestry estimates for the five Latin American countries examined. a Colored pies and grey dots indicate the approximate geographic location of the 117 reference population samples studied. These samples have been subdivided on the world map into five major bio-geographic regions: Native Americans (38 populations), Europeans (42 populations), East/South Mediterraneans (15 populations), Sub-Saharan Africans (15 populations) and East Asians (7 populations). The coloring of pies represents the proportion of individuals from that population included in one of the 35 reference groups defined using fineSTRUCTURE (these groups are listed in the color-coded insets for each region; Supplementary Fig. 2). The small dark grey dots indicate reference populations not inferred to contribute ancestry to the CANDELA sample. b–d refer to the CANDELA dataset. b, c show, respectively, the average estimated proportion of sub-continental Native American and European ancestry components in individuals with >5% total Native American or European ancestry in each country sampled; the stacked bars are color-coded as for the reference population groups shown in the insets of (a). d shows boxplots of the estimated sub-continental ancestry components for individuals with >5% total Sephardic/East/South Mediterranean ancestry. In this panel colors refer to countries as for the colored country labels shown in (a). Following standard convention for boxplots, the center line denotes the median, the box boundaries represent the first and the third quartiles, and the whiskers range to 1.5 times the inter-quartile range on either side. Outlying points are plotted individually
Fig. 2Geographic variation of ancestry sub-components in Latin American individuals. a Native American, b European, and c East/South Mediterranean. Each pie represents an individual, with pie location corresponding to birthplace. Since many individuals share birthplace, jittering has been performed based on pie size and how crowded an area is. Pie size is proportional to total ancestry from all sources depicted in that specific figure, and only individuals with >5% of such total ancestry are shown. Coloring of pies represents the proportion of each sub-continental component estimated for each individual (color-coded as in Fig. 1; Chaco2 does not contribute >5% to any individual and was excluded). Pies in (c) have been enlarged to facilitate visualization
Fig. 3Times since admixture estimated using GLOBETROTTER. a Top: frequency distribution of admixture times for individuals in which a single admixture event between Native and European sources was inferred (dashed line indicates the mean). Bottom: mean continental ancestry (%) as a function of time since admixture among these individuals. Only time bins including >20 individuals are shown. (NAM Native American, EUR European, ESM East/South Mediterranean, SSA Sub-Saharan African, EAS East Asian). b–e show contrasts of the distribution of admixture times involving Iberian versus other sources: b North-West Europe and Italy, c East Mediterranean and Sephardic, d Sub-Saharan Africa and e East Asia. p-values for comparing the mean date of Iberian versus each other ancestry source are from a one-sided Mann–Whitney U test, and numbers of inferred admixture events are given in parenthesis
Fig. 4Effect of sub-continental genetic ancestry on physical appearance. a Regression –log p-values for 28 traits (Supplementary Note 4) against the contrast between two sub-continental ancestry components estimated by SOURCEFIND. The left column shows results for the Portugal/West-Spain versus North-West Europe contrast in the Brazilian sample (Br). The right column presents the contrast between Central Andes versus Mapuche ancestry in the full CANDELA sample. b Regression coefficients (Betas) in units of SD for the contrasts in (a). In a, b color intensity reflects variation in -Log-p values or beta coefficients, as indicated on the scale. Bonferroni-corrected significant values are highlighted with a dot (–log p-value threshold of 3.05 for alpha = 0.05). c, d Display scatterplots and regression lines (with 95% confidence intervals) for two traits showing significant association with variation in sub-continental ancestry: skin melanin index in Brazilians (c) and nose bridge breadth in Chileans and Peruvians (d; Y-axis is in Procrustes units). e Scatterplot of -log p-values from follow-up analyses of the regression of physical traits on the Central Andes versus Mapuche ancestry contrast. The X-axis refers to -log p-values from the primary analyses (using SOURCEFIND (SF) estimates and data for all individuals, as shown in the second column of (a)). The Y-axis refers to -log-p values from four other regression analyses: using SOURCEFIND (SF) estimates restricted to Peruvian and Chilean individuals, or only to Chileans; using related ancestry components defined by: ADMIXTURE (ADMIX., at K = 7) in all the CANDELA data, or by PCA (PC 7), in an analysis limited to Chileans (Supplementary Note 5, Supplementary Figures 8 and 9). Sample sizes: all data N = 5,794, Peruvians and Chileans N = 2,594, Chileans N = 1,542