| Literature DB >> 28855283 |
Andrew B Conley1,2, Lavanya Rishishwar1,2, Emily T Norris1,2,3, Augusto Valderrama-Aguirre2,4, Leonardo Mariño-Ramírez2,5, Miguel A Medina-Rivas2,6, I King Jordan7,2,3.
Abstract
At least 20% of Colombians identify as having African ancestry, yielding the second largest population of Afro-descendants in Latin America. To date, there have been relatively few studies focused on the genetic ancestry of Afro-Latino populations. We report a comparative analysis of the genetic ancestry of Chocó, a state located on Colombia's Pacific coast with a population that is >80% Afro-Colombian. We compared genome-wide patterns of genetic ancestry and admixture for Chocó to six other admixed American populations, with an emphasis on a Mestizo population from the nearby Colombian city of Medellín. One hundred sample donors from Chocó were genotyped across 610,545 genomic sites and compared with 94 publicly available whole genome sequences from Medellín. At the continental level, Chocó shows mostly African genetic ancestry (76%) with a nearly even split between European (13%) and Native American (11%) fractions, whereas Medellín has primarily European ancestry (75%), followed by Native American (18%) and African (7%). Sample donors from Chocó self-identify as having more African ancestry, and conversely less European and Native American ancestry, than can be genetically inferred, as opposed to what we previously found for Medellín, where individuals tend to overestimate levels of European ancestry. We developed a novel approach for subcontinental ancestry assignment, which allowed us to characterize subcontinental source populations for each of the three distinct continental ancestry fractions separately. Despite the clear differences between Chocó and Medellín at the level of continental ancestry, the two populations show overall patterns of subcontinental ancestry that are highly similar. Their African subcontinental ancestries are only slightly different, with Chocó showing more exclusive shared ancestry with the modern Yoruba (Nigerian) population, and Medellín having relatively more shared ancestry with West African populations in Sierra Leone and Gambia. Both populations show very similar Spanish ancestry within Europe and virtually identical patterns of Native American ancestry, with main contributions from the Embera and Waunana tribes. When the three subcontinental ancestry components are considered jointly, the populations of Chocó and Medellín are shown to be most closely related, to the exclusion of the other admixed American populations that we analyzed. We consider the implications of the existence of shared subcontinental ancestries for Colombian populations that appear, at first glance, to be clearly distinct with respect to competing notions of national identity that emphasize ethnic mixing (mestizaje) vs. group-specific identities (multiculturalism).Entities:
Keywords: Admixture; Afro-Colombian; Afro-Latino; Comparative Genomics; Genetic Ancestry; Human Genomics; Population Genetics
Mesh:
Year: 2017 PMID: 28855283 PMCID: PMC5633392 DOI: 10.1534/g3.117.1118
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
Human populations analyzed in this study
| Dataset | Full Description | Short | Dataset | Full Description | Short | ||||
|---|---|---|---|---|---|---|---|---|---|
| Colombia ( | Chocoano in Quibdó, Colombia | Chocó | 94 | Mesoamerica ( | 1KGP | Mexican Ancestry in Los Angeles, California | Mexico | 64 | |
| 1KGP | Colombian in Medellin, Colombia | Medellín | 94 | 1KGP | Puerto Rican in Puerto Rico | Puerto Rico | 104 | ||
| Native American tribes | Kogi | 4 | HGDP | Native American tribes | Pima | 14 | |||
| Waunana | 3 | Tepehuano | 25 | ||||||
| Embera | 5 | Mixtec | 5 | ||||||
| Guahibo | 6 | HGDP | Maya | 21 | |||||
| Piapoco | 7 | Mixe | 17 | ||||||
| Inga | 9 | Zapotec | 43 | ||||||
| Ticuna | 6 | Kaqchikel | 13 | ||||||
| South America ( | 1KGP | Peruvian in Lima, Peru | Peru | 85 | Europe ( | 1KGP | Finnish in Finland | Finnish | 99 |
| Native American tribes | Aymara | 23 | 1KGP | British in England and Scotland | British | 90 | |||
| Guarani | 6 | 1KGP | Iberian populations in Spain | Spanish | 107 | ||||
| Quechua | 40 | 1KGP | Toscani in Italy | Tuscan | 107 | ||||
| HGDP | Surui | 8 | HGDP | Russian | 25 | ||||
| Wayuu | 11 | HGDP | Orcadian | 15 | |||||
| Africa ( | 1KGP | Esan in Nigeria | Esan | 99 | North American ( | HGDP | French | 28 | |
| 1KGP | Gambian in Western Division, The Gambia | Gambian | 113 | 1KGP | African Ancestry in Southwest United States | African American | 61 | ||
| 1KGP | Luhya in Webuye, Kenya | Luhya | 99 | 1KGP | African Caribbean in African Caribbean | African Caribbean | 96 | ||
| 1KGP | Mende in Sierra Leone | Mende | 85 | Native American tribes | Chipewyan | 15 | |||
| 1KGP | Yoruba in Ibadan, Nigeria | Yoruba | 108 | Cree | 4 | ||||
| HGDP | Mandenka | 22 | Ojibwa | 5 | |||||
| HGDP | Yoruba | 21 | Algonquin | 5 |
Populations are organized into continental groups, and the number of individuals in each population is shown.
1KGP, 1000 Genomes Project (phase 3 data release); HGDP, Human Genome Diversity Project Reich refers to a recent study of Native American genetic ancestry (2012).
Figure 1Comparison of continental ancestry for Chocó and Medellín. (A) Principal component analysis (PCA) projection of inter-individual genetic distance of admixed American populations, Chocó, Medellín, and African American, in relation to African, European, and Native American reference populations. (B) ADMXITURE plot showing genome-wide continental ancestry fractions for individuals from admixed American populations and global reference populations. Ancestry is shown as African (blue), European (orange), and Native American (red). The same ancestry color scheme is used throughout. (C) Box plots showing distributions of individuals’ continental ancestry fractions for the Chocó and Medellín populations.
Figure 2Genetic vs. self-identified ancestry in Chocó. (A) Box plots showing distributions of individuals’ self-identified vs. genetic continental ancestry fractions; P-values show the significance of the difference between the distributions. (B) Violin plots showing the distributions of the differences between self-identified and genetic continental ancestry.
Figure 3Modeling the timing of admixture events in Chocó. (A) Observed (points) and predicted (solid line) ancestry tract size distributions; the shaded areas represent 95% confidence intervals. (B) Admixture event timings are shown together with ancestry proportions. Each inferred admixture event is indicated by a circle, which is scaled according to the size of the contribution to the population and also shows the relative ancestry proportions. The y-axes of the charts show the inferred continental ancestry fractions, and the x-axes show time as the number of generations ago (GA).
Figure 4Ancestry-specific principal component analysis (PCA) for admixed American and global reference populations. PCAs are shown separately for (A) African, (B) European, and (C) Native American ancestry fractions. The locations of the dots correspond to population-specific centroids estimated from all individuals within each population. Population descriptions can be found in Table 1.
Figure 5Subcontinental ancestry of admixed American and global reference populations. Subcontinental ancestries for admixed American populations were inferred separately using (A) African, (B) European, and (C) Native American reference populations. ChromoPainter2 was used to paint reference and admixed individuals as mosaics of reference individuals. The estimated subcontinental ancestry derived from the reference populations is shown for each individual. Population descriptions can be found in Table 1.
Figure 6Native American subcontinental ancestry of admixed American populations. Spearman rank correlations (color coded as seen in the key) are shown for ChromoPainter2 painting vectors between all pairs of individuals from Native American reference populations (x-axis) and admixed American populations (y-axis). Native American reference populations are grouped according to their geographic origins as shown below the plot (Amz. – Amazonian). Population descriptions can be found in Table 1.
Figure 7Subcontinental ancestry profiles of admixed American populations. African, European and Native American ancestries are broken down into their major subcontinental fractions as shown. The sizes of the subcontinental ancestry circles represent the relative contributions to each admixed American population. The dendogram shows the relationships among the admixed American populations based on their subcontinental ancestry profiles. Population descriptions can be found in Table 1.