| Literature DB >> 25254375 |
Andrés Ruiz-Linares1, Kaustubh Adhikari1, Victor Acuña-Alonzo2, Mirsha Quinto-Sanchez3, Claudia Jaramillo4, William Arias4, Macarena Fuentes5, María Pizarro5, Paola Everardo6, Francisco de Avila6, Jorge Gómez-Valdés7, Paola León-Mimila7, Tábita Hunemeier8, Virginia Ramallo8, Caio C Silva de Cerqueira8, Mari-Wyn Burley1, Esra Konca1, Marcelo Zagonel de Oliveira8, Mauricio Roberto Veronez9, Marta Rubio-Codina10, Orazio Attanasio11, Sahra Gibbon12, Nicolas Ray13, Carla Gallo14, Giovanni Poletti14, Javier Rosique15, Lavinia Schuler-Faccini8, Francisco M Salzano8, Maria-Cátira Bortolini8, Samuel Canizales-Quinteros16, Francisco Rothhammer5, Gabriel Bedoya4, David Balding1, Rolando Gonzalez-José3.
Abstract
The current genetic makeup of Latin America has been shaped by a history of extensive admixture between Africans, Europeans and Native Americans, a process taking place within the context of extensive geographic and social stratification. We estimated individual ancestry proportions in a sample of 7,342 subjects ascertained in five countries (Brazil, Chile, Colombia, México and Perú). These individuals were also characterized for a range of physical appearance traits and for self-perception of ancestry. The geographic distribution of admixture proportions in this sample reveals extensive population structure, illustrating the continuing impact of demographic history on the genetic diversity of Latin America. Significant ancestry effects were detected for most phenotypes studied. However, ancestry generally explains only a modest proportion of total phenotypic variation. Genetically estimated and self-perceived ancestry correlate significantly, but certain physical attributes have a strong impact on self-perception and bias self-perception of ancestry relative to genetically estimated ancestry.Entities:
Mesh:
Year: 2014 PMID: 25254375 PMCID: PMC4177621 DOI: 10.1371/journal.pgen.1004572
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 5.917
Sample size, proportion of women, age, estimated admixture proportions and phenotypic features of the study sample.
| Brazil | Chile | Colombia | México | Perú | Total | |
| N | 1,594 | 1,561 | 1,659 | 1,622 | 906 | 7,342 |
| Women | 0.67 | 0.34 | 0.56 | 0.61 | 0.60 | 0.55 |
| Age | 25 | 24 | 23 | 25 | 20 | 23 |
| American Ancestry | 0.09 | 0.48 | 0.29 | 0.56 | 0.64 | 0.38 |
| African Ancestry | 0.09 | 0.05 | 0.11 | 0.05 | 0.00 | 0.06 |
| European Ancestry | 0.82 | 0.49 | 0.60 | 0.37 | 0.29 | 0.52 |
| Head circumference (cm) | 55/57 | 56/57 | 54/56 | 55/57 | 55/57 | 55/57 |
| Height (cm) | 162/175 | 159/172 | 160/173 | 158/172 | 158/171 | 160/172 |
| Hip circumference (cm) | 98/100 | 100/102 | 94/95 | 95/97 | 96/99 | 97/99 |
| Melanin Index | 32/32 | 36/35 | 34/33 | 36/35 | 37/37 | 34/35 |
| Waist circumference (cm) | 75/87 | 77/90 | 77/81 | 81/87 | 80/88 | 78/87 |
| Weight (kg) | 60/76 | 61/76 | 56/70 | 60/74 | 56/71 | 59/74 |
|
| ||||||
| (1) No baldness | 70 | 76 | 88 | 65 | 86 | 72 |
| (2) Some baldness | 30 | 24 | 12 | 35 | 14 | 28 |
|
| ||||||
| (1) No graying | 78/65 | 80/82 | 90/87 | 70/67 | 92/88 | 81/78 |
| (2) Some graying | 22/35 | 20/18 | 10/13 | 30/33 | 8/12 | 19/22 |
|
| ||||||
| (1) Blue/Grey | 8/8 | 1/3 | 2/2 | 1/1 | 0/0 | 3/3 |
| (2) Honey | 4/2 | 4/5 | 10/11 | 3/2 | 1/2 | 5/5 |
| (3) Green | 14/15 | 4/10 | 8/8 | 4/6 | 3/2 | 7/9 |
| (4) Light brown | 19/20 | 9/9 | 16/15 | 21/21 | 11/8 | 16/14 |
| (5) Dark brown/Black | 55/55 | 83/74 | 64/64 | 72/71 | 85/87 | 68/70 |
|
| ||||||
| (1) Red/reddish | 1/1 | 1/0 | 1/0 | 0/0 | 0/0 | 1/1 |
| (2) Blond | 7/5 | 3/1 | 2/2 | 2/1 | 1/0 | 3/2 |
| (3) Dark blond/light brown | 35/28 | 18/11 | 16/12 | 21/12 | 10/5 | 22/14 |
| (4) Brown/Black | 57/65 | 78/87 | 81/85 | 77/86 | 90/95 | 75/84 |
|
| ||||||
| (1) Straight | 41/41 | 48/62 | 39/33 | 46/45 | 45/41 | 43/47 |
| (2) Wavy | 38/34 | 38/32 | 39/38 | 41/43 | 42/38 | 39/37 |
| (3) Curly | 18/22 | 10/4 | 20/27 | 12/12 | 12/20 | 15/15 |
| (4) Frizzy | 3/4 | 3/1 | 2/2 | 1/0 | 1/1 | 2/2 |
Note: Values shown are medians except for categorical traits where the numbers indicate percentages in that category. Data for women is shown in the numerator (except for Male pattern baldness). For the regression analyses (Tables 2 and 3 below) categorical phenotypes 15–17 were considered ordinal variables with 4 or 5 ordered integer levels as specified here (see Methods). Individual ancestry histograms for each country are presented in Text S1.
Multiple linear regression of physical appearance traits on European and African ancestry.
| European ancestry | African ancestry | |||||
| Trait | Coef. | P-value | Coef. | P-value | %R2 | % Δ R2 |
| 1. Weight | 0.35 | 0.70 | 0.95 | 0.54 | 32 | 0 |
| 2. Height | 7.31 |
| 8.14 |
| 55 | 2 |
| 3. Hip circumference | −0.03 | 0.96 | 0.18 | 0.87 | 11 | 0 |
| 4. Waist circumference | −4.69 |
| −6.46 |
| 26 | 1 |
| 5. Head circumference | −0.03 | 0.88 | 0.81 | 0.02 | 20 | 3 |
| 6. Melanin Index | −10.05 |
| 11.89 |
| 25 | 19 |
| 7. Balding | 0.12 |
| 0.15 |
| 23 | 1 |
| 8. Hair shape | 0.47 |
| 2.41 |
| 11 | 8 |
| 9. Eye color | −1.26 |
| 0.24 | 0.08 | 11 | 5 |
| 10. Graying | 0.06 | 0.01 | 0.11 | 0.01 | 44 | 0 |
| 11. Hair color | −0.62 |
| −0.01 | 0.87 | 12 | 5 |
| 12. Eye fold | −0.37 |
| −0.37 |
| 27 | 1 |
| 13. Centroid Size | −12.98 |
| 2.94 | 0.62 | 48 | 0 |
| 14. PC-1(19%) | 0.02 |
| 0.04 |
| 15 |
|
| 15. PC-2(12%) | −0.01 | 1.00E-03 | −0.02 |
| 2 | 0 |
| 16. PC-3(10%) | −0.01 |
| 0 | 0.84 | 21 |
|
| 17. PC-4(7%) | 0.01 |
| −0.02 |
| 18 |
|
| 18. PC-5(7%) | 0.01 |
| 0 | 0.6 | 5 | 0 |
Note: All regressions account for age, sex, country, education and wealth. Regressions for facial features (traits 13 to 18) also account for BMI and height. %R2 refers to trait variance explained by a regression model incorporating European and African ancestry (being proportions, European, African and American ancestries sum up to 1 and since in this sample African ancestry is very low (median of 7%), we use Native American ancestry as a baseline). %Δ R2 refers to the difference in variance explained by this full model and a model without ancestry as a predictor. P-Values <10−3 are shown in bold italic. The facial features (traits 13 to 18) refer to morphogeometric summaries of face variation derived from 3D landmark coordinates (see Methods). PC = Principal Components of the procrustes 3D landmark coordinates (% in parenthesis refer to variance explained by that PC).
Multiple linear regression of the difference (Δ) between self-perceived and genetically estimated ancestry for the three continental components.
| Δ AFRICA | Δ AMERICA | Δ EUROPE | ||||
| Coef. | P-value | Coef. | P-value | Coef. | P-value | |
| European Ancestry* | −0.01 | 0.48 | −0.45* |
| −0.52 |
|
| African Ancestry | −0.23 |
| −0.07 | 0.02 | 0.07 | 0.04 |
| Country-Chile | −0.05 |
| 1.59E-03 | 0.85 | −0.03 | 1.51E-03 |
| Country-Colombia | −0.03 |
| 0.04 |
| −0.09 |
|
| Country-México | −0.05 |
| 0.02 | 0.01 | 2.95E-03 | 0.79 |
| Country-Perú | −0.35 |
| −0.03 | 0.78 | −0.80 |
|
| Wealth | −1.45E-04 | 0.81 | 2.63E-04 | 0.72 | 2.91E-03 |
|
| Education | −2.17E-03 | 0.45 | 3.96E-03 | 0.25 | 1.56E-03 | 0.69 |
| Age | 1.75E-04 | 0.59 | 8.45E-04 | 0.03 | 7.87E-04 | 0.08 |
| Sex | −0.01 | 0.03 | 1.40E-03 | 0.83 | 0.02 | 1.16E-03 |
| Balding | 0.01 | 0.14 | 0.01 | 0.29 | −0.01 | 0.39 |
| Hair graying | −0.01 | 0.32 | −0.02 | 4.23E-03 | −0.01 | 0.27 |
| Height | −3.79E-04 | 0.19 | −7.35E-04 | 0.04 | 1.81E-03 |
|
| Melanin index | 2.55E-03 |
| 2.97E-03 |
| −0.01 |
|
| Hair shape | 0.02 |
| −0.01 |
| −2.93E-03 | 0.38 |
| Eye color | 4.26E-03 | 0.02 | 0.01 |
| −0.02 |
|
| Hair color | 0.01 | 0.01 | 0.02 |
| −0.03 |
|
| Eyefold | 0.00 | 0.68 | 0.01 | 4.01E-03 | −0.02 |
|
| PC1 | 0.10 | 0.36 | −0.30 | 0.03 | 0.33 | 0.03 |
| PC2 | −0.10 | 0.44 | −0.03 | 0.87 | 0.03 | 0.87 |
| PC3 | 0.38 | 0.02 | 0.31 | 0.12 | −0.98 |
|
| PC4 | −0.45 | 0.02 | −0.03 | 0.91 | 0.91 |
|
| PC5 | −0.03 | 0.86 | 0.35 | 0.10 | 0.14 | 0.57 |
NOTE: Δ refers to self-perception (bands 1 to 5, see Methods) minus continental ancestry proportions (0–100%) estimated from the genetic data. Sex and country of sampling were incorporated in the analyses as factors while the other variables were treated as quantitative. For ease of interpretation, the regression coefficient and p-value for Δ AMERICA (*) refer to Native American (not European) ancestry.
Figure 1Geographic distribution of Native American (blue), African (green) and European (red) ancestry based on individual estimates for samples from (A) Brazil, (B) Chile, (C) Colombia, (D) México and (E) Perú.
To facilitate comparison, color intensity transitions occur at 10% ancestry intervals for all maps. The birthplace of individuals are indicated by purple dots on the African ancestry map. Sampling density is shown in Figure S4. Maps were obtained using Kriging interpolation as detailed in the text.
Figure 2Bar plots contrasting skin pigmentation (Melanin Index) to proportion of European genetic ancestry across four self-identified ethno/racial categories in samples from Brazil, Chile, Colombia, México and Perú.
Sample sizes and all estimates of pigmentation and ancestry, are presented in Table S4. In Perú no individual self-identified as “Black”.
Figure 3Vertical histograms (pyramid plots) showing the distribution of genetic ancestry for each of the five self-perceived ancestry categories.
As reference, (A) shows the theoretical case of agreement between self-perceived and genetically estimated ancestry. For this plot random values were drawn from a beta distribution such that, for each self-perception band, the median ancestry lies at the centre of a 0.2 interval containing 75% of the simulated ancestry values. The number of simulated values was fixed at our sample size. Panels (B), (C) and (D) show respectively, the observed distributions for European, Native American and African ancestries. The red diagonal line indicates the midpoint, on the genetic ancestry scale, of each self-perceived ancestry category. Distributions are coded in three shades of orange: the darkest shade denotes the central quartiles (the median shown as a brown line), the medium-shade indicates the 5%–95% range, and the lightest shade refers to samples outside this range. For European ancestry, self-perception tends to underestimate genetic ancestry (the distributions are mostly above the diagonal). By contrast, self-perception tends to overestimate African ancestry (the distributions are mostly below the diagonal). At increasing levels of Native American genetic ancestry self-perception first underestimates then overestimates genetic ancestry (the distributions are on both sides of the diagonal). Simulations and plots were carried out using MATLAB [61].