| Literature DB >> 31105740 |
Emily T Norris1,2,3, Lavanya Rishishwar1,2,3, Lu Wang1,3, Andrew B Conley2, Aroon T Chande1,2,3, Adam M Dabrowski1, Augusto Valderrama-Aguirre3,4, I King Jordan1,2,3.
Abstract
Assortative mating is a universal feature of human societies, and individuals from ethnically diverse populations are known to mate assortatively based on similarities in genetic ancestry. However, little is currently known regarding the exact phenotypic cues, or their underlying genetic architecture, which inform ancestry-based assortative mating. We developed a novel approach, using genome-wide analysis of ancestry-specific haplotypes, to evaluate ancestry-based assortative mating on traits whose expression varies among the three continental population groups - African, European, and Native American - that admixed to form modern Latin American populations. Application of this method to genome sequences sampled from Colombia, Mexico, Peru, and Puerto Rico revealed widespread ancestry-based assortative mating. We discovered a number of anthropometric traits (body mass, height, and facial development) and neurological attributes (educational attainment and schizophrenia) that serve as phenotypic cues for ancestry-based assortative mating. Major histocompatibility complex (MHC) loci show population-specific patterns of both assortative and disassortative mating in Latin America. Ancestry-based assortative mating in the populations analyzed here appears to be driven primarily by African ancestry. This study serves as an example of how population genomic analyses can yield novel insights into human behavior.Entities:
Keywords: admixture; assortative mating; genetic ancestry; mate choice; polygenic phenotypes; population genomics
Year: 2019 PMID: 31105740 PMCID: PMC6491930 DOI: 10.3389/fgene.2019.00359
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Human populations analyzed in this study.
| Dataseta | Geographical source | Short | Dataseta | Geographical source | ||||
|---|---|---|---|---|---|---|---|---|
| 1KGP | Esan in Nigeria | ESN | 99 | HGDP | Pima in Mexico | 14 | ||
| 1KGP | Gambian in Western Division, The Gambia | GWD | 113 | HGDP | Maya in Mexico | 21 | ||
| 1KGP | Luhya in Webuye, Kenya | LWK | 99 | Reich et al. | Tepehuano in Mexico | 25 | ||
| Africa ( | 1KGP | Mende in Sierra Leone | MSL | 85 | Reich et al. | Mixtec in Mexico | 5 | |
| 1KGP | Yoruba in Ibadan, Nigeria | YRI | 108 | Reich et al. | Mixe in Mexico | 17 | ||
| HGDP | Mandenka | 22 | Reich et al. | Zapotec in Mexico | 43 | |||
| HGDP | Yoruba | 21 | Reich et al. | Kaqchikel in Guatemala | 13 | |||
| 1KGP | Finnish in Finland | FIN | 99 | Reich et al. | Kogi in Colombia | 4 | ||
| 1KGP | British in England and Scotland | GBR | 90 | Reich et al. | Waunana in Colombia | 3 | ||
| 1KGP | Iberian populations in Spain | IBS | 107 | Reich et al. | Embera in Colombia | 5 | ||
| Europe ( | 1KGP | Toscani in Italy | TSI | 107 | Native American ( | Reich et al. | Guahibo in Colombia | 6 |
| HGDP | Russian | 25 | Reich et al. | Piapoco in Colombia | 7 | |||
| HGDP | Orcadian | 15 | Reich et al. | Inga in Colombia | 9 | |||
| HGDP | French | 28 | Reich et al. | Wayuu in Colombia | 11 | |||
| 1KGP | Colombian in Medellin, Colombia | CLM | 94 | HGDP | Karitiana in Brazil | 14 | ||
| 1KGP | Peruvian in Lima, Peru | PEL | 85 | HGDP | Suruí in Brazil | 8 | ||
| Admixed ( | 1KGP | Mexican Ancestry in LA, California | MXL | 64 | Reich et al. | Ticuna in Brazil | 6 | |
| 1KGP | Puerto Rican in Puerto Rico | PUR | 104 | Reich et al. | Quechua in Peru | 40 | ||
| Reich et al. | Aymara in Bolivia | 23 | ||||||
| Reich et al. | Guarani in Paraguay | 6 | ||||||
FIGURE 1Genetic ancestry proportions for the admixed Latin American populations studied here. For each population, distributions and average values are shown for African (blue), European (orange), and Native American (red) ancestry.
FIGURE 2Approach used to measure assortative mating on local ancestry. (A) Local ancestry is assigned for specific haplotypes across the genome: African (blue), European (orange), and Native American (red). (B) Within individual genomes, genes are characterized as homozygous or heterozygous for local ancestry. For any given population, at each gene locus, the assortative mating index (AMI) is computed from the observed and expected counts of homozygous and heterozygous gene pairs. (C) Data from genome-wide association studies (GWAS) are used to evaluate polygenic phenotypes. (D) Meta-analysis of AMI values is used to evaluate the significance of ancestry-based assortative mating for polygenic phenotypes.
FIGURE 3Overview of ancestry-based assortative mating in the four admixed Latin American populations analyzed here. (A) Distributions of observed and expected AMI values for all four populations. Inset: Mean observed and expected AMI values (±se) for all four populations. Significance between mean observed and expected AMI values (P = 8.12e-56) is indicated by two asterisks. (B) Observed and expected average AMI values (±se) across all polygenic phenotype gene sets are shown for each population. (C) Average AMI values (±se) for each population are shown for the three main phenotype functional categories characterized here: anthropometric, neurological, and human leukocyte antigen (HLA) genes.
FIGURE 4Phenotypes with statistically significant patterns of assortative mating within and among populations. (A) The top 20 phenotypes with the highest, and most statistically significant, assortative mating values (AMI) seen within any individual population. All AMI values shown are significant at P < 0.05, and the dashed line corresponds to a false discovery rate q-value cutoff of 0.05. (B) The observed AMI values for all trait-specific gene sets in each population (red lines) are compared to distributions of expected AMI values (black lines) based on random permutations of 10,000 gene sets. The top panels show the overall distributions of observed and permuted AMI values for each population, with steep peaks around the mean values for expected AMI. The bottom panels show a more narrow range of observed and expected AMI values for each population, which are centered around the population-specific mean expected AMI values. (C) The top 20 phenotypes with the highest or lowest, and most statistically significant, AMI variance levels across populations. Across population variance levels are normalized using the average AMI population variance level for all phenotypes. All AMI variance levels shown are significant at q < 0.05. The highest variance (most dissimilar patterns) of the AMI are at the top, while the lowest variance (most similar patterns) of AMI are at the bottom.
FIGURE 5Individual examples of ancestry-based assortative and disassortative mating. Results of meta-analysis of (dis)assortative mating on polygenic phenotypes along with their ancestry drivers are shown for (A) an anthropometric trait: Height, (B) a neurological trait: Schizophrenia, and (C) the immune-related HLA class I and II genes. The meta-analysis plots show pooled AMI odds ratio values along with their 95% CIs and P-values. Stars indicate false discovery rate q-values < 0.05. The ancestry driver plots show the extent to which individual ancestry components – African (blue), European (orange), and Native American (red) – have an excess (>0) or a deficit (<0) of homozygosity.
FIGURE 6Inter-individual ancestry variance for the four admixed Latin American populations analyzed here. Variance among individuals for the African (blue), European (orange), and Native American (red) ancestry fractions within each population are shown.