| Literature DB >> 30537949 |
Emily T Norris1,2,3, Lu Wang1, Andrew B Conley3, Lavanya Rishishwar1,2,3, Leonardo Mariño-Ramírez2,4, Augusto Valderrama-Aguirre2,5, I King Jordan6,7,8.
Abstract
BACKGROUND: Modern Latin American populations were formed via genetic admixture among ancestral source populations from Africa, the Americas and Europe. We are interested in studying how combinations of genetic ancestry in admixed Latin American populations may impact genomic determinants of health and disease. For this study, we characterized the impact of ancestry and admixture on genetic variants that underlie health- and disease-related phenotypes in population genomic samples from Colombia, Mexico, Peru, and Puerto Rico.Entities:
Keywords: Adaptive introgression; Admixture; Ancestry-enrichment; Disease; Genetic ancestry; Health; Immune system; Population genetics
Mesh:
Year: 2018 PMID: 30537949 PMCID: PMC6288849 DOI: 10.1186/s12864-018-5195-7
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Analysis scheme used for this study. (1) Genetic ancestry and admixture profiles were characterized for four Latin American populations. (2) Expected SNP frequencies in the admixed Latin American populations are calculated based on their ancestry profiles. (3) Ancestry-enriched SNPs are identified by comparing observed versus expected SNP allele frequencies in in the admixed Latin American populations. (4) Ancestry-enriched SNPs are mapped to genes, which in turn are used for gene set enrichment in order to identify impacted health-related pathways and phenotypes
Human populations analyzed in this study
| Dataset | Short | Full Description | n |
|---|---|---|---|
| 1KGP African ( | MSL | Mende in Sierra Leone | 85 |
| YRI | Yoruba in Ibadan, Nigeria | 108 | |
| 1KGP East Asian ( | CDX | Chinese Dai in Xishuangbanna, China | 93 |
| CHB | Han Chinese in Bejing, China | 103 | |
| CHS | Southern Han Chinese, China | 105 | |
| JPT | Japanese in Tokyo, Japan | 104 | |
| KHV | Kinh in Ho Chi Minh City, Vietnam | 99 | |
| 1KGP European ( | CEU | Utah residents with NW European ancestry | 99 |
| IBS | Iberian populations in Spain | 107 | |
| GBR | British in England and Scotland | 91 | |
| 1KGP Admixed American ( | CLM | Colombian in Medellin, Colombia | 94 |
| MXL | Mexican Ancestry in Los Angeles, California | 64 | |
| PEL | Peruvian in Lima, Peru | 85 | |
| PUR | Puerto Rican in Puerto Rico | 104 | |
| HGDP Native American ( | KRT | Karitiana in Brazil | 24 |
| SRI | Surui in Brazil | 21 | |
| COL | Colombians in Colombia | 13 | |
| MAY | Maya in Mexico | 25 | |
| PIM | Pima in Mexico | 25 |
Populations are organized into continental groups: African, East Asian, European, and Admixed American from the 1000 Genomes Project (1KGP) and Native American from the Human Genome Diversity Project (HGDP). Short names, descriptions, and the numbers of genomes analyzed are provided for each individual population
Fig. 2Genetic ancestry and admixture in Latin American populations. The ancestry contributions of putative ancestral source populations to four modern, admixed Latin American populations are shown. a Triangle plots showing the relative ancestry contributions – African, European, Native American – to admixed individuals from four Latin American populations. b PCA plot showing the genetic relationships among individuals from admixed Latin American populations compared to putative ancestral source populations. Each population is bounded by a minimum spanning ellipse. c Admixture plots showing the fractions of African, Native American and European ancestry among admixed individuals from four Latin American populations. Each individual is represented as a column with the admixture fractions color coded as shown in the legend. d Violin plots showing distributions of ancestry fractions among individuals from four Latin American populations. e Pie charts showing the average ancestry values for each population next to its geographic location
Fig. 3Ancestry-enriched SNPs in Latin American populations. An overview of the distributions of ancestry-enriched SNPs within and between the four admixed Latin American populations are shown, giving an indication of the overall numbers of ancestry-enriched SNPs along with the extent to which they are shared or unique to specific populations. a Cumulative distributions of ancestry-enrichment χ2 values for all SNPs in the four Latin American populations. Inset: Median χ2 values for each population ± standard error. b Venn diagram showing the number of genes with significant ancestry-enriched SNPs exclusive to one population and those shared by more than one population
Fig. 4Gene set enrichment analysis of ancestry-enriched SNP genes. Functionally coherent gene sets and pathways that are overrepresented with respect to ancestry-enriched SNPs are shown, giving an indication of the kinds of health-related phenotypes that have been shaped by genetic ancestry in the four admixed Latin American populations. a Heatmap showing significantly enriched functional gene sets (i.e., pathways and phenotypes) shared by two, three or all four Latin American populations. The cells are color coded by the log transformed statistical significance (FDR q-value) of gene set enrichment analysis results. b Network showing significantly enriched pathways and phenotypes shared by all four Latin American populations. Nodes sizes represent the number of ancestry-enriched genes in each set. Pie charts show how many genes in a given set are from each population. Color coding describes the functional enrichment category as shown in the legend
Fig. 5Pathways with ancestry-enriched SNP genes in functional categories of interest. These results highlight examples of specific health-related functions and pathways that have been shaped by genetic ancestry in the four admixed Latin American populations. For each functional category, a pathway schematic is shown, indicating the pathway genes and their roles, along with meta-analysis results and observed versus expected SNP frequencies for each population. a Leishmaniasis, an example of a disease and health related pathway. b Cytochrome P450 drug metabolism, an example of a metabolism related pathway. c Jak-STAT signaling pathway, an example of an immune-related pathway
Fig. 6Ancestry-specific effects on gene expression. These results give an indication of how ancestry-enriched SNPs can impact health-related phenotypes by virtue of their gene regulatory effects. SNP-by-ancestry interactions were characterized using an expression quantitative trait loci (eQTL) approach. Examples of a African-specific and b European-specific eQTL are shown along with c eQTL shared between populations. d eQTL related to immune system and infectious disease found in the African and European populations are shown in a CIRCOS plot with links indicated between eQTLs and their regulated genes. e Examples of immune-related pathways that include multiple eQTL-regulated genes for African and/or European populations