| Literature DB >> 22439654 |
Tamara Hofer1, Matthieu Foll, Laurent Excoffier.
Abstract
BACKGROUND: Levels of differentiation among populations depend both on demographic and selective factors: genetic drift and local adaptation increase population differentiation, which is eroded by gene flow and balancing selection. We describe here the genomic distribution and the properties of genomic regions with unusually high and low levels of population differentiation in humans to assess the influence of selective and neutral processes on human genetic structure.Entities:
Mesh:
Year: 2012 PMID: 22439654 PMCID: PMC3317871 DOI: 10.1186/1471-2164-13-107
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Detection of outlier SNPs based on . Joint distributions of Fand Hfor SNPs on chromosome 4 (A, B) and genome-wide distribution of SNP z-values (C, D). Locus specific p-values and z-values were computed either based on the hierarchical island model (A, C) or the finite island model (B, D). Chromosome 4 is representative of the joint distribution of these statistics on all other chromosomes.
Properties of genomic islands.
| High-differentiation islands (HDIs) | Low-differentiation islands | |||
|---|---|---|---|---|
| No. of islands | 625 | 197 | ||
| Length (bp) | 465,756 | 11,037 - 15,210,531 | 368,902 | 25,709 - 2,021,127 |
| No. of SNPs | 73.61 | 4 - 322 | 90.76 | 10 - 430 |
| No. of genes | 5.35 | 0 - 92 | 3.45 | 0 - 37 |
| Mean Recombination rate | 1.04 | 0.00 - 6.31 | 1.31 | 0.04 - 7.48 |
| Mean FST | 0.23 | 0.17 - 0.43 | 0.09 | 0.06 - 0.14 |
| Mean heterozygosity | 0.23 | 0.09 - 0.33 | 0.31 | 0.18 - 0.42 |
| Mean | 0.88 | 0.32 - 2.63 | -0.77 | -1.41 - -0.32 |
Properties of islands with significant levels of population differentiation as detected by the HMM approach
Figure 2Distribution of HDIs and LDIs. Genomic distribution of HDIs (red) and LDIs (blue). Chromosomes are represented by lines with a filled circle at the centromere. Each circle represents a significant genomic island, as identified by a Hidden Markov Model.
Figure 3Examples of genomic islands. Each panel shows a genomic island that includes a significantly differentiated non-synonymous SNP. On top, the HDI or LDI is shown in details with all the SNPs and genes located in that region. SNPs are coloured according to their significance level in the selection test and FDR-SNPs are indicated by red vertical lines. The oblique arrow points at the non-synonymous SNP, and the vertical arrow indicates the gene embedding this SNP. The allele frequencies of the non-synonymous SNP in the HGDP populations are shown on the map below. Most common patterns of significant differentiation are found between continental groups, such as African vs. non-Africans (A), between Eurasian vs. non-Eurasians (B), or between East Asian and American vs. African and Eurasian populations (C). SNPs in low-differentiation islands tend to have similar allele frequencies in all populations (D).
Figure 4Continental groups showing the largest degree of genetic differentiation. For each FDR-SNP, we determined the allele frequency difference between populations in each continental group (Africa, Eurasia, East-Asia and America) and the rest of the world, and we assigned it to the continental group that showed the highest frequency difference. The histogram reports the number of FDR-SNPs in HDIs assigned to each group.
Enrichment of genic and non-synonymous SNPs in genomic islands.
| SNP subset | Genic | Non-synonymous | |
|---|---|---|---|
| High population differentiation | p-value(selection test) < = 0.01a | 0.6576 | 0.1307 |
| Most likely states (Viterbi)b | < 0.0001+ | 0.0002+ | |
| FDR < = 0.001c | < 0.0001+ | 0.0374+ | |
| Islandsd | < 0.0001+ | 0.0053+ | |
| Low population differentiation | p-value(selection test) < = 0.01a | 0.2143 | 0.6402 |
| Most likely states (Viterbi)b | 0.0303- | 0.5888 | |
| FDR < = 0.001c | 0.0007- | 0.6543 | |
| Islandsd | < 0.0001- | 0.0083+ | |
Fisher exact test for enrichment or depletion of genic and non-synonymous SNPs in genomic islands defined by the selection test or by the HMM approach
a subset of SNPs that are significant in the selection test at the 1% level
b subset of SNPs that belong to the high-differentiation or low-differentiation state as identified by the Viterbi algorithm in the first step of the HMM approach
c subset of SNPs that are assigned to the high-differentiation or low-differentiation state under a genome-wide FDR = 0.001 in the second step of the HMM approach
d subset of SNPs that are located within HDIs or LDIs
+ enrichment of SNP category in SNP class
- under-representation of SNP category in SNP class