| Literature DB >> 19098910 |
Alon Keinan1, James C Mullikin, Nick Patterson, David Reich.
Abstract
Comparisons of chromosome X and the autosomes can illuminate differences in the histories of males and females as well as shed light on the forces of natural selection. We compared the patterns of variation in these parts of the genome using two datasets that we assembled for this study that are both genomic in scale. Three independent analyses show that around the time of the dispersal of modern humans out of Africa, chromosome X experienced much more genetic drift than is expected from the pattern on the autosomes. This is not predicted by known episodes of demographic history, and we found no similar patterns associated with the dispersals into East Asia and Europe. We conclude that a sex-biased process that reduced the female effective population size, or an episode of natural selection unusually affecting chromosome X, was associated with the founding of non-African populations.Entities:
Mesh:
Year: 2008 PMID: 19098910 PMCID: PMC2612098 DOI: 10.1038/ng.303
Source DB: PubMed Journal: Nat Genet ISSN: 1061-4036 Impact factor: 38.330
Frequency differentiation between Africans and non-Africans is higher on chromosome X than is expected from the autosomes
| Autosomes | Chromosome X | Comparison of autosomes and chromosome X | ||||
|---|---|---|---|---|---|---|
| (SNPs ascertained in bold population) | SNPs | FST | SNPs | FST | Observed autosome-to-X genetic drift ratio | P-value for observed versus expected ¾ |
| 62,830 | 0.106 (.002) | 2,668 | 0.133 (.006) | 0.771 (.036) | 0.57 | |
| 45,423 | 0.098 (.002) | 1,247 | 0.131 (.009) | 0.715 (.050) | 0.48 | |
| 13,606 | 0.178 (.003) | 1,087 | 0.256 (.010) | 0.615 (.030) | 6.5×10−6 | |
| 13,606 | 0.144 (.003) | 1,087 | 0.221 (.009) | 0.582 (.030) | 3.0×10−8 | |
Notes: Analyses are based on SNPs ascertained in two chromosomes of individuals from the ancestry indicated in bold. With this ascertainment strategy, estimates the ratio of autosome-to-X genetic drift. This estimate is independent of the demographic history of the population not used to ascertain SNPs, so that unlike the sequence diversity ratio (Table 2), we do not need to carry out modeling of demographic history to determine the autosome-to-X expected relationship. For the comparisons involving West Africans, the autosome-to-X genetic drift ratio is significantly below the expected ¾ (standard errors are based on 1,000 moving block bootstraps and account for linkage disequilibrium between SNPs; Supp. Methods).
Figure 1The distribution of allele frequencies on chromosome X does not match the expectation from the autosomes in non-African populations. (a) Distribution of derived allele frequencies on the autosomes compared with the expectation for our best fit models of history8, for SNPs discovered in two chromosomes of an ancestry and then genotyped in a larger number of samples of the same ancestry from HapMap. Derived allele refers to the new mutation, which we infer by requiring that both chimpanzee and orangutan match the other allele. (b) Analogous plots for chromosome X compare the observed data to the expectation from the autosomal best fit models after rescaling all effective population sizes by ¾ (‘Model 1’). While the fit of the model to the chromosome X data is excellent for West Africans, the fit of the models is poor in North Europeans and East Asians, both of which exhibit more high derived allele frequencies than expected from the models. ‘Model 2’ separately fits the out-of-Africa bottleneck on chromosome X, resulting in a more intense bottleneck than is expected from the autosomes even after adjusting for the ¾ expected difference in population size (Supp. Note). The fit to the observed data is greatly improved in Model 2 compared to Model 1, with mean squared errors reduced by 79% for North Europeans and 34% for East Asians. (The panels in this figure use different scales reflecting the numbers of sampled chromosomes used in each analysis; modeling adjusts for the differences in sample size.8)
Ratio of sequence diversity on chromosome X to the autosomes is reduced outside Africa
| Autosomes | Chromosome X | Comparison of autosomes and chromosome X | |||||
|---|---|---|---|---|---|---|---|
| Aligned bases (millions) | Divergent sites/base pair (×10-3) | Aligned bases (millions) | Divergent sites/base pair (×10-3) | Observed X-to-autosome ratio normalized by macaque divergence | Expected X-to-autosome ratio from best fit demographic model | P-value for difference between observed and expected | |
| West African | 641.7 | 1.081 (.005) | 22.0 | 0.722 (.017) | 0.763 (.026) | 0.780 (.001) | 0.514 |
| North European | 657.2 | 0.827 (.004) | 34.6 | 0.460 (.013) | 0.635 (.024) | 0.702 (.004) | 0.005 |
| East Asian | 296.8 | 0.772 (.005) | 23.5 | 0.414 (.014) | 0.613 (.026) | 0.690 (.004) | 0.003 |
Notes: We aligned sequence from unrelated individuals from each population to calculate genetic diversity per base pair (Methods and Supp. Table 4). Standard errors are obtained by a jackknife method to account for correlated diversity in neighboring regions of the genome (Methods). The X-to-autosome ratio is obtained by dividing the chromosome X diversity column by the autosome diversity column, normalizing each by human-macaque divergence in the same section of the genome, and combining all standard errors appropriately (Supp. Methods). We also compared the observed data to the expectation from the best-fit autosomal demographic model8 (Supp. Note; standard errors reflect uncertainty in model fitting). Tests for significance use a two-sided z-test between the observed and expected value.
Figure 2Gene-centric natural selection, or natural selection localized to specific regions of chromosome X, fail to explain the signal of accelerated genetic drift. (a) Dividing the chromosome X and autosome data sets based on distance from the nearest gene, we find no evidence that the ratio of autosome-to-X genetic drift Q between West Africans and East Asians and between West Africans and North Europeans increases with distance, as would be expected if selection explains our results. In particular, the bin furthest from genes (>100kb) is significantly below the expectation of ¾ (horizontal black line; error bars indicate ±1 standard error). Dotted lines show the values for all data for comparison to the individual bins. A related plot for the sequence diversity data also shows no significant effect of distance from genes (Supp. Figure 2). (b,c) The ratio of autosome-to-X genetic drift Q between West Africans and (b) North Europeans (c) East Asians, presented spatially across chromosome X. For each of 40 bins 3 cM in size on chromosome X, Q is presented as the reciprocal of the ratio of genetic drift in that bin to the average across the autosomes. A triangle's gray level indicates the number of SNPs in the bin, which is between 15 and 60 (bins with fewer than 15 SNPs are excluded from the figure). Error bars indicate one standard error below and above Q, which we obtain by a jackknife over 5 windows of 0.6 cM. The reduction of Q below the expectation of ¾ (horizontal line) has a complex pattern. However, it is widely distributed across chromosome X rather than being localized to a few regions. This rules out the possibility that the signal of accelerated drift on chromosome X can be explained by selection at a small number of loci.