| Literature DB >> 19619313 |
Sean Myles1, Mark Stoneking, Nic Timpson.
Abstract
BACKGROUND: Recent work has shown that population stratification can have confounding effects on genetic association studies and statistical methods have been developed to correct for these effects. Subsets of markers that are highly-differentiated between populations, ancestry-informative markers (AIMs), have been used to correct for population stratification. Often AIMs are discovered in one set of populations and then employed in a different set of populations. The underlying assumption in these cases is that the population under study has the same substructure as the population in which the AIMs were discovered. The present study assesses this assumption and evaluates the portability between worldwide populations of 10 SNPs found to be highly-differentiated within Britain (BritAIMs).Entities:
Year: 2009 PMID: 19619313 PMCID: PMC2719660 DOI: 10.1186/1755-8794-2-45
Source DB: PubMed Journal: BMC Med Genomics ISSN: 1755-8794 Impact factor: 3.063
Fst values of the 10 BritAIMs and the associated P values.
| Global | Europe | Middle East | Africa | Central South Asia | East Asia | Oceania | America | |||||||||
| BritAIMs | Fst | Fst | Fst | Fst | Fst | Fst | Fst | Fst | ||||||||
| rs7696175 | 0.074 | 0.551 | 0.129 | 0.002 | 0.840 | 0.052 | 0.469 | 0.011 | 0.748 | 0.018 | 0.802 | 0.067 | 0.439 | 0.118 | 0.333 | |
| rs1460133 | 0.035 | 0.842 | 0.062 | 0.027 | 0.286 | 0 | 1 | 0.022 | 0.502 | 0.010 | 0.916 | NA | NA | 0.042 | 0.695 | |
| rs9378805 | 0.115 | 0.321 | 0.030 | 0.280 | 0.043 | 0.136 | 0.019 | 0.796 | 0.020 | 0.554 | 0.016 | 0.831 | 0.072 | 0.424 | 0.027 | 0.754 |
| rs11790408 | 0.067 | 0.594 | 0.043 | 0.141 | 0.088 | 0.028 | 0.714 | 0.046 | 0.169 | 0.009 | 0.920 | 0.015 | 0.709 | 0.108 | 0.370 | |
| rs12295525 | 0.068 | 0.591 | 0.034 | 0.226 | 0.008 | 0.666 | 0.047 | 0.513 | 0.024 | 0.454 | 0.017 | 0.809 | NA | NA | 0.021 | 0.820 |
| rs12797951 | 0.106 | 0.365 | 0 | 1 | 0.105 | 0.037 | 0.611 | 0.053 | 0.116 | 0.061 | 0.136 | NA | NA | 0.175 | 0.167 | |
| rs10774241 | 0.103 | 0.383 | 0 | 1 | 0 | 1 | 0.090 | 0.226 | 0.024 | 0.470 | 0.029 | 0.580 | 0.004 | 0.760 | 0.139 | 0.247 |
| rs17449560 | 0.036 | 0.835 | 0.017 | 0.520 | 0.043 | 0.135 | 0.112 | 0.146 | 0.003 | 0.909 | 0.017 | 0.814 | 0.199 | 0.210 | 0.061 | 0.581 |
| rs3760843 | 0.033 | 0.855 | 0.024 | 0.369 | 0 | 1 | 0.073 | 0.320 | 0.014 | 0.665 | 0.021 | 0.747 | 0.144 | 0.295 | 0.139 | 0.247 |
| rs2143877 | 0.078 | 0.529 | 0.046 | 0.117 | 0.011 | 0.585 | 0.053 | 0.460 | 0.004 | 0.892 | 0.015 | 0.857 | 0.005 | 0.754 | 0.038 | 0.714 |
Global Fst was calculated among the 7 continental regions listed as column titles. P values were generated by comparison to an empirical distribution of 2750 markers (see Methods). Significant P values (P < 0.05) are in bold.
Figure 1Global Fst distribution for 2750 random markers. The global Fst values of the 10 BritAIMs are indicated by vertical lines.
Figure 2Fst values of the 2750 random markers within Europe (A) and within the Middle East (B). The Fst values of the 10 BritAIMs are indicated by vertical lines. BritAIMs that lie within the top 5% of the empirical distribution are highlighted with asterisks.
Median Fst of the 10 BritAIMs within each continent and the associated P values
| Europe | 0.0316 | |
| Central South Asia | 0.021 | 0.574 |
| Africa | 0.0492 | 0.49 |
| Middle East | 0.01895 | 0.302 |
| East Asia | 0.017 | 0.992 |
| America | 0.08465 | 0.414 |
| Oceania | 0.067 | 0.362 |
P values generated by random sampling from the empirical distribution (see Methods). Significant P values (P < 0.05) are in bold.
Figure 3Levels of population differentiation for 2 BritAIMs with significantly high Fst within Europe (A) and 2 BritAIMs with significantly high Fst within the Middle East (B). Each box in each matrix represents a population pairwise Fst comparison. The shaded boxes in the matrices show which pairwise Fst values are significant compared to the empirical distribution at three P value thresholds (see the P value legend).
Figure 4Worldwide allele frequencies and population differentiation for rs7696175, the most highly differentiated BritAIM within Europe. The vertical bar chart displays the minor allele frequency in each of the populations represented in the CEPH-HGDP panel with sample sizes (number of individuals) on the left. The shaded boxes in the 53 × 53 and 7 × 7 matrices show which pairwise Fst values are significant compared to the empirical distribution at three P value thresholds (see the boxed-in P value legend) for the population by population comparison and the continent by continent comparison, respectively.