| Literature DB >> 24275494 |
Marc Pybus1, Giovanni M Dall'Olio, Pierre Luisi, Manu Uzkudun, Angel Carreño-Torres, Pavlos Pavlidis, Hafid Laayouni, Jaume Bertranpetit, Johannes Engelken.
Abstract
Searching for Darwinian selection in natural populations has been the focus of a multitude of studies over the last decades. Here we present the 1000 Genomes Selection Browser 1.0 (http://hsb.upf.edu) as a resource for signatures of recent natural selection in modern humans. We have implemented and applied a large number of neutrality tests as well as summary statistics informative for the action of selection such as Tajima's D, CLR, Fay and Wu's H, Fu and Li's F* and D*, XPEHH, ΔiHH, iHS, F(ST), ΔDAF and XPCLR among others to low coverage sequencing data from the 1000 genomes project (Phase 1; release April 2012). We have implemented a publicly available genome-wide browser to communicate the results from three different populations of West African, Northern European and East Asian ancestry (YRI, CEU, CHB). Information is provided in UCSC-style format to facilitate the integration with the rich UCSC browser tracks and an access page is provided with instructions and for convenient visualization. We believe that this expandable resource will facilitate the interpretation of signals of selection on different temporal, geographical and genomic scales.Entities:
Mesh:
Year: 2013 PMID: 24275494 PMCID: PMC3965045 DOI: 10.1093/nar/gkt1188
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
List of available summary statistics
| Method family | Method | Reference | Window size | Rank scores tail |
|---|---|---|---|---|
| Allele frequency spectrum | Tajima’s D | Tajima ( | 30 kb | Lower |
| CLR | Nielsen | Variable size | Upper | |
| Fay and Wu’s H | Fay and Wu ( | 30 kb | Lower | |
| Fu and Li’s F* | Fu and Li ( | 30 kb | Lower | |
| Fu and Li’s D* | Fu and Li ( | 30 kb | Lower | |
| R2 | Ramos-Onsins and Rozas ( | 30 kb | Lower | |
| Linkage disequilibrium structure | XP-EHH | modified from Sabeti | SNP-specific | Upper |
| AiHH | modified from Voight | SNP-specific | Upper | |
| his | modified from Voight | SNP-specific | Upper | |
| EHH_average | modified from Sabeti | 30 kb | Upper | |
| EHH_max | modified from Sabeti | 30 kb | Upper | |
| Wall’s B | Wall ( | 30 kb | Upper | |
| Wall’s Q | Wall ( | 30 kb | Upper | |
| Fu’s F | Fu ( | 30 kb | Lower | |
| Dh | Nei ( | 30 kb | Upper | |
| Za | Rozas | 30 kb | Upper | |
| ZnS | Kelly ( | 30 kb | Upper | |
| ZZ | Rozas | 30 kb | Upper | |
| Population differentiation | Fst (global and pairwise) | Weir and Cockerham ( | SNP-specific | Upper |
| ΔDAF (standard and absolute) | Hofer | SNP-specific | Upper | |
| XP-CLR | Chen | 0.1 cM (maximum window) | Upper | |
| Descriptive statistics | Segregating sites | 30 kb | NA | |
| Singletons | 30 kb | NA | ||
| pi (nucleotide diversity) | Nei and Li ( | 30 kb | NA | |
| DAF (derived allele frequency) | SNP-specific | NA | ||
| MAF (minor allele frequency) | SNP-specific | NA |
Figure 1.Schematic workflow developed in order to calculate diverse genome-wide summary statistics informative for the action of selection and to build a database in order to share and visualize the results.
Figure 2.Examples of genomic regions under selection in the 1000 genomes selection browser. Tracks of statistics from different populations are visualized in colour (CEU in green, CHB in red and YRI in blue). Additional examples are given at http://hsb.upf.edu (A) The p- and q-arms of chromosome 2 (−log10 of empirically ranked scores). Recurrent peaks at around 72.5 Mb (left green arrow) and 109.5 Mb (right green arrow) indicate the loci CYP26B1/EXOC6B and EDAR, respectively. (B) Signature of positive selection around SLC45A2, another established skin colour gene, in the CEU population (0.5-Mb window; −log10 of empirically ranked scores). (C) Widespread balancing selection in the HLA region indicated by strongly positive scores for Tajima’s D in all three analysed human populations (0.5-Mb window).