| Literature DB >> 24194603 |
Mulin Jun Li1, Lily Yan Wang, Zhengyuan Xia, Maria P Wong, Pak Chung Sham, Junwen Wang.
Abstract
The dbPSHP database (http://jjwanglab.org/dbpshp) aims to help researchers to efficiently identify, validate and visualize putative positively selected loci in human evolution and further discover the mechanism governing these natural selections. Recent evolution of human populations at the genomic level reflects the adaptations to the living environments, including climate change and availability and stability of nutrients. Many genetic regions under positive selection have been identified, which assist us to understand how natural selection has shaped population differences. Here, we manually collect recent positive selections in different human populations, consisting of 15,472 loci from 132 publications. We further compiled a database that used 15 statistical terms of different evolutionary attributes for single nucleotide variant sites from the HapMap 3 and 1000 Genomes Project to identify putative regions under positive selection. These attributes include variant allele/genotype properties, variant heterozygosity, within population diversity, long-range haplotypes, pairwise population differentiation and evolutionary conservation. We also provide interactive pages for visualization and annotation of different selective signals. The database is freely available to the public and will be frequently updated.Entities:
Mesh:
Year: 2013 PMID: 24194603 PMCID: PMC3965004 DOI: 10.1093/nar/gkt1052
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
The scope and calculated scores in the dbPSHP database
| Attribute | Evaluation term | Abbreviation |
|---|---|---|
| Variant genotype properties | Derived allele | DA |
| Ancestral allele | AA | |
| Allele frequency | DAF | |
| AAF | ||
| Genotype frequency | GFHOM1 | |
| GFHET | ||
| GFHOM2 | ||
| Hardy–Weinberg equilibrium | HWE1 | |
| HWE2 | ||
| Variant heterozygosity | Heterozygosity | HET |
| Nucleotide diversity | PI | |
| Within population diversity | Tajima’s D | TD |
| Long-range haplotypes | Integrated extended haplotype homozygosity | IHH |
| Integrated haplotype score | UIHS | |
| IHS | ||
| Differentiation between populations | Difference of derived allele frequency | DDAF |
| DDAF_POP1_POP2 | ||
| Fixation index | FST1 | |
| FST1_POP1_POP2 | ||
| FST2 | ||
| FST2_POP1_POP2 | ||
| Cross-population extended haplotype homozygosity | UXPEHH | |
| XPEHH_POP1_POP2 | ||
| Cross-population composite likelihood ratio | XPCLR | |
| XPCLR_POP1_POP2 | ||
| Evolutionary conservation | Neutral rate | NR |
| Rejected substitution | RS |
DAF is the allele frequency for the derived allele; AAF is the allele frequency for the ancestral allele; GFHOM1 is the genotype frequency for homozygous derived allele AA; GFHET is the genotype frequency for heterozygous Aa; GFHOM2 is the genotype frequency for homozygous ancestral allele aa; HWE1 is the value of simple chi square goodness-of-fit test; HWE2 is the P-value of exact test; FST1 is the FST of Wright’s approximate formula; FST2 is the FST of Cockerham & Weir estimator; UIHS is the unstandardized integrated haplotype score; UXPEHH is the unstandardized cross-population extended haplotype homozygosity; POP1_POP2 represents the pairwise scores of two specific populations (Supplementary Methods).
Figure 1.The main functional units of dbPSHP interface. (a) The interactive chart for the scatter plot of different statistical scores, which depicts the iHS distribution of genetic hitchhiking region surrounding the LCT gene in the CEU population. (b) The worldwide allele frequency map of a genetic variant rs10188066 and selected population is marked with a red outline. Derived allele frequency is marked with blue color and ancestral allele frequency is marked with red color in each pie chart. (c) The summary table of important statistical terms for selected variant. (d) The three tabs records detailed information about selected variant including variant attributes, selective scores, literature evidence, mapped gene, GWAS information, cross population selective signals and comprehensive variant annotations from the external browser. (e) The searchable table collected literature-based positive selections in the human population.