| Literature DB >> 29112749 |
Chao Zhang1,2, Yang Gao1,2,3, Jiaojiao Liu1,2,3, Zhe Xue1, Yan Lu1, Lian Deng1,2, Lei Tian1,2, Qidi Feng1,2, Shuhua Xu1,2,3,4.
Abstract
There are a growing number of studies focusing on delineating genetic variations that are associated with complex human traits and diseases due to recent advances in next-generation sequencing technologies. However, identifying and prioritizing disease-associated causal variants relies on understanding the distribution of genetic variations within and among populations. The PGG.Population database documents 7122 genomes representing 356 global populations from 107 countries and provides essential information for researchers to understand human genomic diversity and genetic ancestry. These data and information can facilitate the design of research studies and the interpretation of results of both evolutionary and medical studies involving human populations. The database is carefully maintained and constantly updated when new data are available. We included miscellaneous functions and a user-friendly graphical interface for visualization of genomic diversity, population relationships (genetic affinity), ancestral makeup, footprints of natural selection, and population history etc. Moreover, PGG.Population provides a useful feature for users to analyze data and visualize results in a dynamic style via online illustration. The long-term ambition of the PGG.Population, together with the joint efforts from other researchers who contribute their data to our database, is to create a comprehensive depository of geographic and ethnic variation of human genome, as well as a platform bringing influence on future practitioners of medicine and clinical investigators. PGG.Population is available at https://www.pggpopulation.org.Entities:
Mesh:
Year: 2018 PMID: 29112749 PMCID: PMC5753384 DOI: 10.1093/nar/gkx1032
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.An schematic diagram of data processing for PGG.Population database.
Figure 2.Number of populations and genomes collected in the current release of PGG.Population. (A) Worldwide distribution of population samples, each belonging to one of the eight regional groups. (B) A summary of the population samples. (C) Number of groups in each region. (D) Number of genomes/individuals in each region.
Figure 3.User-friendly interfaces for querying ethnic groups. (A) Keyword-based search bar. Users can find the bar in the welcome page of the database and in the upper right of any pages of our website. (B) Advanced interface to retrieving populations.
Figure 4.An example of genomic diversity and genetic ancestry reconstructed by PGG.Population for the Tibetan population. (A) A fan-like chart showing pairwise FST values between the Tibetan and other worldwide populations. The lengths of the bars are proportional to the FST values. (B) Plot of the first two principal components dissecting 22 East Asian populations. (C) Admixture analysis of 22 East Asian populations assuming five ancestral source populations. Each colour represents a genetic component. (D) An example of a genomic region showing signatures of natural selection.