| Literature DB >> 29165562 |
Eran Elhaik1, Leeban Yusuf1, Ainan I J Anderson2, Mehdi Pirooznia3, Dimitrios Arnellos1,4, Gregory Vilshansky5, Gunes Ercal6, Yontao Lu7, Teresa Webster7, Michael L Baird8, Umberto Esposito1.
Abstract
The human population displays wide variety in demographic history, ancestry, content of DNA derived from hominins or ancient populations, adaptation, traits, copy number variation, drug response, and more. These polymorphisms are of broad interest to population geneticists, forensics investigators, and medical professionals. Historically, much of that knowledge was gained from population survey projects. Although many commercial arrays exist for genome-wide single-nucleotide polymorphism genotyping, their design specifications are limited and they do not allow a full exploration of biodiversity. We thereby aimed to design the Diversity of REcent and Ancient huMan (DREAM)-an all-inclusive microarray that would allow both identification of known associations and exploration of standing questions in genetic anthropology, forensics, and personalized medicine. DREAM includes probes to interrogate ancestry informative markers obtained from over 450 human populations, over 200 ancient genomes, and 10 archaic hominins. DREAM can identify 94% and 61% of all known Y and mitochondrial haplogroups, respectively, and was vetted to avoid interrogation of clinically relevant markers. To demonstrate its capabilities, we compared its FST distributions with those of the 1000 Genomes Project and commercial arrays. Although all arrays yielded similarly shaped (inverse J) FST distributions, DREAM's autosomal and X-chromosomal distributions had the highest mean FST, attesting to its ability to discern subpopulations. DREAM performances are further illustrated in biogeographical, identical by descent, and copy number variation analyses. In summary, with approximately 800,000 markers spanning nearly 2,000 genes, DREAM is a useful tool for genetic anthropology, forensic, and personalized medicine studies.Entities:
Keywords: CNVs; ancient DNA; archaic DNA; biogeography; forensics; population genetics
Mesh:
Substances:
Year: 2017 PMID: 29165562 PMCID: PMC5726468 DOI: 10.1093/gbe/evx237
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
. 1.—Worldwide distribution of population from which AIMs were obtained. AIMs from over 450 world populations were harvested from the literature (green) or calculated based on genotyped data from public collections (red).
. 2.—Local human adaptations. Following Fan et al. (2016), each adaptation is labeled by the phenotype and/or selection pressure. The genetic loci under selection and the studied population are shown.
. 3.—Human traits and their associated genetic loci.
. 4.—SNP density in the DREAM microarray. The average numbers of DREAM SNPs per 100,000 nucleotides across the genome are color coded. Gaps in the assembly are shown in gray.
. 5.—Success rate in identifying Y-chromosomal (left) and mtDNA (right) haplogroups. The plots depict all known basal haplogroups (columns), the number of known subgroups in each haplogroup (top of each column), and the proportion of computationally validated subgroups.
. 6.—GPS predictions of biogeographical affinities for worldwide 33 populations. The x axis illustrates populations represented by a vertical stacked column indicating the proportion of individuals predicted within 200 km of their country’s political borders (blue) and the remaining individuals (green). The average distance from the predicted location and true country of origin is indicated in red balls.