| Literature DB >> 25392759 |
Xu Zhang1, Wenbo Mu2, Cong Liu2, Wei Zhang3.
Abstract
Genetic admixture has been utilized as a tool for identifying loci associated with complex traits and diseases in recently admixed populations such as African Americans. In particular, admixture mapping is an efficient approach to identifying genetic basis for those complex diseases with substantial racial or ethnic disparities. Though current advances in admixture mapping algorithms may utilize the entire panel of SNPs, providing ancestry-informative markers (AIMs) that can differentiate parental populations and estimate ancestry proportions in an admixed population may particularly benefit admixture mapping in studies of limited samples, help identify unsuitable individuals (e.g., through genotyping the most informative ancestry markers) before starting large genome-wide association studies (GWAS), or guide larger scale targeted deep re-sequencing for determining specific disease-causing variants. Defining panels of AIMs based on commercial, high-throughput genotyping platforms will facilitate the utilization of these platforms for simultaneous admixture mapping of complex traits and diseases, in addition to conventional GWAS. Here, we describe AIMs detected based on the Shannon Information Content (SIC) or Fst for African Americans with genome-wide coverage that were selected from ∼2.3 million single nucleotide polymorphisms (SNPs) covered by the Affymetrix Axiom Pan-African array, a newly developed genotyping platform optimized for individuals of African ancestry.Entities:
Keywords: Admixture mapping; African American; Ancestry-informative marker; Pan-African array; Single nucleotide polymorphism
Year: 2014 PMID: 25392759 PMCID: PMC4226639 DOI: 10.7717/peerj.660
Source DB: PubMed Journal: PeerJ ISSN: 2167-8359 Impact factor: 2.984
Figure 1Evaluation analysis of ancestry-informative markers.
(A) The rpower distributions for AIMs selected based on SIC and F. The average rpower is 0.85 (sd = 0.06) for both lists. (B) Principal components analysis on the 1000 Genomes Project CEU, YRI and ASW panels (n = 85 88, 56 unrelated samples, respectively) using the AIMs detected based on SIC. (C) Comparison of the proportion of variance explained (PVE) by the first PCs derived from the CEU, YRI, and ASW samples. The histogram shows the distribution from 1000 randomly-sampled sets of SNPs according to the number of AIMs (based on SIC) on each chromosome. Circles denote real PVE observations for each panel of AIMs: AIMs selected by SIC (5885 SNPs) and F (6012 SNPs) from Pan-African array, AIMs selected from Affymetrix SNP 6.0 (4290 SNPs), and Illumina 1M (4285 SNPs), respectively.