| Literature DB >> 21635713 |
Daniel Shriner1, Adebowale Adeyemo, Edward Ramos, Guanjie Chen, Charles N Rotimi.
Abstract
A more refined estimation of ancestry would benefit admixture mapping and association mapping, making disease loci identification in admixed populations more powerful.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21635713 PMCID: PMC3219963 DOI: 10.1186/gb-2011-12-5-223
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Figure 1Schematic representation of population stratification and admixture. Population stratification can confound analysis of the meta-population in Generation 0 if allele frequencies differ between the two parental populations. Admixed individuals are generated by interbreeding among previously isolated populations. In subsequent generations, due to the cumulative effect of meioses, ancestry switches (that is, changes in ancestry in the interval between two markers) accumulate and chromosomes become mosaics of ancestry.
Figure 2Local and global ancestry for three unrelated admixed African American individuals. Blue indicates two chromosomes of African ancestry, red indicates two chromosomes of European ancestry, and black indicates one chromosome of African ancestry and one chromosome of European ancestry. (a) An individual with a low proportion of African ancestry is depicted. (b) An individual with similar proportions of African and European ancestry is depicted. (c) An individual with a high proportion of African ancestry is depicted. The histograms on the right indicate the genome-wide frequencies of the three local ancestry values. Global ancestry equals the average of local ancestry divided by two (to account for diploidy). Among self-identified African Americans, the proportion of African ancestry can range from 0 to 1, representing the full range of variation. The information content of the local ancestry plots on the left is considerably higher than the global ancestry summaries on the right.
Figure 3The effect of marker density on detecting ancestry switches. The number of ancestry switches detected as a function of marker density for the individual with the largest number of ancestry switches (n = 737) among 1,976 African Americans [17]. The continuous red line indicates the sparse panel of 1,943 ancestry-informative markers [18]. The dashed red line indicates saturation at 177,000 random markers, after which more markers provide no additional information regarding the number of ancestry switches. For admixed African Americans, high-throughput genotyping of approximately 1 million markers using commercially available microarrays is sufficient to extract all of the information on local ancestry.
Figure 4Detecting natural selection based on the distribution of ancestry switches. The expected number of ancestry switches is based on the local recombination rate, provided by phase II of the International HapMap Project [19]. The observed number of ancestry switches is based on a sample of 1,018 unrelated African Americans [17]. An excess of ancestry switches on chromosome 6p co-localizes with the major histocompatibility complex (MHC; indicated by the bar), consistent with the well-reported evidence of positive natural selection in this genomic region. Comparing the distributions of the observed and expected numbers of ancestry switches in an admixed sample can reveal genomic regions experiencing either positive natural selection (if there is an observed excess of ancestry switches) or negative natural selection (if there is an observed deficiency of ancestry switches).
Controlling for confounding due to admixture in association testing
| 0 | 0 | 0.048 | 0.044 | 0.053 |
| 0 | 1 | 0.570 | 0.538 | 0.052 |
| 1 | 0 | 0.892 | 0.892 | 0.893 |
| 1 | 1 | 0.599 | 0.626 | 0.899 |
The first two rows demonstrate inflation of the false-positive error rate resulting from confounding due to admixture. The second two rows demonstrate the loss of power resulting from confounding due to admixture. In both cases, confounding is controlled by local ancestry but not by global ancestry. aTwo isolated parental populations were generated with FST = 0.115 (FST is the ratio of the observed variance in allele frequencies among populations to the variance expected if the populations were randomly mating), mimicking the amount of population differentiation between the African and European ancestors of African Americans. A sample of admixed individuals was generated with 80% of the genome inherited from the first parental population, mimicking the amount of African ancestry in African Americans. A dataset consisted of 1,000 unrelated individuals and 1,000 unlinked markers. The generative model for the phenotype was a linear model with the listed fixed effects for the queried marker, no effects for all other markers, and noise equal to a random deviate from the standard normal distribution. bFor each marker and individual, the genotype was coded as 0, 1, or 2 copies of the derived allele. cFor each marker and individual, local ancestry was coded as 0, 1, or 2 copies inherited from the first parental population. dThe rejection rates (false-positive error rates if βgenotype = 0, or power if βgenotype = 1) for testing genotype association at one marker are shown. The significance level was 0.05.