| Literature DB >> 21368315 |
Jeffrey D Wall1, Rong Jiang, Christopher Gignoux, Gary K Chen, Celeste Eng, Scott Huntsman, Paul Marjoram.
Abstract
Analyses of genetic polymorphism data have the potential to be highly informative about the demographic history of Native American populations, but due to a combination of historical and political factors, there are essentially no autosomal sequence polymorphism data from any Native American group. However, there are many resequencing studies involving Latinos, whose genomes contain segments inherited from their Native American ancestors. In this study, we introduce a new method for estimating local ancestry across the genomes of admixed individuals and show how this method, along with dense genotyping and targeted resequencing, can be used to assay genetic variation in ancestral Native American groups. We analyze roughly 6 Mb of resequencing data from 22 Mexican Americans to provide the first large-scale view of sequence level variation in Native Americans. We observe low levels of diversity and high levels of linkage disequilibrium in the Native American-derived sequences, consistent with a recent severe population bottleneck associated with the initial peopling of the Americas. Using two different computational approaches, one novel, we estimate that this bottleneck occurred roughly 12.5 Kya; when uncertainty in the estimation process is taken into account, our results are consistent with archeological estimates for the colonization of the Americas.Entities:
Mesh:
Year: 2011 PMID: 21368315 PMCID: PMC3144384 DOI: 10.1093/molbev/msr049
Source DB: PubMed Journal: Mol Biol Evol ISSN: 0737-4038 Impact factor: 16.240
FSchematic showing a pair of chromosomes from an admixed individual with ancestry from different continental populations (shown in black and red). Local ancestry can be inferred by estimating the number of copies inherited from each ancestral population at each location across the genome.
Comparison of Different Methods for Estimating Local Ancestry.
| Method | δ | Marker-specific accuracy (%) |
| Our method (unphased data) | 0.2 | 91.0 |
| 0.05 | 92.3 | |
| Our method (phased data) | 0.2 | 93.4 |
| 0.05 | 94.5 | |
| Hapmix | 0.2 | 96.1 |
| 0.05 | 98.0 | |
| LAMP | 0.2 | 84.1 |
| Structure | 0.4 | 72.0 |
We used only those SNPs with ancestral allele frequencies that differed by at least δ in the two ancestral populations. We calculated the average accuracy of the marker-specific ancestry calls for each method. Note that different methods make different assumptions about phased versus unphased data. See text for further details.
FDiagram of the demographic model used, with estimates and 95% confidence intervals (in parentheses) for T, the time when the two populations split; t, the time of onset of population growth,; t, the time since the end of the population bottleneck; and b, the strength of the bottleneck. Parameter estimates, along with approximate 95% confidence intervals in parentheses, are given to the right of the figure. Method 1 is the composite likelihood method described in Plagnol and Wall (2006), and method 2 is a summary likelihood method described in the Materials and Methods.
FPlot of diversity θ (= 4Nμ, where N is the effective population size and μ is the mutation rate per base pair per generation) versus estimated recombination rate ρ (= 4Nr, where r is the recombination rate per base pair per generation) for individuals with different continental ancestries. The two blue diamonds refer to a European sample and a Mexican American sample with two European-derived sequences.