| Literature DB >> 19477991 |
Bogdan Pasaniuc1, Sriram Sankararaman, Gad Kimmel, Eran Halperin.
Abstract
UNLABELLED: A characterization of the genetic variation of recently admixed populations may reveal historical population events, and is useful for the detection of single nucleotide polymorphisms (SNPs) associated with diseases through association studies and admixture mapping. Inference of locus-specific ancestry is key to our understanding of the genetic variation of such populations. While a number of methods for the inference of locus-specific ancestry are accurate when the ancestral populations are quite distant (e.g. African-Americans), current methods incur a large error rate when inferring the locus-specific ancestry in admixed populations where the ancestral populations are closely related (e.g. Americans of European descent).Entities:
Mesh:
Year: 2009 PMID: 19477991 PMCID: PMC2687951 DOI: 10.1093/bioinformatics/btp197
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Accuracies of ancestry estimates obtained by the compared methods on the HapMap admixtures
| Method | YRI-CEU | CEU-JPT | JPT-CHB |
|---|---|---|---|
| SABER | 89.4 | 85.2 | 68.2 |
| HAPAA | 93.7 | 88.2 | 72.0 |
| LAMP | 94.8 | 93.0 | 65.8 |
| LAMP-EM | 97.8 | 94.8 | 74.8 |
| WINPOP | 98.0 | 95.9 | 82.8 |
| WINPOP-EM | 97.7 | 94.7 | 74.8 |
| Upper bound | 99.9 | 99.6 | 91.9 |
LAMP-EM (WINPOP-EM) uses LAMP (WINPOP) solution as an initialization for an EM algorithm that optimizes an HMM similar to the model proposed in STRUCTURE (see Section 2.4 for details).
Fig. 1.Accuracy of ancestry estimates of LAMP and WINPOP, as well as the upper bound on all possible methods as a function of the F between the simulated ancestral populations. The accuracy of WINPOP on the JPT-CHB HapMap admixture is also provided as a single point.
Fig. 2.Accuracies of ancestry estimates for admixtures of pairs of populations from HGDP as a function of the F.
Fig. 3.Comparison of the accuracy of LAMP and WINPOP on admixtures created from the HGDP populations. Red denotes high accuracy, while blue denotes low accuracy. The upper half of the matrix denotes accuracies attained by WINPOP, while the lower half denotes LAMP. While LAMP attains accuracies of<70% on 116 pairs, WINPOP has a minimum accuracy of 81.49%. The populations on the two axes ordered by continental groups.
Fig. 4.Accuracies of ancestry estimates for admixtures of close populations when WINPOP is provided with drifted ancestral populations.
Accuracies of ancestry estimates for the Puerto Rican simulations averaged over 100 datasets
| Method | Accuracy |
|---|---|
| LAMP | 68.1 |
| WINPOP | 91.3 |
| Upper bound | 98.4 |
Imputation error rate (%) obtained by GEDI on the three HapMap simulated admixtures GEDI-1ANC denotes imputation based on only one ancestral reference population, GEDI-2ANC denotes imputation based on both ancestral reference populations, while WINPOP+GEDI denotes imputation of each SNP genotype based on the local ancestry estimated by WINPOP
| Method | YRI-CEU | CEU-JPT | JPT-CHB |
|---|---|---|---|
| GEDI–1ANC | 13.12 | 6.85 | 4.04 |
| GEDI–2ANC | 6.43 | 3.97 | 3.48 |
| WINPOP+GEDI | 5.69 | 3.64 | 3.39 |