| Literature DB >> 26072481 |
Danny S Park1, Brielin Brown1, Celeste Eng1, Scott Huntsman1, Donglei Hu1, Dara G Torgerson1, Esteban G Burchard2, Noah Zaitlen2.
Abstract
MOTIVATION: Approaches to identifying new risk loci, training risk prediction models, imputing untyped variants and fine-mapping causal variants from summary statistics of genome-wide association studies are playing an increasingly important role in the human genetics community. Current summary statistics-based methods rely on global 'best guess' reference panels to model the genetic correlation structure of the dataset being studied. This approach, especially in admixed populations, has the potential to produce misleading results, ignores variation in local structure and is not feasible when appropriate reference panels are missing or small. Here, we develop a method, Adapt-Mix, that combines information across all available reference panels to produce estimates of local genetic correlation structure for summary statistics-based methods in arbitrary populations.Entities:
Mesh:
Year: 2015 PMID: 26072481 PMCID: PMC4553832 DOI: 10.1093/bioinformatics/btv230
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.This heatmap shows the average mixture frequency assigned to each reference population when optimizing over independent chromosomes for various datasets
Performance of each reference panel when imputing z-scores for GALA II Mexicans
| Panel | MSE | ||
|---|---|---|---|
| GALA II | 2966 | 0.214 | 0.916 |
| YRI | 2572 | 1.11 | 0.499 |
| MXL | 2923 | 0.615 | 0.737 |
| 1KG-Genome | 2836 | 0.484 | 0.807 |
| 1KG-Chrom | 2898 | 0.451 | 0.818 |
| 1KG-Window | 2836 | 0.438 | 0.824 |
| 1KG-No-MXL-PUR | 2904 | 0.507 | 0.795 |
Performance of each reference panel when imputing z-scores for GALA II Puerto Ricans
| Panel | MSE | ||
|---|---|---|---|
| GALA II | 3231 | 0.234 | 0.903 |
| JPT | 2572 | 0.884 | 0.626 |
| PUR | 3103 | 0.554 | 0.757 |
| 1KG-Genome | 2759 | 0.587 | 0.760 |
| 1KG-Chrom | 2906 | 0.473 | 0.800 |
| 1KG-Window | 2839 | 0.467 | 0.804 |
| 1KG-No-MXL-PUR | 2912 | 0.520 | 0.795 |
Performance of each panel for the joint statistics on chromosome 22 of the GALA II Mexicans (n = 41 758)
| Panel | MSE | Mean diff. | Var. of diff. | |
|---|---|---|---|---|
| MXL | 0.116 | 0.988 | 0.042 | 0.114 |
| 1KG-Chrom | 0.031 | 0.997 | 0.004 | 0.031 |
| 1KG-Genome | 0.048 | 0.995 | 0.008 | 0.048 |
| 1KG-Window | 0.05 | 0.994 | 0.006 | 0.049 |
| 1KG-No-MXL-PUR | 0.057 | 0.994 | 0.005 | 0.057 |
Performance of each panel for the joint statistics on chromosome 22 of the GALA II Puerto Ricans (n = 43 715)
| Panel | MSE | Mean diff. | Var. of diff. | |
|---|---|---|---|---|
| PUR | 0.057 | 0.994 | 0.023 | 0.057 |
| 1KG-Chrom | 0.017 | 0.998 | 0.004 | 0.017 |
| 1KG-Genome | 0.070 | 0.993 | 0.018 | 0.069 |
| 1KG-Window | 0.042 | 0.995 | 0.012 | 0.042 |
| 1KG-No-MXL-PUR | 0.032 | 0.997 | 0.008 | 0.032 |
Fig. 2.Estimated joint statistic (x axis) versus the true joint statistic (y axis) in the GALA II individuals using Σ estimated from a ‘best guess’ reference panel and Adapt-Mix. (a) Joint statistics for the GALA II Mexicans using MXL (red) and 1KG-Chrom (blue). (b) Joint statistics for the GALA II Puerto Ricans using PUR (orange) and 1KG-Chrom (blue). (c) Joint statistics for the GALA II Mexicans using MXL (red) and 1KG-No-MXL-PUR (gray). (d) Joint statistics for the GALA II Puerto Ricans using PUR (orange) and 1KG-No-MXL-PUR (gray)
Fig. 3.Histogram of the deviations from the true joint statistic when using a ‘best guess’ panel and Adapt-Mix to estimate Σ for joint-testing. (a) Joint testing for GALA II Mexicans. MXL deviations are shown in red and 1KG-Chrom is shown in blue. (b) Joint testing for GALA II Puerto Ricans. PUR deviations are shown in orange and 1KG-Chrom is shown in blue
The performance of each reference panel when imputing z-scores for the C4D dataset
| Panel | MSE | ||
|---|---|---|---|
| CEU | 2637 | 0.379 | 0.813 |
| GIH | 2627 | 0.414 | 0.796 |
| 1KG-Chrom | 2651 | 0.272 | 0.870 |
| 1KG-Window | 2628 | 0.265 | 0.872 |