| Literature DB >> 23269928 |
Shudong Wang1, Wenan Chen, Xiangning Chen, Fengjiao Hu, Kellie J Archer, Hb Nianjun Liu, Shumei Sun, Guimin Gao.
Abstract
Meta-analysis of genome-wide association studies (GWAS) has become a useful tool to identify genetic variants that are associated with complex human diseases. To control spurious associations between genetic variants and disease that are caused by population stratification, double genomic control (GC) correction for population stratification in meta-analysis for GWAS has been implemented in the software METAL and GWAMA and is widely used by investigators. In this research, we conducted extensive simulation studies to evaluate the double GC correction method in meta-analysis and compared the performance of the double GC correction with that of a principal components analysis (PCA) correction method in meta-analysis. Results show that when the data consist of population stratification, using double GC correction method can have inflated type I error rates at a marker with significant allele frequency differentiation in the subpopulations (such as caused by recent strong selection). On the other hand, the PCA correction method can control type I error rates well and has much higher power in meta-analysis compared to the double GC correction method, even though in the situation that the casual marker does not have significant allele frequency difference between the subpopulations. We applied the double GC correction and PCA correction to meta-analysis of GWAS for two real datasets from the Atherosclerosis Risk in Communities (ARIC) project and the Multi-Ethnic Study of Atherosclerosis (MESA) project. The results also suggest that PCA correction is more effective than the double GC correction in meta-analysis.Entities:
Keywords: double genomic control correction; genome-wide association studies; meta-analysis; population stratification; principal components analysis
Year: 2012 PMID: 23269928 PMCID: PMC3529452 DOI: 10.3389/fgene.2012.00300
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Type I error rates of meta-analysis with two case–control studies for different allele frequencies in the two subpopulations.
| Frequency[ | Study 1[ | Study 2 | 10−5 | 10−6 | ||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Population 1 (2) | Case | Control | Case | Control | PCA | 1 GC | 2 GC | PCA | 1 GC | 2 GC |
| Ran(ran)[ | 1000 | 1000 | 1000 | 1000 | 9.51e–6[ | 5.05e–6 | 5.34e–6 | 1.07e–6 | 3.90e–7 | 3.80e–7 |
| 1500 | 1500 | 500 | 500 | 9.88e–6 | 5.00e–6 | 5.29e–6 | 9.19e–7 | 3.90e–7 | 4.60e–7 | |
| 750 | 1500 | 1500 | 750 | 1.04e–5 | 5.08e–6 | 5.50e–6 | 8.50e–7 | 5.20e–7 | 5.70e–7 | |
| 0.4 (0.2) | 1000 | 1000 | 1000 | 1000 | 9.89e–6 | 1.25e–2 | 1.31e–2 | 9.20e–7 | 1.75e–3 | 1.86e–3 |
| 1500 | 1500 | 500 | 500 | 1.01e–5 | 8.04e–3 | 8.49e–3 | 1.01e–6 | 9.77e–4 | 1.05e–3 | |
| 750 | 1500 | 1500 | 750 | 1.02e–5 | 1.26e–2 | 1.30e–2 | 1.03e–6 | 1.76e–3 | 1.85e–3 | |
| 0.6 (0.2) | 1000 | 1000 | 1000 | 1000 | 9.67e–6 | 0.836 | 0.839 | 7.70e–7 | 0.585 | 0.591 |
| 1500 | 1500 | 500 | 500 | 9.43e–6 | 0.763 | 0.769 | 8.60e–7 | 0.478 | 0.486 | |
| 750 | 1500 | 1500 | 750 | 9.86e–6 | 0.837 | 0.841 | 9.30e–7 | 0.588 | 0.594 | |
| 0.8 (0.2) | 1000 | 1000 | 1000 | 1000 | 9.98e–6 | 1.00 | 1.00 | 1.11e–6 | 1.00 | 1.00 |
| 1500 | 1500 | 500 | 500 | 9.61e–6 | 1.00 | 1.00 | 9.50e–7 | 1.00 | 1.00 | |
| 750 | 1500 | 1500 | 750 | 9.69e–6 | 1.00 | 1.00 | 9.20e–7 | 1.00 | 1.00 | |
Allele frequencies in the two subpopulations from which cases and controls were sampled for each study.
Sample size.
Allele frequencies in the two populations of each study were randomly generated.
9.51e-6 = 9.51 × 10−6
Type I error rates of meta-analysis with five studies for different allele frequencies in the two subpopulations.
| Frequency | 10−5 | 10−6 | ||||
|---|---|---|---|---|---|---|
| Population 1 (2) | PCA | 1 GC | 2 GC | PCA | 1 GC | 2 GC |
| Ran(ran) | 9.89e–6 | 7.48e–6 | 8.13e–6 | 1.04e–6 | 6.30e–7 | 7.20e–7 |
| 0.4 (0.2) | 1.00e–5 | 0.418 | 0.426 | 9.90e–7 | 0.195 | 0.203 |
| 0.6 (0.2) | 1.01e–5 | 1.00 | 1.00 | 1.03e–6 | 0.999 | 1.00 |
| 0.8 (0.2) | 1.01e–5 | 1.00 | 1.00 | 9.78e–7 | 1.00 | 1.00 |
Power of meta-analysis of GWAS for two studies using PCA correction, single GC correction (1 GC), and double GC correction (2 GC) at significance levels of 10–5, 10–6, and 10–7
| Study 1 | Study 2 | 10–5 | 10–6 | 10–7 | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Case | Control | Case | Control | PCA | 1 GC | 2 GC | PCA | 1 GC | 2 GC | PCA | 1 GC | 2 GC | ||
| 0.0402 | 0.0425 | 0.0113 | 0.0124 | 0.0041 | 0.0045 | |||||||||
| 0.1207 | 0.1194 | 0.0511 | 0.0502 | 0.0199 | 0.0196 | |||||||||
| 0.1368 | 0.1450 | 0.0604 | 0.0645 | 0.0238 | 0.0253 | |||||||||
| 0.4306 | 0.4368 | 0.2724 | 0.2787 | 0.1549 | 0.1602 | |||||||||
| 0.2412 | 0.2375 | 0.1228 | 0.1205 | 0.0552 | 0.0535 | |||||||||
| 0.4703 | 0.4806 | 0.3003 | 0.3085 | 0.1709 | 0.1800 | |||||||||
| 0.6377 | 0.6403 | 0.4683 | 0.4702 | 0.3128 | 0.3152 | |||||||||
| 0.6532 | 0.6570 | 0.4740 | 0.4782 | 0.3190 | 0.3215 | |||||||||
| 0.7008 | 0.6986 | 0.5442 | 0.5400 | 0.3819 | 0.3773 | |||||||||
| 0.8942 | 0.8930 | 0.8044 | 0.8028 | 0.6844 | 0.6824 | |||||||||
Power of meta-analysis of GWAS for five studies using PCA correction, single GC correction (1 GC), and double GC correction (2 GC) at significance levels of 10−5, 10−6, and 10−7.
| 10–5 | 10–6 | 10–7 | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Study 1 | Study 2 | Study 3 | Study 4 | Study 5 | PCA | 1 GC | 2 GC | PCA | 1 GC | 2 GC | PCA | 1 GC | 2 GC |
| 1.2 | 1.2 | 1.2 | 1.2 | 1.2 | 0.8284 | 0.3120 | 0.3148 | 0.6912 | 0.1683 | 0.1703 | 0.5421 | 0.0823 | 0.0841 |
| 1.3 | 1.3 | 1.3 | 1.3 | 1.3 | 0.9985 | 0.8735 | 0.8796 | 0.9937 | 0.7586 | 0.7670 | 0.9873 | 0.6101 | 0.6226 |
| 1.4 | 1.4 | 1.4 | 1.4 | 1.4 | 1.000 | 0.9951 | 0.9951 | 1.000 | 0.9828 | 0.9832 | 1.000 | 0.9539 | 0.9545 |
| 1.5 | 1.5 | 1.5 | 1.5 | 1.5 | 1.000 | 0.9997 | 0.9997 | 1.000 | 0.9991 | 0.9991 | 1.000 | 0.9978 | 0.9979 |
| 1.2 | 1.2 | 1.3 | 1.3 | 1.4 | 0.9940 | 0.7678 | 0.7779 | 0.9829 | 0.6153 | 0.6310 | 0.9592 | 0.4549 | 0.4693 |
| 1.3 | 1.3 | 1.4 | 1.4 | 1.5 | 0.9998 | 0.9843 | 0.9854 | 0.9998 | 0.9598 | 0.9625 | 0.9996 | 0.9164 | 0.9202 |
| 1.3 | 1.4 | 1.4 | 1.5 | 1.5 | 1.000 | 0.9928 | 0.9931 | 1.000 | 0.9772 | 0.9783 | 1.000 | 0.9453 | 0.9472 |
| 1.2 | 1.2 | 1.4 | 1.4 | 1.5 | 0.9997 | 0.9267 | 0.9299 | 0.9985 | 0.8498 | 0.8560 | 0.9960 | 0.7414 | 0.7514 |
Top SNPs with p-values <10−5 in a meta-analysis with PCA correction, single GC correction (1 GC), or double GC correction (2 GC).
| Chromo-some | SNP | Reference allele | |||
|---|---|---|---|---|---|
| PCA | 1 GC | 2 GC | |||
| 1 | rs6688672 | T | 6.63e–6 | 1.25e–5 | 1.35e–5 |
| 1 | rs6657751 | G | 3.46e–6 | 6.40e–6 | 6.94e–6 |
| 1 | rs10493964 | C | 6.64e–6 | 1.17e–5 | 1.27e–5 |
| 4 | rs7673213 | G | 4.06e–6 | 9.20e–6 | 9.95e–6 |
| 4 | rs3762865 | A | 8.44e–6 | 1.80e–5 | 1.94e–5 |
| 5 | rs1477654 | A | 7.29e–6 | 7.82e–6 | 8.47e–6 |
| 9 | rs10960562 | A | 4.07e–6 | 1.82e–5 | 1.96e–5 |
| 9 | rs10960592 | T | 5.50e–6 | 2.13e–5 | 2.29e–5 |