| Literature DB >> 22373395 |
Yanming Di1, Gu Mi, Luna Sun, Rongrong Dong, Hong Zhu, Lili Peng.
Abstract
We show that the statistical power of a single single-nucleotide polymorphism (SNP) score test for genetic association reflects the cumulative effect of all causal SNPs that are correlated with the test SNP. Statistical significance of a score test can sometimes be explained by the collective effect of weak correlations between the test SNP and multiple causal SNPs. In a finite population, weak but significant correlations between the test SNP and the causal SNPs can arise by chance alone. As a consequence, when a single-SNP score test shows significance, the causal SNPs contributing to the power of the test are not necessarily located near the test SNP, nor do they have to be in linkage disequilibrium with the test SNP. These findings are confirmed with the Genetic Analysis Workshop 17 mini-exome data. The findings of this study highlight the often overlooked importance of long-range and weak linkage disequilibrium in genetic association studies.Entities:
Year: 2011 PMID: 22373395 PMCID: PMC3287902 DOI: 10.1186/1753-6561-5-S9-S63
Source DB: PubMed Journal: BMC Proc ISSN: 1753-6561
Figure 1Analytical and simulated power of score tests performed on the GAW17 data set. Each plus sign corresponds to one SNP in the GAW17 data. The black circles correspond to the 39 known causal SNPs.
Figure 2Significant correlations between SNPs. Significant correlations (r > 0.1) between SNPs in (a) the GAW17 data set and (b) a simulated data set in which all SNPs are in linkage equilibrium. On the x-axes is the minor allele frequency at a SNP τ. On the y-axes is the number of SNPs j with r > 0.1.
Distributions of the number of causal SNPs significantly correlated (r > 0.1) with each SNP
| Data set | Number of correlated ( | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | >7 | |
| GAW17 | 13,288 | 5,153 | 2,402 | 870 | 424 | 278 | 136 | 85 | 121 |
| Simulated data | 14,322 | 5,229 | 2,092 | 729 | 245 | 104 | 29 | 7 | 0 |
For each SNP in the GAW17 data and in the simulated data, we count how many of the 39 causal SNPs in the GAW17 data have correlation coefficients r > 0.1 with it. We find that 9,469 SNPs in the GAW17 data and 8,435 SNPs in the simulated data are correlated (r > 0.1) with at least one causal SNP. Note that in the simulated data set, all SNPs are simulated to be in linkage equilibrium with each other and with the 39 causal SNPs, so all observed correlations are due to chance alone.