| Literature DB >> 22373521 |
Xiangqing Sun1, Junghyun Namkung, Xiaofeng Zhu, Robert C Elston.
Abstract
Genome-wide association studies are based on the linkage disequilibrium pattern between common tagging single-nucleotide polymorphisms (SNPs) (i.e., SNPs having only common alleles) and true causal variants, and association studies with rare SNP alleles aim to detect rare causal variants. To better understand and explain the findings from both types of studies and to provide clues to improve the power of an association study with only common SNPs genotyped, we study the correlation between common SNPs and the presence of rare alleles within a region in the genome and look at the capability of common SNPs in strong linkage disequilibrium with each other to capture single rare alleles. Our results indicate that common SNPs can, to some extent, tag the presence of rare alleles and that including SNPs in strong linkage disequilibrium with each other among the tagging SNPs helps to detect rare alleles.Entities:
Year: 2011 PMID: 22373521 PMCID: PMC3287929 DOI: 10.1186/1753-6561-5-S9-S88
Source DB: PubMed Journal: BMC Proc ISSN: 1753-6561
Mean multiple correlation between (1) the set of common SNPs and the number of rare alleles, (2) permuted common SNPs and the number of rare alleles, (3) the set of common SNPs and the number of synonymous rare alleles, and (4) the set of common SNPs and the number of nonsynonymous rare alleles
| Population | (1) Common vs. rare SNPs | (2) Random correlation | Kolmogorov-Smirnov test | (3) Common vs. synonymous rare SNPs | (4) Common vs. nonsynonymous rare SNPs | ||
|---|---|---|---|---|---|---|---|
| European | 0.078 | −0.022 | 2.84 × 10−9 | 9.66 × 10−15 | 0.078 | 0.077 | 0.952 |
| Chinese | 0.067 | 0.003 | 6.80 × 10−6 | 1.28 × 10−5 | 0.090 | 0.041 | 0.024 |
| Denver Chinese | 0.063 | −0.002 | 2.30 × 10−6 | 3.052 × 10−10 | 0.085 | 0.064 | 0.350 |
| Japanese | 0.089 | 0.004 | 1.50 × 10−8 | 6.17 × 10−12 | 0.091 | 0.081 | 0.668 |
| Luhya | 0.238 | −0.0006 | <2.2 × 10−16 | <2.2 × 10−16 | 0.241 | 0.233 | 0.678 |
| Tuscan | 0.063 | −0.002 | 0.001 | 4.60 × 10−6 | 0.088 | 0.045 | 0.100 |
| Yoruba | 0.120 | −0.007 | <2.2 × 10−16 | <2.2 × 10−16 | 0.142 | 0.099 | 0.008 |
| All samples | 0.053 | 0.054 | 0.580 | 0.118 | 0.057 | 0.048 | 7.74 × 10-4 |
Figure 1Distribution of the correlation The correlation is between the common SNPs and the number of rare alleles present in five random rare SNPs within a 1-Mb region. X-axes are the correlation r2, y-axes are the probability densities.
Figure 2Distribution of the multiple correlation Each point represents a rare SNP. The x-axis is the adjusted between the rare SNP and the common SNPs in set A, and the y-axis is the adjusted between the rare SNP and the common SNPs in set B. SNPs in set B have stronger LD than SNPs in set A, thus set B contains all the SNPs in set A and the SNPs that have stronger LD with those in set A or between themselves. In the left-hand panel, SNPs in set A have LD r2 ≤ 0.8 and SNPs in set B have LD r2 ≤ 0.95. In the right-hand panel, SNPs in set A have LD r2 ≤ 0.95 and SNPs in set B have LD r2 ≤ 0.99.