| Literature DB >> 23300628 |
Jonas Carlsson Almlöf1, Per Lundmark, Anders Lundmark, Bing Ge, Seraya Maouche, Harald H H Göring, Ulrika Liljedahl, Camilla Enström, Jessy Brocheton, Carole Proust, Tiphaine Godefroy, Jennifer G Sambrook, Jennifer Jolley, Abigail Crisp-Hihn, Nicola Foad, Heather Lloyd-Jones, Jonathan Stephens, Rhian Gwilliam, Catherine M Rice, Christian Hengstenberg, Nilesh J Samani, Jeanette Erdmann, Heribert Schunkert, Tomi Pastinen, Panos Deloukas, Alison H Goodall, Willem H Ouwehand, François Cambien, Ann-Christine Syvänen.
Abstract
A large number of genome-wide association studies have been performed during the past five years to identify associations between SNPs and human complex diseases and traits. The assignment of a functional role for the identified disease-associated SNP is not straight-forward. Genome-wide expression quantitative trait locus (eQTL) analysis is frequently used as the initial step to define a function while allele-specific gene expression (ASE) analysis has not yet gained a wide-spread use in disease mapping studies. We compared the power to identify cis-acting regulatory SNPs (cis-rSNPs) by genome-wide allele-specific gene expression (ASE) analysis with that of traditional expression quantitative trait locus (eQTL) mapping. Our study included 395 healthy blood donors for whom global gene expression profiles in circulating monocytes were determined by Illumina BeadArrays. ASE was assessed in a subset of these monocytes from 188 donors by quantitative genotyping of mRNA using a genome-wide panel of SNP markers. The performance of the two methods for detecting cis-rSNPs was evaluated by comparing associations between SNP genotypes and gene expression levels in sample sets of varying size. We found that up to 8-fold more samples are required for eQTL mapping to reach the same statistical power as that obtained by ASE analysis for the same rSNPs. The performance of ASE is insensitive to SNPs with low minor allele frequencies and detects a larger number of significantly associated rSNPs using the same sample size as eQTL mapping. An unequivocal conclusion from our comparison is that ASE analysis is more sensitive for detecting cis-rSNPs than standard eQTL mapping. Our study shows the potential of ASE mapping in tissue samples and primary cells which are difficult to obtain in large numbers.Entities:
Mesh:
Substances:
Year: 2012 PMID: 23300628 PMCID: PMC3530574 DOI: 10.1371/journal.pone.0052260
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Comparison of the power to detect cis-acting regulatory SNPs using allele-specific expression (ASE) and genotype expression (GTE) analysis.
| Type of p-value threshold | Method | Number of samples | |||
| 395 | 188 | 95 | 50 | ||
|
| |||||
| FDR 5% | ASE | NA | 203893 | 127990 | 65258 |
| GTE | 31963 | 16757 | 6879 | 2077 | |
| ASE/GTE | NA | 12.2 | 18.6 | 31.4 | |
| FDR 1% | ASE | NA | 155205 | 87523 | 38685 |
| GTE | 22651 | 11101 | 4242 | 1151 | |
| ASE/GTE | NA | 14.0 | 20.6 | 33.6 | |
| Bonferroni correction pcorr = 0.05 | ASE | NA | 58078 | 23213 | 6781 |
| GTE | 9424 | 4277 | 1439 | 390 | |
| ASE/GTE | NA | 13.6 | 16.1 | 17.4 | |
| Bonferroni correction pcorr = 0.01 | ASE | NA | 51383 | 19357 | 5191 |
| GTE | 8337 | 3677 | 1189 | 300 | |
| ASE/GTE | NA | 14.0 | 16.3 | 17.3 | |
|
| |||||
| FDR 5% | ASE | NA | 111978 | 76161 | 42100 |
| GTE | 24379 | 13479 | 5887 | 1863 | |
| ASE/GTE | NA | 8.3 | 12.9 | 22.6 | |
| FDR 1% | ASE | NA | 88837 | 54758 | 26284 |
| GTE | 17975 | 9281 | 3739 | 1064 | |
| ASE/GTE | NA | 9.6 | 14.6 | 24.7 | |
| Bonferroni correction pcorr = 0.05 | ASE | NA | 37995 | 16467 | 5037 |
| GTE | 8148 | 3855 | 1349 | 370 | |
| ASE/GTE | NA | 9.9 | 12.2 | 13.6 | |
| Bonferroni correction pcorr = 0.01 | ASE | NA | 34131 | 13876 | 3881 |
| GTE | 7266 | 3335 | 1119 | 284 | |
| ASE/GTE | NA | 10.2 | 12.4 | 13.7 | |
|
| |||||
| FDR 5% | ASE | NA | 12389 | 7111 | 6447 |
| GTE | 5051 | 3364 | 1850 | 740 | |
| ASE/GTE | NA | 3.7 | 3.8 | 8.7 | |
| FDR 1% | ASE | NA | 11914 | 6788 | 5472 |
| GTE | 3620 | 2256 | 1158 | 423 | |
| ASE/GTE | NA | 5.3 | 5.9 | 12.9 | |
| Bonferroni correction pcorr = 0.05 | ASE | NA | 8723 | 4395 | 1984 |
| GTE | 1867 | 1064 | 476 | 165 | |
| ASE/GTE | NA | 8.2 | 9.2 | 12.0 | |
| Bonferroni correction pcorr = 0.01 | ASE | NA | 8223 | 3994 | 1607 |
| GTE | 1712 | 948 | 407 | 133 | |
| ASE/GTE | NA | 8.7 | 9.8 | 12.1 | |
Median values of 10 runs for the random sample subsets. The top panel show the number of significant SNP-transcript associations, the middle panel show the number of significantly associated SNPs when counting only the best SNP-transcript association for each SNP, and the bottom panel show the number of transcripts that have at least one significantly associated SNP. Both the Bonferroni and FDR p-value thresholds are used. ASE/GTE = Ratio between number of significant ASE and GTE associations.
Figure 1Overlap of significantly associated rSNPs identified by ASE and GTE.
The percentage of overlapping rSNPs detected by allele-specific expression (ASE) and genotype expression (GTE) analysis is plotted for varying numbers of samples. The top 9536 SNPs from the GTE analysis are compared with the top 38203 SNPs from the ASE analysis, which corresponds to a Bonferroni threshold of p = 0.05 for a GTE sample size of 395 and an ASE sample size of 188. The p-value cut-offs were adapted so that the same SNP top-list sizes were obtained at all sample sizes for both GTE (p-value of 1.17E-7, 1.06E-4, 1.93E-3, 6.12E-3 for n = 395, n = 188, n = 95, and n = 50 respectively) and ASE (p-value of 8.06E-8, 9.35E-5, 4.90E-3 for n = 188, n = 95, and n = 50 respectively). The vertical axes show the percentage of SNPs in the top-lists detected by both GTE and ASE analysis and the horizontal axes show the number of samples analyzed using GTE and ASE, respectively. The percentage overlap is calculated by dividing the number of overlaps with the number of top SNPs in the GTE analysis. In (A), each line shows the effect on the number of overlapping SNPs detected by ASE analysis of a specific sample size when the sample size in GTE analysis was increased. In (B), each line shows the effect on the number of overlapping rSNPs detected by GTE analysis of a specific sample size when the samples size in ASE analysis is increased.
Figure 2The ability of ASE and GTE analysis to detect significantly associated rSNPs at different MAF.
Fractions of rSNPs are shown for different minor allele frequencies (MAF) with significant association signals according to a Bonferroni-corrected p-value of 0.05. Each data point underlying the curves represents the fraction of significant associations within a 1% MAF bin. Sliding 5% MAF window averages are plotted for different sample sizes analyzed by ASE and GTE. Both methods detect a lower fraction of low frequency rSNPs, compared to the fraction of all the SNPs at the same frequency (black line). The ASE method detects a higher fraction of the SNPs (solid lines) with a MAF <15% than GTE (dashed lines) regardless of sample size except for the largest GTE sample set.