| Literature DB >> 28931044 |
Shuyan Mei1, Ali Karimnezhad2,3,4, Marie Forest5, David R Bickel3,4,6, Celia M T Greenwood1,5,7,8.
Abstract
The maximum entropy (ME) method is a recently-developed approach for estimating local false discovery rates (LFDR) that incorporates external information allowing assignment of a subset of tests to a category with a different prior probability of following the null hypothesis. Using this ME method, we have reanalyzed the findings from a recent large genome-wide association study of coronary artery disease (CAD), incorporating biologic annotations. Our revised LFDR estimates show many large reductions in LFDR, particularly among the genetic variants belonging to annotation categories that were known to be of particular interest for CAD. However, among SNPs with rare minor allele frequencies, the reductions in LFDR were modest in size.Entities:
Mesh:
Year: 2017 PMID: 28931044 PMCID: PMC5607215 DOI: 10.1371/journal.pone.0185174
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1FDR and LFDR.
FDR at a cutoff of 1.95 (p-value 0.05 for a normally distributed test) is the ratio of the area of the light blue region divided by the area of (beige plus light blue). LFDR compares the height of the dark blue line to the height of the brown line.
Number of SNPs by minor allele frequency bins, as well as the number and percentage of significant SNPs, using several definitions of statistical significance.
| MAF bins | <0.001 | 0.001–0.005 | 0.005–0.01 | 0.01–0.05 | ≥0.05 | All |
|---|---|---|---|---|---|---|
| 0 | 0 | 240,423 | 2,500,103 | 6,715,230 | 9,455,778 | |
| 0 | 0 | 2.54 | 26.44 | 71.02 | 100 | |
| 0 | 0 | 103 | 1,988 | 30,417 | 32,508 | |
| 0 | 0 | 0.32 | 6.11 | 93.57 | 100 | |
| 0 | 0 | 0 | 61 | 2,152 | 2,213 | |
| 0 | 0 | 0 | 2.76 | 97.24 | 100 | |
| 0 | 0 | 0 | 39 | 1,836 | 1,875 | |
| 0 | 0 | 0 | 2.08 | 97.92 | 100 |
*P deviated: the p-value was in the tail of the QQ plot, after a point of inflexion where the line sloped away from the line of expectation. This includes all SNPs with p< 0.0074
Observed heritability (h2 obs) and its standard error (SE), expected heritability (h2 exp) and the adjusted P-value from LD-score regression for enrichment in CAD.
Also, the distances between p-value distributions (D-statistics) from Kolmogorov-Smirnov tests are shown, comparing different MAF groups: (a) [0.005–0.01) vs. [0.01–0.05); (b) [0.005–0.001) vs. (≥0.05); (c) [0.01–0.05) vs. (≥0.05).
| Annotation Category | h2 obs (SE) | h2 exp | P-value (adjusted) | # of SNPs | KS-test D measure (a,b,c) |
|---|---|---|---|---|---|
| Enhancer_Hoffman. extend.500 | 0.18 (0.03) | 0.03 | 1.1x10-04 | 401,897 | 0.030, 0.069, 0.042 |
| H3K9ac_Trynka | 0.15 (0.03) | 0.02 | 2.7x10-04 | 322,412 | 0.027, 0.074, 0.048 |
| H3K9ac_Trynka.extend.500 | 0.18 (0.03) | 0.04 | 3.7x10-04 | 601,848 | 0.028, 0.072, 0.045 |
| Enhancer_Hoffman | 0.14 (0.03) | 0.01 | 4.1x10-04 | 163,480 | 0.030, 0.072, 0.044 |
| H3K27ac_PGC2.extend.500 | 0.19 (0.03) | 0.07 | 3.8x10-03 | 962,593 | 0.024, 0.065, 0.041 |
| H3K4me3_Trynka.extend.500 | 0.20 (0.04) | 0.05 | 3.9x10-03 | 713,844 | 0.024, 0.065, 0.042 |
| H3K27ac_PGC2 | 0.18 (0.03) | 0.05 | 3.9x10-03 | 768,410 | 0.024, 0.065, 0.042 |
| H3K9ac_peaks_Trynka | 0.11 (0.03) | 0.01 | 4.0x10-03 | 95,531 | 0.032, 0.079, 0.049 |
| FetalDHS_Trynka | 0.18 (0.04) | 0.02 | 9.1x10-03 | 255,582 | 0.022, 0.059, 0.039 |
(1) Adjusted p-value for enrichment, using a Bonferroni correction
(2) The number of SNPs used for the adjusted p-value
(3) “extend.500” implies that a 500 base pair window around the category was included with the annotation to minimize inflation of heritability from flanking regions[22]
Fig 2Changes in LFDR estimates between unadjusted LFDR and LFDR estimated with the ME method, when each of nine functional annotations are used to define a high risk subset of SNPs.
Within each panel, the three distributions are divided by p-value ranges: unadjusted p<0.05; unadjusted p<0.01; unadjusted p<0.001.
Fig 3LFDR estimates with the ME method, as a function of the –log(10) of the raw p-values, for all nine SNP annotation categories considered.
Fig 4Histogram of LFDR differences for three MAF categories using the Enhancer Hoffman.extended.500 annotation.
Differences are on an 0.25 power scale. (Left) MAF between 0.005 and 0.01. (Middle) MAF between 0.01 and 0.05. (Right) MAF greater than 0.05.
Fig 5Scatter plot of the LFDR-ME estimates by minor allele frequency and the decrease in LFDR estimates using the ME method, when using the H3K9ac annotation.