| Literature DB >> 35907789 |
Anna Hutchinson1, James Liley2,3, Chris Wallace4,5,6.
Abstract
BACKGROUND: Genome-wide association studies (GWAS) are limited in power to detect associations that exceed the stringent genome-wide significance threshold. This limitation can be alleviated by leveraging relevant auxiliary data, such as functional genomic data. Frameworks utilising the conditional false discovery rate have been developed for this purpose, and have been shown to increase power for GWAS discovery whilst controlling the false discovery rate. However, the methods are currently only applicable for continuous auxiliary data and cannot be used to leverage auxiliary data with a binary representation, such as whether SNPs are synonymous or non-synonymous, or whether they reside in regions of the genome with specific activity states.Entities:
Keywords: FDR; Functional genomics; GWAS; Multiple testing; Power
Mesh:
Year: 2022 PMID: 35907789 PMCID: PMC9338519 DOI: 10.1186/s12859-022-04838-0
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.307
Fig. 1Simulation results for Binary cFDR and BL. Mean +/− standard error for the sensitivity, specificity and FDR of FDR values (derived from the Benjamini–Hochberg procedure) from Binary cFDR when iterating over independent (A; “simulation A”) and dependent (B; “simulation B” and C; “simulation C”) binary auxiliary data. BL refers to results when using Boca and Leek’s FDR regression to leverage the 5-dimensional covariate data. Iteration 0 corresponds to the original FDR values. Results were averaged across 100 simulations
Fig. 2Summary of cFDR results from type 1 diabetes application. A FDR values (derived from the Benjamini–Hochberg procedure) before and after each iteration of cFDR, coloured by the auxiliary data values. B Manhattan plot of () FDR values (y-axis truncated to aid visualisation). Green points indicate the four lead variants that were newly FDR significant after cFDR. Black dashed line at FDR significance threshold ()
Table of newly significant index SNPs from type 1 diabetes application
| rsID | Position | Ref/Alt | OR | SE | RA | DGF | H3K27ac percentile | Gene | ||
|---|---|---|---|---|---|---|---|---|---|---|
| rs1052553 | chr17:44073889 | A/G | 0.889 | 0.022 | 1 | 2.2th | ||||
| rs3024505 | chr1:206939904 | G/A | 0.864 | 0.027 | 0.601 | 1 | 87.4th | |||
| rs6518350 | chr21:45621817 | A/G | 0.880 | 0.024 | 0.062 | 0 | 72.7th | |||
| rs13415583 | chr2:100764087 | T/G | 0.904 | 0.019 | 0 | 14.4th |
For each of the four newly significant SNPs from the cFDR analysis, we list the rsID, genomic position (hg19; Position), reference and alternative alleles (Ref/Alt), odds ratio (OR), standard error (SE) and p value reported in the primary GWAS data set [18], v-value from the cFDR analysis, GWAS p value for rheumatoid arthritis [27] (RA p value), binary indicator of SNP overlap with regulatory factor binding sites (DGF), percentile of mean H3K27ac fold change value across asthma relevant cell types (H3K27ac percentile) and the closest protein-coding gene