| Literature DB >> 27307612 |
Dat Duong1, Jennifer Zou1, Farhad Hormozdiari1, Jae Hoon Sul2, Jason Ernst3, Buhm Han4, Eleazar Eskin5.
Abstract
MOTIVATION: Expression quantitative trait loci (eQTLs) are genetic variants that affect gene expression. In eQTL studies, one important task is to find eGenes or genes whose expressions are associated with at least one eQTL. The standard statistical method to determine whether a gene is an eGene requires association testing at all nearby variants and the permutation test to correct for multiple testing. The standard method however does not consider genomic annotation of the variants. In practice, variants near gene transcription start sites (TSSs) or certain histone modifications are likely to regulate gene expression. In this article, we introduce a novel eGene detection method that considers this empirical evidence and thereby increases the statistical power.Entities:
Mesh:
Year: 2016 PMID: 27307612 PMCID: PMC4908356 DOI: 10.1093/bioinformatics/btw272
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.(a) Q–Q plots of the uniform density quantiles against the simulated eGene P-value quantiles using Func-eGene at the gene ENSG00000204219.5 under the null hypothesis. (b) Func-eGene simulated statistical power at the gene ENSG00000204219.5
Number of candidate eGenes discovered by permutation and Mvn Func-eGene using uniform and non-uniform priors for 21 868 genes in the liver tissue GTEx data
| Permutation | Mvn | Overlap | ||||
|---|---|---|---|---|---|---|
| eGene | Yes | No | Yes | No | Yes | No |
| Uniform | 1626 | 20 242 | 1582 | 20 286 | 1549 | 20 209 |
| Non-uniform | 2445 | 19 423 | 2449 | 19 419 | 2379 | 19 353 |
| Overlap | 1484 | 19 281 | 1457 | 19294 |
aCondition on uniform prior or non-uniform prior and count the number of eGenes (or not eGenes) that permutation Func-eGene agrees with Mvn Func-eGene.
bIndicates the number of genes detected to be eGenes.
cUniform prior uses the prior ratio 1/1 for all variants.
dNon-uniform prior uses the prior ratio 100/1 so that variants in the TSS are 100 times more likely to affect gene expression.
eCondition on permutation or Mvn Func-eGene and count the number of eGenes (or not eGenes) that the uniform prior agrees with non-uniform prior.
Fig. 2.(a) Scatter plot of eGene P-values using Mvn Func-eGene and the permutation test. (b) Histogram of the difference between eGene P-values using Mvn Func-eGene and the permutation test
Fig. 3.Histogram of the eGene P-values using Mvn Func-eGene and the permutation test
Fig. 4.(a) Scatter plot of the eGene Q-values using Mvn Func-eGene and the permutation test. (b) Histogram of the difference between eGene Q-values using Mvn Func-eGene and the permutation test
The number of eGene discovered at 19 annotation score ratios
| Row | Ratio | eGene | Row | Ratio | eGene |
|---|---|---|---|---|---|
| 1 | 1:1:1 | 2057 | 10 | 1:1:10 | 1579 |
| 2 | 1:10:1 | 2032 | 11 | 1:10:10 | 1747 |
| 3 | 1:100:1 | 1890 (1834) | 12 | 1:100:10 | 1904 |
| 4 | 10:1:1 | 2450 | 13 | 10:1:10 | 2060 |
| 5 | 10:10:1 | 2331 | 14 | 100:1:10 | 2473 (2413) |
| 6 | 10:100:1 | 1991 | 15 | 1:1:100 | 1280 |
| 7 | 100:1:1 | 2493 (2449) | 16 | 1:10:100 | 1391 |
| 8 | 100:10:1 | 2489 (2449) | 17 | 1:100:100 | 1673 |
| 9 | 100:100:1 | 2329 | 18 | 10:1:100 | 1548 |
| 19 | 100:1:100 | 2014 |
aPrior ratios of variants inside and outside an annotation. These ratios are in order .
bThe numbers are obtained using the approximation method in Section 2.2.5. Numbers in parentheses are obtained using Mvn Func-eGene
The number of candidate eGenes detected by Mvn Func-eGene at the best priors in each annotation
| Annotation | (%) | Ratio | eGene |
|---|---|---|---|
| H3K27ac | 12.25 | 40/1 | 1944 |
| H3K27me3 | 7.26 | 1/70 | 1880 |
| H3K4me1 | 16.38 | 80/1 | 1858 |
| H3K4me3 | 7.73 | 50/1 | 1917 |
| H3K9ac | 9.74 | 100/1 | 1861 |
| H3K9me3 | 11.92 | 1/50 | 1879 |
| TSS | 14.74 | 60/1 | 2479 |
| DNase | 4.66 | 100/1 | 1834 |
| Uniform | 100 | 1/1 | 1592 |
aAverage percent of variants in an annotation.
bBest prior ratios of variants inside and outside an annotation given by the method in Section 2.2.5.
cNumbers are obtained by using Mvn Func-eGene at the best ratios.
dAssociated with gene activation.
eAssociated with gene suppression
Fig. 5.(a) False-positive rate for permutation and Mvn Func-eGene using prior ratio 60/1. (b) Mvn Func-eGene statistical power at gene ENSG00000204219.5 using ratio 60/1 is not worse than the ratio 100/1
Fig. 6.Permutation and Mvn Func-eGene runtime