| Literature DB >> 24564682 |
Arindom Chakraborty, Guanglong Jiang, Malaz Boustani, Yunlong Liu, Todd Skaar, Lang Li.
Abstract
BACKGROUND: Genome-wide association studies (GWAS) have identified hundreds of genetic variants associated with complex human diseases, clinical conditions and traits. Genetic mapping of expression quantitative trait loci (eQTLs) is providing us with novel functional effects of thousands of single nucleotide polymorphisms (SNPs). In a classical quantitative trail loci (QTL) mapping problem multiple tests are done to assess whether one trait is associated with a number of loci. In contrast to QTL studies, thousands of traits are measured alongwith thousands of gene expressions in an eQTL study. For such a study, a huge number of tests have to be performed (~10(6)). This extreme multiplicity gives rise to many computational and statistical problems. In this paper we have tried to address these issues using two closely related inferential approaches: an empirical Bayes method that bears the Bayesian flavor without having much a priori knowledge and the frequentist method of false discovery rates. A three-component t-mixture model has been used for the parametric empirical Bayes (PEB) method. Inferences have been obtained using Expectation/Conditional Maximization Either (ECME) algorithm. A simulation study has also been performed and has been compared with a nonparametric empirical Bayes (NPEB) alternative.Entities:
Mesh:
Year: 2013 PMID: 24564682 PMCID: PMC4042241 DOI: 10.1186/1471-2164-14-S8-S8
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1A part of the simulated data for .
Figure 2Effect of minor allele frequency (MAF) on the null distribution. Only upper quantiles (from 80%) have been considered as lower quantiles showing almost no difference.
Figure 3QQ-plot for eight SNPs.
The True FDR Performance of Controlled FDR in EB Models
| 0.01 | 0.05 | 0.10 | 0.01 | 0.05 | 0.10 | |
| 0.01 | 0.004 | 0.029 | 0.067 | 0.005 | 0.042 | 0.090 |
| 0.05 | 0.006 | 0.041 | 0.079 | 0.006 | 0.045 | 0.094 |
| 0.10 | 0.007 | 0.043 | 0.087 | 0.008 | 0.047 | 0.097 |
Figure 4Minor allele frequency (MAF) distribution. X axis corresponds to minor allele frequency 25% to 50%.
Number of eQTL pairs after crossing the threshold of FDR
| Gene symbol | No. of SNPs (FDR<10%) | No. of cis-SNP | No. of cis-eSNP (FDR<10%) by Yang et al. (2010) |
|---|---|---|---|
| CYP3A5 | 263 | 62 | 56 |
| CYP2D6 | 264 | 67 | 54 |
| CYP4F12 | 392 | 55 | 46 |
| CYP2E1 | 130 | 45 | 31 |
| CYP2U1 | 549 | 45 | 26 |
| CYP1B1 | 168 | 21 | 13 |
| CYP2C18 | 90 | 13 | 9 |
| CYP4F11 | 169 | 15 | 7 |
| CYP4V2 | 159 | 25 | 3 |
| CYP2F1 | 324 | 10 | 2 |
| CYP39A1 | 448 | 17 | 2 |
| CYP26C1 | 154 | 29 | 1 |
| CYP2C19 | 356 | 7 | 1 |
| CYP2C9 | 413 | 20 | 1 |
| CYP2S1 | 319 | 10 | 1 |
| CYP46A1 | 430 | 7 | 1 |
| CYP4A11 | 461 | 4 | 1 |
| CYP4X1 | 151 | 3 | 1 |