| Literature DB >> 29727690 |
Yu-Ru Su1, Chongzhi Di2, Stephanie Bien2, Licai Huang2, Xinyuan Dong3, Goncalo Abecasis4, Sonja Berndt5, Stephane Bezieau6, Hermann Brenner7, Bette Caan8, Graham Casey9, Jenny Chang-Claude10, Stephen Chanock5, Sai Chen11, Charles Connolly2, Keith Curtis2, Jane Figueiredo12, Manish Gala13, Steven Gallinger14, Tabitha Harrison2, Michael Hoffmeister7, John Hopper15, Jeroen R Huyghe2, Mark Jenkins15, Amit Joshi16, Loic Le Marchand17, Polly Newcomb18, Deborah Nickerson19, John Potter18, Robert Schoen20, Martha Slattery21, Emily White18, Brent Zanke22, Ulrike Peters18, Li Hsu23.
Abstract
Genome-wide association studies (GWASs) have successfully identified thousands of genetic variants for many complex diseases; however, these variants explain only a small fraction of the heritability. Recently, genetic association studies that leverage external transcriptome data have received much attention and shown promise for discovering novel variants. One such approach, PrediXcan, is to use predicted gene expression through genetic regulation. However, there are limitations in this approach. The predicted gene expression may be biased, resulting from regularized regression applied to moderately sample-sized reference studies. Further, some variants can individually influence disease risk through alternative functional mechanisms besides expression. Thus, testing only the association of predicted gene expression as proposed in PrediXcan will potentially lose power. To tackle these challenges, we consider a unified mixed effects model that formulates the association of intermediate phenotypes such as imputed gene expression through fixed effects, while allowing residual effects of individual variants to be random. We consider a set-based score testing framework, MiST (mixed effects score test), and propose two data-driven combination approaches to jointly test for the fixed and random effects. We establish the asymptotic distributions, which enable rapid calculation of p values for genome-wide analyses, and provide p values for fixed and random effects separately to enhance interpretability over GWASs. Extensive simulations demonstrate that our approaches are more powerful than existing ones. We apply our approach to a large-scale GWAS of colorectal cancer and identify two genes, POU5F1B and ATF1, which would have otherwise been missed by PrediXcan, after adjusting for all known loci.Entities:
Keywords: data-adaptive weight; expression quantitative trait locus; functional annotation; genome-wide association study; mixed-effects score test; set-based association; variance component test
Mesh:
Year: 2018 PMID: 29727690 PMCID: PMC5986723 DOI: 10.1016/j.ajhg.2018.03.019
Source DB: PubMed Journal: Am J Hum Genet ISSN: 0002-9297 Impact factor: 11.025