Andrey A Shabalin1. 1. Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA. shabalin@email.unc.edu
Abstract
MOTIVATION: Expression quantitative trait loci (eQTL) analysis links variations in gene expression levels to genotypes. For modern datasets, eQTL analysis is a computationally intensive task as it involves testing for association of billions of transcript-SNP (single-nucleotide polymorphism) pair. The heavy computational burden makes eQTL analysis less popular and sometimes forces analysts to restrict their attention to just a small subset of transcript-SNP pairs. As more transcripts and SNPs get interrogated over a growing number of samples, the demand for faster tools for eQTL analysis grows stronger. RESULTS: We have developed a new software for computationally efficient eQTL analysis called Matrix eQTL. In tests on large datasets, it was 2-3 orders of magnitude faster than existing popular tools for QTL/eQTL analysis, while finding the same eQTLs. The fast performance is achieved by special preprocessing and expressing the most computationally intensive part of the algorithm in terms of large matrix operations. Matrix eQTL supports additive linear and ANOVA models with covariates, including models with correlated and heteroskedastic errors. The issue of multiple testing is addressed by calculating false discovery rate; this can be done separately for cis- and trans-eQTLs.
MOTIVATION: Expression quantitative trait loci (eQTL) analysis links variations in gene expression levels to genotypes. For modern datasets, eQTL analysis is a computationally intensive task as it involves testing for association of billions of transcript-SNP (single-nucleotide polymorphism) pair. The heavy computational burden makes eQTL analysis less popular and sometimes forces analysts to restrict their attention to just a small subset of transcript-SNP pairs. As more transcripts and SNPs get interrogated over a growing number of samples, the demand for faster tools for eQTL analysis grows stronger. RESULTS: We have developed a new software for computationally efficient eQTL analysis called Matrix eQTL. In tests on large datasets, it was 2-3 orders of magnitude faster than existing popular tools for QTL/eQTL analysis, while finding the same eQTLs. The fast performance is achieved by special preprocessing and expressing the most computationally intensive part of the algorithm in terms of large matrix operations. Matrix eQTL supports additive linear and ANOVA models with covariates, including models with correlated and heteroskedastic errors. The issue of multiple testing is addressed by calculating false discovery rate; this can be done separately for cis- and trans-eQTLs.
Authors: Shaun Purcell; Benjamin Neale; Kathe Todd-Brown; Lori Thomas; Manuel A R Ferreira; David Bender; Julian Maller; Pamela Sklar; Paul I W de Bakker; Mark J Daly; Pak C Sham Journal: Am J Hum Genet Date: 2007-07-25 Impact factor: 11.025
Authors: Joe R Davis; Laure Fresard; David A Knowles; Mauro Pala; Carlos D Bustamante; Alexis Battle; Stephen B Montgomery Journal: Am J Hum Genet Date: 2015-12-31 Impact factor: 11.025
Authors: Gregory Stone; Ashley Choi; Oliva Meritxell; Joshua Gorham; Mahyar Heydarpour; Christine E Seidman; Jon G Seidman; Sary F Aranki; Simon C Body; Vincent J Carey; Benjamin A Raby; Barbara E Stranger; Jochen D Muehlschlegel Journal: Hum Mol Genet Date: 2019-05-15 Impact factor: 6.150
Authors: Marc Parisien; Samar Khoury; Anne-Julie Chabot-Doré; Susana G Sotocinal; Gary D Slade; Shad B Smith; Roger B Fillingim; Richard Ohrbach; Joel D Greenspan; William Maixner; Jeffrey S Mogil; Inna Belfer; Luda Diatchenko Journal: Cell Rep Date: 2017-05-30 Impact factor: 9.423
Authors: Jeffrey Hsu; Shamone Gore-Panter; Gregory Tchou; Laurie Castel; Beth Lovano; Christine S Moravec; Gosta B Pettersson; Eric E Roselli; A Marc Gillinov; Kenneth R McCurry; Nicholas G Smedira; John Barnard; David R Van Wagoner; Mina K Chung; Jonathan D Smith Journal: Circ Genom Precis Med Date: 2018-03
Authors: Haiyang Guo; Musaddeque Ahmed; Fan Zhang; Cindy Q Yao; SiDe Li; Yi Liang; Junjie Hua; Fraser Soares; Yifei Sun; Jens Langstein; Yuchen Li; Christine Poon; Swneke D Bailey; Kinjal Desai; Teng Fei; Qiyuan Li; Dorota H Sendorek; Michael Fraser; John R Prensner; Trevor J Pugh; Mark Pomerantz; Robert G Bristow; Mathieu Lupien; Felix Y Feng; Paul C Boutros; Matthew L Freedman; Martin J Walsh; Housheng Hansen He Journal: Nat Genet Date: 2016-08-15 Impact factor: 38.330