| Literature DB >> 29717146 |
Karolina Sikorska1, Emmanuel Lesaffre2, Patrick J F Groenen3, Fernando Rivadeneira4, Paul H C Eilers5.
Abstract
Genome-wide association studies (GWAS) with longitudinal phenotypes provide opportunities to identify genetic variations associated with changes in human traits over time. Mixed models are used to correct for the correlated nature of longitudinal data. GWA studies are notorious for their computational challenges, which are considerable when mixed models for thousands of individuals are fitted to millions of SNPs. We present a new algorithm that speeds up a genome-wide analysis of longitudinal data by several orders of magnitude. It solves the equivalent penalized least squares problem efficiently, computing variances in an initial step. Factorizations and transformations are used to avoid inversion of large matrices. Because the system of equations is bordered, we can re-use components, which can be precomputed for the mixed model without a SNP. Two SNP effects (main and its interaction with time) are obtained. Our method completes the analysis a thousand times faster than the R package lme4, providing an almost identical solution for the coefficients and p-values. We provide an R implementation of our algorithm.Entities:
Mesh:
Year: 2018 PMID: 29717146 PMCID: PMC5931565 DOI: 10.1038/s41598-018-24578-7
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Speed-up compared to the lmer function in R. Results based on the simulated data for 1000 SNPs, 4 time points and 3 covariates. Performed on a 64-bit Windows running on a laptop with CPU @ 2.3 GHz and 6 GB RAM.
Figure 2Simulation study. Accuracy of the coefficients computed by GALLOP compared to lmer.
Figure 3Simulation study. Accuracy of the p-values computed by GALLOP compared to lmer.
Figure 4BMD data. Accuracy of the p-values for the GALLOP and CTS.
Figure 5Time of the genome-wide analysis of the BMD data, 97384 SNPs from chromosome 22. Time spent on data access and time spent on computations are separated.