Gabriel E Hoffman1, Jason G Mezey2, Eric E Schadt2. 1. Department of Genetics and Genomic Sciences, Icahn Institute for Genomics and Multiscale Biology, Icahn School of Medicine at Mount Sinai, New York, NY, USA, Department of Biological Statistics and Computational Biology, Cornell University, Ithaca, USA and Department of Genetic Medicine, Weill Cornell Medical College, New York, NY, USA Department of Genetics and Genomic Sciences, Icahn Institute for Genomics and Multiscale Biology, Icahn School of Medicine at Mount Sinai, New York, NY, USA, Department of Biological Statistics and Computational Biology, Cornell University, Ithaca, USA and Department of Genetic Medicine, Weill Cornell Medical College, New York, NY, USA Department of Genetics and Genomic Sciences, Icahn Institute for Genomics and Multiscale Biology, Icahn School of Medicine at Mount Sinai, New York, NY, USA, Department of Biological Statistics and Computational Biology, Cornell University, Ithaca, USA and Department of Genetic Medicine, Weill Cornell Medical College, New York, NY, USA. 2. Department of Genetics and Genomic Sciences, Icahn Institute for Genomics and Multiscale Biology, Icahn School of Medicine at Mount Sinai, New York, NY, USA, Department of Biological Statistics and Computational Biology, Cornell University, Ithaca, USA and Department of Genetic Medicine, Weill Cornell Medical College, New York, NY, USA Department of Genetics and Genomic Sciences, Icahn Institute for Genomics and Multiscale Biology, Icahn School of Medicine at Mount Sinai, New York, NY, USA, Department of Biological Statistics and Computational Biology, Cornell University, Ithaca, USA and Department of Genetic Medicine, Weill Cornell Medical College, New York, NY, USA.
Abstract
UNLABELLED: The linear mixed model is the state-of-the-art method to account for the confounding effects of kinship and population structure in genome-wide association studies (GWAS). Current implementations test the effect of one or more genetic markers while including prespecified covariates such as sex. Here we develop an efficient implementation of the linear mixed model that allows composite hypothesis tests to consider genotype interactions with variables such as other genotypes, environment, sex or ancestry. Our R package, lrgpr, allows interactive model fitting and examination of regression diagnostics to facilitate exploratory data analysis in the context of the linear mixed model. By leveraging parallel and out-of-core computing for datasets too large to fit in main memory, lrgpr is applicable to large GWAS datasets and next-generation sequencing data. AVAILABILITY AND IMPLEMENTATION: lrgpr is an R package available from lrgpr.r-forge.r-project.org.
UNLABELLED: The linear mixed model is the state-of-the-art method to account for the confounding effects of kinship and population structure in genome-wide association studies (GWAS). Current implementations test the effect of one or more genetic markers while including prespecified covariates such as sex. Here we develop an efficient implementation of the linear mixed model that allows composite hypothesis tests to consider genotype interactions with variables such as other genotypes, environment, sex or ancestry. Our R package, lrgpr, allows interactive model fitting and examination of regression diagnostics to facilitate exploratory data analysis in the context of the linear mixed model. By leveraging parallel and out-of-core computing for datasets too large to fit in main memory, lrgpr is applicable to large GWAS datasets and next-generation sequencing data. AVAILABILITY AND IMPLEMENTATION: lrgpr is an R package available from lrgpr.r-forge.r-project.org.
Authors: Jennifer Listgarten; Christoph Lippert; Carl M Kadie; Robert I Davidson; Eleazar Eskin; David Heckerman Journal: Nat Methods Date: 2012-05-30 Impact factor: 28.547
Authors: Hyun Min Kang; Jae Hoon Sul; Susan K Service; Noah A Zaitlen; Sit-Yee Kong; Nelson B Freimer; Chiara Sabatti; Eleazar Eskin Journal: Nat Genet Date: 2010-03-07 Impact factor: 38.330
Authors: Christoph Lippert; Jennifer Listgarten; Ying Liu; Carl M Kadie; Robert I Davidson; David Heckerman Journal: Nat Methods Date: 2011-09-04 Impact factor: 28.547
Authors: Gulnara R Svishcheva; Tatiana I Axenovich; Nadezhda M Belonogova; Cornelia M van Duijn; Yurii S Aulchenko Journal: Nat Genet Date: 2012-09-16 Impact factor: 38.330
Authors: Jennifer Listgarten; Christoph Lippert; Eun Yong Kang; Jing Xiang; Carl M Kadie; David Heckerman Journal: Bioinformatics Date: 2013-04-18 Impact factor: 6.937
Authors: Jacqueline M Otto; Ian R Gizer; Chris Bizon; Kirk C Wilhelmsen; Cindy L Ehlers Journal: Drug Alcohol Depend Date: 2016-08-10 Impact factor: 4.492