Qi Yan1, Erick Forno2, Juan C Celedón2,3, Wei Chen2,3,4, Daniel E Weeks3,4. 1. Department of Obstetrics and Gynecology, Columbia University Irving Medical Center, New York, NY, 10032. 2. Division of Pediatric Pulmonary Medicine, UPMC Children's Hospital of Pittsburgh, Pittsburgh, PA, 15224. 3. Department of Human Genetics, Graduate School of Public Health, University of Pittsburgh, Pittsburgh, PA, 15261, USA. 4. Department of Biostatistics, Graduate School of Public Health, University of Pittsburgh, Pittsburgh, PA, 15261, USA.
Abstract
MOTIVATION: Allele specific differences in molecular traits can be obtained from next generation sequencing data and could potentially improve testing power, but such information is usually overlooked in association studies. Furthermore, the variation of molecular quantitative traits (e.g., gene expression) could result from the interaction effect of genotypes and phenotypes, but it is challenging to identify such interaction signals in complex disease studies in humans due to small genetic effect sizes and/or small sample sizes. RESULTS: We develop a novel statistical method, the combined haplotype interaction test (CHIT), which tests for association between molecular quantitative traits and phenotype-genotype interactions by modeling the total read counts and allele-specific reads in a target region. CHIT can be used as a supplementary analysis to the regular linear interaction regression. In our simulations, CHIT obtains non-inflated type I error rates, and it has higher power than a standard interaction quantitative trait locus approach based on linear regression models. Finally, we illustrate CHIT by testing associations between gene expression obtained by RNA-seq and the interaction of SNPs and atopy status from a study of childhood asthma in Puerto Ricans, and results demonstrate that CHIT could be more powerful than a standard linear interaction expression quantitative trait loci (eQTL) approach. AVAILABILITY: The CHIT algorithm has been implemented in Python. The source code and documentation are available and can be downloaded from https://github.com/QiYanPitt/CHIT. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
MOTIVATION: Allele specific differences in molecular traits can be obtained from next generation sequencing data and could potentially improve testing power, but such information is usually overlooked in association studies. Furthermore, the variation of molecular quantitative traits (e.g., gene expression) could result from the interaction effect of genotypes and phenotypes, but it is challenging to identify such interaction signals in complex disease studies in humans due to small genetic effect sizes and/or small sample sizes. RESULTS: We develop a novel statistical method, the combined haplotype interaction test (CHIT), which tests for association between molecular quantitative traits and phenotype-genotype interactions by modeling the total read counts and allele-specific reads in a target region. CHIT can be used as a supplementary analysis to the regular linear interaction regression. In our simulations, CHIT obtains non-inflated type I error rates, and it has higher power than a standard interaction quantitative trait locus approach based on linear regression models. Finally, we illustrate CHIT by testing associations between gene expression obtained by RNA-seq and the interaction of SNPs and atopy status from a study of childhood asthma in Puerto Ricans, and results demonstrate that CHIT could be more powerful than a standard linear interaction expression quantitative trait loci (eQTL) approach. AVAILABILITY: The CHIT algorithm has been implemented in Python. The source code and documentation are available and can be downloaded from https://github.com/QiYanPitt/CHIT. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Authors: Alexander Dobin; Carrie A Davis; Felix Schlesinger; Jorg Drenkow; Chris Zaleski; Sonali Jha; Philippe Batut; Mark Chaisson; Thomas R Gingeras Journal: Bioinformatics Date: 2012-10-25 Impact factor: 6.937
Authors: Sayantan Das; Lukas Forer; Sebastian Schönherr; Carlo Sidore; Adam E Locke; Alan Kwong; Scott I Vrieze; Emily Y Chew; Shawn Levy; Matt McGue; David Schlessinger; Dwight Stambolian; Po-Ru Loh; William G Iacono; Anand Swaroop; Laura J Scott; Francesco Cucca; Florian Kronenberg; Michael Boehnke; Gonçalo R Abecasis; Christian Fuchsberger Journal: Nat Genet Date: 2016-08-29 Impact factor: 38.330
Authors: David A Knowles; Joe R Davis; Hilary Edgington; Anil Raj; Marie-Julie Favé; Xiaowei Zhu; James B Potash; Myrna M Weissman; Jianxin Shi; Douglas F Levinson; Philip Awadalla; Sara Mostafavi; Stephen B Montgomery; Alexis Battle Journal: Nat Methods Date: 2017-05-22 Impact factor: 28.547
Authors: Jacob F Degner; John C Marioni; Athma A Pai; Joseph K Pickrell; Everlyne Nkadori; Yoav Gilad; Jonathan K Pritchard Journal: Bioinformatics Date: 2009-10-06 Impact factor: 6.937
Authors: Shane McCarthy; Sayantan Das; Warren Kretzschmar; Olivier Delaneau; Andrew R Wood; Alexander Teumer; Hyun Min Kang; Christian Fuchsberger; Petr Danecek; Kevin Sharp; Yang Luo; Carlo Sidore; Alan Kwong; Nicholas Timpson; Seppo Koskinen; Scott Vrieze; Laura J Scott; He Zhang; Anubha Mahajan; Jan Veldink; Ulrike Peters; Carlos Pato; Cornelia M van Duijn; Christopher E Gillies; Ilaria Gandin; Massimo Mezzavilla; Arthur Gilly; Massimiliano Cocca; Michela Traglia; Andrea Angius; Jeffrey C Barrett; Dorrett Boomsma; Kari Branham; Gerome Breen; Chad M Brummett; Fabio Busonero; Harry Campbell; Andrew Chan; Sai Chen; Emily Chew; Francis S Collins; Laura J Corbin; George Davey Smith; George Dedoussis; Marcus Dorr; Aliki-Eleni Farmaki; Luigi Ferrucci; Lukas Forer; Ross M Fraser; Stacey Gabriel; Shawn Levy; Leif Groop; Tabitha Harrison; Andrew Hattersley; Oddgeir L Holmen; Kristian Hveem; Matthias Kretzler; James C Lee; Matt McGue; Thomas Meitinger; David Melzer; Josine L Min; Karen L Mohlke; John B Vincent; Matthias Nauck; Deborah Nickerson; Aarno Palotie; Michele Pato; Nicola Pirastu; Melvin McInnis; J Brent Richards; Cinzia Sala; Veikko Salomaa; David Schlessinger; Sebastian Schoenherr; P Eline Slagboom; Kerrin Small; Timothy Spector; Dwight Stambolian; Marcus Tuke; Jaakko Tuomilehto; Leonard H Van den Berg; Wouter Van Rheenen; Uwe Volker; Cisca Wijmenga; Daniela Toniolo; Eleftheria Zeggini; Paolo Gasparini; Matthew G Sampson; James F Wilson; Timothy Frayling; Paul I W de Bakker; Morris A Swertz; Steven McCarroll; Charles Kooperberg; Annelot Dekker; David Altshuler; Cristen Willer; William Iacono; Samuli Ripatti; Nicole Soranzo; Klaudia Walter; Anand Swaroop; Francesco Cucca; Carl A Anderson; Richard M Myers; Michael Boehnke; Mark I McCarthy; Richard Durbin Journal: Nat Genet Date: 2016-08-22 Impact factor: 38.330