MOTIVATION: Identification of expression Quantitative Trait Loci (eQTL), the genetic loci that contribute to heritable variation in gene expression, can be obstructed by factors that produce variation in expression profiles if these factors are unmeasured or hidden from direct analysis. METHODS: We have developed a method for Hidden Expression Factor analysis (HEFT) that identifies individual and pleiotropic effects of eQTL in the presence of hidden factors. The HEFT model is a combined multivariate regression and factor analysis, where the complete likelihood of the model is used to derive a ridge estimator for simultaneous factor learning and detection of eQTL. HEFT requires no pre-estimation of hidden factor effects; it provides P-values and is extremely fast, requiring just a few hours to complete an eQTL analysis of thousands of expression variables when analyzing hundreds of thousands of single nucleotide polymorphisms on a standard 8 core 2.6 G desktop. RESULTS: By analyzing simulated data, we demonstrate that HEFT can correct for an unknown number of hidden factors and significantly outperforms all related hidden factor methods for eQTL analysis when there are eQTL with univariate and multivariate (pleiotropic) effects. To demonstrate a real-world application, we applied HEFT to identify eQTL affecting gene expression in the human lung for a study that included presumptive hidden factors. HEFT identified all of the cis-eQTL found by other hidden factor methods and 91 additional cis-eQTL. HEFT also identified a number of eQTLs with direct relevance to lung disease that could not be found without a hidden factor analysis, including cis-eQTL for GTF2H1 and MTRR, genes that have been independently associated with lung cancer. AVAILABILITY: Software is available at http://mezeylab.cb.bscb.cornell.edu/Software.aspx. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
MOTIVATION: Identification of expression Quantitative Trait Loci (eQTL), the genetic loci that contribute to heritable variation in gene expression, can be obstructed by factors that produce variation in expression profiles if these factors are unmeasured or hidden from direct analysis. METHODS: We have developed a method for Hidden Expression Factor analysis (HEFT) that identifies individual and pleiotropic effects of eQTL in the presence of hidden factors. The HEFT model is a combined multivariate regression and factor analysis, where the complete likelihood of the model is used to derive a ridge estimator for simultaneous factor learning and detection of eQTL. HEFT requires no pre-estimation of hidden factor effects; it provides P-values and is extremely fast, requiring just a few hours to complete an eQTL analysis of thousands of expression variables when analyzing hundreds of thousands of single nucleotide polymorphisms on a standard 8 core 2.6 G desktop. RESULTS: By analyzing simulated data, we demonstrate that HEFT can correct for an unknown number of hidden factors and significantly outperforms all related hidden factor methods for eQTL analysis when there are eQTL with univariate and multivariate (pleiotropic) effects. To demonstrate a real-world application, we applied HEFT to identify eQTL affecting gene expression in the human lung for a study that included presumptive hidden factors. HEFT identified all of the cis-eQTL found by other hidden factor methods and 91 additional cis-eQTL. HEFT also identified a number of eQTLs with direct relevance to lung disease that could not be found without a hidden factor analysis, including cis-eQTL for GTF2H1 and MTRR, genes that have been independently associated with lung cancer. AVAILABILITY: Software is available at http://mezeylab.cb.bscb.cornell.edu/Software.aspx. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Authors: Ben-Gary Harvey; Adriana Heguy; Philip L Leopold; Brendan J Carolan; Barbara Ferris; Ronald G Crystal Journal: J Mol Med (Berl) Date: 2006-11-08 Impact factor: 4.599
Authors: Alkes L Price; Nick J Patterson; Robert M Plenge; Michael E Weinblatt; Nancy A Shadick; David Reich Journal: Nat Genet Date: 2006-07-23 Impact factor: 38.330
Authors: E Dehan; A Ben-Dor; W Liao; D Lipson; H Frimer; S Rienstein; D Simansky; M Krupsky; P Yaron; E Friedman; G Rechavi; M Perlman; A Aviram-Goldring; S Izraeli; M Bittner; Z Yakhini; N Kaminski Journal: Lung Cancer Date: 2007-01-25 Impact factor: 5.705
Authors: Danny Arends; K Joeri van der Velde; Pjotr Prins; Karl W Broman; Steffen Möller; Ritsert C Jansen; Morris A Swertz Journal: Bioinformatics Date: 2012-02-03 Impact factor: 6.937
Authors: Manhong Dai; Pinglang Wang; Andrew D Boyd; Georgi Kostov; Brian Athey; Edward G Jones; William E Bunney; Richard M Myers; Terry P Speed; Huda Akil; Stanley J Watson; Fan Meng Journal: Nucleic Acids Res Date: 2005-11-10 Impact factor: 16.971
Authors: Wei Luo; Ma'en Obeidat; Antonio Fabio Di Narzo; Rong Chen; Don D Sin; Peter D Paré; Ke Hao Journal: Am J Respir Cell Mol Biol Date: 2016-02 Impact factor: 6.914
Authors: Anne H Agler; Ronald G Crystal; Jason G Mezey; Jennifer Fuller; Chuan Gao; Joyanna G Hansen; Patricia A Cassano Journal: COPD Date: 2013-08 Impact factor: 2.409
Authors: Santhi K Ganesh; Daniel I Chasman; Martin G Larson; Xiuqing Guo; Germain Verwoert; Joshua C Bis; Xiangjun Gu; Albert V Smith; Min-Lee Yang; Yan Zhang; Georg Ehret; Lynda M Rose; Shih-Jen Hwang; George J Papanicolau; Eric J Sijbrands; Kenneth Rice; Gudny Eiriksdottir; Vasyl Pihur; Paul M Ridker; Ramachandran S Vasan; Christopher Newton-Cheh; Leslie J Raffel; Najaf Amin; Jerome I Rotter; Kiang Liu; Lenore J Launer; Ming Xu; Mark Caulfield; Alanna C Morrison; Andrew D Johnson; Dhananjay Vaidya; Abbas Dehghan; Guo Li; Claude Bouchard; Tamara B Harris; He Zhang; Eric Boerwinkle; David S Siscovick; Wei Gao; Andre G Uitterlinden; Fernando Rivadeneira; Albert Hofman; Cristen J Willer; Oscar H Franco; Yong Huo; Jacqueline C M Witteman; Patricia B Munroe; Vilmundur Gudnason; Walter Palmas; Cornelia van Duijn; Myriam Fornage; Daniel Levy; Bruce M Psaty; Aravinda Chakravarti Journal: Am J Hum Genet Date: 2014-06-26 Impact factor: 11.025
Authors: Stephen Salerno; Mahya Mehrmohamadi; Maria V Liberti; Muting Wan; Martin T Wells; James G Booth; Jason W Locasale Journal: PLoS One Date: 2017-06-29 Impact factor: 3.240
Authors: Brian J Reardon; Joyanna G Hansen; Ronald G Crystal; Denise K Houston; Stephen B Kritchevsky; Tamara Harris; Kurt Lohman; Yongmei Liu; George T O'Connor; Jemma B Wilk; Jason Mezey; Chuan Gao; Patricia A Cassano Journal: BMC Med Genet Date: 2013-11-25 Impact factor: 2.103
Authors: Samuel W Lukowski; Luke R Lloyd-Jones; Alexander Holloway; Holger Kirsten; Gibran Hemani; Jian Yang; Kerrin Small; Jing Zhao; Andres Metspalu; Emmanouil T Dermitzakis; Greg Gibson; Timothy D Spector; Joachim Thiery; Markus Scholz; Grant W Montgomery; Tonu Esko; Peter M Visscher; Joseph E Powell Journal: Nat Commun Date: 2017-09-07 Impact factor: 14.919