Xin Zhou1, Xiaodong Cai1. 1. Department of Electrical and Computer Engineering, University of Miami, Florida 33146, USA.
Abstract
MOTIVATION: Genetic variations of expression quantitative trait loci (eQTLs) play a critical role in influencing complex traits and diseases development. Two main factors that affect the statistical power of detecting eQTLs are: 1) relatively small size of samples available, and 2) heavy burden of multiple testing due to a very large number of variants to be tested. The later issue is particularly severe when one tries to identify trans-eQTLs that are far away from the genes they influence. If one can exploit co-expressed genes jointly in eQTL-mapping, effective sample size can be increased. Furthermore, using the structure of the gene regulatory network (GRN) may help to identify trans-eQTLs without increasing multiple testing burden. RESULTS: In this paper, we employ the structure equation model (SEM) to model both GRN and effect of eQTLs on gene expression, and then develop a novel algorithm, named sparse SEM for eQTL mapping (SSEMQ), to conduct joint eQTL mapping and GRN inference. The SEM can exploit co-expressed genes jointly in eQTL mapping and also use GRN to determine trans-eQTLs. Computer simulations demonstrate that our SSEMQ significantly outperforms nine existing eQTL mapping methods. SSEMQ is further employed to analyze two real datasets of human breast and whole blood tissues, yielding a number of cis- and trans-eQTLs. AVAILABILITY: R package ssemQr is available at https://github.com/Ivis4ml/ssemQr.git. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
MOTIVATION: Genetic variations of expression quantitative trait loci (eQTLs) play a critical role in influencing complex traits and diseases development. Two main factors that affect the statistical power of detecting eQTLs are: 1) relatively small size of samples available, and 2) heavy burden of multiple testing due to a very large number of variants to be tested. The later issue is particularly severe when one tries to identify trans-eQTLs that are far away from the genes they influence. If one can exploit co-expressed genes jointly in eQTL-mapping, effective sample size can be increased. Furthermore, using the structure of the gene regulatory network (GRN) may help to identify trans-eQTLs without increasing multiple testing burden. RESULTS: In this paper, we employ the structure equation model (SEM) to model both GRN and effect of eQTLs on gene expression, and then develop a novel algorithm, named sparse SEM for eQTL mapping (SSEMQ), to conduct joint eQTL mapping and GRN inference. The SEM can exploit co-expressed genes jointly in eQTL mapping and also use GRN to determine trans-eQTLs. Computer simulations demonstrate that our SSEMQ significantly outperforms nine existing eQTL mapping methods. SSEMQ is further employed to analyze two real datasets of human breast and whole blood tissues, yielding a number of cis- and trans-eQTLs. AVAILABILITY: R package ssemQr is available at https://github.com/Ivis4ml/ssemQr.git. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Authors: Maud Fagny; Joseph N Paulson; Marieke L Kuijjer; Abhijeet R Sonawane; Cho-Yi Chen; Camila M Lopes-Ramos; Kimberly Glass; John Quackenbush; John Platig Journal: Proc Natl Acad Sci U S A Date: 2017-08-29 Impact factor: 11.205
Authors: A Nissan; D Jager; M Roystacher; D Prus; T Peretz; I Eisenberg; H R Freund; M Scanlan; G Ritter; L J Old; S Mitrani-Rosenbaum Journal: Br J Cancer Date: 2006-03-13 Impact factor: 7.640
Authors: Andrea Mathe; Michelle Wong-Brown; Brianna Morten; John F Forbes; Stephen G Braye; Kelly A Avery-Kiejda; Rodney J Scott Journal: Sci Rep Date: 2015-11-05 Impact factor: 4.379