Literature DB >> 22547833

High-Dimensional Heteroscedastic Regression with an Application to eQTL Data Analysis.

Z John Daye1, Jinbo Chen, Hongzhe Li.   

Abstract

We consider the problem of high-dimensional regression under non-constant error variances. Despite being a common phenomenon in biological applications, heteroscedasticity has, so far, been largely ignored in high-dimensional analysis of genomic data sets. We propose a new methodology that allows non-constant error variances for high-dimensional estimation and model selection. Our method incorporates heteroscedasticity by simultaneously modeling both the mean and variance components via a novel doubly regularized approach. Extensive Monte Carlo simulations indicate that our proposed procedure can result in better estimation and variable selection than existing methods when heteroscedasticity arises from the presence of predictors explaining error variances and outliers. Further, we demonstrate the presence of heteroscedasticity in and apply our method to an expression quantitative trait loci (eQTLs) study of 112 yeast segregants. The new procedure can automatically account for heteroscedasticity in identifying the eQTLs that are associated with gene expression variations and lead to smaller prediction errors. These results demonstrate the importance of considering heteroscedasticity in eQTL data analysis.

Entities:  

Year:  2011        PMID: 22547833      PMCID: PMC3218221          DOI: 10.1111/j.1541-0420.2011.01652.x

Source DB:  PubMed          Journal:  Biometrics        ISSN: 0006-341X            Impact factor:   2.571


  7 in total

1.  Predicting survival from microarray data--a comparative study.

Authors:  H M Bøvelstad; S Nygård; H L Størvold; M Aldrin; Ø Borgan; A Frigessi; O C Lingjaerde
Journal:  Bioinformatics       Date:  2007-06-06       Impact factor: 6.937

2.  Genome-wide association analysis by lasso penalized logistic regression.

Authors:  Tong Tong Wu; Yi Fang Chen; Trevor Hastie; Eric Sobel; Kenneth Lange
Journal:  Bioinformatics       Date:  2009-01-28       Impact factor: 6.937

3.  The landscape of genetic complexity across 5,700 gene expression traits in yeast.

Authors:  Rachel B Brem; Leonid Kruglyak
Journal:  Proc Natl Acad Sci U S A       Date:  2005-01-19       Impact factor: 11.205

4.  A system for enhancing genome-wide coexpression dynamics study.

Authors:  Ker-Chau Li; Ching-Ti Liu; Wei Sun; Shinsheng Yuan; Tianwei Yu
Journal:  Proc Natl Acad Sci U S A       Date:  2004-10-18       Impact factor: 11.205

5.  Regularization Paths for Generalized Linear Models via Coordinate Descent.

Authors:  Jerome Friedman; Trevor Hastie; Rob Tibshirani
Journal:  J Stat Softw       Date:  2010       Impact factor: 6.440

6.  Simultaneous estimation and variable selection in median regression using Lasso-type penalty.

Authors:  Jinfeng Xu; Zhiliang Ying
Journal:  Ann Inst Stat Math       Date:  2010-06-01       Impact factor: 1.267

7.  Learning a prior on regulatory potential from eQTL data.

Authors:  Su-In Lee; Aimée M Dudley; David Drubin; Pamela A Silver; Nevan J Krogan; Dana Pe'er; Daphne Koller
Journal:  PLoS Genet       Date:  2009-01-30       Impact factor: 5.917

  7 in total
  9 in total

1.  Additive, epistatic, and environmental effects through the lens of expression variability QTL in a twin cohort.

Authors:  Gang Wang; Ence Yang; Candice L Brinkmeyer-Langford; James J Cai
Journal:  Genetics       Date:  2013-12-02       Impact factor: 4.562

2.  Epistasis and destabilizing mutations shape gene expression variability in humans via distinct modes of action.

Authors:  Ence Yang; Gang Wang; Jizhou Yang; Beiyan Zhou; Yanan Tian; James J Cai
Journal:  Hum Mol Genet       Date:  2016-11-15       Impact factor: 6.150

3.  Genetic variants contribute to gene expression variability in humans.

Authors:  Amanda M Hulse; James J Cai
Journal:  Genetics       Date:  2012-11-12       Impact factor: 4.562

4.  Robust Gaussian graphical modeling via l1 penalization.

Authors:  Hokeun Sun; Hongzhe Li
Journal:  Biometrics       Date:  2012-09-28       Impact factor: 2.571

5.  Gene expression variability and the analysis of large-scale RNA-seq studies with the MDSeq.

Authors:  Di Ran; Z John Daye
Journal:  Nucleic Acids Res       Date:  2017-07-27       Impact factor: 16.971

6.  Bayesian Hidden Markov Models for Dependent Large-Scale Multiple Testing.

Authors:  Xia Wang; Ali Shojaie; Jian Zou
Journal:  Comput Stat Data Anal       Date:  2019-01-29       Impact factor: 1.681

7.  Statistical analysis reveals co-expression patterns of many pairs of genes in yeast are jointly regulated by interacting loci.

Authors:  Lin Wang; Wei Zheng; Hongyu Zhao; Minghua Deng
Journal:  PLoS Genet       Date:  2013-03-28       Impact factor: 5.917

8.  Applying genome-wide gene-based expression quantitative trait locus mapping to study population ancestry and pharmacogenetics.

Authors:  Hsin-Chou Yang; Chien-Wei Lin; Chia-Wei Chen; James J Chen
Journal:  BMC Genomics       Date:  2014-04-29       Impact factor: 3.969

9.  Set-Based Rare Variant Expression Quantitative Trait Loci in Blood and Brain from Alzheimer Disease Study Participants.

Authors:  Devanshi Patel; Xiaoling Zhang; John J Farrell; Kathryn L Lunetta; Lindsay A Farrer
Journal:  Genes (Basel)       Date:  2021-03-15       Impact factor: 4.141

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.