Literature DB >> 24436727

BAYESIAN SHRINKAGE METHODS FOR PARTIALLY OBSERVED DATA WITH MANY PREDICTORS.

Philip S Boonstra1, Bhramar Mukherjee1, Jeremy Mg Taylor1.   

Abstract

Motivated by the increasing use of and rapid changes in array technologies, we consider the prediction problem of fitting a linear regression relating a continuous outcome Y to a large number of covariates X , eg measurements from current, state-of-the-art technology. For most of the samples, only the outcome Y and surrogate covariates, W , are available. These surrogates may be data from prior studies using older technologies. Owing to the dimension of the problem and the large fraction of missing information, a critical issue is appropriate shrinkage of model parameters for an optimal bias-variance tradeoff. We discuss a variety of fully Bayesian and Empirical Bayes algorithms which account for uncertainty in the missing data and adaptively shrink parameter estimates for superior prediction. These methods are evaluated via a comprehensive simulation study. In addition, we apply our methods to a lung cancer dataset, predicting survival time (Y) using qRT-PCR ( X ) and microarray ( W ) measurements.

Entities:  

Keywords:  High-dimensional data; Markov chain Monte Carlo; measurement error; missing data; shrinkage

Year:  2013        PMID: 24436727      PMCID: PMC3891514          DOI: 10.1214/13-AOAS668

Source DB:  PubMed          Journal:  Ann Appl Stat        ISSN: 1932-6157            Impact factor:   2.083


  7 in total

1.  Assessment and comparison of prognostic classification schemes for survival data.

Authors:  E Graf; C Schmoor; W Sauerbrei; M Schumacher
Journal:  Stat Med       Date:  1999 Sep 15-30       Impact factor: 2.373

2.  Empirical Bayes Gibbs sampling.

Authors:  G Casella
Journal:  Biostatistics       Date:  2001-12       Impact factor: 5.899

3.  Stochastic relaxation, gibbs distributions, and the bayesian restoration of images.

Authors:  S Geman; D Geman
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  1984-06       Impact factor: 6.226

4.  Bayesian LASSO for quantitative trait loci mapping.

Authors:  Nengjun Yi; Shizhong Xu
Journal:  Genetics       Date:  2008-05-27       Impact factor: 4.562

5.  Incorporating auxiliary information for improved prediction in high-dimensional datasets: an ensemble of shrinkage approaches.

Authors:  Philip S Boonstra; Jeremy M G Taylor; Bhramar Mukherjee
Journal:  Biostatistics       Date:  2012-10-19       Impact factor: 5.899

6.  Development and validation of a quantitative real-time polymerase chain reaction classifier for lung cancer prognosis.

Authors:  Guoan Chen; Sinae Kim; Jeremy M G Taylor; Zhuwen Wang; Oliver Lee; Nithya Ramnath; Rishindra M Reddy; Jules Lin; Andrew C Chang; Mark B Orringer; David G Beer
Journal:  J Thorac Oncol       Date:  2011-09       Impact factor: 15.609

7.  Covariance-regularized regression and classification for high-dimensional problems.

Authors:  Daniela M Witten; Robert Tibshirani
Journal:  J R Stat Soc Series B Stat Methodol       Date:  2009-02-20       Impact factor: 4.488

  7 in total
  2 in total

1.  Data-adaptive Shrinkage via the Hyperpenalized EM Algorithm.

Authors:  Philip S Boonstra; Jeremy M G Taylor; Bhramar Mukherjee
Journal:  Stat Biosci       Date:  2015-06-03

2.  Evaluating biomarkers for treatment selection from reproducibility studies.

Authors:  Xiao Song; Kevin K Dobbin
Journal:  Biostatistics       Date:  2022-01-13       Impact factor: 5.899

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.