Literature DB >> 23482517

Consistent high-dimensional Bayesian variable selection via penalized credible regions.

Howard D Bondell1, Brian J Reich.   

Abstract

For high-dimensional data, particularly when the number of predictors greatly exceeds the sample size, selection of relevant predictors for regression is a challenging problem. Methods such as sure screening, forward selection, or penalized regressions are commonly used. Bayesian variable selection methods place prior distributions on the parameters along with a prior over model space, or equivalently, a mixture prior on the parameters having mass at zero. Since exhaustive enumeration is not feasible, posterior model probabilities are often obtained via long MCMC runs. The chosen model can depend heavily on various choices for priors and also posterior thresholds. Alternatively, we propose a conjugate prior only on the full model parameters and use sparse solutions within posterior credible regions to perform selection. These posterior credible regions often have closed-form representations, and it is shown that these sparse solutions can be computed via existing algorithms. The approach is shown to outperform common methods in the high-dimensional setting, particularly under correlation. By searching for a sparse solution within a joint credible region, consistent model selection is established. Furthermore, it is shown that, under certain conditions, the use of marginal credible intervals can give consistent selection up to the case where the dimension grows exponentially in the sample size. The proposed approach successfully accomplishes variable selection in the high-dimensional setting, while avoiding pitfalls that plague typical Bayesian variable selection methods.

Entities:  

Keywords:  Bayesian variable selection; Consistency; Credible region; LASSO; Stochastic search

Year:  2012        PMID: 23482517      PMCID: PMC3587767          DOI: 10.1080/01621459.2012.716344

Source DB:  PubMed          Journal:  J Am Stat Assoc        ISSN: 0162-1459            Impact factor:   5.033


  5 in total

1.  Fixed and random effects selection in linear and logistic models.

Authors:  Satkartar K Kinney; David B Dunson
Journal:  Biometrics       Date:  2007-04-02       Impact factor: 2.571

2.  Simultaneous regression shrinkage, variable selection, and supervised clustering of predictors with OSCAR.

Authors:  Howard D Bondell; Brian J Reich
Journal:  Biometrics       Date:  2007-06-30       Impact factor: 2.571

3.  Discussion of "Sure Independence Screening for Ultra-High Dimensional Feature Space.

Authors:  Hao Helen Zhang
Journal:  J R Stat Soc Series B Stat Methodol       Date:  2008-11       Impact factor: 4.488

4.  One-step Sparse Estimates in Nonconcave Penalized Likelihood Models.

Authors:  Hui Zou; Runze Li
Journal:  Ann Stat       Date:  2008-08-01       Impact factor: 4.028

5.  Combined expression trait correlations and expression quantitative trait locus mapping.

Authors:  Hong Lan; Meng Chen; Jessica B Flowers; Brian S Yandell; Donnie S Stapleton; Christine M Mata; Eric Ton-Keen Mui; Matthew T Flowers; Kathryn L Schueler; Kenneth F Manly; Robert W Williams; Christina Kendziorski; Alan D Attie
Journal:  PLoS Genet       Date:  2006-01-20       Impact factor: 5.917

  5 in total
  9 in total

1.  Gene Network Reconstruction using Global-Local Shrinkage Priors.

Authors:  Gwenaël G R Leday; Mathisca C M de Gunst; Gino B Kpogbezan; Aad W van der Vaart; Wessel N van Wieringen; Mark A van de Wiel
Journal:  Ann Appl Stat       Date:  2017-03       Impact factor: 2.083

2.  Endogeneity in High Dimensions.

Authors:  Jianqing Fan; Yuan Liao
Journal:  Ann Stat       Date:  2014-06-01       Impact factor: 4.028

3.  Bayesian Factor Analysis for Inference on Interactions.

Authors:  Federico Ferrari; David B Dunson
Journal:  J Am Stat Assoc       Date:  2020-04-20       Impact factor: 5.033

4.  Flexible co-data learning for high-dimensional prediction.

Authors:  Mirrelijn M van Nee; Lodewyk F A Wessels; Mark A van de Wiel
Journal:  Stat Med       Date:  2021-08-26       Impact factor: 2.497

5.  Scalable Bayesian Variable Selection Using Nonlocal Prior Densities in Ultrahigh-dimensional Settings.

Authors:  Minsuk Shin; Anirban Bhattacharya; Valen E Johnson
Journal:  Stat Sin       Date:  2018-04       Impact factor: 1.261

6.  Bayesian sparse multiple regression for simultaneous rank reduction and variable selection.

Authors:  Antik Chakraborty; Anirban Bhattacharya; Bani K Mallick
Journal:  Biometrika       Date:  2019-11-23       Impact factor: 2.445

7.  Bayesian variable selection with graphical structure learning: Applications in integrative genomics.

Authors:  Suprateek Kundu; Yichen Cheng; Minsuk Shin; Ganiraju Manyam; Bani K Mallick; Veerabhadran Baladandayuthapani
Journal:  PLoS One       Date:  2018-07-30       Impact factor: 3.240

8.  High-dimensional regression in practice: an empirical study of finite-sample prediction, variable selection and ranking.

Authors:  Fan Wang; Sach Mukherjee; Sylvia Richardson; Steven M Hill
Journal:  Stat Comput       Date:  2019-12-19       Impact factor: 2.559

9.  Bayesian differential analysis of gene regulatory networks exploiting genetic perturbations.

Authors:  Yan Li; Dayou Liu; Tengfei Li; Yungang Zhu
Journal:  BMC Bioinformatics       Date:  2020-01-09       Impact factor: 3.169

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.