Literature DB >> 21686315

An Efficient Stochastic Search for Bayesian Variable Selection with High-Dimensional Correlated Predictors.

Deukwoo Kwon1, Maria Teresa Landi, Marina Vannucci, Haleem J Issaq, Darue Prieto, Ruth M Pfeiffer.   

Abstract

We present a Bayesian variable selection method for the setting in which the number of independent variables or predictors in a particular dataset is much larger than the available sample size. While most existing methods allow some degree of correlations among predictors but do not consider these correlations for variable selection, our method accounts for correlations among the predictors in variable selection. Our correlation-based stochastic search (CBS) method, the hybrid-CBS algorithm, extends a popular search algorithm for high-dimensional data, the stochastic search variable selection (SSVS) method. Similar to SSVS, we search the space of all possible models using variable addition, deletion or swap moves. However, our moves through the model space are designed to accommodate correlations among the variables. We describe our approach for continuous, binary, ordinal, and count outcome data. The impact of choices of prior distributions and hyper-parameters is assessed in simulation studies. We also examined performance of variable selection and prediction as the correlation structure of the predictors varies. We found that the hybrid-CBS resulted in lower prediction errors and better identified the true outcome associated predictors than SSVS when predictors were moderately to highly correlated. We illustrate the method on data from a proteomic profiling study of melanoma, a skin cancer.

Entities:  

Year:  2011        PMID: 21686315      PMCID: PMC3113479          DOI: 10.1016/j.csda.2011.04.019

Source DB:  PubMed          Journal:  Comput Stat Data Anal        ISSN: 0167-9473            Impact factor:   1.681


  7 in total

1.  Bayesian variable selection in multinomial probit models to identify molecular signatures of disease stage.

Authors:  Naijun Sha; Marina Vannucci; Mahlet G Tadesse; Philip J Brown; Ilaria Dragoni; Nick Davies; Tracy C Roberts; Andrea Contestabile; Mike Salmon; Chris Buckley; Francesco Falciani
Journal:  Biometrics       Date:  2004-09       Impact factor: 2.571

2.  A shrinkage approach to large-scale covariance matrix estimation and implications for functional genomics.

Authors:  Juliane Schäfer; Korbinian Strimmer
Journal:  Stat Appl Genet Mol Biol       Date:  2005-11-14

3.  Bayesian variable selection for the analysis of microarray data with censored outcomes.

Authors:  Naijun Sha; Mahlet G Tadesse; Marina Vannucci
Journal:  Bioinformatics       Date:  2006-07-15       Impact factor: 6.937

4.  MC1R, ASIP, and DNA repair in sporadic and familial melanoma in a Mediterranean population.

Authors:  Maria Teresa Landi; Peter A Kanetsky; Shirley Tsang; Bert Gold; David Munroe; Timothy Rebbeck; Jennifer Swoyer; Monica Ter-Minassian; Mohammad Hedayati; Lawrence Grossman; Alisa M Goldstein; Donato Calista; Ruth M Pfeiffer
Journal:  J Natl Cancer Inst       Date:  2005-07-06       Impact factor: 13.506

5.  A novel wavelet-based thresholding method for the pre-processing of mass spectrometry data that accounts for heterogeneous noise.

Authors:  Deukwoo Kwon; Marina Vannucci; Joon Jin Song; Jaesik Jeong; Ruth M Pfeiffer
Journal:  Proteomics       Date:  2008-08       Impact factor: 3.984

6.  Identifying biomarkers from mass spectrometry data with ordinal outcome.

Authors:  Deukwoo Kwon; Mahlet G Tadesse; Naijun Sha; Ruth M Pfeiffer; Marina Vannucci
Journal:  Cancer Inform       Date:  2007-02-05

7.  Gene selection in arthritis classification with large-scale microarray expression profiles.

Authors:  Naijun Sha; Marina Vannucci; Philip J Brown; Michael K Trower; Gillian Amphlett; Francesco Falciani
Journal:  Comp Funct Genomics       Date:  2003
  7 in total
  5 in total

1.  Association between prenatal exposure to multiple insecticides and child body weight and body composition in the VHEMBE South African birth cohort.

Authors:  Eric Coker; Jonathan Chevrier; Stephen Rauch; Asa Bradman; Muvhulawa Obida; Madelein Crause; Riana Bornman; Brenda Eskenazi
Journal:  Environ Int       Date:  2018-02-06       Impact factor: 9.621

2.  GPU-powered Shotgun Stochastic Search for Dirichlet process mixtures of Gaussian Graphical Models.

Authors:  Chiranjit Mukherjee; Abel Rodriguez
Journal:  J Comput Graph Stat       Date:  2016-08-05       Impact factor: 2.302

3.  INTEGRATIVE STATISTICAL METHODS FOR EXPOSURE MIXTURES AND HEALTH.

Authors:  Brian J Reich; Yawen Guan; Denis Fourches; Joshua L Warren; Stefanie E Sarnat; Howard H Chang
Journal:  Ann Appl Stat       Date:  2020-12-19       Impact factor: 2.083

4.  GUESS-ing polygenic associations with multiple phenotypes using a GPU-based evolutionary stochastic search algorithm.

Authors:  Leonardo Bottolo; Marc Chadeau-Hyam; David I Hastie; Tanja Zeller; Benoit Liquet; Paul Newcombe; Loic Yengo; Philipp S Wild; Arne Schillert; Andreas Ziegler; Sune F Nielsen; Adam S Butterworth; Weang Kee Ho; Raphaële Castagné; Thomas Munzel; David Tregouet; Mario Falchi; François Cambien; Børge G Nordestgaard; Fredéric Fumeron; Anne Tybjærg-Hansen; Philippe Froguel; John Danesh; Enrico Petretto; Stefan Blankenberg; Laurence Tiret; Sylvia Richardson
Journal:  PLoS Genet       Date:  2013-08-08       Impact factor: 5.917

5.  Obstructive sleep apnoea, positive airway pressure treatment and postoperative delirium: protocol for a retrospective observational study.

Authors:  Christopher R King; Krisztina E Escallier; Yo-El S Ju; Nan Lin; Ben Julian Palanca; Sherry Lynn McKinnon; Michael Simon Avidan
Journal:  BMJ Open       Date:  2019-08-26       Impact factor: 2.692

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.