Literature DB >> 29738602

Scalable Bayesian variable selection for structured high-dimensional data.

Changgee Chang1, Suprateek Kundu2, Qi Long1.   

Abstract

Variable selection for structured covariates lying on an underlying known graph is a problem motivated by practical applications, and has been a topic of increasing interest. However, most of the existing methods may not be scalable to high-dimensional settings involving tens of thousands of variables lying on known pathways such as the case in genomics studies. We propose an adaptive Bayesian shrinkage approach which incorporates prior network information by smoothing the shrinkage parameters for connected variables in the graph, so that the corresponding coefficients have a similar degree of shrinkage. We fit our model via a computationally efficient expectation maximization algorithm which scalable to high-dimensional settings ( <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML"><mml:mi>p</mml:mi> <mml:mo>∼</mml:mo> <mml:mn>100</mml:mn> <mml:mo>,</mml:mo> <mml:mn>000</mml:mn></mml:math> ). Theoretical properties for fixed as well as increasing dimensions are established, even when the number of variables increases faster than the sample size. We demonstrate the advantages of our approach in terms of variable selection, prediction, and computational scalability via a simulation study, and apply the method to a cancer genomics study.
© 2018, The International Biometric Society.

Entities:  

Keywords:  Adaptive Bayesian shrinkage; EM algorithm; Oracle property; Selection consistency; Structured high-dimensional variable selection

Mesh:

Year:  2018        PMID: 29738602      PMCID: PMC6222001          DOI: 10.1111/biom.12882

Source DB:  PubMed          Journal:  Biometrics        ISSN: 0006-341X            Impact factor:   2.571


  16 in total

1.  Variable selection for discriminant analysis with Markov random field priors for the analysis of microarray data.

Authors:  Francesco C Stingo; Marina Vannucci
Journal:  Bioinformatics       Date:  2010-12-14       Impact factor: 6.937

2.  Identification of RANBP16 and RANBP17 as novel interaction partners for the bHLH transcription factor E12.

Authors:  Jun-Ho Lee; Shengli Zhou; Cynthia M Smas
Journal:  J Cell Biochem       Date:  2010-09-01       Impact factor: 4.429

3.  Network-constrained regularization and variable selection for analysis of genomic data.

Authors:  Caiyan Li; Hongzhe Li
Journal:  Bioinformatics       Date:  2008-03-01       Impact factor: 6.937

4.  On path restoration for censored outcomes.

Authors:  Brent A Johnson; Qi Long; Matthias Chung
Journal:  Biometrics       Date:  2011-04-02       Impact factor: 2.571

5.  Integrated genomic analysis identifies clinically relevant subtypes of glioblastoma characterized by abnormalities in PDGFRA, IDH1, EGFR, and NF1.

Authors:  Roel G W Verhaak; Katherine A Hoadley; Elizabeth Purdom; Victoria Wang; Yuan Qi; Matthew D Wilkerson; C Ryan Miller; Li Ding; Todd Golub; Jill P Mesirov; Gabriele Alexe; Michael Lawrence; Michael O'Kelly; Pablo Tamayo; Barbara A Weir; Stacey Gabriel; Wendy Winckler; Supriya Gupta; Lakshmi Jakkula; Heidi S Feiler; J Graeme Hodgson; C David James; Jann N Sarkaria; Cameron Brennan; Ari Kahn; Paul T Spellman; Richard K Wilson; Terence P Speed; Joe W Gray; Matthew Meyerson; Gad Getz; Charles M Perou; D Neil Hayes
Journal:  Cancer Cell       Date:  2010-01-19       Impact factor: 31.743

6.  GENERALIZED DOUBLE PARETO SHRINKAGE.

Authors:  Artin Armagan; David B Dunson; Jaeyong Lee
Journal:  Stat Sin       Date:  2013-01-01       Impact factor: 1.261

7.  INCORPORATING BIOLOGICAL INFORMATION INTO LINEAR MODELS: A BAYESIAN APPROACH TO THE SELECTION OF PATHWAYS AND GENES.

Authors:  Francesco C Stingo; Yian A Chen; Mahlet G Tadesse; Marina Vannucci
Journal:  Ann Appl Stat       Date:  2011-09-01       Impact factor: 2.083

8.  Comprehensive association analysis of the vitamin D pathway genes, VDR, CYP27B1, and CYP24A1, in prostate cancer.

Authors:  Crystal N Holick; Janet L Stanford; Erika M Kwon; Elaine A Ostrander; Sergey Nejentsev; Ulrike Peters
Journal:  Cancer Epidemiol Biomarkers Prev       Date:  2007-10       Impact factor: 4.254

9.  Disturbing miR-182 and -381 inhibits BRD7 transcription and glioma growth by directly targeting LRRC4.

Authors:  Hailin Tang; Zeyou Wang; Qing Liu; Xiaoping Liu; Minghua Wu; Guiyuan Li
Journal:  PLoS One       Date:  2014-01-03       Impact factor: 3.240

10.  Overlapping Group Logistic Regression with Applications to Genetic Pathway Selection.

Authors:  Yaohui Zeng; Patrick Breheny
Journal:  Cancer Inform       Date:  2016-09-15
View more
  8 in total

1.  Integrative analysis of genetical genomics data incorporating network structures.

Authors:  Bin Gao; Xu Liu; Hongzhe Li; Yuehua Cui
Journal:  Biometrics       Date:  2019-04-29       Impact factor: 2.571

2.  Bayesian generalized biclustering analysis via adaptive structured shrinkage.

Authors:  Ziyi Li; Changgee Chang; Suprateek Kundu; Qi Long
Journal:  Biostatistics       Date:  2020-07-01       Impact factor: 5.899

3.  Knowledge-Guided Biclustering via Sparse Variational EM Algorithm.

Authors:  Changgee Chang; Jihwan Oh; Eun Jeong Min; Qi Long
Journal:  10th IEEE Int Conf Big Knowl (2019)       Date:  2019-12-30

4.  GRIA: Graphical Regularization for Integrative Analysis.

Authors:  Changgee Chang; Jihwan Oh; Qi Long
Journal:  Proc SIAM Int Conf Data Min       Date:  2020

5.  Bayesian Non-linear Support Vector Machine for High-Dimensional Data with Incorporation of Graph Information on Features.

Authors:  Wenli Sun; Changgee Chang; Qi Long
Journal:  Proc IEEE Int Conf Big Data       Date:  2020-02-24

6.  Graph-guided Bayesian SVM with Adaptive Structured Shrinkage Prior for High-dimensional Data.

Authors:  Wenli Sun; Changgee Chang; Qi Long
Journal:  Proc IEEE Int Conf Big Data       Date:  2021-12

7.  Integrative Bayesian analysis of brain functional networks incorporating anatomical knowledge.

Authors:  Ixavier A Higgins; Suprateek Kundu; Ying Guo
Journal:  Neuroimage       Date:  2018-07-11       Impact factor: 6.556

8.  Bayesian sparse heritability analysis with high-dimensional neuroimaging phenotypes.

Authors:  Yize Zhao; Tengfei Li; Hongtu Zhu
Journal:  Biostatistics       Date:  2022-04-13       Impact factor: 5.279

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.