Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Scalable Bayesian variable selection for structured high-dimensional data.

Literature DB >> 29738602

Scalable Bayesian variable selection for structured high-dimensional data.

Changgee Chang¹, Suprateek Kundu², Qi Long¹.

Abstract

Variable selection for structured covariates lying on an underlying known graph is a problem motivated by practical applications, and has been a topic of increasing interest. However, most of the existing methods may not be scalable to high-dimensional settings involving tens of thousands of variables lying on known pathways such as the case in genomics studies. We propose an adaptive Bayesian shrinkage approach which incorporates prior network information by smoothing the shrinkage parameters for connected variables in the graph, so that the corresponding coefficients have a similar degree of shrinkage. We fit our model via a computationally efficient expectation maximization algorithm which scalable to high-dimensional settings ( <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML"><mml:mi>p</mml:mi> <mml:mo>∼</mml:mo> <mml:mn>100</mml:mn> <mml:mo>,</mml:mo> <mml:mn>000</mml:mn></mml:math> ). Theoretical properties for fixed as well as increasing dimensions are established, even when the number of variables increases faster than the sample size. We demonstrate the advantages of our approach in terms of variable selection, prediction, and computational scalability via a simulation study, and apply the method to a cancer genomics study.

Entities: Chemical Disease Gene Species

Keywords: Adaptive Bayesian shrinkage; EM algorithm; Oracle property; Selection consistency; Structured high-dimensional variable selection

Mesh：

Year: 2018 PMID： 29738602 PMCID： PMC6222001 DOI： 10.1111/biom.12882

Source DB: PubMed Journal: Biometrics ISSN： 0006-341X Impact factor: 2.571

16 in total

1. Variable selection for discriminant analysis with Markov random field priors for the analysis of microarray data.

Authors: Francesco C Stingo; Marina Vannucci
Journal: Bioinformatics Date: 2010-12-14 Impact factor: 6.937

2. Identification of RANBP16 and RANBP17 as novel interaction partners for the bHLH transcription factor E12.

Authors: Jun-Ho Lee; Shengli Zhou; Cynthia M Smas
Journal: J Cell Biochem Date: 2010-09-01 Impact factor: 4.429

3. Network-constrained regularization and variable selection for analysis of genomic data.

Authors: Caiyan Li; Hongzhe Li
Journal: Bioinformatics Date: 2008-03-01 Impact factor: 6.937

4. On path restoration for censored outcomes.

Authors: Brent A Johnson; Qi Long; Matthias Chung
Journal: Biometrics Date: 2011-04-02 Impact factor: 2.571

5. Integrated genomic analysis identifies clinically relevant subtypes of glioblastoma characterized by abnormalities in PDGFRA, IDH1, EGFR, and NF1.

Authors: Roel G W Verhaak; Katherine A Hoadley; Elizabeth Purdom; Victoria Wang; Yuan Qi; Matthew D Wilkerson; C Ryan Miller; Li Ding; Todd Golub; Jill P Mesirov; Gabriele Alexe; Michael Lawrence; Michael O'Kelly; Pablo Tamayo; Barbara A Weir; Stacey Gabriel; Wendy Winckler; Supriya Gupta; Lakshmi Jakkula; Heidi S Feiler; J Graeme Hodgson; C David James; Jann N Sarkaria; Cameron Brennan; Ari Kahn; Paul T Spellman; Richard K Wilson; Terence P Speed; Joe W Gray; Matthew Meyerson; Gad Getz; Charles M Perou; D Neil Hayes
Journal: Cancer Cell Date: 2010-01-19 Impact factor: 31.743

6. GENERALIZED DOUBLE PARETO SHRINKAGE.

Authors: Artin Armagan; David B Dunson; Jaeyong Lee
Journal: Stat Sin Date: 2013-01-01 Impact factor: 1.261

7. INCORPORATING BIOLOGICAL INFORMATION INTO LINEAR MODELS: A BAYESIAN APPROACH TO THE SELECTION OF PATHWAYS AND GENES.

Authors: Francesco C Stingo; Yian A Chen; Mahlet G Tadesse; Marina Vannucci
Journal: Ann Appl Stat Date: 2011-09-01 Impact factor: 2.083

8. Comprehensive association analysis of the vitamin D pathway genes, VDR, CYP27B1, and CYP24A1, in prostate cancer.

Authors: Crystal N Holick; Janet L Stanford; Erika M Kwon; Elaine A Ostrander; Sergey Nejentsev; Ulrike Peters
Journal: Cancer Epidemiol Biomarkers Prev Date: 2007-10 Impact factor: 4.254

9. Disturbing miR-182 and -381 inhibits BRD7 transcription and glioma growth by directly targeting LRRC4.

Authors: Hailin Tang; Zeyou Wang; Qing Liu; Xiaoping Liu; Minghua Wu; Guiyuan Li
Journal: PLoS One Date: 2014-01-03 Impact factor: 3.240

10. Overlapping Group Logistic Regression with Applications to Genetic Pathway Selection.

Authors: Yaohui Zeng; Patrick Breheny
Journal: Cancer Inform Date: 2016-09-15

8 in total

1. Integrative analysis of genetical genomics data incorporating network structures.

Authors: Bin Gao; Xu Liu; Hongzhe Li; Yuehua Cui
Journal: Biometrics Date: 2019-04-29 Impact factor: 2.571

2. Bayesian generalized biclustering analysis via adaptive structured shrinkage.

Authors: Ziyi Li; Changgee Chang; Suprateek Kundu; Qi Long
Journal: Biostatistics Date: 2020-07-01 Impact factor: 5.899

3. Knowledge-Guided Biclustering via Sparse Variational EM Algorithm.

Authors: Changgee Chang; Jihwan Oh; Eun Jeong Min; Qi Long
Journal: 10th IEEE Int Conf Big Knowl (2019) Date: 2019-12-30

4. GRIA: Graphical Regularization for Integrative Analysis.

Authors: Changgee Chang; Jihwan Oh; Qi Long
Journal: Proc SIAM Int Conf Data Min Date: 2020

5. Bayesian Non-linear Support Vector Machine for High-Dimensional Data with Incorporation of Graph Information on Features.

Authors: Wenli Sun; Changgee Chang; Qi Long
Journal: Proc IEEE Int Conf Big Data Date: 2020-02-24