| Literature DB >> 33286578 |
Abstract
High-dimensional variable selection is an important research topic in modern statistics. While methods using nonlocal priors have been thoroughly studied for variable selection in linear regression, the crucial high-dimensional model selection properties for nonlocal priors in generalized linear models have not been investigated. In this paper, we consider a hierarchical generalized linear regression model with the product moment nonlocal prior over coefficients and examine its properties. Under standard regularity assumptions, we establish strong model selection consistency in a high-dimensional setting, where the number of covariates is allowed to increase at a sub-exponential rate with the sample size. The Laplace approximation is implemented for computing the posterior probabilities and the shotgun stochastic search procedure is suggested for exploring the posterior space. The proposed method is validated through simulation studies and illustrated by a real data example on functional activity analysis in fMRI study for predicting Parkinson's disease.Entities:
Keywords: high-dimensional; nonlocal prior; strong selection consistency
Year: 2020 PMID: 33286578 PMCID: PMC7517378 DOI: 10.3390/e22080807
Source DB: PubMed Journal: Entropy (Basel) ISSN: 1099-4300 Impact factor: 2.524
The summary statistics for Design 1 (Dense model design) are represented for each setting of the true regression coefficients under the first isotropic covariance case. Different setting means different choice of the true coefficient .
|
| |||||
|
|
|
|
|
| |
| Nonlocal | 1 | 1 | 1 | 1 | 0.02 |
| Spike and Slab | 1 | 0.38 | 1 | 0.60 | 0.21 |
| Lasso | 0.67 | 1 | 0.96 | 0.80 | 0.17 |
| EBLasso | 1 | 0.38 | 1 | 0.60 | 0.22 |
| SCAD | 0.57 | 1 | 0.93 | 0.73 | 0.14 |
|
| |||||
|
|
|
|
|
| |
| Nonlocal | 0.73 | 1 | 0.97 | 0.84 | 0.18 |
| Spike and Slab | 1 | 0.13 | 1 | 0.34 | 0.23 |
| Lasso | 0.54 | 0.88 | 0.93 | 0.65 | 0.15 |
| EBLasso | 1 | 0.63 | 1 | 0.78 | 0.22 |
| SCAD | 0.47 | 0.88 | 0.91 | 0.60 | 0.13 |
The summary statistics for Design 1 (Dense model design) are represented for each setting of the true regression coefficients under the second autoregressive covariance case. Different setting means different choice of the true coefficient .
|
| |||||
|
|
|
|
|
| |
| Nonlocal | 0.89 | 1 | 0.99 | 0.94 | 0.13 |
| Spike and Slab | 0.71 | 0.63 | 0.98 | 0.64 | 0.20 |
| Lasso | 0.70 | 0.88 | 0.98 | 0.76 | 0.16 |
| EBLasso | 1 | 0.50 | 1 | 0.69 | 0.23 |
| SCAD | 0.67 | 0.75 | 0.97 | 0.68 | 0.17 |
|
| |||||
|
|
|
|
|
| |
| Nonlocal | 0.88 | 0.88 | 0.99 | 0.86 | 0.14 |
| Spike and Slab | 0.83 | 0.63 | 0.99 | 0.70 | 0.13 |
| Lasso | 0.63 | 0.88 | 0.96 | 0.72 | 0.14 |
| EBLasso | 1 | 0.38 | 1 | 0.60 | 0.22 |
| SCAD | 0.47 | 0.88 | 0.91 | 0.60 | 0.13 |
The summary statistics for Design 2 (High-dimensional design) are represented for each setting of the true regression coefficients under the first isotropic covariance case. Different setting means different choice of the true coefficient .
|
| |||||
|
|
|
|
|
| |
| Nonlocal | 1 | 1 | 1 | 1 | 0.08 |
| Spike and Slab | 0.75 | 0.75 | 0.99 | 0.74 | 0.09 |
| Lasso | 0.80 | 1 | 0.99 | 0.89 | 0.14 |
| EBLasso | 1 | 0.75 | 1 | 0.86 | 0.21 |
| SCAD | 0.67 | 1 | 0.99 | 0.81 | 0.12 |
|
| |||||
|
|
|
|
|
| |
| Nonlocal | 1 | 1 | 1 | 1 | 0.10 |
| Spike and Slab | 0.75 | 0.75 | 0.99 | 0.74 | 0.11 |
| Lasso | 0.67 | 1 | 0.99 | 0.81 | 0.14 |
| EBLasso | 1 | 0.75 | 1 | 0.86 | 0.23 |
| SCAD | 0.44 | 1 | 0.97 | 0.66 | 0.12 |
The summary statistics for Design 2 (High-dimensional design) are represented for each setting of the true regression coefficients under the second autoregressive covariance case. Different setting means different choice of the true coefficient .
|
| |||||
|
|
|
|
|
| |
| Nonlocal | 1 | 0.75 | 1 | 0.86 | 0.11 |
| Spike and Slab | 1 | 0.50 | 1 | 0.71 | 0.10 |
| Lasso | 0.57 | 1 | 0.98 | 0.75 | 0.10 |
| EBLasso | 1 | 0.50 | 1 | 0.70 | 0.18 |
| SCAD | 0.44 | 1 | 0.97 | 0.66 | 0.12 |
|
| |||||
|
|
|
|
|
| |
| Nonlocal | 1 | 0.75 | 1 | 0.86 | 0.15 |
| Spike and Slab | 0.50 | 0.50 | 0.99 | 0.49 | 0.14 |
| Lasso | 0.44 | 1 | 0.97 | 0.66 | 0.13 |
| EBLasso | 1 | 0.50 | 1 | 0.70 | 0.21 |
| SCAD | 0.40 | 1 | 0.96 | 0.62 | 0.14 |
Figure 1Histograms of selected radiomic features for PD and HC subjects with darker color representing overlapping values. Purple: PD group; Green: HC group.
The summary statistics for prediction performance on the testing set for all methods.
| Precision | Sensitivity | Specificity | MCC | MSPE | |
|---|---|---|---|---|---|
| Nonlocal | 0.77 | 0.83 | 0.73 | 0.56 | 0.21 |
| Spike and Slab | 0.53 | 0.75 | 0.27 | 0.40 | 0.29 |
| Lasso | 0.67 | 1 | 0.45 | 0.55 | 0.18 |
| EBLasso | 0.57 | 1 | 0.18 | 0.32 | 0.28 |
| SCAD | 0.58 | 1 | 0.37 | 0.41 | 0.19 |