| Literature DB >> 35751586 |
Michael Komodromos1, Eric O Aboagye2, Marina Evangelou1, Sarah Filippi1, Kolyan Ray1.
Abstract
MOTIVATION: Few Bayesian methods for analyzing high-dimensional sparse survival data provide scalable variable selection, effect estimation and uncertainty quantification. Such methods often either sacrifice uncertainty quantification by computing maximum a posteriori estimates, or quantify the uncertainty at high (unscalable) computational expense.Entities:
Year: 2022 PMID: 35751586 PMCID: PMC9364383 DOI: 10.1093/bioinformatics/btac416
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.931
Comparison of variational to MCMC posterior taking and , presented is the median and quantiles
| Setting |
| Method |
|
| TPR | FDR | AUC | Runtime |
|---|---|---|---|---|---|---|---|---|
|
| 0.25 | SVB | 0.368 (0.21, 0.70) | 1.000 (0.52, 1.86) | 1.000 (0.90, 1.00) | 0.000 (0.00, 0.00) | 1.000 (1.00, 1.00) | 18.5 s (13.5 s , 25.6 s) |
| MCMC | 0.412 (0.20, 0.75) | 1.017 (0.48, 2.01) | 1.000 (0.90, 1.00) | 0.000 (0.00, 0.00) | 1.000 (1.00, 1.00) | 1 h 24 m (1 h 7 m , 1 h 50 m) | ||
| 0.4 | SVB | 0.428 (0.23, 0.89) | 1.138 (0.63, 2.45) | 1.000 (0.90, 1.00) | 0.000 (0.00, 0.00) | 1.000 (0.95, 1.00) | 21.9 s (14.5 s , 30.5 s) | |
| MCMC | 0.506 (0.26, 0.98) | 1.300 (0.69, 2.74) | 1.000 (0.80, 1.00) | 0.000 (0.00, 0.00) | 1.000 (1.00, 1.00) | 1 h 28 m (1 h 25 m , 1 h 30 m) | ||
|
| 0.25 | SVB | 0.376 (0.20, 0.73) | 1.031 (0.58, 2.07) | 1.000 (0.90, 1.00) | 0.000 (0.00, 0.00) | 1.000 (1.00, 1.00) | 18.9 s (14.4 s , 25.4 s) |
| MCMC | 0.405 (0.21, 0.81) | 1.059 (0.58, 2.18) | 1.000 (0.90, 1.00) | 0.000 (0.00, 0.00) | 1.000 (1.00, 1.00) | 1 h 14 m (1 h 6 m , 1 h 17 m) | ||
| 0.4 | SVB | 0.472 (0.23, 1.08) | 1.176 (0.61, 2.96) | 1.000 (0.90, 1.00) | 0.000 (0.00, 0.00) | 1.000 (0.95, 1.00) | 24.0 s (17.3 s , 33.1 s) | |
| MCMC | 0.520 (0.25, 1.08) | 1.319 (0.62, 2.91) | 1.000 (0.90, 1.00) | 0.000 (0.00, 0.00) | 1.000 (1.00, 1.00) | 1 h 38 m (1 h 25 m , 2 h 4 m) | ||
|
| 0.25 | SVB | 0.392 (0.18, 1.40) | 1.079 (0.53, 3.28) | 1.000 (0.90, 1.00) | 0.000 (0.00, 0.09) | 1.000 (0.95, 1.00) | 29.2 s (16.9 s , 44.9 s) |
| MCMC | 0.418 (0.21, 1.01) | 1.092 (0.54, 2.58) | 1.000 (0.90, 1.00) | 0.000 (0.00, 0.00) | 1.000 (1.00, 1.00) | 1 h 45 m (1 h 24 m , 1 h 49 m) | ||
| 0.4 | SVB | 0.470 (0.24, 1.57) | 1.263 (0.63, 4.16) | 1.000 (0.80, 1.00) | 0.000 (0.00, 0.10) | 1.000 (0.95, 1.00) | 21.7 s (13.7 s , 33.2 s) | |
| MCMC | 0.508 (0.23, 1.26) | 1.236 (0.61, 3.45) | 1.000 (0.80, 1.00) | 0.000 (0.00, 0.09) | 1.000 (1.00, 1.00) | 1 h 36 m (1 h 30 m , 1 h 45 m) | ||
|
| 0.25 | SVB | 0.393 (0.18, 1.12) | 1.067 (0.50, 2.54) | 1.000 (0.90, 1.00) | 0.000 (0.00, 0.10) | 1.000 (0.95, 1.00) | 17.0 s (9.2 s , 24.9 s) |
| MCMC | 0.382 (0.17, 0.95) | 1.007 (0.44, 2.47) | 1.000 (0.90, 1.00) | 0.000 (0.00, 0.10) | 1.000 (1.00, 1.00) | 1 h 5 m (1 h 3 m , 1 h 8 m) | ||
| 0.4 | SVB | 0.425 (0.18, 1.38) | 1.171 (0.50, 2.85) | 1.000 (0.90, 1.00) | 0.000 (0.00, 0.10) | 1.000 (0.95, 1.00) | 25.8 s (14.8 s , 39.9 s) | |
| MCMC | 0.486 (0.21, 1.13) | 1.158 (0.53, 3.17) | 1.000 (0.80, 1.00) | 0.000 (0.00, 0.00) | 1.000 (0.95, 1.00) | 1 h 38 m (1 h 14 m , 1 h 46 m) |
Note: Simulations were ran on Intel® Xeon® E5-2680 v4 2.40 GHz CPUs.
Coverage and set size for variational and MCMC posterior
| Set. |
| Meth. | Cov. | Set size | Cov. | Set size |
|---|---|---|---|---|---|---|
| 1 | 0.25 | SVB | 0.770 (0.202) | 0.320 (0.013) | 1.000 (0.000) | 0.000 (0.000) |
| MCMC | 0.928 (0.138) | 0.506 (0.039) | 1.000 (0.000) | 0.000 (0.000) | ||
| 0.4 | SVB | 0.774 (0.208) | 0.355 (0.021) | 1.000 (0.000) | 0.000 (0.000) | |
| MCMC | 0.914 (0.127) | 0.570 (0.054) | 1.000 (0.000) | 0.000 (0.000) | ||
| 2 | 0.25 | SVB | 0.703 (0.227) | 0.306 (0.028) | 1.000 (0.001) | 0.000 (0.000) |
| MCMC | 0.904 (0.161) | 0.522 (0.053) | 1.000 (0.000) | 0.000 (0.000) | ||
| 0.4 | SVB | 0.683 (0.262) | 0.333 (0.039) | 1.000 (0.001) | 0.000 (0.000) | |
| MCMC | 0.845 (0.218) | 0.567 (0.101) | 1.000 (0.000) | 0.000 (0.000) | ||
| 3 | 0.25 | SVB | 0.626 (0.288) | 0.251 (0.020) | 1.000 (0.000) | 0.000 (0.000) |
| MCMC | 0.903 (0.140) | 0.482 (0.047) | 1.000 (0.000) | 0.000 (0.000) | ||
| 0.4 | SVB | 0.619 (0.278) | 0.276 (0.028) | 1.000 (0.000) | 0.000 (0.000) | |
| MCMC | 0.873 (0.197) | 0.540 (0.078) | 1.000 (0.000) | 0.000 (0.000) | ||
| 4 | 0.25 | SVB | 0.672 (0.224) | 0.252 (0.021) | 1.000 (0.000) | 0.000 (0.000) |
| MCMC | 0.921 (0.144) | 0.483 (0.047) | 1.000 (0.000) | 0.000 (0.000) | ||
| 0.4 | SVB | 0.660 (0.249) | 0.277 (0.025) | 1.000 (0.001) | 0.000 (0.000) | |
| MCMC | 0.906 (0.156) | 0.547 (0.059) | 1.000 (0.000) | 0.000 (0.000) |
Note: Presented are means and std. dev.
Comparison of Bayesian variable selection methods, taking and , presented is the median and quantiles
| Setting |
| Method |
|
| TPR | FDR | AUC |
|---|---|---|---|---|---|---|---|
|
| 0.25 | SVB | 0.378 (0.26, 0.89) | 1.747 (1.16, 4.17) | 1.000 (1.00, 1.00) | 0.000 (0.00, 0.00) | 1.000 (1.00, 1.00) |
| BhGLM | 1.206 (0.79, 1.78) | 9.590 (7.22, 12.88) | 1.000 (1.00, 1.00) | 0.000 (0.00, 0.00) | 1.000 (1.00, 1.00) | ||
| BVSNLP | 0.456 (0.33, 0.96) | 2.007 (1.41, 4.57) | 1.000 (1.00, 1.00) | 0.000 (0.00, 0.03) | 1.000 (1.00, 1.00) | ||
| 0.4 | SVB | 0.449 (0.31, 0.99) | 2.056 (1.37, 4.87) | 1.000 (1.00, 1.00) | 0.000 (0.00, 0.00) | 1.000 (1.00, 1.00) | |
| BhGLM | 0.807 (0.53, 1.35) | 6.458 (4.52, 9.31) | 1.000 (1.00, 1.00) | 0.000 (0.00, 0.00) | 1.000 (1.00, 1.00) | ||
| BVSNLP | 0.518 (0.35, 1.44) | 2.231 (1.52, 6.85) | 1.000 (0.96, 1.00) | 0.000 (0.00, 0.03) | 1.000 (0.99, 1.00) | ||
|
| 0.25 | SVB | 0.405 (0.29, 0.80) | 1.823 (1.28, 3.78) | 1.000 (1.00, 1.00) | 0.000 (0.00, 0.00) | 1.000 (1.00, 1.00) |
| BhGLM | 0.596 (0.45, 1.04) | 4.494 (3.51, 6.89) | 1.000 (1.00, 1.00) | 0.000 (0.00, 0.00) | 1.000 (1.00, 1.00) | ||
| BVSNLP | 0.475 (0.33, 0.90) | 2.130 (1.47, 4.01) | 1.000 (1.00, 1.00) | 0.000 (0.00, 0.00) | 1.000 (1.00, 1.00) | ||
| 0.4 | SVB | 0.491 (0.33, 1.05) | 2.208 (1.45, 5.03) | 1.000 (0.97, 1.00) | 0.000 (0.00, 0.03) | 1.000 (1.00, 1.00) | |
| BhGLM | 0.551 (0.44, 0.86) | 3.716 (2.98, 5.36) | 1.000 (0.97, 1.00) | 0.000 (0.00, 0.00) | 1.000 (1.00, 1.00) | ||
| BVSNLP | 0.515 (0.37, 1.47) | 2.238 (1.54, 6.71) | 1.000 (1.00, 1.00) | 0.000 (0.00, 0.00) | 1.000 (1.00, 1.00) | ||
|
| 0.25 | SVB | 1.040 (0.30, 3.17) | 3.881 (1.36, 15.37) | 0.967 (0.83, 1.00) | 0.000 (0.00, 0.14) | 0.983 (0.92, 1.00) |
| BhGLM | 0.590 (0.36, 1.57) | 3.279 (2.23, 6.73) | 1.000 (0.93, 1.00) | 0.000 (0.00, 0.03) | 1.000 (0.97, 1.00) | ||
| BVSNLP | 3.107 (1.74, 9.73) | 12.262 (6.88, 47.67) | 0.967 (0.53, 1.00) | 0.000 (0.00, 0.53) | 0.983 (0.78, 1.00) | ||
| 0.4 | SVB | 1.379 (0.36, 3.47) | 5.728 (1.55, 17.47) | 0.933 (0.77, 1.00) | 0.033 (0.00, 0.13) | 0.967 (0.88, 1.00) | |
| BhGLM | 0.796 (0.41, 2.18) | 4.035 (2.25, 10.92) | 0.967 (0.87, 1.00) | 0.000 (0.00, 0.07) | 1.000 (0.95, 1.00) | ||
| BVSNLP | 3.867 (1.98, 11.44) | 15.874 (7.99, 51.07) | 0.967 (0.20, 1.00) | 0.033 (0.00, 0.69) | 0.983 (0.65, 1.00) | ||
|
| 0.25 | SVB | 0.603 (0.29, 2.02) | 2.298 (1.21, 8.84) | 1.000 (0.90, 1.00) | 0.000 (0.00, 0.08) | 1.000 (0.95, 1.00) |
| BhGLM | 0.503 (0.35, 1.36) | 3.141 (2.25, 5.59) | 1.000 (0.93, 1.00) | 0.000 (0.00, 0.03) | 1.000 (0.97, 1.00) | ||
| BVSNLP | 2.946 (1.96, 8.72) | 11.426 (6.98, 36.46) | 1.000 (0.90, 1.00) | 0.000 (0.00, 0.07) | 1.000 (0.95, 1.00) | ||
| 0.4 | SVB | 1.092 (0.32, 2.83) | 3.878 (1.40, 14.06) | 0.967 (0.83, 1.00) | 0.000 (0.00, 0.08) | 0.983 (0.92, 1.00) | |
| BhGLM | 0.674 (0.40, 1.64) | 3.610 (2.28, 7.72) | 1.000 (0.93, 1.00) | 0.000 (0.00, 0.04) | 1.000 (0.97, 1.00) | ||
| BVSNLP | 3.163 (2.14, 10.53) | 12.227 (8.14, 45.64) | 1.000 (0.73, 1.00) | 0.000 (0.00, 0.32) | 1.000 (0.87, 1.00) |
Fig. 1.Ovarian cancer dataset model convergence diagnostics for λ = 1
Gene names and selection proportions for ovarian cancer dataset
| PI3 | PPP3CA | CCR7 | SDF2L1 | D4S234E | VSIG4 | DAP | IL7R |
|---|---|---|---|---|---|---|---|
| 0.7 | 0.379 | 0.293 | 0.286 | 0.229 | 0.171 | 0.136 | 0.136 |
| TBP | ACSL3 | SLAMF7 | UBD | IL2RG | GALNT10 | FLJ20323 | RNF128 |
| 0.121 | 0.114 | 0.1 | 0.1 | 0.064 | 0.057 | 0.05 | 0.05 |
Gene names and selection proportions for the breast cancer dataset
| ARHGAP28 | NEK2 | ABCC5 | GREM1 | DUSP4 | ITGA5 | CCL2 | IGFBP7 |
|---|---|---|---|---|---|---|---|
| 0.386 | 0.25 | 0.2 | 0.2 | 0.193 | 0.193 | 0.164 | 0.143 |
| NFE2L3 | TRPC1 | PKMYT1 | DDX31 | EMILIN1 | SSPN | ABO | HSPC072 |
| 0.114 | 0.114 | 0.1 | 0.086 | 0.086 | 0.086 | 0.079 | 0.079 |
Fig. 2.(A) Kaplan–Meier curves for patients in low- and high-risk groups. (B) Comparison of patients in the low- and high-risk groups (ordered by )—within each cell the (variational) posterior probability patient in row i is at greater risk than patient in column j is computed. Samples are taken from the second validation fold and the fit with is used