| Literature DB >> 31886488 |
Magnus M Münch1, Carel F W Peeters2, Aad W Van Der Vaart3, Mark A Van De Wiel4.
Abstract
In high-dimensional data settings, additional information on the features is often available. Examples of such external information in omics research are: (i) $p$-values from a previous study and (ii) omics annotation. The inclusion of this information in the analysis may enhance classification performance and feature selection but is not straightforward. We propose a group-regularized (logistic) elastic net regression method, where each penalty parameter corresponds to a group of features based on the external information. The method, termed gren, makes use of the Bayesian formulation of logistic elastic net regression to estimate both the model and penalty parameters in an approximate empirical-variational Bayes framework. Simulations and applications to three cancer genomics studies and one Alzheimer metabolomics study show that, if the partitioning of the features is informative, classification performance, and feature selection are indeed enhanced.Entities:
Keywords: Empirical Bayes; High-dimensional data; Prediction; Variational Bayes
Mesh:
Year: 2021 PMID: 31886488 PMCID: PMC8596493 DOI: 10.1093/biostatistics/kxz062
Source DB: PubMed Journal: Biostatistics ISSN: 1465-4644 Impact factor: 5.899