Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Ensemble Linear Subspace Analysis of High-Dimensional Data.

Literature DB >> 33803346

Ensemble Linear Subspace Analysis of High-Dimensional Data.

S Ejaz Ahmed¹, Saeid Amiri², Kjell Doksum³.

Abstract

Regression models provide prediction frameworks for multivariate mutual information analysis that uses information concepts when choosing covariates (also called features) that are important for analysis and prediction. We consider a high dimensional regression framework where the number of covariates (p) exceed the sample size (n). Recent work in high dimensional regression analysis has embraced an ensemble subspace approach that consists of selecting random subsets of covariates with fewer than p covariates, doing statistical analysis on each subset, and then merging the results from the subsets. We examine conditions under which penalty methods such as Lasso perform better when used in the ensemble approach by computing mean squared prediction errors for simulations and a real data example. Linear models with both random and fixed designs are considered. We examine two versions of penalty methods: one where the tuning parameter is selected by cross-validation; and one where the final predictor is a trimmed average of individual predictors corresponding to the members of a set of fixed tuning parameters. We find that the ensemble approach improves on penalty methods for several important real data and model scenarios. The improvement occurs when covariates are strongly associated with the response, when the complexity of the model is high. In such cases, the trimmed average version of ensemble Lasso is often the best predictor.

Entities: Disease Gene Species

Keywords: Lasso; elastic net; ensembling; high-dimensional data; penalty methods; prediction; random subspaces

Year: 2021 PMID： 33803346 PMCID： PMC7998555 DOI： 10.3390/e23030324

Source DB: PubMed Journal: Entropy (Basel) ISSN： 1099-4300 Impact factor: 2.524

1 in total

1. Transfer-Learning-Based Approach for the Diagnosis of Lung Diseases from Chest X-ray Images.

Authors: Rong Fan; Shengrong Bu
Journal: Entropy (Basel) Date: 2022-02-22 Impact factor: 2.524

1 in total