Literature DB >> 17608783

Simultaneous regression shrinkage, variable selection, and supervised clustering of predictors with OSCAR.

Howard D Bondell1, Brian J Reich.   

Abstract

Variable selection can be challenging, particularly in situations with a large number of predictors with possibly high correlations, such as gene expression data. In this article, a new method called the OSCAR (octagonal shrinkage and clustering algorithm for regression) is proposed to simultaneously select variables while grouping them into predictive clusters. In addition to improving prediction accuracy and interpretation, these resulting groups can then be investigated further to discover what contributes to the group having a similar behavior. The technique is based on penalized least squares with a geometrically intuitive penalty function that shrinks some coefficients to exactly zero. Additionally, this penalty yields exact equality of some coefficients, encouraging correlated predictors that have a similar effect on the response to form predictive clusters represented by a single coefficient. The proposed procedure is shown to compare favorably to the existing shrinkage and variable selection techniques in terms of both prediction error and model complexity, while yielding the additional grouping information.

Entities:  

Mesh:

Year:  2007        PMID: 17608783      PMCID: PMC2605279          DOI: 10.1111/j.1541-0420.2007.00843.x

Source DB:  PubMed          Journal:  Biometrics        ISSN: 0006-341X            Impact factor:   2.571


  1 in total

1.  Simultaneous gene clustering and subset selection for sample classification via MDL.

Authors:  Rebecka Jörnsten; Bin Yu
Journal:  Bioinformatics       Date:  2003-06-12       Impact factor: 6.937

  1 in total
  48 in total

1.  Sparse regression and marginal testing using cluster prototypes.

Authors:  Stephen Reid; Robert Tibshirani
Journal:  Biostatistics       Date:  2015-11-27       Impact factor: 5.899

2.  Scalable Bayesian variable selection for structured high-dimensional data.

Authors:  Changgee Chang; Suprateek Kundu; Qi Long
Journal:  Biometrics       Date:  2018-05-08       Impact factor: 2.571

3.  Simultaneous supervised clustering and feature selection over a graph.

Authors:  Xiaotong Shen; Hsin-Cheng Huang; Wei Pan
Journal:  Biometrika       Date:  2012-10-18       Impact factor: 2.445

4.  Adaptive regularization using the entire solution surface.

Authors:  S Wu; X Shen; C J Geyer
Journal:  Biometrika       Date:  2009-09       Impact factor: 2.445

5.  Feature Grouping and Selection Over an Undirected Graph.

Authors:  Sen Yang; Lei Yuan; Ying-Cheng Lai; Xiaotong Shen; Peter Wonka; Jieping Ye
Journal:  KDD       Date:  2012

6.  Adaptive Estimation with Partially Overlapping Models.

Authors:  Sunyoung Shin; Jason Fine; Yufeng Liu
Journal:  Stat Sin       Date:  2016-01       Impact factor: 1.261

7.  Interquantile Shrinkage and Variable Selection in Quantile Regression.

Authors:  Liewen Jiang; Howard D Bondell; Huixia Judy Wang
Journal:  Comput Stat Data Anal       Date:  2014-01-01       Impact factor: 1.681

8.  Factor selection and structural identification in the interaction ANOVA model.

Authors:  Justin B Post; Howard D Bondell
Journal:  Biometrics       Date:  2013-01-17       Impact factor: 2.571

9.  The Cluster Elastic Net for High-Dimensional Regression With Unknown Variable Grouping.

Authors:  Daniela M Witten; Ali Shojaie; Fan Zhang
Journal:  Technometrics       Date:  2014-02-20

10.  Simultaneous grouping pursuit and feature selection over an undirected graph.

Authors:  Yunzhang Zhu; Xiaotong Shen; Wei Pan
Journal:  J Am Stat Assoc       Date:  2013-01-01       Impact factor: 5.033

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.