Literature DB >> 23772171

Consistent Group Identification and Variable Selection in Regression with Correlated Predictors.

Dhruv B Sharma, Howard D Bondell, Hao Helen Zhang.   

Abstract

Statistical procedures for variable selection have become integral elements in any analysis. Successful procedures are characterized by high predictive accuracy, yielding interpretable models while retaining computational efficiency. Penalized methods that perform coefficient shrinkage have been shown to be successful in many cases. Models with correlated predictors are particularly challenging to tackle. We propose a penalization procedure that performs variable selection while clustering groups of predictors automatically. The oracle properties of this procedure including consistency in group identification are also studied. The proposed method compares favorably with existing selection approaches in both prediction accuracy and model discovery, while retaining its computational efficiency. Supplemental material are available online.

Entities:  

Keywords:  Coefficient shrinkage; Correlation; Group identification; Oracle properties; Penalization; Supervised clustering; Variable selection

Year:  2013        PMID: 23772171      PMCID: PMC3678393          DOI: 10.1080/15533174.2012.707849

Source DB:  PubMed          Journal:  J Comput Graph Stat        ISSN: 1061-8600            Impact factor:   2.302


  7 in total

1.  Averaged gene expressions for regression.

Authors:  Mee Young Park; Trevor Hastie; Robert Tibshirani
Journal:  Biostatistics       Date:  2006-05-11       Impact factor: 5.899

2.  Simultaneous regression shrinkage, variable selection, and supervised clustering of predictors with OSCAR.

Authors:  Howard D Bondell; Brian J Reich
Journal:  Biometrics       Date:  2007-06-30       Impact factor: 2.571

3.  Variable Selection using MM Algorithms.

Authors:  David R Hunter; Runze Li
Journal:  Ann Stat       Date:  2005       Impact factor: 4.028

4.  Simultaneous factor selection and collapsing levels in ANOVA.

Authors:  Howard D Bondell; Brian J Reich
Journal:  Biometrics       Date:  2008-05-28       Impact factor: 2.571

5.  ON THE ADAPTIVE ELASTIC-NET WITH A DIVERGING NUMBER OF PARAMETERS.

Authors:  Hui Zou; Hao Helen Zhang
Journal:  Ann Stat       Date:  2009       Impact factor: 4.028

6.  Supervised harvesting of expression trees.

Authors:  T Hastie; R Tibshirani; D Botstein; P Brown
Journal:  Genome Biol       Date:  2001-01-10       Impact factor: 13.583

7.  A multivariate regression approach to association analysis of a quantitative trait network.

Authors:  Seyoung Kim; Kyung-Ah Sohn; Eric P Xing
Journal:  Bioinformatics       Date:  2009-06-15       Impact factor: 6.937

  7 in total
  2 in total

1.  Deciduous forest responses to temperature, precipitation, and drought imply complex climate change impacts.

Authors:  Yingying Xie; Xiaojing Wang; John A Silander
Journal:  Proc Natl Acad Sci U S A       Date:  2015-10-19       Impact factor: 11.205

2.  The Cluster Elastic Net for High-Dimensional Regression With Unknown Variable Grouping.

Authors:  Daniela M Witten; Ali Shojaie; Fan Zhang
Journal:  Technometrics       Date:  2014-02-20
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.