Literature DB >> 25506256

Strong rules for discarding predictors in lasso-type problems.

Robert Tibshirani1, Jacob Bien1, Jerome Friedman1, Trevor Hastie1, Noah Simon1, Jonathan Taylor1, Ryan J Tibshirani1.   

Abstract

We consider rules for discarding predictors in lasso regression and related problems, for computational efficiency. El Ghaoui and his colleagues have propose 'SAFE' rules, based on univariate inner products between each predictor and the outcome, which guarantee that a coefficient will be 0 in the solution vector. This provides a reduction in the number of variables that need to be entered into the optimization. We propose strong rules that are very simple and yet screen out far more predictors than the SAFE rules. This great practical improvement comes at a price: the strong rules are not foolproof and can mistakenly discard active predictors, i.e. predictors that have non-zero coefficients in the solution. We therefore combine them with simple checks of the Karush-Kuhn-Tucker conditions to ensure that the exact solution to the convex problem is delivered. Of course, any (approximate) screening method can be combined with the Karush-Kuhn-Tucker, conditions to ensure the exact solution; the strength of the strong rules lies in the fact that, in practice, they discard a very large number of the inactive predictors and almost never commit mistakes. We also derive conditions under which they are foolproof. Strong rules provide substantial savings in computational time for a variety of statistical optimization problems.

Entities:  

Year:  2012        PMID: 25506256      PMCID: PMC4262615          DOI: 10.1111/j.1467-9868.2011.01004.x

Source DB:  PubMed          Journal:  J R Stat Soc Series B Stat Methodol        ISSN: 1369-7412            Impact factor:   4.488


  2 in total

1.  Genome-wide association analysis by lasso penalized logistic regression.

Authors:  Tong Tong Wu; Yi Fang Chen; Trevor Hastie; Eric Sobel; Kenneth Lange
Journal:  Bioinformatics       Date:  2009-01-28       Impact factor: 6.937

2.  Discussion of "Sure Independence Screening for Ultra-High Dimensional Feature Space.

Authors:  Hao Helen Zhang
Journal:  J R Stat Soc Series B Stat Methodol       Date:  2008-11       Impact factor: 4.488

  2 in total
  108 in total

1.  Quality of life, pain, and psychological factors in patients undergoing surgery for primary tumors of the spine.

Authors:  Francesca Luzzati; Emanuele Maria Giusti; Gennaro Maria Scotto; Giuseppe Perrucchini; Luca Cannavò; Gianluca Castelnuovo; Andrea Colonna Cottini
Journal:  Support Care Cancer       Date:  2019-07-01       Impact factor: 3.603

2.  High-resolution linkage map for two honeybee chromosomes: the hotspot quest.

Authors:  Florence Mougel; Marie-Anne Poursat; Nicolas Beaume; Dominique Vautrin; Michel Solignac
Journal:  Mol Genet Genomics       Date:  2013-10-27       Impact factor: 3.291

Review 3.  Personalized evidence based medicine: predictive approaches to heterogeneous treatment effects.

Authors:  David M Kent; Ewout Steyerberg; David van Klaveren
Journal:  BMJ       Date:  2018-12-10

4.  Learning interactions via hierarchical group-lasso regularization.

Authors:  Michael Lim; Trevor Hastie
Journal:  J Comput Graph Stat       Date:  2015-09-16       Impact factor: 2.302

5.  Development and validation of Risk Equations for Complications Of type 2 Diabetes (RECODe) using individual participant data from randomised trials.

Authors:  Sanjay Basu; Jeremy B Sussman; Seth A Berkowitz; Rodney A Hayward; John S Yudkin
Journal:  Lancet Diabetes Endocrinol       Date:  2017-08-10       Impact factor: 32.069

6.  Making the Most of Clumping and Thresholding for Polygenic Scores.

Authors:  Florian Privé; Bjarni J Vilhjálmsson; Hugues Aschard; Michael G B Blum
Journal:  Am J Hum Genet       Date:  2019-11-21       Impact factor: 11.025

7.  Delirium Severity Trajectories and Outcomes in ICU Patients. Defining a Dynamic Symptom Phenotype.

Authors:  Heidi Lindroth; Babar A Khan; Janet S Carpenter; Sujuan Gao; Anthony J Perkins; Sikandar H Khan; Sophia Wang; Richard N Jones; Malaz A Boustani
Journal:  Ann Am Thorac Soc       Date:  2020-09

8.  Predictors of 6-month health utility outcomes in survivors of acute respiratory distress syndrome.

Authors:  Samuel M Brown; Emily Wilson; Angela P Presson; Chong Zhang; Victor D Dinglas; Tom Greene; Ramona O Hopkins; Dale M Needham
Journal:  Thorax       Date:  2016-07-20       Impact factor: 9.139

9.  Object-oriented regression for building predictive models with high dimensional omics data from translational studies.

Authors:  Lue Ping Zhao; Hamid Bolouri
Journal:  J Biomed Inform       Date:  2016-03-10       Impact factor: 6.317

10.  Regularization Paths for Conditional Logistic Regression: The clogitL1 Package.

Authors:  Stephen Reid; Rob Tibshirani
Journal:  J Stat Softw       Date:  2014-07       Impact factor: 6.440

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.