Literature DB >> 36092461

LassoNet: Neural Networks with Feature Sparsity.

Ismael Lemhadri1, Feng Ruan1, Robert Tibshirani1.   

Abstract

Much work has been done recently to make neural networks more interpretable, and one approach is to arrange for the network to use only a subset of the available features. In linear models, Lasso (or ℓ 1-regularized) regression assigns zero weights to the most irrelevant or redundant features, and is widely used in data science. However the Lasso only applies to linear models. Here we introduce LassoNet, a neural network framework with global feature selection. Our approach achieves feature sparsity by allowing a feature to participate in a hidden unit only if its linear representative is active. Unlike other approaches to feature selection for neural nets, our method uses a modified objective function with constraints, and so integrates feature selection with the parameter learning directly. As a result, it delivers an entire regularization path of solutions with a range of feature sparsity. In experiments with real and simulated data, LassoNet significantly outperforms state-of-the-art methods for feature selection and regression. The LassoNet method uses projected proximal gradient descent, and generalizes directly to deep networks. It can be implemented by adding just a few lines of code to a standard neural network.

Entities:  

Year:  2021        PMID: 36092461      PMCID: PMC9453696     

Source DB:  PubMed          Journal:  Proc Mach Learn Res


  8 in total

1.  An experimental comparison of feature selection methods on two-class biomedical datasets.

Authors:  P Drotár; J Gazda; Z Smékal
Journal:  Comput Biol Med       Date:  2015-08-24       Impact factor: 4.589

2.  High-dimensional feature selection by feature-wise kernelized Lasso.

Authors:  Makoto Yamada; Wittawat Jitkrittum; Leonid Sigal; Eric P Xing; Masashi Sugiyama
Journal:  Neural Comput       Date:  2013-10-08       Impact factor: 2.026

3.  Learning interactions via hierarchical group-lasso regularization.

Authors:  Michael Lim; Trevor Hastie
Journal:  J Comput Graph Stat       Date:  2015-09-16       Impact factor: 2.302

Review 4.  Proteomic applications for the early detection of cancer.

Authors:  Julia D Wulfkuhle; Lance A Liotta; Emanuel F Petricoin
Journal:  Nat Rev Cancer       Date:  2003-04       Impact factor: 60.716

5.  Spectral Regularization Algorithms for Learning Large Incomplete Matrices.

Authors:  Rahul Mazumder; Trevor Hastie; Robert Tibshirani
Journal:  J Mach Learn Res       Date:  2010-03-01       Impact factor: 3.654

6.  Regularization Paths for Generalized Linear Models via Coordinate Descent.

Authors:  Jerome Friedman; Trevor Hastie; Rob Tibshirani
Journal:  J Stat Softw       Date:  2010       Impact factor: 6.440

7.  A SIGNIFICANCE TEST FOR THE LASSO.

Authors:  Richard Lockhart; Jonathan Taylor; Ryan J Tibshirani; Robert Tibshirani
Journal:  Ann Stat       Date:  2014-04       Impact factor: 4.028

8.  Self-Organizing Feature Maps Identify Proteins Critical to Learning in a Mouse Model of Down Syndrome.

Authors:  Clara Higuera; Katheleen J Gardiner; Krzysztof J Cios
Journal:  PLoS One       Date:  2015-06-25       Impact factor: 3.240

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.