Literature DB >> 33252656

Fractional ridge regression: a fast, interpretable reparameterization of ridge regression.

Ariel Rokem1, Kendrick Kay2.   

Abstract

BACKGROUND: Ridge regression is a regularization technique that penalizes the L2-norm of the coefficients in linear regression. One of the challenges of using ridge regression is the need to set a hyperparameter (α) that controls the amount of regularization. Cross-validation is typically used to select the best α from a set of candidates. However, efficient and appropriate selection of α can be challenging. This becomes prohibitive when large amounts of data are analyzed. Because the selected α depends on the scale of the data and correlations across predictors, it is also not straightforwardly interpretable.
RESULTS: The present work addresses these challenges through a novel approach to ridge regression. We propose to reparameterize ridge regression in terms of the ratio γ between the L2-norms of the regularized and unregularized coefficients. We provide an algorithm that efficiently implements this approach, called fractional ridge regression, as well as open-source software implementations in Python and matlab (https://github.com/nrdg/fracridge). We show that the proposed method is fast and scalable for large-scale data problems. In brain imaging data, we demonstrate that this approach delivers results that are straightforward to interpret and compare across models and datasets.
CONCLUSION: Fractional ridge regression has several benefits: the solutions obtained for different γ are guaranteed to vary, guarding against wasted calculations; and automatically span the relevant range of regularization, avoiding the need for arduous manual exploration. These properties make fractional ridge regression particularly suitable for analysis of large complex datasets.
© The Author(s) 2020. Published by Oxford University Press GigaScience.

Entities:  

Keywords:  brain imaging; general linear model; hyperparameters; open-source software

Year:  2020        PMID: 33252656      PMCID: PMC7702219          DOI: 10.1093/gigascience/giaa133

Source DB:  PubMed          Journal:  Gigascience        ISSN: 2047-217X            Impact factor:   6.524


  6 in total

1.  Efficient quadratic regularization for expression arrays.

Authors:  Trevor Hastie; Robert Tibshirani
Journal:  Biostatistics       Date:  2004-07       Impact factor: 5.899

Review 2.  Computational neuroimaging and population receptive fields.

Authors:  Brian A Wandell; Jonathan Winawer
Journal:  Trends Cogn Sci       Date:  2015-04-04       Impact factor: 20.229

3.  Definitions, methods, and applications in interpretable machine learning.

Authors:  W James Murdoch; Chandan Singh; Karl Kumbier; Reza Abbasi-Asl; Bin Yu
Journal:  Proc Natl Acad Sci U S A       Date:  2019-10-16       Impact factor: 11.205

4.  Regularization Paths for Generalized Linear Models via Coordinate Descent.

Authors:  Jerome Friedman; Trevor Hastie; Rob Tibshirani
Journal:  J Stat Softw       Date:  2010       Impact factor: 6.440

Review 5.  Array programming with NumPy.

Authors:  Charles R Harris; K Jarrod Millman; Stéfan J van der Walt; Ralf Gommers; Pauli Virtanen; David Cournapeau; Eric Wieser; Julian Taylor; Sebastian Berg; Nathaniel J Smith; Robert Kern; Matti Picus; Stephan Hoyer; Marten H van Kerkwijk; Matthew Brett; Allan Haldane; Jaime Fernández Del Río; Mark Wiebe; Pearu Peterson; Pierre Gérard-Marchant; Kevin Sheppard; Tyler Reddy; Warren Weckesser; Hameer Abbasi; Christoph Gohlke; Travis E Oliphant
Journal:  Nature       Date:  2020-09-16       Impact factor: 49.962

Review 6.  SciPy 1.0: fundamental algorithms for scientific computing in Python.

Authors:  Pauli Virtanen; Ralf Gommers; Travis E Oliphant; Matt Haberland; Tyler Reddy; David Cournapeau; Evgeni Burovski; Pearu Peterson; Warren Weckesser; Jonathan Bright; Stéfan J van der Walt; Matthew Brett; Joshua Wilson; K Jarrod Millman; Nikolay Mayorov; Andrew R J Nelson; Eric Jones; Robert Kern; Eric Larson; C J Carey; İlhan Polat; Yu Feng; Eric W Moore; Jake VanderPlas; Denis Laxalde; Josef Perktold; Robert Cimrman; Ian Henriksen; E A Quintero; Charles R Harris; Anne M Archibald; Antônio H Ribeiro; Fabian Pedregosa; Paul van Mulbregt
Journal:  Nat Methods       Date:  2020-02-03       Impact factor: 28.547

  6 in total
  1 in total

1.  NeuroGen: Activation optimized image synthesis for discovery neuroscience.

Authors:  Zijin Gu; Keith Wakefield Jamison; Meenakshi Khosla; Emily J Allen; Yihan Wu; Ghislain St-Yves; Thomas Naselaris; Kendrick Kay; Mert R Sabuncu; Amy Kuceyeski
Journal:  Neuroimage       Date:  2021-12-20       Impact factor: 6.556

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.