Literature DB >> 10953243

Gradient-based optimization of hyperparameters.

Y Bengio1.   

Abstract

Many machine learning algorithms can be formulated as the minimization of a training criterion that involves a hyperparameter. This hyperparameter is usually chosen by trial and error with a model selection criterion. In this article we present a methodology to optimize several hyperparameters, based on the computation of the gradient of a model selection criterion with respect to the hyperparameters. In the case of a quadratic training criterion, the gradient of the selection criterion with respect to the hyperparameters is efficiently computed by backpropagating through a Cholesky decomposition. In the more general case, we show that the implicit function theorem can be used to derive a formula for the hyperparameter gradient involving second derivatives of the training criterion.

Entities:  

Mesh:

Year:  2000        PMID: 10953243     DOI: 10.1162/089976600300015187

Source DB:  PubMed          Journal:  Neural Comput        ISSN: 0899-7667            Impact factor:   2.026


  9 in total

Review 1.  Machine learning and radiology.

Authors:  Shijun Wang; Ronald M Summers
Journal:  Med Image Anal       Date:  2012-02-23       Impact factor: 8.545

2.  Review of ML and AutoML Solutions to Forecast Time-Series Data.

Authors:  Ahmad Alsharef; Karan Aggarwal; Manoj Kumar; Ashutosh Mishra
Journal:  Arch Comput Methods Eng       Date:  2022-06-01       Impact factor: 8.171

3.  Visual Recognition Software for Binary Classification and Its Application to Spruce Pollen Identification.

Authors:  David K Tcheng; Ashwin K Nayak; Charless C Fowlkes; Surangi W Punyasena
Journal:  PLoS One       Date:  2016-02-11       Impact factor: 3.240

4.  Automated spectroscopic modelling with optimised convolutional neural networks.

Authors:  Zefang Shen; R A Viscarra Rossel
Journal:  Sci Rep       Date:  2021-01-08       Impact factor: 4.379

5.  Artificial Intelligence to Get Insights of Multi-Drug Resistance Risk Factors during the First 48 Hours from ICU Admission.

Authors:  Inmaculada Mora-Jiménez; Jorge Tarancón-Rey; Joaquín Álvarez-Rodríguez; Cristina Soguero-Ruiz
Journal:  Antibiotics (Basel)       Date:  2021-02-27

6.  Brain-inspired global-local learning incorporated with neuromorphic computing.

Authors:  Yujie Wu; Rong Zhao; Jun Zhu; Feng Chen; Mingkun Xu; Guoqi Li; Sen Song; Lei Deng; Guanrui Wang; Hao Zheng; Songchen Ma; Jing Pei; Youhui Zhang; Mingguo Zhao; Luping Shi
Journal:  Nat Commun       Date:  2022-01-10       Impact factor: 14.919

7.  Learning the Effect of Registration Hyperparameters with HyperMorph.

Authors:  Andrew Hoopes; Malte Hoffmann; Douglas N Greve; Bruce Fischl; John Guttag; Adrian V Dalca
Journal:  J Mach Learn Biomed Imaging       Date:  2022-04-07

8.  Patient-specific hyperparameter learning for optimization-based CT image reconstruction.

Authors:  Jingyan Xu; Frederic Noo
Journal:  Phys Med Biol       Date:  2021-09-20       Impact factor: 4.174

9.  MLBCD: a machine learning tool for big clinical data.

Authors:  Gang Luo
Journal:  Health Inf Sci Syst       Date:  2015-09-28
  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.