Literature DB >> 16212768

Learning curves for stochastic gradient descent in linear feedforward networks.

Justin Werfel1, Xiaohui Xie, H Sebastian Seung.   

Abstract

Gradient-following learning methods can encounter problems of implementation in many applications, and stochastic variants are sometimes used to overcome these difficulties. We analyze three online training methods used with a linear perceptron: direct gradient descent, node perturbation, and weight perturbation. Learning speed is defined as the rate of exponential decay in the learning curves. When the scalar parameter that controls the size of weight updates is chosen to maximize learning speed, node perturbation is slower than direct gradient descent by a factor equal to the number of output units; weight perturbation is slower still by an additional factor equal to the number of input units. Parallel perturbation allows faster learning than sequential perturbation, by a factor that does not depend on network size. We also characterize how uncertainty in quantities used in the stochastic updates affects the learning curves. This study suggests that in practice, weight perturbation may be slow for large networks, and node perturbation can have performance comparable to that of direct gradient descent when there are few output units. However, these statements depend on the specifics of the learning problem, such as the input distribution and the target function, and are not universally applicable.

Mesh:

Year:  2005        PMID: 16212768     DOI: 10.1162/089976605774320539

Source DB:  PubMed          Journal:  Neural Comput        ISSN: 0899-7667            Impact factor:   2.026


  20 in total

1.  Toward an Integration of Deep Learning and Neuroscience.

Authors:  Adam H Marblestone; Greg Wayne; Konrad P Kording
Journal:  Front Comput Neurosci       Date:  2016-09-14       Impact factor: 2.380

2.  Spike-based decision learning of Nash equilibria in two-player games.

Authors:  Johannes Friedrich; Walter Senn
Journal:  PLoS Comput Biol       Date:  2012-09-27       Impact factor: 4.475

3.  A computational model of use-dependent motor recovery following a stroke: optimizing corticospinal activations via reinforcement learning can explain residual capacity and other strength recovery dynamics.

Authors:  David J Reinkensmeyer; Emmanuel Guigon; Marc A Maier
Journal:  Neural Netw       Date:  2012-02-13

4.  Vocal exploration is locally regulated during song learning.

Authors:  Primoz Ravbar; Dina Lipkind; Lucas C Parra; Ofer Tchernichovski
Journal:  J Neurosci       Date:  2012-03-07       Impact factor: 6.167

Review 5.  Backpropagation and the brain.

Authors:  Timothy P Lillicrap; Adam Santoro; Luke Marris; Colin J Akerman; Geoffrey Hinton
Journal:  Nat Rev Neurosci       Date:  2020-04-17       Impact factor: 34.870

Review 6.  How learning unfolds in the brain: toward an optimization view.

Authors:  Jay A Hennig; Emily R Oby; Darby M Losey; Aaron P Batista; Byron M Yu; Steven M Chase
Journal:  Neuron       Date:  2021-10-13       Impact factor: 17.173

7.  A smart, practical, deep learning-based clinical decision support tool for patients in the prostate-specific antigen gray zone: model development and validation.

Authors:  Sang Hun Song; Hwanik Kim; Jung Kwon Kim; Hakmin Lee; Jong Jin Oh; Sang-Chul Lee; Seong Jin Jeong; Sung Kyu Hong; Junghoon Lee; Sangjun Yoo; Min-Soo Choo; Min Chul Cho; Hwancheol Son; Hyeon Jeong; Jungyo Suh; Seok-Soo Byun
Journal:  J Am Med Inform Assoc       Date:  2022-10-07       Impact factor: 7.942

8.  Maximization of learning speed in the motor cortex due to neuronal redundancy.

Authors:  Ken Takiyama; Masato Okada
Journal:  PLoS Comput Biol       Date:  2012-01-12       Impact factor: 4.475

9.  Gradient estimation in dendritic reinforcement learning.

Authors:  Mathieu Schiess; Robert Urbanczik; Walter Senn
Journal:  J Math Neurosci       Date:  2012-02-15       Impact factor: 1.300

10.  Burst-dependent synaptic plasticity can coordinate learning in hierarchical circuits.

Authors:  Alexandre Payeur; Jordan Guerguiev; Blake A Richards; Richard Naud; Friedemann Zenke
Journal:  Nat Neurosci       Date:  2021-05-13       Impact factor: 28.771

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.