Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Learning curves for stochastic gradient descent in linear feedforward networks.

Literature DB >> 16212768

Learning curves for stochastic gradient descent in linear feedforward networks.

Justin Werfel¹, Xiaohui Xie, H Sebastian Seung.

Abstract

Gradient-following learning methods can encounter problems of implementation in many applications, and stochastic variants are sometimes used to overcome these difficulties. We analyze three online training methods used with a linear perceptron: direct gradient descent, node perturbation, and weight perturbation. Learning speed is defined as the rate of exponential decay in the learning curves. When the scalar parameter that controls the size of weight updates is chosen to maximize learning speed, node perturbation is slower than direct gradient descent by a factor equal to the number of output units; weight perturbation is slower still by an additional factor equal to the number of input units. Parallel perturbation allows faster learning than sequential perturbation, by a factor that does not depend on network size. We also characterize how uncertainty in quantities used in the stochastic updates affects the learning curves. This study suggests that in practice, weight perturbation may be slow for large networks, and node perturbation can have performance comparable to that of direct gradient descent when there are few output units. However, these statements depend on the specifics of the learning problem, such as the input distribution and the target function, and are not universally applicable.

Mesh：

Year: 2005 PMID： 16212768 DOI： 10.1162/089976605774320539

Source DB: PubMed Journal: Neural Comput ISSN： 0899-7667 Impact factor: 2.026

Keyword Cloud
Cited

20 in total

1. Toward an Integration of Deep Learning and Neuroscience.

Authors: Adam H Marblestone; Greg Wayne; Konrad P Kording
Journal: Front Comput Neurosci Date: 2016-09-14 Impact factor: 2.380

2. Spike-based decision learning of Nash equilibria in two-player games.

Authors: Johannes Friedrich; Walter Senn
Journal: PLoS Comput Biol Date: 2012-09-27 Impact factor: 4.475

3. A computational model of use-dependent motor recovery following a stroke: optimizing corticospinal activations via reinforcement learning can explain residual capacity and other strength recovery dynamics.

Authors: David J Reinkensmeyer; Emmanuel Guigon; Marc A Maier
Journal: Neural Netw Date: 2012-02-13

4. Vocal exploration is locally regulated during song learning.

Authors: Primoz Ravbar; Dina Lipkind; Lucas C Parra; Ofer Tchernichovski
Journal: J Neurosci Date: 2012-03-07 Impact factor: 6.167

Review 5. Backpropagation and the brain.

Authors: Timothy P Lillicrap; Adam Santoro; Luke Marris; Colin J Akerman; Geoffrey Hinton
Journal: Nat Rev Neurosci Date: 2020-04-17 Impact factor: 34.870

Review 6. How learning unfolds in the brain: toward an optimization view.

Authors: Jay A Hennig; Emily R Oby; Darby M Losey; Aaron P Batista; Byron M Yu; Steven M Chase
Journal: Neuron Date: 2021-10-13 Impact factor: 17.173

7. A smart, practical, deep learning-based clinical decision support tool for patients in the prostate-specific antigen gray zone: model development and validation.

Authors: Sang Hun Song; Hwanik Kim; Jung Kwon Kim; Hakmin Lee; Jong Jin Oh; Sang-Chul Lee; Seong Jin Jeong; Sung Kyu Hong; Junghoon Lee; Sangjun Yoo; Min-Soo Choo; Min Chul Cho; Hwancheol Son; Hyeon Jeong; Jungyo Suh; Seok-Soo Byun
Journal: J Am Med Inform Assoc Date: 2022-10-07 Impact factor: 7.942