Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Mathematical properties of neuronal TD-rules and differential Hebbian learning: a comparison.

Literature DB >> 18196266

Mathematical properties of neuronal TD-rules and differential Hebbian learning: a comparison.

Christoph Kolodziejski¹, Bernd Porr, Florentin Wörgötter.

Abstract

A confusingly wide variety of temporally asymmetric learning rules exists related to reinforcement learning and/or to spike-timing dependent plasticity, many of which look exceedingly similar, while displaying strongly different behavior. These rules often find their use in control tasks, for example in robotics and for this rigorous convergence and numerical stability is required. The goal of this article is to review these rules and compare them to provide a better overview over their different properties. Two main classes will be discussed: temporal difference (TD) rules and correlation based (differential hebbian) rules and some transition cases. In general we will focus on neuronal implementations with changeable synaptic weights and a time-continuous representation of activity. In a machine learning (non-neuronal) context, for TD-learning a solid mathematical theory has existed since several years. This can partly be transferred to a neuronal framework, too. On the other hand, only now a more complete theory has also emerged for differential Hebb rules. In general rules differ by their convergence conditions and their numerical stability, which can lead to very undesirable behavior, when wanting to apply them. For TD, convergence can be enforced with a certain output condition assuring that the delta-error drops on average to zero (output control). Correlation based rules, on the other hand, converge when one input drops to zero (input control). Temporally asymmetric learning rules treat situations where incoming stimuli follow each other in time. Thus, it is necessary to remember the first stimulus to be able to relate it to the later occurring second one. To this end different types of so-called eligibility traces are being used by these two different types of rules. This aspect leads again to different properties of TD and differential Hebbian learning as discussed here. Thus, this paper, while also presenting several novel mathematical results, is mainly meant to provide a road map through the different neuronally emulated temporal asymmetrical learning rules and their behavior to provide some guidance for possible applications.

Entities: Chemical Disease Gene Species

Mesh：

Year: 2008 PMID： 18196266 PMCID： PMC2798052 DOI： 10.1007/s00422-007-0209-6

Source DB: PubMed Journal: Biol Cybern ISSN： 0340-1200 Impact factor: 2.086

33 in total

1. Computational consequences of temporally asymmetric learning rules: I. Differential hebbian learning.

Authors: P D Roberts
Journal: J Comput Neurosci Date: 1999 Nov-Dec Impact factor: 1.621

Review 2. Synaptic modification by correlated activity: Hebb's postulate revisited.

Authors: G Bi ; M Poo
Journal: Annu Rev Neurosci Date: 2001 Impact factor: 12.449

3. Temporal difference model reproduces anticipatory neural activity.

Authors: R E Suri; W Schultz
Journal: Neural Comput Date: 2001-04 Impact factor: 2.026

4. Spatial cognition and neuro-mimetic navigation: a model of hippocampal place cell activity.

Authors: A Arleo; W Gerstner
Journal: Biol Cybern Date: 2000-09 Impact factor: 2.086

5. Modeling functions of striatal dopamine modulation in learning and planning.

Authors: R E Suri; J Bargas; M A Arbib
Journal: Neuroscience Date: 2001 Impact factor: 3.590

6. Isotropic sequence order learning.

Authors: Bernd Porr; Florentin Wörgötter
Journal: Neural Comput Date: 2003-04 Impact factor: 2.026

7. ISO learning approximates a solution to the inverse-controller problem in an unsupervised behavioral paradigm.

Authors: Bernd Porr; Christian von Ferber; Florentin Wörgötter
Journal: Neural Comput Date: 2003-04 Impact factor: 2.026

Review 8. TD models of reward predictive responses in dopamine neurons.

Authors: Roland E Suri
Journal: Neural Netw Date: 2002 Jun-Jul

9. Mossy fibre synaptic NMDA receptors trigger non-Hebbian long-term potentiation at entorhino-CA3 synapses in the rat.

Authors: Masako Tsukamoto; Takuya Yasui; Maki K Yamada; Nobuyoshi Nishiyama; Norio Matsuki; Yuji Ikegaya
Journal: J Physiol Date: 2003-02-01 Impact factor: 5.182

10. Adaptive, fast walking in a biped robot under neuronal control and learning.

Authors: Poramate Manoonpong; Tao Geng; Tomas Kulvicius; Bernd Porr; Florentin Wörgötter
Journal: PLoS Comput Biol Date: 2007-07 Impact factor: 4.475

3 in total

1. Closed-Form Treatment of the Interactions between Neuronal Activity and Timing-Dependent Plasticity in Networks of Linear Neurons.

Authors: Christoph Kolodziejski; Christian Tetzlaff; Florentin Wörgötter
Journal: Front Comput Neurosci Date: 2010-10-27 Impact factor: 2.380

2. Neuromodulatory adaptive combination of correlation-based learning in cerebellum and reward-based learning in basal ganglia for goal-directed behavior control.

Authors: Sakyasingha Dasgupta; Florentin Wörgötter; Poramate Manoonpong
Journal: Front Neural Circuits Date: 2014-10-28 Impact factor: 3.492

3. Synaptic plasticity in a recurrent neural network for versatile and adaptive behaviors of a walking robot.

Authors: Eduard Grinke; Christian Tetzlaff; Florentin Wörgötter; Poramate Manoonpong
Journal: Front Neurorobot Date: 2015-10-13 Impact factor: 2.650

3 in total