Literature DB >> 18196266

Mathematical properties of neuronal TD-rules and differential Hebbian learning: a comparison.

Christoph Kolodziejski1, Bernd Porr, Florentin Wörgötter.   

Abstract

A confusingly wide variety of temporally asymmetric learning rules exists related to reinforcement learning and/or to spike-timing dependent plasticity, many of which look exceedingly similar, while displaying strongly different behavior. These rules often find their use in control tasks, for example in robotics and for this rigorous convergence and numerical stability is required. The goal of this article is to review these rules and compare them to provide a better overview over their different properties. Two main classes will be discussed: temporal difference (TD) rules and correlation based (differential hebbian) rules and some transition cases. In general we will focus on neuronal implementations with changeable synaptic weights and a time-continuous representation of activity. In a machine learning (non-neuronal) context, for TD-learning a solid mathematical theory has existed since several years. This can partly be transferred to a neuronal framework, too. On the other hand, only now a more complete theory has also emerged for differential Hebb rules. In general rules differ by their convergence conditions and their numerical stability, which can lead to very undesirable behavior, when wanting to apply them. For TD, convergence can be enforced with a certain output condition assuring that the delta-error drops on average to zero (output control). Correlation based rules, on the other hand, converge when one input drops to zero (input control). Temporally asymmetric learning rules treat situations where incoming stimuli follow each other in time. Thus, it is necessary to remember the first stimulus to be able to relate it to the later occurring second one. To this end different types of so-called eligibility traces are being used by these two different types of rules. This aspect leads again to different properties of TD and differential Hebbian learning as discussed here. Thus, this paper, while also presenting several novel mathematical results, is mainly meant to provide a road map through the different neuronally emulated temporal asymmetrical learning rules and their behavior to provide some guidance for possible applications.

Entities:  

Mesh:

Year:  2008        PMID: 18196266      PMCID: PMC2798052          DOI: 10.1007/s00422-007-0209-6

Source DB:  PubMed          Journal:  Biol Cybern        ISSN: 0340-1200            Impact factor:   2.086


  33 in total

1.  Computational consequences of temporally asymmetric learning rules: I. Differential hebbian learning.

Authors:  P D Roberts
Journal:  J Comput Neurosci       Date:  1999 Nov-Dec       Impact factor: 1.621

Review 2.  Synaptic modification by correlated activity: Hebb's postulate revisited.

Authors:  G Bi ; M Poo
Journal:  Annu Rev Neurosci       Date:  2001       Impact factor: 12.449

3.  Temporal difference model reproduces anticipatory neural activity.

Authors:  R E Suri; W Schultz
Journal:  Neural Comput       Date:  2001-04       Impact factor: 2.026

4.  Spatial cognition and neuro-mimetic navigation: a model of hippocampal place cell activity.

Authors:  A Arleo; W Gerstner
Journal:  Biol Cybern       Date:  2000-09       Impact factor: 2.086

5.  Modeling functions of striatal dopamine modulation in learning and planning.

Authors:  R E Suri; J Bargas; M A Arbib
Journal:  Neuroscience       Date:  2001       Impact factor: 3.590

6.  Isotropic sequence order learning.

Authors:  Bernd Porr; Florentin Wörgötter
Journal:  Neural Comput       Date:  2003-04       Impact factor: 2.026

7.  ISO learning approximates a solution to the inverse-controller problem in an unsupervised behavioral paradigm.

Authors:  Bernd Porr; Christian von Ferber; Florentin Wörgötter
Journal:  Neural Comput       Date:  2003-04       Impact factor: 2.026

Review 8.  TD models of reward predictive responses in dopamine neurons.

Authors:  Roland E Suri
Journal:  Neural Netw       Date:  2002 Jun-Jul

9.  Mossy fibre synaptic NMDA receptors trigger non-Hebbian long-term potentiation at entorhino-CA3 synapses in the rat.

Authors:  Masako Tsukamoto; Takuya Yasui; Maki K Yamada; Nobuyoshi Nishiyama; Norio Matsuki; Yuji Ikegaya
Journal:  J Physiol       Date:  2003-02-01       Impact factor: 5.182

10.  Adaptive, fast walking in a biped robot under neuronal control and learning.

Authors:  Poramate Manoonpong; Tao Geng; Tomas Kulvicius; Bernd Porr; Florentin Wörgötter
Journal:  PLoS Comput Biol       Date:  2007-07       Impact factor: 4.475

View more
  3 in total

1.  Closed-Form Treatment of the Interactions between Neuronal Activity and Timing-Dependent Plasticity in Networks of Linear Neurons.

Authors:  Christoph Kolodziejski; Christian Tetzlaff; Florentin Wörgötter
Journal:  Front Comput Neurosci       Date:  2010-10-27       Impact factor: 2.380

2.  Neuromodulatory adaptive combination of correlation-based learning in cerebellum and reward-based learning in basal ganglia for goal-directed behavior control.

Authors:  Sakyasingha Dasgupta; Florentin Wörgötter; Poramate Manoonpong
Journal:  Front Neural Circuits       Date:  2014-10-28       Impact factor: 3.492

3.  Synaptic plasticity in a recurrent neural network for versatile and adaptive behaviors of a walking robot.

Authors:  Eduard Grinke; Christian Tetzlaff; Florentin Wörgötter; Poramate Manoonpong
Journal:  Front Neurorobot       Date:  2015-10-13       Impact factor: 2.650

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.