Literature DB >> 22927003

Evaluating the TD model of classical conditioning.

Elliot A Ludvig1, Richard S Sutton, E James Kehoe.   

Abstract

The temporal-difference (TD) algorithm from reinforcement learning provides a simple method for incrementally learning predictions of upcoming events. Applied to classical conditioning, TD models suppose that animals learn a real-time prediction of the unconditioned stimulus (US) on the basis of all available conditioned stimuli (CSs). In the TD model, similar to other error-correction models, learning is driven by prediction errors--the difference between the change in US prediction and the actual US. With the TD model, however, learning occurs continuously from moment to moment and is not artificially constrained to occur in trials. Accordingly, a key feature of any TD model is the assumption about the representation of a CS on a moment-to-moment basis. Here, we evaluate the performance of the TD model with a heretofore unexplored range of classical conditioning tasks. To do so, we consider three stimulus representations that vary in their degree of temporal generalization and evaluate how the representation influences the performance of the TD model on these conditioning tasks.

Entities:  

Mesh:

Year:  2012        PMID: 22927003     DOI: 10.3758/s13420-012-0082-6

Source DB:  PubMed          Journal:  Learn Behav        ISSN: 1543-4494            Impact factor:   1.986


  36 in total

1.  Extinction revisited: similarities between extinction and reductions in US intensity in classical conditioning of the rabbit's nictitating membrane response.

Authors:  E James Kehoe; Natasha E White
Journal:  Anim Learn Behav       Date:  2002-05

2.  Temporal-difference prediction errors and Pavlovian fear conditioning: role of NMDA and opioid receptors.

Authors:  Sindy Cole; Gavan P McNally
Journal:  Behav Neurosci       Date:  2007-10       Impact factor: 1.912

3.  Stimulus representation and the timing of reward-prediction errors in models of the dopamine system.

Authors:  Elliot A Ludvig; Richard S Sutton; E James Kehoe
Journal:  Neural Comput       Date:  2008-12       Impact factor: 2.026

4.  CS-US temporal relations in blocking.

Authors:  Jeffrey C Amundson; Ralph R Miller
Journal:  Learn Behav       Date:  2008-05       Impact factor: 1.986

5.  Overshadowing and stimulus duration.

Authors:  Dómhnall J Jennings; Charlotte Bonardi; Kimberly Kirkpatrick
Journal:  J Exp Psychol Anim Behav Process       Date:  2007-10

Review 6.  A neural substrate of prediction and reward.

Authors:  W Schultz; P Dayan; P R Montague
Journal:  Science       Date:  1997-03-14       Impact factor: 47.728

7.  Classical conditioning of the rabbit's nictitating membrane response at backward, simultaneous, and forward CS-US intervals.

Authors:  M C Smith; S R Coleman; I Gormezano
Journal:  J Comp Physiol Psychol       Date:  1969-10

8.  Hippocampal "time cells" bridge the gap in memory for discontiguous events.

Authors:  Christopher J MacDonald; Kyle Q Lepage; Uri T Eden; Howard Eichenbaum
Journal:  Neuron       Date:  2011-08-25       Impact factor: 17.173

9.  Isolation of an internal clock.

Authors:  S Roberts
Journal:  J Exp Psychol Anim Behav Process       Date:  1981-07

10.  Scalar timing varies with response magnitude in classical conditioning of the nictitating membrane response of the rabbit (Oryctolagus cuniculus).

Authors:  E James Kehoe; Kirk N Olsen; Elliot A Ludvig; Richard S Sutton
Journal:  Behav Neurosci       Date:  2009-02       Impact factor: 1.912

View more
  30 in total

1.  Trial frequency effects in human temporal bisection: implications for theories of timing.

Authors:  Jeremie Jozefowiez; Cody W Polack; Armando Machado; Ralph R Miller
Journal:  Behav Processes       Date:  2013-09-09       Impact factor: 1.777

2.  Selective maintenance of value information helps resolve the exploration/exploitation dilemma.

Authors:  Michael N Hallquist; Alexandre Y Dombrovski
Journal:  Cognition       Date:  2018-11-28

3.  Bidirectional short-term plasticity during single-trial learning of cerebellar-driven eyelid movements in mice.

Authors:  Farzaneh Najafi; Javier F Medina
Journal:  Neurobiol Learn Mem       Date:  2019-10-11       Impact factor: 2.877

4.  Superior ambiguous occasion setting with visual than temporal feature stimuli.

Authors:  Andrew R Delamater; Rifka C Derman; Justin A Harris
Journal:  J Exp Psychol Anim Learn Cogn       Date:  2017-01       Impact factor: 2.478

Review 5.  Learning task-state representations.

Authors:  Yael Niv
Journal:  Nat Neurosci       Date:  2019-09-24       Impact factor: 24.884

6.  Reinforcement learning with Marr.

Authors:  Yael Niv; Angela Langdon
Journal:  Curr Opin Behav Sci       Date:  2016-10

7.  Intertrial unconditioned stimuli differentially impact trace conditioning.

Authors:  Douglas A Williams; Travis P Todd; Chrissy M Chubala; Elliot A Ludvig
Journal:  Learn Behav       Date:  2017-03       Impact factor: 1.986

Review 8.  Believing in dopamine.

Authors:  Samuel J Gershman; Naoshige Uchida
Journal:  Nat Rev Neurosci       Date:  2019-09-30       Impact factor: 34.870

9.  An adaptive drift-diffusion model of interval timing dynamics.

Authors:  Andre Luzardo; Elliot A Ludvig; François Rivest
Journal:  Behav Processes       Date:  2013-02-18       Impact factor: 1.777

Review 10.  Interactions of timing and prediction error learning.

Authors:  Kimberly Kirkpatrick
Journal:  Behav Processes       Date:  2013-08-17       Impact factor: 1.777

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.