Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Evaluating the TD model of classical conditioning.

Literature DB >> 22927003

Evaluating the TD model of classical conditioning.

Elliot A Ludvig¹, Richard S Sutton, E James Kehoe.

Abstract

The temporal-difference (TD) algorithm from reinforcement learning provides a simple method for incrementally learning predictions of upcoming events. Applied to classical conditioning, TD models suppose that animals learn a real-time prediction of the unconditioned stimulus (US) on the basis of all available conditioned stimuli (CSs). In the TD model, similar to other error-correction models, learning is driven by prediction errors--the difference between the change in US prediction and the actual US. With the TD model, however, learning occurs continuously from moment to moment and is not artificially constrained to occur in trials. Accordingly, a key feature of any TD model is the assumption about the representation of a CS on a moment-to-moment basis. Here, we evaluate the performance of the TD model with a heretofore unexplored range of classical conditioning tasks. To do so, we consider three stimulus representations that vary in their degree of temporal generalization and evaluate how the representation influences the performance of the TD model on these conditioning tasks.

Entities: Chemical

Mesh：

Year: 2012 PMID： 22927003 DOI： 10.3758/s13420-012-0082-6

Source DB: PubMed Journal: Learn Behav ISSN： 1543-4494 Impact factor: 1.986

36 in total

1. Extinction revisited: similarities between extinction and reductions in US intensity in classical conditioning of the rabbit's nictitating membrane response.

Authors: E James Kehoe; Natasha E White
Journal: Anim Learn Behav Date: 2002-05

2. Temporal-difference prediction errors and Pavlovian fear conditioning: role of NMDA and opioid receptors.

Authors: Sindy Cole; Gavan P McNally
Journal: Behav Neurosci Date: 2007-10 Impact factor: 1.912

3. Stimulus representation and the timing of reward-prediction errors in models of the dopamine system.

Authors: Elliot A Ludvig; Richard S Sutton; E James Kehoe
Journal: Neural Comput Date: 2008-12 Impact factor: 2.026

4. CS-US temporal relations in blocking.

Authors: Jeffrey C Amundson; Ralph R Miller
Journal: Learn Behav Date: 2008-05 Impact factor: 1.986

5. Overshadowing and stimulus duration.

Authors: Dómhnall J Jennings; Charlotte Bonardi; Kimberly Kirkpatrick
Journal: J Exp Psychol Anim Behav Process Date: 2007-10

Review 6. A neural substrate of prediction and reward.

Authors: W Schultz; P Dayan; P R Montague
Journal: Science Date: 1997-03-14 Impact factor: 47.728

7. Classical conditioning of the rabbit's nictitating membrane response at backward, simultaneous, and forward CS-US intervals.

Authors: M C Smith; S R Coleman; I Gormezano
Journal: J Comp Physiol Psychol Date: 1969-10

8. Hippocampal "time cells" bridge the gap in memory for discontiguous events.

Authors: Christopher J MacDonald; Kyle Q Lepage; Uri T Eden; Howard Eichenbaum
Journal: Neuron Date: 2011-08-25 Impact factor: 17.173

9. Isolation of an internal clock.

Authors: S Roberts
Journal: J Exp Psychol Anim Behav Process Date: 1981-07

10. Scalar timing varies with response magnitude in classical conditioning of the nictitating membrane response of the rabbit (Oryctolagus cuniculus).

Authors: E James Kehoe; Kirk N Olsen; Elliot A Ludvig; Richard S Sutton
Journal: Behav Neurosci Date: 2009-02 Impact factor: 1.912

30 in total

1. Trial frequency effects in human temporal bisection: implications for theories of timing.

Authors: Jeremie Jozefowiez; Cody W Polack; Armando Machado; Ralph R Miller
Journal: Behav Processes Date: 2013-09-09 Impact factor: 1.777

2. Selective maintenance of value information helps resolve the exploration/exploitation dilemma.

Authors: Michael N Hallquist; Alexandre Y Dombrovski
Journal: Cognition Date: 2018-11-28

3. Bidirectional short-term plasticity during single-trial learning of cerebellar-driven eyelid movements in mice.

Authors: Farzaneh Najafi; Javier F Medina
Journal: Neurobiol Learn Mem Date: 2019-10-11 Impact factor: 2.877

4. Superior ambiguous occasion setting with visual than temporal feature stimuli.

Authors: Andrew R Delamater; Rifka C Derman; Justin A Harris
Journal: J Exp Psychol Anim Learn Cogn Date: 2017-01 Impact factor: 2.478

Review 5. Learning task-state representations.

Authors: Yael Niv
Journal: Nat Neurosci Date: 2019-09-24 Impact factor: 24.884

6. Reinforcement learning with Marr.

Authors: Yael Niv; Angela Langdon
Journal: Curr Opin Behav Sci Date: 2016-10

7. Intertrial unconditioned stimuli differentially impact trace conditioning.

Authors: Douglas A Williams; Travis P Todd; Chrissy M Chubala; Elliot A Ludvig
Journal: Learn Behav Date: 2017-03 Impact factor: 1.986

Review 8. Believing in dopamine.

Authors: Samuel J Gershman; Naoshige Uchida
Journal: Nat Rev Neurosci Date: 2019-09-30 Impact factor: 34.870

9. An adaptive drift-diffusion model of interval timing dynamics.

Authors: Andre Luzardo; Elliot A Ludvig; François Rivest
Journal: Behav Processes Date: 2013-02-18 Impact factor: 1.777

Review 10. Interactions of timing and prediction error learning.

Authors: Kimberly Kirkpatrick
Journal: Behav Processes Date: 2013-08-17 Impact factor: 1.777