Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Representation and timing in theories of the dopamine system.

Literature DB >> 16764517

Representation and timing in theories of the dopamine system.

Nathaniel D Daw¹, Aaron C Courville, David S Tourtezky, David S Touretzky.

Abstract

Although the responses of dopamine neurons in the primate midbrain are well characterized as carrying a temporal difference (TD) error signal for reward prediction, existing theories do not offer a credible account of how the brain keeps track of past sensory events that may be relevant to predicting future reward. Empirically, these shortcomings of previous theories are particularly evident in their account of experiments in which animals were exposed to variation in the timing of events. The original theories mispredicted the results of such experiments due to their use of a representational device called a tapped delay line. Here we propose that a richer understanding of history representation and a better account of these experiments can be given by considering TD algorithms for a formal setting that incorporates two features not originally considered in theories of the dopaminergic response: partial observability (a distinction between the animal's sensory experience and the true underlying state of the world) and semi-Markov dynamics (an explicit account of variation in the intervals between events). The new theory situates the dopaminergic system in a richer functional and anatomical context, since it assumes (in accord with recent computational theories of cortex) that problems of partial observability and stimulus history are solved in sensory cortex using statistical modeling and inference and that the TD system predicts reward using the results of this inference rather than raw sensory data. It also accounts for a range of experimental data, including the experiments involving programmed temporal variability and other previously unmodeled dopaminergic response phenomena, which we suggest are related to subjective noise in animals' interval timing. Finally, it offers new experimental predictions and a rich theoretical framework for designing future experiments.

Entities: Chemical

Mesh：

Substances：
Dopamine

Year: 2006 PMID： 16764517 DOI： 10.1162/neco.2006.18.7.1637

Source DB: PubMed Journal: Neural Comput ISSN： 0899-7667 Impact factor: 2.026

Keyword Cloud
Cited

66 in total

Representation and timing in theories of the dopamine system.

1. A pallidus-habenula-dopamine pathway signals inferred stimulus values.

2. A Neural Circuit Mechanism for Encoding Aversive Stimuli in the Mesolimbic Dopamine System.

Review 3. Decision theory, reinforcement learning, and the brain.

4. The Medial Prefrontal Cortex Shapes Dopamine Reward Prediction Errors under State Uncertainty.

5. Two-factor theory, the actor-critic model, and conditioned avoidance.

Review 6. Reinforcement learning, conditioning, and the brain: Successes and challenges.

7. Alternative time representation in dopamine models.

8. Computational models of reinforcement learning: the role of dopamine as a reward signal.

9. Learning to represent reward structure: a key to adapting to complex environments.

10. Temporal-difference reinforcement learning with distributed representations.