Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 The emergence of saliency and novelty responses from Reinforcement Learning principles.

Literature DB >> 18938058

The emergence of saliency and novelty responses from Reinforcement Learning principles.

Abstract

Recent attempts to map reward-based learning models, like Reinforcement Learning [Sutton, R. S., & Barto, A. G. (1998). Reinforcement Learning: An introduction. Cambridge, MA: MIT Press], to the brain are based on the observation that phasic increases and decreases in the spiking of dopamine-releasing neurons signal differences between predicted and received reward [Gillies, A., & Arbuthnott, G. (2000). Computational models of the basal ganglia. Movement Disorders, 15(5), 762-770; Schultz, W. (1998). Predictive reward signal of dopamine neurons. Journal of Neurophysiology, 80(1), 1-27]. However, this reward-prediction error is only one of several signals communicated by that phasic activity; another involves an increase in dopaminergic spiking, reflecting the appearance of salient but unpredicted non-reward stimuli [Doya, K. (2002). Metalearning and neuromodulation. Neural Networks, 15(4-6), 495-506; Horvitz, J. C. (2000). Mesolimbocortical and nigrostriatal dopamine responses to salient non-reward events. Neuroscience, 96(4), 651-656; Redgrave, P., & Gurney, K. (2006). The short-latency dopamine signal: A role in discovering novel actions? Nature Reviews Neuroscience, 7(12), 967-975], especially when an organism subsequently orients towards the stimulus [Schultz, W. (1998). Predictive reward signal of dopamine neurons. Journal of Neurophysiology, 80(1), 1-27]. To explain these findings, Kakade and Dayan [Kakade, S., & Dayan, P. (2002). Dopamine: Generalization and bonuses. Neural Networks, 15(4-6), 549-559.] and others have posited that novel, unexpected stimuli are intrinsically rewarding. The simulation reported in this article demonstrates that this assumption is not necessary because the effect it is intended to capture emerges from the reward-prediction learning mechanisms of Reinforcement Learning. Thus, Reinforcement Learning principles can be used to understand not just reward-related activity of the dopaminergic neurons of the basal ganglia, but also some of their apparently non-reward-related activity.

Entities: Chemical Disease Species

Mesh：

Year: 2008 PMID： 18938058 PMCID： PMC2629355 DOI： 10.1016/j.neunet.2008.09.004

Source DB: PubMed Journal: Neural Netw ISSN： 0893-6080

16 in total

Review 1. Mesolimbocortical and nigrostriatal dopamine responses to salient non-reward events.

Authors: J C Horvitz
Journal: Neuroscience Date: 2000 Impact factor: 3.590

Review 2. Is the short-latency dopamine response too short to signal reward error?

Authors: P Redgrave; T J Prescott; K Gurney
Journal: Trends Neurosci Date: 1999-04 Impact factor: 13.837

Review 3. Metalearning and neuromodulation.

Authors: Kenji Doya
Journal: Neural Netw Date: 2002 Jun-Jul

4. Prediction of immediate and future rewards differentially recruits cortico-basal ganglia loops.

Authors: Saori C Tanaka; Kenji Doya; Go Okada; Kazutaka Ueda; Yasumasa Okamoto; Shigeto Yamawaki
Journal: Nat Neurosci Date: 2004-07-04 Impact factor: 24.884

5. How visual stimuli activate dopaminergic neurons at short latency.

Authors: Eleanor Dommett; Véronique Coizet; Charles D Blaha; John Martindale; Véronique Lefebvre; Natalie Walton; John E W Mayhew; Paul G Overton; Peter Redgrave
Journal: Science Date: 2005-03-04 Impact factor: 47.728

Review 6. The short-latency dopamine signal: a role in discovering novel actions?

Authors: Peter Redgrave; Kevin Gurney
Journal: Nat Rev Neurosci Date: 2006-11-08 Impact factor: 34.870

7. Absolute coding of stimulus novelty in the human substantia nigra/VTA.

Authors: Nico Bunzeck; Emrah Düzel
Journal: Neuron Date: 2006-08-03 Impact factor: 17.173

Review 8. Dopamine: generalization and bonuses.

Authors: Sham Kakade; Peter Dayan
Journal: Neural Netw Date: 2002 Jun-Jul

9. Temporal prediction errors in a passive learning task activate human striatum.

Authors: Samuel M McClure; Gregory S Berns; P Read Montague
Journal: Neuron Date: 2003-04-24 Impact factor: 17.173

Review 10. Predictive reward signal of dopamine neurons.

Authors: W Schultz
Journal: J Neurophysiol Date: 1998-07 Impact factor: 2.714

9 in total

Review 1. A value-driven mechanism of attentional selection.

Authors: Brian A Anderson
Journal: J Vis Date: 2013-04-15 Impact factor: 2.240

2. Novelty enhances visual salience independently of reward in the parietal lobe.

Authors: Nicholas C Foley; David C Jangraw; Christopher Peck; Jacqueline Gottlieb
Journal: J Neurosci Date: 2014-06-04 Impact factor: 6.167

3. Valuable Orientations Capture Attention.

Authors: Patryk A Laurent; Michelle G Hall; Brian A Anderson; Steven Yantis
Journal: Vis cogn Date: 2015-01-01

4. Eye movements and imitation learning: intentional disruption of expectation.

Authors: Jessica Maryott; Abigail Noyce; Robert Sekuler
Journal: J Vis Date: 2011-01-06 Impact factor: 2.240

5. Computational perspectives on forebrain microcircuits implicated in reinforcement learning, action selection, and cognitive control.

Authors: Daniel Bullock; Can Ozan Tan; Yohan J John
Journal: Neural Netw Date: 2009-06-30

Review 6. Interoception, Trait Anxiety, and the Gut Microbiome: A Cognitive and Physiological Model.

Authors: Pascal Büttiker; Simon Weissenberger; Radek Ptacek; George B Stefano
Journal: Med Sci Monit Date: 2021-05-04

7. Learned value magnifies salience-based attentional capture.

Authors: Brian A Anderson; Patryk A Laurent; Steven Yantis
Journal: PLoS One Date: 2011-11-21 Impact factor: 3.240

Review 8. Applications of Artificial Intelligence Based on Medical Imaging in Glioma: Current State and Future Challenges.

Authors: Jiaona Xu; Yuting Meng; Kefan Qiu; Win Topatana; Shijie Li; Chao Wei; Tianwen Chen; Mingyu Chen; Zhongxiang Ding; Guozhong Niu
Journal: Front Oncol Date: 2022-07-27 Impact factor: 5.738

9. Climbing fibers encode a temporal-difference prediction error during cerebellar learning in mice.

Authors: Shogo Ohmae; Javier F Medina
Journal: Nat Neurosci Date: 2015-11-09 Impact factor: 24.884

9 in total