Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Goal-Directed and Habit-Like Modulations of Stimulus Processing during Reinforcement Learning.

Literature DB >> 28193692

Goal-Directed and Habit-Like Modulations of Stimulus Processing during Reinforcement Learning.

David Luque¹, Tom Beesley², Richard W Morris², Bradley N Jack², Oren Griffiths², Thomas J Whitford², Mike E Le Pelley².

Abstract

Recent research has shown that perceptual processing of stimuli previously associated with high-value rewards is automatically prioritized even when rewards are no longer available. It has been hypothesized that such reward-related modulation of stimulus salience is conceptually similar to an "attentional habit." Recording event-related potentials in humans during a reinforcement learning task, we show strong evidence in favor of this hypothesis. Resistance to outcome devaluation (the defining feature of a habit) was shown by the stimulus-locked P1 component, reflecting activity in the extrastriate visual cortex. Analysis at longer latencies revealed a positive component (corresponding to the P3b, from 550-700 ms) sensitive to outcome devaluation. Therefore, distinct spatiotemporal patterns of brain activity were observed corresponding to habitual and goal-directed processes. These results demonstrate that reinforcement learning engages both attentional habits and goal-directed processes in parallel. Consequences for brain and computational models of reinforcement learning are discussed.SIGNIFICANCE STATEMENT The human attentional network adapts to detect stimuli that predict important rewards. A recent hypothesis suggests that the visual cortex automatically prioritizes reward-related stimuli, driven by cached representations of reward value; that is, stimulus-response habits. Alternatively, the neural system may track the current value of the predicted outcome. Our results demonstrate for the first time that visual cortex activity is increased for reward-related stimuli even when the rewarding event is temporarily devalued. In contrast, longer-latency brain activity was specifically sensitive to transient changes in reward value. Therefore, we show that both habit-like attention and goal-directed processes occur in the same learning episode at different latencies. This result has important consequences for computational models of reinforcement learning.

Entities: Species

Keywords: attention; event-related potentials; goal-directed; habit; learning; reward

Mesh：

Year: 2017 PMID： 28193692 PMCID： PMC6596732 DOI： 10.1523/JNEUROSCI.3205-16.2017

Source DB: PubMed Journal: J Neurosci ISSN： 0270-6474 Impact factor: 6.167

Keyword Cloud
Cited

7 in total

1. Stress diminishes outcome but enhances response representations during instrumental learning.

Authors: Jacqueline Katharina Meier; Bernhard P Staresina; Lars Schwabe
Journal: Elife Date: 2022-07-18 Impact factor: 8.713

2. Previously Reward-Associated Stimuli Capture Spatial Attention in the Absence of Changes in the Corresponding Sensory Representations as Measured with MEG.

Authors: Lev Tankelevitch; Eelke Spaak; Matthew F S Rushworth; Mark G Stokes
Journal: J Neurosci Date: 2020-05-04 Impact factor: 6.167

3. Learned predictiveness acquired through experience prevails over the influence of conflicting verbal instructions in rapid selective attention.

Authors: Pedro L Cobos; Miguel A Vadillo; David Luque; Mike E Le Pelley
Journal: PLoS One Date: 2018-09-14 Impact factor: 3.240

Goal-Directed and Habit-Like Modulations of Stimulus Processing during Reinforcement Learning.

1. Stress diminishes outcome but enhances response representations during instrumental learning.

2. Previously Reward-Associated Stimuli Capture Spatial Attention in the Absence of Changes in the Corresponding Sensory Representations as Measured with MEG.

3. Learned predictiveness acquired through experience prevails over the influence of conflicting verbal instructions in rapid selective attention.

4. Cross-Modal Integration of Reward Value during Oculomotor Planning.

5. The role of prediction in learned predictiveness.

6. Dopaminergic Modulation of Goal-Directed Behavior in a Rodent Model of Attention-Deficit/Hyperactivity Disorder.

7. The influence of associative reward learning on motor inhibition.