Literature DB >> 22487034

Habits, action sequences and reinforcement learning.

Amir Dezfouli1, Bernard W Balleine.   

Abstract

It is now widely accepted that instrumental actions can be either goal-directed or habitual; whereas the former are rapidly acquired and regulated by their outcome, the latter are reflexive, elicited by antecedent stimuli rather than their consequences. Model-based reinforcement learning (RL) provides an elegant description of goal-directed action. Through exposure to states, actions and rewards, the agent rapidly constructs a model of the world and can choose an appropriate action based on quite abstract changes in environmental and evaluative demands. This model is powerful but has a problem explaining the development of habitual actions. To account for habits, theorists have argued that another action controller is required, called model-free RL, that does not form a model of the world but rather caches action values within states allowing a state to select an action based on its reward history rather than its consequences. Nevertheless, there are persistent problems with important predictions from the model; most notably the failure of model-free RL correctly to predict the insensitivity of habitual actions to changes in the action-reward contingency. Here, we suggest that introducing model-free RL in instrumental conditioning is unnecessary, and demonstrate that reconceptualizing habits as action sequences allows model-based RL to be applied to both goal-directed and habitual actions in a manner consistent with what real animals do. This approach has significant implications for the way habits are currently investigated and generates new experimental predictions.
© 2012 The Authors. European Journal of Neuroscience © 2012 Federation of European Neuroscience Societies and Blackwell Publishing Ltd.

Entities:  

Mesh:

Year:  2012        PMID: 22487034      PMCID: PMC3325518          DOI: 10.1111/j.1460-9568.2012.08050.x

Source DB:  PubMed          Journal:  Eur J Neurosci        ISSN: 0953-816X            Impact factor:   3.386


  84 in total

1.  The cognitive and neural architecture of sequence representation.

Authors:  Steven W Keele; Richard Ivry; Ulrich Mayr; Eliot Hazeltine; Herbert Heuer
Journal:  Psychol Rev       Date:  2003-04       Impact factor: 8.934

2.  Distinct basal ganglia territories are engaged in early and advanced motor sequence learning.

Authors:  Stéphane Lehéricy; Habib Benali; Pierre-François Van de Moortele; Mélanie Pélégrini-Issac; Tobias Waechter; Kamil Ugurbil; Julien Doyon
Journal:  Proc Natl Acad Sci U S A       Date:  2005-08-17       Impact factor: 11.205

Review 3.  The integrative function of the basal ganglia in instrumental conditioning.

Authors:  Bernard W Balleine; Mimi Liljeholm; Sean B Ostlund
Journal:  Behav Brain Res       Date:  2008-11-05       Impact factor: 3.332

Review 4.  A framework for studying the neurobiology of value-based decision making.

Authors:  Antonio Rangel; Colin Camerer; P Read Montague
Journal:  Nat Rev Neurosci       Date:  2008-06-11       Impact factor: 34.870

Review 5.  A neural substrate of prediction and reward.

Authors:  W Schultz; P Dayan; P R Montague
Journal:  Science       Date:  1997-03-14       Impact factor: 47.728

6.  On the development of procedural knowledge.

Authors:  D B Willingham; M J Nissen; P Bullemer
Journal:  J Exp Psychol Learn Mem Cogn       Date:  1989-11       Impact factor: 3.051

7.  Lesion to the nigrostriatal dopamine system disrupts stimulus-response habit formation.

Authors:  Alexis Faure; Ulrike Haberland; Françoise Condé; Nicole El Massioui
Journal:  J Neurosci       Date:  2005-03-16       Impact factor: 6.167

8.  A mechanistic account of striatal dopamine function in human cognition: psychopharmacological studies with cabergoline and haloperidol.

Authors:  Michael J Frank; Randall C O'Reilly
Journal:  Behav Neurosci       Date:  2006-06       Impact factor: 1.912

9.  Differential corticostriatal plasticity during fast and slow motor skill learning in mice.

Authors:  Rui M Costa; Dana Cohen; Miguel A L Nicolelis
Journal:  Curr Biol       Date:  2004-07-13       Impact factor: 10.834

Review 10.  A unified framework for addiction: vulnerabilities in the decision process.

Authors:  A David Redish; Steve Jensen; Adam Johnson
Journal:  Behav Brain Sci       Date:  2008-08       Impact factor: 21.357

View more
  91 in total

1.  Of goals and habits.

Authors:  Nathaniel D Daw
Journal:  Proc Natl Acad Sci U S A       Date:  2015-10-21       Impact factor: 11.205

2.  Habit Learning by Naive Macaques Is Marked by Response Sharpening of Striatal Neurons Representing the Cost and Outcome of Acquired Action Sequences.

Authors:  Theresa M Desrochers; Ken-ichi Amemori; Ann M Graybiel
Journal:  Neuron       Date:  2015-08-19       Impact factor: 17.173

Review 3.  Toward sophisticated basal ganglia neuromodulation: Review on basal ganglia deep brain stimulation.

Authors:  Claudio Da Cunha; Suelen L Boschen; Alexander Gómez-A; Erika K Ross; William S J Gibson; Hoon-Ki Min; Kendall H Lee; Charles D Blaha
Journal:  Neurosci Biobehav Rev       Date:  2015-02-12       Impact factor: 8.989

4.  Adaptive integration of habits into depth-limited planning defines a habitual-goal-directed spectrum.

Authors:  Mehdi Keramati; Peter Smittenaar; Raymond J Dolan; Peter Dayan
Journal:  Proc Natl Acad Sci U S A       Date:  2016-10-24       Impact factor: 11.205

5.  Habits without values.

Authors:  Kevin J Miller; Amitai Shenhav; Elliot A Ludvig
Journal:  Psychol Rev       Date:  2019-01-24       Impact factor: 8.934

Review 6.  Defining the place of habit in substance use disorders.

Authors:  Youna Vandaele; Patricia H Janak
Journal:  Prog Neuropsychopharmacol Biol Psychiatry       Date:  2017-06-27       Impact factor: 5.067

Review 7.  Rethinking avoidance: Toward a balanced approach to avoidance in treating anxiety disorders.

Authors:  Stefan G Hofmann; Aleena C Hay
Journal:  J Anxiety Disord       Date:  2018-03-09

Review 8.  Exploration versus exploitation in space, mind, and society.

Authors:  Thomas T Hills; Peter M Todd; David Lazer; A David Redish; Iain D Couzin
Journal:  Trends Cogn Sci       Date:  2014-12-03       Impact factor: 20.229

Review 9.  A fundamental role for context in instrumental learning and extinction.

Authors:  Mark E Bouton; Travis P Todd
Journal:  Behav Processes       Date:  2014-02-24       Impact factor: 1.777

Review 10.  Associative and sensorimotor cortico-basal ganglia circuit roles in effects of abused drugs.

Authors:  C M Gremel; D M Lovinger
Journal:  Genes Brain Behav       Date:  2016-08-26       Impact factor: 3.449

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.