Literature DB >> 20974967

Optimal habits can develop spontaneously through sensitivity to local cost.

Theresa M Desrochers1, Dezhe Z Jin, Noah D Goodman, Ann M Graybiel.   

Abstract

Habits and rituals are expressed universally across animal species. These behaviors are advantageous in allowing sequential behaviors to be performed without cognitive overload, and appear to rely on neural circuits that are relatively benign but vulnerable to takeover by extreme contexts, neuropsychiatric sequelae, and processes leading to addiction. Reinforcement learning (RL) is thought to underlie the formation of optimal habits. However, this theoretic formulation has principally been tested experimentally in simple stimulus-response tasks with relatively few available responses. We asked whether RL could also account for the emergence of habitual action sequences in realistically complex situations in which no repetitive stimulus-response links were present and in which many response options were present. We exposed naïve macaque monkeys to such experimental conditions by introducing a unique free saccade scan task. Despite the highly uncertain conditions and no instruction, the monkeys developed a succession of stereotypical, self-chosen saccade sequence patterns. Remarkably, these continued to morph for months, long after session-averaged reward and cost (eye movement distance) reached asymptote. Prima facie, these continued behavioral changes appeared to challenge RL. However, trial-by-trial analysis showed that pattern changes on adjacent trials were predicted by lowered cost, and RL simulations that reduced the cost reproduced the monkeys' behavior. Ultimately, the patterns settled into stereotypical saccade sequences that minimized the cost of obtaining the reward on average. These findings suggest that brain mechanisms underlying the emergence of habits, and perhaps unwanted repetitive behaviors in clinical disorders, could follow RL algorithms capturing extremely local explore/exploit tradeoffs.

Entities:  

Mesh:

Year:  2010        PMID: 20974967      PMCID: PMC2996716          DOI: 10.1073/pnas.1013470107

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


  14 in total

1.  A model of hippocampally dependent navigation, using the temporal difference learning rule.

Authors:  D J Foster; R G Morris; P Dayan
Journal:  Hippocampus       Date:  2000       Impact factor: 3.899

2.  Learning the parts of objects by non-negative matrix factorization.

Authors:  D D Lee; H S Seung
Journal:  Nature       Date:  1999-10-21       Impact factor: 49.962

3.  THE MECHANICS OF HUMAN SACCADIC EYE MOVEMENT.

Authors:  D A ROBINSON
Journal:  J Physiol       Date:  1964-11       Impact factor: 5.182

4.  Prefrontal cortex and decision making in a mixed-strategy game.

Authors:  Dominic J Barraclough; Michelle L Conroy; Daeyeol Lee
Journal:  Nat Neurosci       Date:  2004-03-07       Impact factor: 24.884

5.  Representation of action-specific reward values in the striatum.

Authors:  Kazuyuki Samejima; Yasumasa Ueda; Kenji Doya; Minoru Kimura
Journal:  Science       Date:  2005-11-25       Impact factor: 47.728

6.  Midbrain dopamine neurons encode a quantitative reward prediction error signal.

Authors:  Hannah M Bayer; Paul W Glimcher
Journal:  Neuron       Date:  2005-07-07       Impact factor: 17.173

Review 7.  A neural substrate of prediction and reward.

Authors:  W Schultz; P Dayan; P R Montague
Journal:  Science       Date:  1997-03-14       Impact factor: 47.728

8.  Cortical substrates for exploratory decisions in humans.

Authors:  Nathaniel D Daw; John P O'Doherty; Peter Dayan; Ben Seymour; Raymond J Dolan
Journal:  Nature       Date:  2006-06-15       Impact factor: 49.962

9.  Bee foraging in uncertain environments using predictive hebbian learning.

Authors:  P R Montague; P Dayan; C Person; T J Sejnowski
Journal:  Nature       Date:  1995-10-26       Impact factor: 49.962

Review 10.  The psychology of perserverative and stereotyped behaviour.

Authors:  R M Ridley
Journal:  Prog Neurobiol       Date:  1994-10       Impact factor: 11.685

View more
  16 in total

1.  Neuronal activity in the primate dorsomedial prefrontal cortex contributes to strategic selection of response tactics.

Authors:  Yoshiya Matsuzaka; Tetsuya Akiyama; Jun Tanji; Hajime Mushiake
Journal:  Proc Natl Acad Sci U S A       Date:  2012-02-27       Impact factor: 11.205

2.  Learning optimal strategies in complex environments.

Authors:  Terrence J Sejnowski
Journal:  Proc Natl Acad Sci U S A       Date:  2010-11-15       Impact factor: 11.205

Review 3.  The striatum: where skills and habits meet.

Authors:  Ann M Graybiel; Scott T Grafton
Journal:  Cold Spring Harb Perspect Biol       Date:  2015-08-03       Impact factor: 10.005

4.  Habit Learning by Naive Macaques Is Marked by Response Sharpening of Striatal Neurons Representing the Cost and Outcome of Acquired Action Sequences.

Authors:  Theresa M Desrochers; Ken-ichi Amemori; Ann M Graybiel
Journal:  Neuron       Date:  2015-08-19       Impact factor: 17.173

Review 5.  Sensory integration, sensory processing, and sensory modulation disorders: putative functional neuroanatomic underpinnings.

Authors:  Leonard F Koziol; Deborah Ely Budding; Dana Chidekel
Journal:  Cerebellum       Date:  2011-12       Impact factor: 3.847

6.  Neuronal Activity in the Posterior Cingulate Cortex Signals Environmental Information and Predicts Behavioral Variability during Trapline Foraging.

Authors:  David L Barack; Michael L Platt
Journal:  J Neurosci       Date:  2021-02-03       Impact factor: 6.167

7.  Parietal neurons encode expected gains in instrumental information.

Authors:  Nicholas C Foley; Simon P Kelly; Himanshu Mhatre; Manuel Lopes; Jacqueline Gottlieb
Journal:  Proc Natl Acad Sci U S A       Date:  2017-04-03       Impact factor: 11.205

8.  Goal-oriented searching mediated by ventral hippocampus early in trial-and-error learning.

Authors:  Sarah Ruediger; Dominique Spirig; Flavio Donato; Pico Caroni
Journal:  Nat Neurosci       Date:  2012-09-23       Impact factor: 24.884

9.  Representation of Behavioral Tactics and Tactics-Action Transformation in the Primate Medial Prefrontal Cortex.

Authors:  Yoshiya Matsuzaka; Jun Tanji; Hajime Mushiake
Journal:  J Neurosci       Date:  2016-06-01       Impact factor: 6.167

10.  Dopaminergic enhancement of local food-seeking is under global homeostatic control.

Authors:  Jeff A Beeler; Cristianne R M Frazier; Xiaoxi Zhuang
Journal:  Eur J Neurosci       Date:  2011-11-27       Impact factor: 3.386

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.