Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Optimal habits can develop spontaneously through sensitivity to local cost.

Literature DB >> 20974967

Optimal habits can develop spontaneously through sensitivity to local cost.

Theresa M Desrochers¹, Dezhe Z Jin, Noah D Goodman, Ann M Graybiel.

Abstract

Habits and rituals are expressed universally across animal species. These behaviors are advantageous in allowing sequential behaviors to be performed without cognitive overload, and appear to rely on neural circuits that are relatively benign but vulnerable to takeover by extreme contexts, neuropsychiatric sequelae, and processes leading to addiction. Reinforcement learning (RL) is thought to underlie the formation of optimal habits. However, this theoretic formulation has principally been tested experimentally in simple stimulus-response tasks with relatively few available responses. We asked whether RL could also account for the emergence of habitual action sequences in realistically complex situations in which no repetitive stimulus-response links were present and in which many response options were present. We exposed naïve macaque monkeys to such experimental conditions by introducing a unique free saccade scan task. Despite the highly uncertain conditions and no instruction, the monkeys developed a succession of stereotypical, self-chosen saccade sequence patterns. Remarkably, these continued to morph for months, long after session-averaged reward and cost (eye movement distance) reached asymptote. Prima facie, these continued behavioral changes appeared to challenge RL. However, trial-by-trial analysis showed that pattern changes on adjacent trials were predicted by lowered cost, and RL simulations that reduced the cost reproduced the monkeys' behavior. Ultimately, the patterns settled into stereotypical saccade sequences that minimized the cost of obtaining the reward on average. These findings suggest that brain mechanisms underlying the emergence of habits, and perhaps unwanted repetitive behaviors in clinical disorders, could follow RL algorithms capturing extremely local explore/exploit tradeoffs.

Entities: Disease Gene

Mesh：

Year: 2010 PMID： 20974967 PMCID： PMC2996716 DOI： 10.1073/pnas.1013470107

Source DB: PubMed Journal: Proc Natl Acad Sci U S A ISSN： 0027-8424 Impact factor: 11.205

14 in total

1. A model of hippocampally dependent navigation, using the temporal difference learning rule.

Authors: D J Foster; R G Morris; P Dayan
Journal: Hippocampus Date: 2000 Impact factor: 3.899

2. Learning the parts of objects by non-negative matrix factorization.

Authors: D D Lee; H S Seung
Journal: Nature Date: 1999-10-21 Impact factor: 49.962

3. THE MECHANICS OF HUMAN SACCADIC EYE MOVEMENT.

Authors: D A ROBINSON
Journal: J Physiol Date: 1964-11 Impact factor: 5.182

4. Prefrontal cortex and decision making in a mixed-strategy game.

Authors: Dominic J Barraclough; Michelle L Conroy; Daeyeol Lee
Journal: Nat Neurosci Date: 2004-03-07 Impact factor: 24.884

5. Representation of action-specific reward values in the striatum.

Authors: Kazuyuki Samejima; Yasumasa Ueda; Kenji Doya; Minoru Kimura
Journal: Science Date: 2005-11-25 Impact factor: 47.728

6. Midbrain dopamine neurons encode a quantitative reward prediction error signal.

Authors: Hannah M Bayer; Paul W Glimcher
Journal: Neuron Date: 2005-07-07 Impact factor: 17.173

Review 7. A neural substrate of prediction and reward.

Authors: W Schultz; P Dayan; P R Montague
Journal: Science Date: 1997-03-14 Impact factor: 47.728

8. Cortical substrates for exploratory decisions in humans.

Authors: Nathaniel D Daw; John P O'Doherty; Peter Dayan; Ben Seymour; Raymond J Dolan
Journal: Nature Date: 2006-06-15 Impact factor: 49.962

9. Bee foraging in uncertain environments using predictive hebbian learning.

Authors: P R Montague; P Dayan; C Person; T J Sejnowski
Journal: Nature Date: 1995-10-26 Impact factor: 49.962

Review 10. The psychology of perserverative and stereotyped behaviour.

Authors: R M Ridley
Journal: Prog Neurobiol Date: 1994-10 Impact factor: 11.685

16 in total

1. Neuronal activity in the primate dorsomedial prefrontal cortex contributes to strategic selection of response tactics.

Authors: Yoshiya Matsuzaka; Tetsuya Akiyama; Jun Tanji; Hajime Mushiake
Journal: Proc Natl Acad Sci U S A Date: 2012-02-27 Impact factor: 11.205

2. Learning optimal strategies in complex environments.

Authors: Terrence J Sejnowski
Journal: Proc Natl Acad Sci U S A Date: 2010-11-15 Impact factor: 11.205

Review 3. The striatum: where skills and habits meet.

Authors: Ann M Graybiel; Scott T Grafton
Journal: Cold Spring Harb Perspect Biol Date: 2015-08-03 Impact factor: 10.005

4. Habit Learning by Naive Macaques Is Marked by Response Sharpening of Striatal Neurons Representing the Cost and Outcome of Acquired Action Sequences.

Authors: Theresa M Desrochers; Ken-ichi Amemori; Ann M Graybiel
Journal: Neuron Date: 2015-08-19 Impact factor: 17.173

Review 5. Sensory integration, sensory processing, and sensory modulation disorders: putative functional neuroanatomic underpinnings.

Authors: Leonard F Koziol; Deborah Ely Budding; Dana Chidekel
Journal: Cerebellum Date: 2011-12 Impact factor: 3.847

6. Neuronal Activity in the Posterior Cingulate Cortex Signals Environmental Information and Predicts Behavioral Variability during Trapline Foraging.

Authors: David L Barack; Michael L Platt
Journal: J Neurosci Date: 2021-02-03 Impact factor: 6.167