Literature DB >> 32060169

Primate Orbitofrontal Cortex Codes Information Relevant for Managing Explore-Exploit Tradeoffs.

Vincent D Costa1, Bruno B Averbeck2.   

Abstract

Reinforcement learning (RL) refers to the behavioral process of learning to obtain reward and avoid punishment. An important component of RL is managing explore-exploit tradeoffs, which refers to the problem of choosing between exploiting options with known values and exploring unfamiliar options. We examined correlates of this tradeoff, as well as other RL related variables, in orbitofrontal cortex (OFC) while three male monkeys performed a three-armed bandit learning task. During the task, novel choice options periodically replaced familiar options. The values of the novel options were unknown, and the monkeys had to explore them to see if they were better than other currently available options. The identity of the chosen stimulus and the reward outcome were strongly encoded in the responses of single OFC neurons. These two variables define the states and state transitions in our model that are relevant to decision-making. The chosen value of the option and the relative value of exploring that option were encoded at intermediate levels. We also found that OFC value coding was stimulus specific, as opposed to coding value independent of the identity of the option. The location of the option and the value of the current environment were encoded at low levels. Therefore, we found encoding of the variables relevant to learning and managing explore-exploit tradeoffs in OFC. These results are consistent with findings in the ventral striatum and amygdala and show that this monosynaptically connected network plays an important role in learning based on the immediate and future consequences of choices.SIGNIFICANCE STATEMENT Orbitofrontal cortex (OFC) has been implicated in representing the expected values of choices. Here we extend these results and show that OFC also encodes information relevant to managing explore-exploit tradeoffs. Specifically, OFC encodes an exploration bonus, which characterizes the relative value of exploring novel choice options. OFC also strongly encodes the identity of the chosen stimulus, and reward outcomes, which are necessary for computing the value of novel and familiar options.
Copyright © 2020 the authors.

Keywords:  decision-making; explore–exploit; monkey; orbitofrontal cortex; reinforcement learning

Mesh:

Year:  2020        PMID: 32060169      PMCID: PMC7083541          DOI: 10.1523/JNEUROSCI.2355-19.2020

Source DB:  PubMed          Journal:  J Neurosci        ISSN: 0270-6474            Impact factor:   6.167


  52 in total

1.  Measures of Effect Size for Comparative Studies: Applications, Interpretations, and Limitations.

Authors: 
Journal:  Contemp Educ Psychol       Date:  2000-07

Review 2.  The orbitofrontal cortex and the computation of subjective value: consolidated concepts and new perspectives.

Authors:  Camillo Padoa-Schioppa; Xinying Cai
Journal:  Ann N Y Acad Sci       Date:  2011-12       Impact factor: 5.691

3.  Connectional networks within the orbital and medial prefrontal cortex of macaque monkeys.

Authors:  S T Carmichael; J L Price
Journal:  J Comp Neurol       Date:  1996-07-22       Impact factor: 3.215

Review 4.  Parallel organization of functionally segregated circuits linking basal ganglia and cortex.

Authors:  G E Alexander; M R DeLong; P L Strick
Journal:  Annu Rev Neurosci       Date:  1986       Impact factor: 12.449

5.  Amygdala Contributions to Stimulus-Reward Encoding in the Macaque Medial and Orbital Frontal Cortex during Learning.

Authors:  Peter H Rudebeck; Joshua A Ripple; Andrew R Mitz; Bruno B Averbeck; Elisabeth A Murray
Journal:  J Neurosci       Date:  2017-01-25       Impact factor: 6.167

6.  Humans use directed and random exploration to solve the explore-exploit dilemma.

Authors:  Robert C Wilson; Andra Geana; John M White; Elliot A Ludvig; Jonathan D Cohen
Journal:  J Exp Psychol Gen       Date:  2014-10-27

7.  The Bilateral Prefronto-striatal Pathway Is Necessary for Learning New Goal-Directed Actions.

Authors:  Genevra Hart; Laura A Bradfield; Sandra Y Fok; Billy Chieng; Bernard W Balleine
Journal:  Curr Biol       Date:  2018-06-28       Impact factor: 10.834

8.  A flexible software tool for temporally-precise behavioral control in Matlab.

Authors:  Wael F Asaad; Emad N Eskandar
Journal:  J Neurosci Methods       Date:  2008-07-25       Impact factor: 2.390

9.  Dopamine modulates novelty seeking behavior during decision making.

Authors:  Vincent D Costa; Valery L Tran; Janita Turchi; Bruno B Averbeck
Journal:  Behav Neurosci       Date:  2014-06-09       Impact factor: 1.912

10.  A causal role for right frontopolar cortex in directed, but not random, exploration.

Authors:  Wojciech K Zajkowski; Malgorzata Kossut; Robert C Wilson
Journal:  Elife       Date:  2017-09-15       Impact factor: 8.140

View more
  20 in total

1.  Reinforcement Learning during Adolescence in Rats.

Authors:  Neema Moin Afshar; Alex J Keip; Jane R Taylor; Daeyeol Lee; Stephanie M Groman
Journal:  J Neurosci       Date:  2020-06-29       Impact factor: 6.167

2.  Correlates of Auditory Decision-Making in Prefrontal, Auditory, and Basal Lateral Amygdala Cortical Areas.

Authors:  Julia L Napoli; Corrie R Camalier; Anna-Leigh Brown; Jessica Jacobs; Mortimer M Mishkin; Bruno B Averbeck
Journal:  J Neurosci       Date:  2020-12-10       Impact factor: 6.167

3.  Rules warp feature encoding in decision-making circuits.

Authors:  R Becket Ebitz; Jiaxin Cindy Tu; Benjamin Y Hayden
Journal:  PLoS Biol       Date:  2020-11-30       Impact factor: 8.029

4.  Organization of parietoprefrontal and temporoprefrontal networks in the macaque.

Authors:  Franco Giarrocco; Bruno B Averbeck
Journal:  J Neurophysiol       Date:  2021-08-11       Impact factor: 2.714

Review 5.  Hypothalamic Interactions with Large-Scale Neural Circuits Underlying Reinforcement Learning and Motivated Behavior.

Authors:  Bruno B Averbeck; Elisabeth A Murray
Journal:  Trends Neurosci       Date:  2020-08-03       Impact factor: 13.837

6.  Balancing exploration and exploitation with information and randomization.

Authors:  Robert C Wilson; Elizabeth Bonawitz; Vincent D Costa; R Becket Ebitz
Journal:  Curr Opin Behav Sci       Date:  2020-11-06

Review 7.  Interactions between ventrolateral prefrontal and anterior cingulate cortex during learning and behavioural change.

Authors:  Ilya E Monosov; Matthew F S Rushworth
Journal:  Neuropsychopharmacology       Date:  2021-07-07       Impact factor: 7.853

8.  The whole prefrontal cortex is premotor cortex.

Authors:  Justin M Fine; Benjamin Y Hayden
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2021-12-27       Impact factor: 6.237

9.  Spatial Representations in Rat Orbitofrontal Cortex.

Authors:  Andrew M Wikenheiser; Matthew P H Gardner; Lauren E Mueller; Geoffrey Schoenbaum
Journal:  J Neurosci       Date:  2021-07-01       Impact factor: 6.167

Review 10.  Reinforcement-learning in fronto-striatal circuits.

Authors:  Bruno Averbeck; John P O'Doherty
Journal:  Neuropsychopharmacology       Date:  2021-08-05       Impact factor: 7.853

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.