| Literature DB >> 35390278 |
Jeremy Hogeveen1, Teagan S Mullins2, John D Romero2, Elizabeth Eversole2, Kimberly Rogge-Obando3, Andrew R Mayer4, Vincent D Costa5.
Abstract
Flexible decision-making requires animals to forego immediate rewards (exploitation) and try novel choice options (exploration) to discover if they are preferable to familiar alternatives. Using the same task and a partially observable Markov decision process (POMDP) model to quantify the value of choices, we first determined that the computational basis for managing explore-exploit tradeoffs is conserved across monkeys and humans. We then used fMRI to identify where in the human brain the immediate value of exploitative choices and relative uncertainty about the value of exploratory choices were encoded. Consistent with prior neurophysiological evidence in monkeys, we observed divergent encoding of reward value and uncertainty in prefrontal and parietal regions, including frontopolar cortex, and parallel encoding of these computations in motivational regions including the amygdala, ventral striatum, and orbitofrontal cortex. These results clarify the interplay between prefrontal and motivational circuits that supports adaptive explore-exploit decisions in humans and nonhuman primates.Entities:
Keywords: amygdala; computational modeling; decision-making; exploration; explore-exploit dilemma; fMRI; frontopolar cortex; reinforcement learning; reward; striatum
Mesh:
Year: 2022 PMID: 35390278 PMCID: PMC9167768 DOI: 10.1016/j.neuron.2022.03.014
Source DB: PubMed Journal: Neuron ISSN: 0896-6273 Impact factor: 18.688