Literature DB >> 23834192

Navigating complex decision spaces: Problems and paradigms in sequential choice.

Matthew M Walsh1, John R Anderson2.   

Abstract

To behave adaptively, we must learn from the consequences of our actions. Doing so is difficult when the consequences of an action follow a delay. This introduces the problem of temporal credit assignment. When feedback follows a sequence of decisions, how should the individual assign credit to the intermediate actions that comprise the sequence? Research in reinforcement learning provides 2 general solutions to this problem: model-free reinforcement learning and model-based reinforcement learning. In this review, we examine connections between stimulus-response and cognitive learning theories, habitual and goal-directed control, and model-free and model-based reinforcement learning. We then consider a range of problems related to temporal credit assignment. These include second-order conditioning and secondary reinforcers, latent learning and detour behavior, partially observable Markov decision processes, actions with distributed outcomes, and hierarchical learning. We ask whether humans and animals, when faced with these problems, behave in a manner consistent with reinforcement learning techniques. Throughout, we seek to identify neural substrates of model-free and model-based reinforcement learning. The former class of techniques is understood in terms of the neurotransmitter dopamine and its effects in the basal ganglia. The latter is understood in terms of a distributed network of regions including the prefrontal cortex, medial temporal lobes, cerebellum, and basal ganglia. Not only do reinforcement learning techniques have a natural interpretation in terms of human and animal behavior but they also provide a useful framework for understanding neural reward valuation and action selection.

Entities:  

Mesh:

Substances:

Year:  2013        PMID: 23834192      PMCID: PMC4309984          DOI: 10.1037/a0033455

Source DB:  PubMed          Journal:  Psychol Bull        ISSN: 0033-2909            Impact factor:   17.737


  157 in total

1.  Lesions of medial prefrontal cortex disrupt the acquisition but not the expression of goal-directed learning.

Authors:  Sean B Ostlund; Bernard W Balleine
Journal:  J Neurosci       Date:  2005-08-24       Impact factor: 6.167

2.  A comparison of abstract rules in the prefrontal cortex, premotor cortex, inferior temporal cortex, and striatum.

Authors:  Rahmat Muhammad; Jonathan D Wallis; Earl K Miller
Journal:  J Cogn Neurosci       Date:  2006-06       Impact factor: 3.225

3.  Solving the credit assignment problem: explicit and implicit learning of action sequences with probabilistic outcomes.

Authors:  Wai-Tat Fu; John R Anderson
Journal:  Psychol Res       Date:  2007-04-20

Review 4.  A framework for studying the neurobiology of value-based decision making.

Authors:  Antonio Rangel; Colin Camerer; P Read Montague
Journal:  Nat Rev Neurosci       Date:  2008-06-11       Impact factor: 34.870

5.  Serotonin affects association of aversive outcomes to past actions.

Authors:  Saori C Tanaka; Kazuhiro Shishida; Nicolas Schweighofer; Yasumasa Okamoto; Shigeto Yamawaki; Kenji Doya
Journal:  J Neurosci       Date:  2009-12-16       Impact factor: 6.167

6.  Anterior cingulate cortex, error detection, and the online monitoring of performance.

Authors:  C S Carter; T S Braver; D M Barch; M M Botvinick; D Noll; J D Cohen
Journal:  Science       Date:  1998-05-01       Impact factor: 47.728

Review 7.  A neural substrate of prediction and reward.

Authors:  W Schultz; P Dayan; P R Montague
Journal:  Science       Date:  1997-03-14       Impact factor: 47.728

8.  Exploring a latent cause theory of classical conditioning.

Authors:  Samuel J Gershman; Yael Niv
Journal:  Learn Behav       Date:  2012-09       Impact factor: 1.986

9.  Dopamine neurons encode the better option in rats deciding between differently delayed or sized rewards.

Authors:  Matthew R Roesch; Donna J Calu; Geoffrey Schoenbaum
Journal:  Nat Neurosci       Date:  2007-11-18       Impact factor: 24.884

Review 10.  A unified framework for addiction: vulnerabilities in the decision process.

Authors:  A David Redish; Steve Jensen; Adam Johnson
Journal:  Behav Brain Sci       Date:  2008-08       Impact factor: 21.357

View more
  7 in total

1.  Toward a Functional View of the P Factor in Psychopathology.

Authors:  Charles S Carver; Sheri L Johnson; Kiara R Timpano
Journal:  Clin Psychol Sci       Date:  2017-06-11

Review 2.  Impulsive reactivity to emotion and vulnerability to psychopathology.

Authors:  Charles S Carver; Sheri L Johnson
Journal:  Am Psychol       Date:  2018-12

3.  Electrophysiological responses to feedback during the application of abstract rules.

Authors:  Matthew M Walsh; John R Anderson
Journal:  J Cogn Neurosci       Date:  2013-08-05       Impact factor: 3.225

4.  VTA neurons coordinate with the hippocampal reactivation of spatial experience.

Authors:  Stephen N Gomperts; Fabian Kloosterman; Matthew A Wilson
Journal:  Elife       Date:  2015-10-14       Impact factor: 8.140

5.  Modeling Search Behaviors during the Acquisition of Expertise in a Sequential Decision-Making Task.

Authors:  Cristóbal Moënne-Loccoz; Rodrigo C Vergara; Vladimir López; Domingo Mery; Diego Cosmelli
Journal:  Front Comput Neurosci       Date:  2017-09-08       Impact factor: 2.380

Review 6.  Dissociable functions of reward inference in the lateral prefrontal cortex and the striatum.

Authors:  Shingo Tanaka; Xiaochuan Pan; Mineki Oguchi; Jessica E Taylor; Masamichi Sakagami
Journal:  Front Psychol       Date:  2015-07-16

Review 7.  Solving the Credit Assignment Problem With the Prefrontal Cortex.

Authors:  Alexandra Stolyarova
Journal:  Front Neurosci       Date:  2018-03-27       Impact factor: 4.677

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.