Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Interplay of approximate planning strategies.

Literature DB >> 25675480

Interplay of approximate planning strategies.

Quentin J M Huys¹, Níall Lally², Paul Faulkner³, Neir Eshel⁴, Erich Seifritz⁵, Samuel J Gershman⁶, Peter Dayan⁷, Jonathan P Roiser⁸.

Abstract

Humans routinely formulate plans in domains so complex that even the most powerful computers are taxed. To do so, they seem to avail themselves of many strategies and heuristics that efficiently simplify, approximate, and hierarchically decompose hard tasks into simpler subtasks. Theoretical and cognitive research has revealed several such strategies; however, little is known about their establishment, interaction, and efficiency. Here, we use model-based behavioral analysis to provide a detailed examination of the performance of human subjects in a moderately deep planning task. We find that subjects exploit the structure of the domain to establish subgoals in a way that achieves a nearly maximal reduction in the cost of computing values of choices, but then combine partial searches with greedy local steps to solve subtasks, and maladaptively prune the decision trees of subtasks in a reflexive manner upon encountering salient losses. Subjects come idiosyncratically to favor particular sequences of actions to achieve subgoals, creating novel complex actions or "options."

Entities: Species

Keywords: hierarchical reinforcement learning; memoization; planning; pruning

Mesh：

Year: 2015 PMID： 25675480 PMCID： PMC4364207 DOI： 10.1073/pnas.1414219112

Source DB: PubMed Journal: Proc Natl Acad Sci U S A ISSN： 0027-8424 Impact factor: 11.205

21 in total

1. Cognitive illusions of authorship reveal hierarchical error detection in skilled typists.

Authors: Gordon D Logan; Matthew J C Crump
Journal: Science Date: 2010-10-29 Impact factor: 47.728

2. Dynamic response-by-response models of matching behavior in rhesus monkeys.

Authors: Brian Lau; Paul W Glimcher
Journal: J Exp Anal Behav Date: 2005-11 Impact factor: 2.468

3. Dynamic signals related to choices and outcomes in the dorsolateral prefrontal cortex.

Authors: Hyojung Seo; Dominic J Barraclough; Daeyeol Lee
Journal: Cereb Cortex Date: 2007-06-04 Impact factor: 5.357

4. Evidence of action sequence chunking in goal-directed instrumental conditioning and its dependence on the dorsomedial prefrontal cortex.

Authors: Sean B Ostlund; Neil E Winterbauer; Bernard W Balleine
Journal: J Neurosci Date: 2009-06-24 Impact factor: 6.167

5. Evidence of hierarchies in cognitive maps.

Authors: S C Hirtle; J Jonides
Journal: Mem Cognit Date: 1985-05

6. Dopaminergic drugs modulate learning rates and perseveration in Parkinson's patients in a dynamic foraging task.

Authors: Robb B Rutledge; Stephanie C Lazzaro; Brian Lau; Catherine E Myers; Mark A Gluck; Paul W Glimcher
Journal: J Neurosci Date: 2009-12-02 Impact factor: 6.167

Review 7. The Mini-International Neuropsychiatric Interview (M.I.N.I.): the development and validation of a structured diagnostic psychiatric interview for DSM-IV and ICD-10.

Authors: D V Sheehan; Y Lecrubier; K H Sheehan; P Amorim; J Janavs; E Weiller; T Hergueta; R Baker; G C Dunbar
Journal: J Clin Psychiatry Date: 1998 Impact factor: 4.384

8. The architecture of cognitive control in the human prefrontal cortex.

Authors: Etienne Koechlin; Chrystèle Ody; Frédérique Kouneiher
Journal: Science Date: 2003-11-14 Impact factor: 47.728

9. Bonsai trees in your head: how the pavlovian system sculpts goal-directed choices by pruning decision trees.

Authors: Quentin J M Huys; Neir Eshel; Elizabeth O'Nions; Luke Sheridan; Peter Dayan; Jonathan P Roiser
Journal: PLoS Comput Biol Date: 2012-03-08 Impact factor: 4.475

10. Goal neglect and knowledge chunking in the construction of novel behaviour.

Authors: Apoorva Bhandari; John Duncan
Journal: Cognition Date: 2013-10-18

42 in total

1. Of goals and habits.

Authors: Nathaniel D Daw
Journal: Proc Natl Acad Sci U S A Date: 2015-10-21 Impact factor: 11.205

2. Evidence integration in model-based tree search.

Authors: Alec Solway; Matthew M Botvinick
Journal: Proc Natl Acad Sci U S A Date: 2015-08-31 Impact factor: 11.205

3. How to divide and conquer the world, one step at a time.

Authors: Reka Daniel; Nicolas W Schuck; Yael Niv
Journal: Proc Natl Acad Sci U S A Date: 2015-03-02 Impact factor: 11.205

4. Adaptive integration of habits into depth-limited planning defines a habitual-goal-directed spectrum.

Authors: Mehdi Keramati; Peter Smittenaar; Raymond J Dolan; Peter Dayan
Journal: Proc Natl Acad Sci U S A Date: 2016-10-24 Impact factor: 11.205

Review 5. Computational psychiatry as a bridge from neuroscience to clinical applications.

Authors: Quentin J M Huys; Tiago V Maia; Michael J Frank
Journal: Nat Neurosci Date: 2016-03 Impact factor: 24.884

6. Toward an Integration of Deep Learning and Neuroscience.

Authors: Adam H Marblestone; Greg Wayne; Konrad P Kording
Journal: Front Comput Neurosci Date: 2016-09-14 Impact factor: 2.380

7. Habitual control of goal selection in humans.

Authors: Fiery Cushman; Adam Morris
Journal: Proc Natl Acad Sci U S A Date: 2015-10-12 Impact factor: 11.205

8. The Neural Basis of Aversive Pavlovian Guidance during Planning.

Authors: Níall Lally; Quentin J M Huys; Neir Eshel; Paul Faulkner; Peter Dayan; Jonathan P Roiser
Journal: J Neurosci Date: 2017-09-18 Impact factor: 6.167

9. Discovery of hierarchical representations for efficient planning.

Authors: Momchil S Tomov; Samyukta Yagati; Agni Kumar; Wanqian Yang; Samuel J Gershman
Journal: PLoS Comput Biol Date: 2020-04-06 Impact factor: 4.475

Review 10. Reinforcement Learning and Episodic Memory in Humans and Animals: An Integrative Framework.

Authors: Samuel J Gershman; Nathaniel D Daw
Journal: Annu Rev Psychol Date: 2016-09-02 Impact factor: 24.137