| Literature DB >> 19225599 |
Abstract
Complex cognitive tasks present a range of computational and algorithmic challenges for neural accounts of both learning and inference. In particular, it is extremely hard to solve them using the sort of simple policies that have been extensively studied as solutions to elementary Markov decision problems. There has thus been recent interest in architectures for the instantiation and even learning of policies that are formally more complicated than these, involving operations such as gated working memory. However, the focus of these ideas and methods has largely been on what might best be considered as automatized, routine or, in the sense of animal conditioning, habitual, performance. Thus, they have yet to provide a route towards understanding the workings of rule-based control, which is critical for cognitively sophisticated competence. Here, we review a recent suggestion for a uniform architecture for habitual and rule-based execution, discuss some of the habitual mechanisms that underpin the use of rules, and consider a statistical relationship between rules and habits.Entities:
Keywords: actions; habits; prefrontal cortex; rules; striatum
Year: 2008 PMID: 19225599 PMCID: PMC2622746 DOI: 10.3389/neuro.01.031.2008
Source DB: PubMed Journal: Front Neurosci ISSN: 1662-453X Impact factor: 4.677
Figure 1The 12AX task (from O'Reilly and Frank, . (A) Subjects see a sequence of numbers and letters (in boxes), and have to respond by pressing one of two keys (L or R; correct responses are shown). They should choose L every time, except to respond R if the most recent digit is a 1 and X directly follows A, or the most recent digit is a 2 and Y directly follo-ws B. There can be many AX or BY subsequences following each 1 or 2, and inner loops always involve pairs of symbols. Different variants of the task impose slightly different constraints on the symbols in the non-one-back inner loops. (B) A finite state representation of one procedural solution to the task. S is the start state; the links are labelled with the observation and the action. The main states are annotated with a representation of the contents of working memory (in italics). ‘1:L’ is upside down indicating it labels the link from 2- to 1-; ‘2:L’ on the same arrow labels the link from 1- to 2-.