| Literature DB >> 23565100 |
John Fox1, Richard P Cooper, David W Glasspool.
Abstract
Decision-making behavior is studied in many very different fields, from medicine and economics to psychology and neuroscience, with major contributions from mathematics and statistics, computer science, AI, and other technical disciplines. However the conceptualization of what decision-making is and methods for studying it vary greatly and this has resulted in fragmentation of the field. A theory that can accommodate various perspectives may facilitate interdisciplinary working. We present such a theory in which decision-making is articulated as a set of canonical functions that are sufficiently general to accommodate diverse viewpoints, yet sufficiently precise that they can be instantiated in different ways for specific theoretical or practical purposes. The canons cover the whole decision cycle, from the framing of a decision based on the goals, beliefs, and background knowledge of the decision-maker to the formulation of decision options, establishing preferences over them, and making commitments. Commitments can lead to the initiation of new decisions and any step in the cycle can incorporate reasoning about previous decisions and the rationales for them, and lead to revising or abandoning existing commitments. The theory situates decision-making with respect to other high-level cognitive capabilities like problem solving, planning, and collaborative decision-making. The canonical approach is assessed in three domains: cognitive and neuropsychology, artificial intelligence, and decision engineering.Entities:
Keywords: autonomous agents; clinical decision-making; cognitive systems; decision-making; unified theories of cognition
Year: 2013 PMID: 23565100 PMCID: PMC3613596 DOI: 10.3389/fpsyg.2013.00150
Source DB: PubMed Journal: Front Psychol ISSN: 1664-1078
Figure 1The Soar information processing architecture.
Figure 2The Soar dynamic decision cycle.
Capabilities that are typical of agent systems described in the AI literature (Fox et al., .
| Perception | Observing and monitoring situations and events in the environment |
| Action | Executing actions that change or control the environment |
| Communication | Employing perception and action to interact with other agents |
| Reasoning | Making inferences on the basis of environmental data, beliefs, goals, knowledge, etc. |
| Problem solving | Searching for explanations of observations, plans which will achieve goals etc. |
| Decision-making | Choosing between alternative hypotheses or actions |
| Scheduling | Sequencing actions and plans flexibly in response to circumstances |
| Planning | Constructing a set or sequence of actions to achieve a goal |
| Learning | Remembering solutions to newly encountered problems for future reuse |
| Reactive behavior | Responding to situations and events in real time |
| Deliberative behavior | The application of cognitive capabilities in a purposive, coordinated way |
| Autonomy | Making plans, taking decisions, etc. without external programing or supervision |
Some of the mental/cognitive states that have been studied in AI.
| Beliefs | Specific information which an agent holds to be true at a particular moment in time |
| Desires | Specific goals which are currently influencing an agent’s behavior |
| Intentions | Specific commitments to actions or plans which an agent has decided to carry out |
| Knowledge | General theories, rules, functions etc as distinct from situation-specific beliefs, desires, and intentions |
Figure 3The domino agent framework, an enhanced BDI agent model.
Figure 4The “ontological ladder,” which formalizes knowledge as a hierarchy of increasingly complex and semantically rich conceptual structures.
Figure 5The Contention Scheduling/Supervisory System model of Norman and Shallice (.
Figure 6The Wisconsin Card Sorting Test. The four target cards are shown across the top row and four piles for sorted cards in the second row, the third of which is currently empty. The card to-be-sorted is at bottom. If the subject is sorting according to color or form, this card should be placed under the third target card, but if he/she is sorting according to number then it should be placed under the first target card.
Some relationships between the canonical functions and selected evidence from cognitive psychology and cognitive neuroscience.
| Signature | Summary |
|---|---|
| S1 (belief maintenance) | Beliefs may be supported by the environment (i.e., inferred from perceptual input) or inferred from long-term knowledge and other beliefs. Both must be actively maintained in working memory (e.g., by rehearsal) |
| S2 (raising goals) | Much behavior, with the possible exception of habitual behavior, can be understood as being purposive or goal-directed. In experimental psychology, high-level task goals are set by the experimenter, with subjects deriving lower-level goals for individual trials. Findings from experimental psychology and more generally indicate that goals provide local coherence of behavior |
| S3 (problem solving) | A variety of problem solving strategies or heuristics may be recruited to generate solutions for a given goal. This includes so-called “weak” methods which are general, knowledge-lean, heuristics such as hill-climbing and means-ends analysis, as well as knowledge-rich, task-specific strategies, acquired through experience |
| S4 (reasons for decisions) | Evolutionary arguments (e.g., Mercier and Sperber, |
| S5 (aggregation) | One neuropsychological hypothesis is that aggregation of the merit of arguments is based on somatic markers – emotionally biased valences associated with decision options acquired through positive and negative experience (Damasio, |
| S6 (commitment) | Commitment to a single decision candidate is required by theories such as Damasio’s somatic marker hypothesis. In the specific context of selecting one word from a set, commitment has been related to the inferior frontal gyrus (Shallice and Cooper, |
| S7 (plan enactment) | Plan enactment is most closely related to the function of task setting, held by many to be a function of left lateral prefrontal cortex (e.g., Shallice et al., |
| S8 (action) | The contention scheduling system provides an account of how intentions are mapped to actions, subject to available resources |
| S9 (monitoring) | A substantial body of evidence suggests that many cognitive processes create expectations that under normal operation are continuously monitored. Perceptual processes may also monitor the external environment for deviations from expected perceptual input. Shallice et al. ( |
| S10 (learning) | There are many forms of learning. One is learning to associate consequences with cognitive and motor actions. These consequences then become expectations which are used by monitoring. A second critical form is reinforcement learning, where positive or negative reward can increase or decrease the merit of a candidate in the context of a goal |
Figure 7Multi-agent network for cooperative decision-making (left), and information processing architecture for an autonomous agent (right).
Figure 8Sequence diagram for some of the interactions between agent C and agent S during the joint decision-making simulation.
The relation between the canon signatures and functions which are implemented in the multi-agent decision-making scenario.
| Signature | Summary |
|---|---|
| S1 (belief maintenance) | Any rule in the agent model can make inferences by applying knowledge to the current working memory state and add, delete or replace information in the working memory. Every item of data in working memory is tagged with the grounds for believing it (e.g., the goal and assumptions which justify it). It uses this to maintain a consistent overall belief state |
| S2 (raising goals) | Goals are a form of belief which are used to determine which knowledge and rules are potentially in play at any moment |
| S3 (problem solving) | Any kind of problem solving technique can be implemented in the COGENT programing system, with the solution then added to working memory |
| S4 (reasons for decisions) | A form of argumentation based on defeasible logic is used to generate and maintain arguments for competing solutions as the working memory belief state changes |
| S5 (aggregation) | In the multi-agent decision-making scenario a simple improper linear aggregation function is implemented (adding up pros and cons) though other aggregation functions can be implemented |
| S6 (commitment) | The multi-agent scenario includes two kinds of commitment, provisional (reversible), and firm (irreversible) |
| S7 (plan enactment) | Dialog plans are simple lists of communication actions that are executed in sequence but can be interrupted if a communication is received from another agent |
| S8 (action) | The main kinds of actions that are included in this demonstration are standard communication performatives from speech act theory and agent communication languages |
| S9 (monitoring) | The whole domino system is a kind of “monitor” in that every computational component can respond to any update to the working memory state at any time |
| S10 (learning) | Two simple learning mechanisms have been implemented. These monitor the working memory and when a decision process terminates these mechanisms (1) add rules to the agent’s episodic knowledge and (2) update frequency counters which can be used to update the agent’s confidence in competing decision options |
Figure 9Simple .
The relation between the canon signatures and task representations in the .
| Signature | Summary |
|---|---|
| S1 (belief maintenance) | Beliefs in |
| S2 (raising goals) | A |
| S3 (problem solving) | Current |
| S4 (reasons for decisions) | Reasons in |
| S5 (aggregation) | A decision assesses all the argument for and against each option to determine their net overall force, and establish an order of preference over the options. The prior confidence and strength of arguments can be taken into account in the aggregation process |
| S6 (commitment) | Each option in a |
| S7 (plan enactment) | A |
| S8 (action) | When a |
| S9 (monitoring) | There is no specific support for monitoring. However continuous monitoring can be implemented using general language features |
| S10 (learning) | The |