| Literature DB >> 26074797 |
Filip Gęsiarz1, Molly J Crockett1.
Abstract
Although prosocial behaviors have been widely studied across disciplines, the mechanisms underlying them are not fully understood. Evidence from psychology, biology and economics suggests that prosocial behaviors can be driven by a variety of seemingly opposing factors: altruism or egoism, intuition or deliberation, inborn instincts or learned dispositions, and utility derived from actions or their outcomes. Here we propose a framework inspired by research on reinforcement learning and decision making that links these processes and explains characteristics of prosocial behaviors in different contexts. More specifically, we suggest that prosocial behaviors inherit features of up to three decision-making systems employed to choose between self- and other- regarding acts: a goal-directed system that selects actions based on their predicted consequences, a habitual system that selects actions based on their reinforcement history, and a Pavlovian system that emits reflexive responses based on evolutionarily prescribed priors. This framework, initially described in the field of cognitive neuroscience and machine learning, provides insight into the potential neural circuits and computations shaping prosocial behaviors. Furthermore, it identifies specific conditions in which each of these three systems should dominate and promote other- or self- regarding behavior.Entities:
Keywords: Pavlovian; altruism; dictator game; model-based; model-free; prosocial behavior; reinforcement learning; warm-glow
Year: 2015 PMID: 26074797 PMCID: PMC4444832 DOI: 10.3389/fnbeh.2015.00135
Source DB: PubMed Journal: Front Behav Neurosci ISSN: 1662-5153 Impact factor: 3.558
Figure 1Stimulus-Response-Outcome contingency and corresponding decision-making systems. The Stimulus-Response-Outcome association is learned through mechanisms of instrumental conditioning, and the Stimulus-Outcome association through mechanisms of classical conditioning. The goal-directed system uses response-outcome associations to infer which actions will bring the best outcomes from the perspective of current goals. The habitual system uses stimulus-response associations to emit responses that produced the best outcomes in similar situations in the past. The Pavlovian system emits innate responses to outcomes that were significant in our evolutionary history or stimuli that were associated with these outcomes.
Properties of three decision-making systems.
| Goal-directed system | Habitual system | Pavlovian system |
|---|---|---|
| • Employs model-based planning algorithms | • Employs model-free learning algorithms | • Employs a priori programmed solutions |
| • Deliberate | • Automatic/Learned | • Automatic/Inborn |
| • Dominating at the beginning of learning | • Dominating in late stages of learning | • Can dominate at all stages of learning |
| • Dependent on working-memory | • Independent from working-memory | • Independent from working-memory |
| • Sensitive to sudden changes in motivational states | • Insensitive to sudden changes in motivational states | • Sensitive to sudden changes in motivational states |
| • Sensitive to consequences of actions | • Insensitive to consequences of actions | • Insensitive to consequences of actions |