| Literature DB >> 32322967 |
Jutta Peterburs1, Alena Frieling2, Christian Bellebaum2.
Abstract
Learning to execute a response to obtain a reward or to inhibit a response to avoid punishment is much easier than learning the reverse, which has been referred to as "Pavlovian" biases. Despite a growing body of research into similarities and differences between active and observational learning, it is as yet unclear if Pavlovian learning biases are specific for active task performance, i.e., learning from feedback provided for one's own actions, or if they persist also when learning by observing another person's actions and subsequent outcomes. The present study, therefore, investigated the influence of action and outcome valence in active and observational feedback learning. Healthy adult volunteers completed a go/nogo task that decoupled outcome valence (win/loss) and action (execution/inhibition) either actively or by observing a virtual co-player's responses and subsequent feedback. Moreover, in a more naturalistic follow-up experiment, pairs of subjects were tested with the same task, with one subject as active learner and the other as observational learner. The results revealed Pavlovian learning biases both in active and in observational learning, with learning of go responses facilitated in the context of reward obtainment, and learning of nogo responses facilitated in the context of loss avoidance. Although the neural correlates of active and observational feedback learning have been shown to differ to some extent, these findings suggest similar mechanisms to underlie both types of learning with respect to the influence of Pavlovian biases. Moreover, performance levels and result patterns were similar in those observational learners who had observed a virtual co-player and those who had completed the task together with an active learner, suggesting that inclusion of a virtual co-player in a computerized task provides an effective manipulation of agency.Entities:
Mesh:
Year: 2020 PMID: 32322967 PMCID: PMC8211594 DOI: 10.1007/s00426-020-01340-1
Source DB: PubMed Journal: Psychol Res ISSN: 0340-0727
Fig. 1Schematic illustration of the sequence and time course of stimulus presentation in a single learning block trial in the active (a) and observational versions (b) of the go/nogo task. This task was specifically designed to decouple outcome valence (win/loss) and action (go/nogo)
Fig. 2Mean performance accuracy according to action and outcome valence for active learners, yoked observers, and subjects who observed chance performance
Results of the analysis of effects for data from active learners and yoked observers (Experiment 1)
| Effects | P(incl) | P(incl|data) | BFInclusion |
|---|---|---|---|
| Block | 0.886 | 0.829 | 0.622 |
| Action | 0.886 | 1.000 | > 10,000 |
| Outcome valence | 0.886 | 1.000 | > 10,000 |
| Learning condition | 0.886 | 0.695 | 0.293 |
| Block * action | 0.503 | 0.120 | 0.135 |
| Block * outcome valence | 0.503 | 0.012 | 0.012 |
| Block * learning condition | 0.503 | 0.020 | 0.020 |
| Action * outcome valence | 0.503 | 1.000 | > 10,000 |
| Action * learning condition | 0.503 | 0.467 | 0.864 |
| Outcome valence * learning condition | 0.503 | 0.245 | 0.321 |
| Block * action * outcome valence | 0.120 | < 0.001 | < 0.001 |
| Block * action * learning condition | 0.120 | < 0.001 | < 0.001 |
| Block * outcome valence * learning condition | 0.120 | < 0.001 | < 0.001 |
| Action * outcome valence * learning condition | 0.120 | 0.029 | 0.220 |
| Block * action * outcome valence * learning condition | 0.006 | < 0.001 | < 0.001 |
This analysis averaged across all models containing a specific factor. The prior inclusion probability for a specific factor (P(incl)) is the summed prior probability of all models that include this factor. The posterior inclusion probability of a specific factor (P(incl|data)) is the summed posterior probability of all models that include this factor. The change from prior to posterior inclusion odds is provided as BFInclusion
Results of the analysis of effects for data from subjects who had observed chance performance and active learners from Experiment 1
| Effects | P(incl) | P(incl|data) | BF Inclusion |
|---|---|---|---|
| Block | 0.886 | 0.526 | 0.143 |
| Action | 0.886 | 1.000 | > 10,000 |
| Outcome valence | 0.886 | 1.000 | > 10,000 |
| Learning condition | 0.886 | 0.254 | 0.044 |
| Block * action | 0.503 | 0.047 | 0.048 |
| Block * outcome valence | 0.503 | 0.082 | 0.089 |
| Block * learning condition | 0.503 | 0.002 | 0.002 |
| Action * outcome valence | 0.503 | 1.000 | > 10,000 |
| Action * learning condition | 0.503 | 0.074 | 0.079 |
| Outcome valence * learning condition | 0.503 | 0.047 | 0.049 |
| Block * action * outcome valence | 0.120 | < 0.001 | 0.002 |
| Block * action * learning condition | 0.120 | < 0.001 | < 0.001 |
| Block * outcome valence * learning condition | 0.120 | < 0.001 | < 0.001 |
| Action * outcome valence * learning condition | 0.120 | < 0.001 | 0.019 |
| Block * action * outcome valence * learning condition | 0.006 | < 0.001 | < 0.001 |
The prior inclusion probability for a specific factor (P(incl)) is the summed prior probability of all models that include this factor. The posterior inclusion probability of a specific factor (P(incl|data)) is the summed posterior probability of all models that include this factor. The change from prior to posterior inclusion odds is provided as BFInclusion
Fig. 3Mean performance accuracy according to action and outcome valence for active learners and observers who completed the task simultaneously (Experiment 3)
Results of the analysis of effects for data from simultaneously performing subjects (Experiment 3)
| Effects | P(incl) | P(incl|data) | BF Inclusion |
|---|---|---|---|
| Block | 0.886 | 0.944 | 2.176 |
| Action | 0.886 | 1.000 | > 10,000 |
| Outcome valence | 0.886 | 1.000 | > 10,000 |
| Learning condition | 0.886 | 0.383 | 0.080 |
| Block * action | 0.503 | 0.063 | 0.066 |
| Block * outcome valence | 0.503 | 0.032 | 0.033 |
| Block * learning condition | 0.503 | 0.006 | 0.006 |
| Action * outcome valence | 0.503 | 1.000 | > 10,000 |
| Action * learning condition | 0.503 | 0.043 | 0.044 |
| Outcome valence * learning condition | 0.503 | 0.083 | 0.089 |
| Block * action * outcome valence | 0.120 | < 0.001 | < 0.001 |
| Block * action * learning condition | 0.120 | < 0.001 | < 0.001 |
| Block * outcome valence * learning condition | 0.120 | < 0.001 | < 0.001 |
| Action * outcome valence * learning condition | 0.120 | 0.003 | 0.019 |
| Block * action * outcome valence * learning condition | 0.006 | < 0.001 | < 0.001 |
The prior inclusion probability for a specific factor (P(incl)) is the summed prior probability of all models that include this factor. The posterior inclusion probability of a specific factor (P(incl|data)) is the summed posterior probability of all models that include this factor. The change from prior to posterior inclusion odds is provided as BFInclusion