| Literature DB >> 27811969 |
Prachi Mistry1, Mimi Liljeholm1.
Abstract
A critical aspect of flexible choice is that alternative actions yield distinct consequences: Only when available action alternatives produce distinct outcome states does discrimination and selection between actions allow an agent to flexibly obtain the currently most desired outcome. Here, we use instrumental divergence - the degree to which alternative actions differ with respect to their outcome probability distributions - as an index of flexible instrumental control, and assess the influence of this novel decision variable on choice preference. In Experiment 1, when other decision variables, such as expected value and outcome entropy, were held constant, we found a significant preference for high instrumental divergence. In Experiment 2, we used an "auto- vs. self-play" manipulation to eliminate outcome diversity as a source of behavioral preferences, and to contrast flexible instrumental control with the complete absence of voluntary choice. Our results suggest that flexible instrumental control over decision outcomes may have intrinsic value.Entities:
Year: 2016 PMID: 27811969 PMCID: PMC5095609 DOI: 10.1038/srep36295
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Probability distributions over three potential outcomes (O1, O2 & O3) for two available actions (A1 & A2) across which instrumental divergence – the difference between outcome probability distributions – is zero (a) or high (b).
Figure 2Task illustration showing the choice screen at the beginning of a block (top), and the choice screen (middle) and feedback screen (bottom) from a trial within a block.
Token probabilities and reward distributions, gambling rooms and choice scenarios in Experiment 1.
| Token Outcomes | Rooms | Choice scenarios | |||||
|---|---|---|---|---|---|---|---|
| High Div. | Zero Div. | a vs. e | d vs. e | ||||
| A1 & A2 | 0.0 | 0.7 | 0.3 | a. A1 & A3 | e. A1 & A2 | a vs. f | d vs. f |
| A3 & A4 | 0.7 | 0.0 | 0.3 | b. A2 & A4 | f. A3 & A4 | b vs. e | a vs. b |
| Balanced | $2 | $2 | $1 | c. A1 & A4 | b vs. f | e vs. f | |
| Unbalanced 1 | $1 | $2 | $3 | d. A2 & A3 | c vs. e | ||
| Unbalanced 2 | $2 | $1 | $3 | c vs. f | |||
The top two rows in the 2nd column indicate the probability of each colored token given either of the actions listed to the left; the bottom three rows indicate the monetary value of each token in balanced and unbalanced blocks. The third column shows the pair of actions available in each room, and the fourth column the combination of rooms into choice scenarios.
Mean ratings of token probabilities following pre-training, for programmed probabilities of 0.7, 0.0 and 0.3, obtained before and after gambling, in Experiments 1 and 2.
| 0.7 | 0.0 | 0.3 | ||
|---|---|---|---|---|
| Exp. 1 | Before | 0.70 ± 0.02 | 0.00 ± 0.02 | 0.30 ± 0.02 |
| After | 0.64 ± 0.16 | 0.04 ± 0.15 | 0.31 ± 0.09 | |
| Exp. 2 | Before | 0.69 ± 0.02 | 0.00 ± 0.02 | 0.30 ± 0.03 |
| After | 0.64 ± 0.16 | 0.05 ± 0.16 | 0.32 ± 0.09 |
Figure 3Mean choice proportions in Experiments 1 and 2.
Dashed lines indicate chance performance. Error bars = SEM. (a) Mean proportions of high- over zero-divergence choices, for blocks in which expected values were identical across high- and low-divergence options (Balanced), blocks in which expected values differed across options in the same direction as divergence (Unbalanced same) and blocks in which expected values differed in the opposite direction of divergence (Unbalanced opposite), in Experiment 1. (b) Mean proportions of high- over zero-divergence choices (left) for blocks in which the high-divergence option was Auto-play versus blocks in which the high-divergence option was Self-play, and mean proportions of self- over auto-play choices (right) for blocks in which both options had high-divergence (High-div.) versus blocks in which both options had zero-divergence (Zero-div.), in Experiment 2.