| Literature DB >> 35927257 |
Magda Dubois1,2, Tobias U Hauser3,4.
Abstract
Deciding whether to forgo a good choice in favour of exploring a potentially more rewarding alternative is one of the most challenging arbitrations both in human reasoning and in artificial intelligence. Humans show substantial variability in their exploration, and theoretical (but only limited empirical) work has suggested that excessive exploration is a critical mechanism underlying the psychiatric dimension of impulsivity. In this registered report, we put these theories to test using large online samples, dimensional analyses, and computational modelling. Capitalising on recent advances in disentangling distinct human exploration strategies, we not only demonstrate that impulsivity is associated with a specific form of exploration-value-free random exploration-but also explore links between exploration and other psychiatric dimensions.Entities:
Mesh:
Year: 2022 PMID: 35927257 PMCID: PMC9352791 DOI: 10.1038/s41467-022-31918-9
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 17.694
Design Table
| Question | Hypothesis | Sampling plan (e.g., power analysis) | Analysis Plan | Interpretation given to different outcomes | Outcome | |
|---|---|---|---|---|---|---|
| Analysis Step 1: task effect replication | Are exploitation and exploration horizon-modulated? | 1.1.a: Less exploitation in the long horizon: High-value bandit frequency: SH > LH 1.1.b: More value-free random exploration in the long horizon: Low value bandit frequency: SH < LH 1.1.c: More novelty exploration in the long horizon: Novel bandit frequency: SH < LH | Pilot data lowest effect size: 0.410. To detect with 95% power, a similar effect size requires a sample of | Paired samples t-test (or Wilcoxon signed-rank test if the Shapiro normality assumption is violated). | 1.1.a: No effect: no evidence that our horizon manipulation modulated overall exploitation. Opposite effect: overall exploitation is increased in the long horizon. 1.1.b: No effect: no evidence that our horizon manipulation modulated value-free random exploration. Opposite effect: value-free random exploration is increased in the short horizon. 1.1.c: No effect: no evidence that our horizon manipulation modulated novelty exploration. Opposite effect: novelty exploration is increased in the short horizon. | 1.1.a: Hypothesis confirmed: Less exploitation in the long horizon. 1.1.b: Hypothesis confirmed: More value-free random exploration in the long horizon. 1.1.c: Hypothesis confirmed: More novelty exploration in the long horizon. Answer to research question: Yes, exploitation and exploration are horizon-modulated. |
| Is exploration beneficial for participants? | 1.2.a: Lower initial reward in the long horizon: Reward of first sample: SH > LH 1.2.b: Higher reward overall in the long horizon: Reward averaged over samples: SH < LH | Pilot data lowest effect size: 0.835. To be detected with 95% power, a similar effect size requires a sample of N = 21. | 1.2.a: No effect: no evidence that participants sacrificed a higher initial outcome in the long horizon. Opposite effect: participants optimised initial reward in the long horizon. 1.2.b: No effect: no evidence that participants took advantage of the information gained in the long horizon. Opposite effect: information gain negatively impacted reward. | 1.2.a: Hypothesis confirmed: Lower initial reward in the long horizon. 1.2.b: Hypothesis confirmed: Higher reward overall in the long horizon. Answer to research question: Yes, exploration is beneficial for participants. | ||
| Do participants use exploration heuristics? | 1.3: Average BIC score: complex model + | Pilot data effect size: 1.304. To be detected with 95% power, a similar effect size requires a sample of N = 10. | 1.3: No effect: no evidence that participants combine complex models with heuristics. Opposite effect: Participants are not using complex models with both heuristics. | 1.3: Hypothesis confirmed: The average BIC score was higher for the complex models with both heuristics compared to other models. Answer to research question: Yes, participants use exploration heuristics. | ||
| Are exploration heuristics used more in the long horizon? | 1.4.a: Value-free random exploration is used more in the long horizon: 1.4.b Novelty exploration is used more in the long horizon: Novelty bonus | Pilot data lowest effect size: 0.446. To be detected with 95% power, a similar effect size requires a sample of N = 71. | 1.4.a: No effect: no evidence that Opposite effect: value-free random exploration is increased in the short horizon. 1.4.b: No effect: no evidence that Opposite effect: novelty exploration is increased in the short horizon. | 1.4.a: Hypothesis confirmed: Value-free random exploration is used more in the long horizon. 1.4.b: Hypothesis confirmed: Novelty exploration is used more in the long horizon. Answer to research question: Yes, exploration heuristics are used more in the long horizon. | ||
| Analysis Step 2: impulsivity | Is impulsivity linked to value-free random exploration? | 2.1: Value-free random exploration is positively associated to BIS: | Previous study[ | Bivariate and partial (correcting for age and IQ) Pearson correlation and a repeated-measures ANOVA with within factor horizon and between participants variable [impulsivity/ADHD-symptoms] | 2.1: No effect: no evidence for an association between value-free random exploration and general impulsivity as measured by the BIS. Opposite effect: Value-free random exploration is negatively associated to BIS. | 2.1: Hypothesis confirmed: Value-free random exploration is positively associated to BIS. Answer to research question: Yes, impulsivity is linked to value-free random exploration. |
| Are ADHD symptoms linked to value-free random exploration? | 2.2: Value-free random exploration is positively associated to ASRS: | 2.2: No effect: no evidence for an association between value-free random exploration and ADHD as measured by the ASRS total score. Opposite effect: Value-free random exploration is negatively associated to ASRS total score. | 2.2: Hypothesis confirmed: Value-free random exploration is positively associated to ASRS. Answer to research question: Yes, ADHD symptoms are linked to value-free random exploration. |
Summary of preregistered hypotheses from our Stage 1 Registered report (the full protocol can be found at: 10.6084/m9.figshare.14346506.v1) with an additional ‘outcome’ column describing the observed effect. SH: Short horizon condition, LH: Long horizon condition.
Fig. 1Exploration task.
In the Maggie’s farm task, participants had to choose from three bandits (depicted as trees) to maximise their overall reward. The rewards (apple size) of each bandit followed a normal distribution with a fixed sampling variance. a At the beginning of each trial, participants are provided with some initial samples (number varied depending on the bandits present on that trial) on the wooden crate at the bottom of the screen and participants had to select which bandit they want to sample from next. b Depending the condition, they can either perform one draw (short horizon) or six draws (long horizon). The empty spaces on the wooden crate (and the position of the sun) indicate how many draws they have left. Image adapted from our previous study[23].
Fig. 2Increased exploration in the long horizon.
a Behavioural horizon effects: in the long (versus short) horizon participants sampled less from the high-value bandit (two-sided Wilcoxon signed-rank two-tailed test: V = 157079.5, p = 1.536e-81, Wilcoxon effect size r = 0.797) and more from the novel (V = 10355, p = 1.811e-72, r = 0.750) and low-value bandit (V = 34420, p = 2.817e-24, r = 0.425). b Model parameters: in the long (versus short) horizon participants had higher value of (i.e., value-free random exploration; V = 35367, p = 9.831e-34, r = 0.503), (i.e., novelty exploration; V = 10334, p = 7.411e-75, r = 0.076) and (their uncertainty about a bandit’s mean before seeing any samples; V = 54537, p = 1.868e-13, r = 0.306). The parameters were fitted to each participant’s first draw, and they were fitted to each horizon separately. ***p < 0.001. For detailed statistics cf. Supplementary Table 2. For details about model parameters cf. Supplementary Table 4. Data are shown as mean ± 95% CI and each dot/line represent one participant. Sample size for statistics: N = 580 human participants. Source data are provided as a Source Data file. Bar values: High-value bandit: Short Horizon (SH): 52.134, Long Horizon (LH): 42.088; Novel bandit: SH: 36.801, LH: 46.073; Low-value bandit: SH: 3.928, LH: 5.026; -greedy parameter: SH: 0.099, LH: 0.134; Novelty bonus : SH: 1.919, LH: 2.884; Prior variance : SH: 1.085, LH: 1.186.
Fig. 3Value-free random exploration linked to impulsivity.
General impulsivity (as measured by the BIS[49]) was significantly associated to value-free random exploration. This was observed both in a the model parameter, the -greedy parameter (Pearson’s correlation: r(578) = 0.171, p = 3.398e-05) and in b the behaviour, the frequency of picking the low-value bandit (r(578) = 0.174, p = 2.442e-05). The motor subscore of the BIS was most closely associated with value-free random exploration, both in c the model parameter (Bonferroni corrected (n = 3): r(578) = 0.198, pcor = 5.318e-06, punc = 1.772e-06) and d the behaviour (r(578) = 0.205, pcor = 2.249e-06, punc = 7.495e-07). Similarly, ADHD-related impulsivity (as measured by the ASRS[72]) was significantly associated to value-free random exploration, both in e the model parameter (r(578) = 0.157, p = 1.466e-04) and in f the behaviour (r(578) = 0.151, p = 3.069e-04). The hyperactivity-impulsivity subscore of the ASRS was most tightly associated to novelty exploration, both in g the model parameter (Bonferroni corrected (n = 2): r(578) = 0.205, pcor = 1.352e-06, punc = 6.764e-07) and h the behaviour (r(578) = 0.193, pcor = 6.319e-06, punc = 3.159e-06). The filled lines represent the 95%CI. Sample size for statistics: N = 580 human participants. Source data are provided as a Source Data file.
Fig. 4Transdiagnostic parcellation of symptoms.
a Three latent factors were identified when performing a factor analysis on individual questionnaire items. b The 10 most loading items for each factor illustrate the aspects that contribute to each dimension. Abbreviations: ASRS: Adult ADHD Self-Report Scale, BIS: Barratt Impulsiveness Scale, LSAS: Liebowitz Social Anxiety Scale, STAI: State-Trait Anxiety Inventory, IUS: Intolerance of Uncertainty, OCIR: Obsessive-Compulsive Inventory-Revised, SDS: Zung’s Self-rating Depression Scale, CFS: Cognitive Flexibility Scale, AQ10: Autism spectrum Quotient. (-) indicates reversed items. For correlations between each questionnaire total score cf. Supplementary Fig. 10. For loading weights of individual items on each factor cf. Supplementary Table 10–18. Source data are provided as a Source Data file.
Fig. 5Exploration associations with transdiagnostic psychiatric factors.
The factor analysis-derived impulsivity factor was significantly associated to value-free random exploration. This was observed both in a the model parameter, the -greedy parameter (Bonferroni corrected (n = 12); Pearson’s correlation: r(578) = 0.257, punc = 3.352e-10, pcor = 4.023e-09) and in b the behaviour, the frequency of picking the low-value bandit (Bonferroni corrected (n = 9): r(578) = 0.257, punc = 1.561e-09, pcor = 1.405e-08). Similarly, the anxious-depression factor was significantly associated with the novelty exploration, both in c the model parameter, the novelty bonus (Bonferroni corrected (n = 12): r(578) = 0.14, punc = 7.041e-04, pcor = 0.0084), and in d the behaviour (Bonferroni corrected (n = 9): r(578) = 0.19, punc = 4.084e-06, pcor = 3.675e-05), the frequency of picking the novel bandit. Pearson correlations for each factor and e model parameters as well as f behavioral task measures (i.e., bandit picking frequencies). Significant uncorrected correlations are displayed. Significant corrected correlations are indicated with asterisks (*p < 0.05, **p < 0.01, ***p < 0.001). Sample size for statistics: N = 580 human participants. Source data are provided as a Source Data file.