| Literature DB >> 35470564 |
Alex H Robinson1, Trevor T-J Chong1, Antonio Verdejo-Garcia1.
Abstract
People with Methamphetamine Use Disorder (PwMUD) spend substantial time and resources on substance use, which hinders their ability to explore alternate reinforcers. Gold-standard behavioural treatments attempt to remedy this by encouraging action towards non-drug reinforcers, but substance use often persists. We aimed to unravel the mechanistic drivers of this behaviour by applying a computational model of explore/exploit behaviour to decision-making data (Iowa Gambling Task) from 106 PwMUD and 48 controls. We then examined the longitudinal link between explore/exploit mechanisms and changes in methamphetamine use 6 weeks later. Exploitation parameters included reinforcement sensitivity and inverse decay (i.e., number of past outcomes used to guide choices). Exploration parameters included maximum directed exploration value (i.e., value of trying novel actions). The Timeline Follow Back measured changes in methamphetamine use. Compared to controls, PwMUD showed deficits in exploitative decision-making, characterised by reduced reinforcement sensitivity, U = 3065, p = 0.009, and less use of previous choice outcomes, U = 3062, p = 0.010. This was accompanied by a behavioural pattern of frequent shifting between choices, which appeared consistent with random exploration. Furthermore, PwMUD with greater reductions of methamphetamine use at 6 weeks had increased directed exploration (β = 0.22, p = 0.045); greater use of past choice outcomes (β = -0.39, p = 0.002) and greater choice consistency (β = -0.39, p = 0.002). Therefore, limited computational exploitation and increased behavioural exploration characterise PwMUD's presentation to treatment, while increased directed exploration, use of past choice outcomes and choice consistency predict greater reductions of methamphetamine use.Entities:
Keywords: computational modelling; decision-making; explore/exploit; methamphetamine; predictive; substance use
Mesh:
Substances:
Year: 2022 PMID: 35470564 PMCID: PMC9286537 DOI: 10.1111/adb.13172
Source DB: PubMed Journal: Addict Biol ISSN: 1355-6215 Impact factor: 4.093
Descriptive statistics of sociodemographic characteristics in PwMUD and controls
| PwMUD | Controls | Frequentist Mann. Whit. | Bayes factor Mann. Whit. | |
|---|---|---|---|---|
| Sex (F/M) | 27F/79M | 12F/36M |
| |
| Age | 31.20 (7.25) | 31.59 (8.67) |
|
|
| Education (years) | 12.93 (2.25) | 13.26 (2.07) |
|
|
| FSIQ | 96.19 (10.96) | 101.92 (11.36) |
|
|
| Employed (any) | 28 (26.92%) | 26 (54.17%) |
| |
| Depression (CES‐D) | 28.51 (12.30) | 7.25 (5.93) |
|
|
| Sociodem. Status | 6.97 (2.41) | 7.92 (1.60) |
|
|
Note: Values represent means with standard deviations in parentheses.
Abbreviation: CES‐D, Centre for Epidemiologic Studies Depression Scale.
PwMUDs baseline methamphetamine and other common substance use
|
|
| Range | |
|---|---|---|---|
| Severity of Dependence Meth | 11.10 | 3.15 | [1–15] |
| Severity of Dependence Cann. | 2.63 | 4.24 | [0–15] |
| Severity of Dependence Alch. | 1.71 | 3.30 | [0–14] |
|
| |||
| Daily dose (grammes) | 0.71 | 0.56 | [0.03–4.00] |
| Frequency (days/month) | 23.30 | 9.25 | [4–31] |
| Duration of use (years) | 6.95 | 4.90 | [0.6–30] |
|
| |||
| Cigarettes | 407.13 | 257.50 | [0–930] |
| Standard drinks | 75.34 | 160.76 | [0–775] |
| Cannabis | 24.89 | 42.21 | [0–217] |
Note: Severity of Dependence Scale scores range between 0 and 15, with those >4 indicating problematic methamphetamine use, ≥3 indicating likely Alcohol Use Disorder and ≥4 indicating likely Cannabis Use Disorder.
FIGURE 1Cumulative net score of PwMUD (red) and controls (blue) across IGT. Note. Solid line represents mean; shaded area represents 95% confidence intervals
FIGURE 2Comparison of behavioural indices of sequential exploration (SE3 and SE4) between PwMUD (red) and controls (blue). Note. SE3, sequential exploration 3. SE4, sequential exploration 4. These refer to how often participants chose three/four different decks across three/four subsequent trials (i.e., choosing deck a then C then B then D). Dotted line represents theoretical chance of each event (0.33 for SE3, and 0.09 for SE4). Solid line represents mean; shaded area represents 95% confidence intervals
FIGURE 3Behaviour indices of choice behaviour in PwMUD (red) and controls (blue). Note. Individual dots represent each participant in each group; * = p < .05. While sequential exploration 4 was significant, BF10 = 1.75. Higher values of mutual information highlight indicate that choices on subsequent trials are better predicted by the current choice. Higher values of choice entropy indicate that participants evenly explored all decks; lower values of choice entropy indicate selecting from only a few decks/one deck. Sequential exploration 3/4 refers to how often participants chose three/four different decks across three/four subsequent trials
FIGURE 4VSE parameter estimates of PwMUD (red) and controls (blue). Note. Individual dots represent each participant in each group; *p < .05. While inverse decay was significant, BF 10 = 2.48. Higher values of reinforcement sensitivity indicate greater sensitivity for reward/punishment. Higher values of inverse decay indicate use of more previous outcomes. Higher values of consistency reflect less choice stochasticity. Higher values of exploration bonus indicate a greater maximum value of directed exploration. Higher values of exploration learning rate indicate a greater frequency of reaching the maximum directed exploration value
Statistics of best fitting model using the VSE parameters to predict PwMUDs reduction of use between baseline and follow‐up
| Predictor | Unstandardised coefficient | Standard error | Standardised Coef. ( |
|
|
|
|
|---|---|---|---|---|---|---|---|
| Intercept | 33.31 | 4.67 | 7.14 | <0.001 | 1.00 | ||
| Inverse decay, ∆ | −16.12 | 5.05 | −0.39 | [−0.63, −0.14] | −3.19 | 0.002 | 16.29 |
| Exploration bonus, | 1.35 | 0.66 | 0.22 | [0.04, 0.44] | 2.04 | 0.045 | 4.36 |
| Consistency, | −11.94 | 3.71 | −0.39 | [−0.64, −0.15] | −3.22 | 0.002 | 14.94 |
Summary of VSE parameters, their interpretation, meanings and findings in between‐group and predictive analysis
| Parameter name | Range | Interpretation | Between‐group differences | Predicting changes in meth. use |
|---|---|---|---|---|
|
| ||||
| Reinforcement sensitivity, | θ ∈ [0, 1] | Influences the strength of rewards/losses on value calculations. Values near 0 reflect a weaker sensitivity to value of rewards/losses equally. Values near 1 reflect a greater sensitivity to value of rewards/losses equally. | PwMUD had less sensitivity to reinforcement than controls. | Was not included in the best fitting model. |
| Inverse decay, Δ | Δ ∈ [0, 1] | Identifies how many previous trial outcomes the participant is using to guide their current decision. At 0, only the previous trials outcome guides the next decision. At 1, all the previous trials outcomes guide the next decision. | PwMUD likely used fewer previous trial outcomes to guide their current decision, compared to controls. | Was in the best fitting model. Using fewer previous choices were associated with a smaller reduction in methamphetamine use. |
|
| ||||
| Directed exploration bonus, |
| Sets a maximal ‘threshold’ of how biassed the participant is to select decks that have not been explored in recent trials. Negative values indicate an overall preference to keep selecting familiar decks (exploitation). Positive values indicate an overall preference to explore recently unselected decks (directed exploration). | A likely null difference between groups. Neither group had a higher threshold or bias towards directed exploration. | Was in the best fitting model. Having a greater bias towards directed exploration was associated with a greater reduction in methamphetamine use. |
| Directed exploration learning rate, |
| How quickly participants return to their maximal value of exploration again, after having recently explored. Values near 0 reflect a slow return to the maximal explore value. Values near 1 reflect a quick return to the maximal explore value. | PwMUD reached their upper value of directed exploration slower than controls. Please note, this does not necessarily mean their exploration value was lower than exploitation. | Was not included in the best fitting model. |
|
| ||||
| Consistency, |
| Reflects stochastic behaviour (i.e., whether the behaviour of the participant is consistent or not with the equations of the VSE model). Greater values reflect behaviour with greater consistency to the VSE model. Smaller values reflect more unpredictable behaviour to the VSE model. | A likely null difference between groups. This means both groups appeared to act equally consistently within the parameters of the VSE model. | Was in the best fitting model. Having greater choice stochasticity was associated with a smaller reduction in methamphetamine use. |