| Literature DB >> 34197589 |
Tsen Vei Lim1, Rudolf N Cardinal1,2, Edward T Bullmore1,2, Trevor W Robbins3, Karen D Ersche1,4.
Abstract
BACKGROUND: Drug-induced alterations to the dopamine system in stimulant use disorder (SUD) are hypothesized to impair reinforcement learning (RL). Computational modeling enables the investigation of the latent processes of RL in SUD patients, which could elucidate the nature of their impairments.Entities:
Keywords: Addiction; cocaine; hierarchical Bayesian modeling; punishment; reinforcement learning; reward
Mesh:
Substances:
Year: 2021 PMID: 34197589 PMCID: PMC8598302 DOI: 10.1093/ijnp/pyab041
Source DB: PubMed Journal: Int J Neuropsychopharmacol ISSN: 1461-1457 Impact factor: 5.176
Sample Demographics and Task Performance of the Two Studies.
| Study 1 | Study 2 | |||
|---|---|---|---|---|
| Groups | Control | CUD | Control | SUD |
| Demographics | Mean (SD) | Mean (SD) | Mean (SD) | Mean (SD) |
| Task performance | Mean (SD) | Mean (SD) | Mean (SD) | Mean (SD) |
| Sample size (n) | 41 | 44 | 18 | 18 |
| Age (years) | 40.1 (12.6) | 40.9 (9.2) | 32.7 (6.9) | 34.3 (7.2) |
| Gender (% male) | 100 | 100 | 83 | 83 |
| Verbal IQ (NART score) | 115 (6.2) | 103 (7.1) | 108 (6.0) | 109 (8.1) |
| Disposable income (£/month) | 657 (501) | 387 (462) | 470 (389) | 621 (866) |
| Subjective value of 50 pence (% rating) | 81.5 (23.0) | 87.1 (19.6) | 72.9 (31.0) | 87.4 (18.8) |
| Trait impulsivity (BIS-11, total score) | 56.1 (6.7) | 79.5 (11.4) | 62 (7.2) | 82 (9.5) |
| Duration of stimulant drug use (years) | — | 13.7 (8.0) | — | 12.3 (6.7) |
| Compulsive drug use (OCDUS total score) | — | 34.1 (10.1) | — | 25.6 (7.9) |
| Total % correct (reward) | ||||
| Placebo | 73.0 (21.6) | 57.9 (22.5) | 87.1 (24.5) | 81.1 (18.6) |
| Amisulpride | — | — | 87.9 (23.7) | 75.8 (27.3) |
| Pramipexole | — | — | 75.7 (30.9) | 61.8 (35.2) |
| Total % correct (punishment) | ||||
| Placebo | 63.6 (12.0) | 54.3 (10.6) | 73.3 (19.3) | 61.7 (13.5) |
| Amisulpride | — | — | 78.5 (17.5) | 62.4 (19.8) |
| Pramipexole | — | — | 72.9 (15.2) | 64.7 (18.8) |
Abbreviations: BIS-11, Barratt Impulsiveness Scale; CUD, cocaine use disorder; NART, National Adult Reading Test; OCDUS, Obsessive-Compulsive Drug Use Scale; SUD, stimulant use disorder.
Figure 1.Schematics for the probabilistic reinforcement learning task of study 1 and study 2. In each trial, participants were first presented with a pair of stimuli and required to select 1 stimulus. After selection, the computer presented an outcome phrased in terms of monetary gains (positive) or losses (negative); this allowed the separate assessment of learning from reward and punishment. In both studies, each condition was differentiated by unique stimulus pairs and feedback and interspersed across 120 trials and presented in a randomized order. Optimal choices are reinforced 70% of the time, so participants needed to accrue experience over time to determine the choices that would maximize their financial gains and minimize their losses.
Figure 2.Accuracy scores, defined as the proportion of optimal choices made in 10-trial blocks, for the behavioral task. These scores are plotted separately based on condition (reward and punishment) and group (controls and stimulant use disorder [SUD]). (A) Reinforcement learning performance accuracy in study 1. (B) Reinforcement learning performance accuracy for the placebo condition in study 2. Error bars denote SEM, and the horizontal dotted line indicates accuracy at chance level (50%).
Figure 3.Group mean differences for the reinforcement learning parameters. (A) In study 1, the learning rate from punishment and reinforcement sensitivity were significantly reduced in the stimulant use disorder (SUD) participants, while the other parameters were no different across groups. (B) In the placebo condition of study 2, we found a markedly reduced learning rate from punishment in SUD patients. Error bars denote 95% highest density intervals (HDI); parameters colored in red signify a credible group difference (95% HDI excludes zero).
Figure 4.Mean differences of the reinforcement learning parameters for each drug condition. The dopaminergic agents are directly compared with placebo. (A) Amisulpride reduced the learning rates in healthy controls but increased the reinforcement sensitivity parameter. (B) Pramipexole selectively reduced the reward learning rate parameter in control participants, but had no effect on the other parameters. (C) Amisulpride improved the punishment learning rate in stimulant use disorder (SUD) participants. (D) Pramipexole significantly increased punishment learning rate and reduced reinforcement sensitivity parameters in SUD patients. Error bars denote 95% highest density intervals (HDI); parameters colored in red indicate a credible drug effect, as their 95% HDI excludes zero.