| Literature DB >> 31324936 |
Jonathan W Kanen1,2, Karen D Ersche3,4, Naomi A Fineberg4,5,6, Trevor W Robbins7,3, Rudolf N Cardinal3,4,8.
Abstract
RATIONALE: Disorders of compulsivity such as stimulant use disorder (SUD) and obsessive-compulsive disorder (OCD) are characterised by deficits in behavioural flexibility, some of which have been captured using probabilistic reversal learning (PRL) paradigms.Entities:
Keywords: Addiction; Amisulpride; Compulsivity; Computational modelling; Dopamine; Obsessive-compulsive disorder; Pramipexole; Reinforcement learning; Reversal learning; Stimulant use disorder
Mesh:
Substances:
Year: 2019 PMID: 31324936 PMCID: PMC6820481 DOI: 10.1007/s00213-019-05325-w
Source DB: PubMed Journal: Psychopharmacology (Berl) ISSN: 0033-3158 Impact factor: 4.530
Demographic, psychological and baseline personality measures for the groups of healthy controls (HC; n = 18), individuals with stimulant use disorder (SUD; n = 17), and individuals with obsessive-compulsive disorder (OCD; n = 18). Mean (standard deviation)
| Group | HC | SUD | OCD |
|
|
|
|---|---|---|---|---|---|---|
| Age (years) | 32.7 (± 6.9) | 34.3 (± 7.4) | 35.4 (± 9.8) | 0.49 | 2.50 | 0.618 |
| Gender ratio (male/female) | 15:3 | 14:3 | 7:11 | 0.318a | ||
| Ethnic ratio (Caucasian:Afro-Caribbean) | 17:1 | 15:2 | 18:00 | 0.308a | ||
| Verbal intelligence quotient (NART) | 108.4 (± 6.0) | 108.0 (± 8.3) | 107.9 (± 8.8) | 0.06 | 2.50 | 0.938 |
| Years of education | 12.4 (± 1.8) | 11.2 (± 1.0) | 12.3 (± 2.0) | 2.06 | 2.50 | 0.082 |
| Dysphoric mood, BDI-II (total score at baseline) | 1.1 (± 2.4) | 9.8 (± 11.2) | 18.5 (± 10.0) | 18.07 | 2.50 | < 0.001 |
| Impulsivity, BIS-11 (total score) | 62.0 (± 7.2) | 81.7 (± 9.7) | 66.9 (± 9.7) | 22.83 | 2.49 | < 0.001 |
| Compulsivity, Y-BOCS (total score) | 0.1 (± 0.5) | – | 24.11 (± 13.0) | – | – | – |
| Compulsivity, OCDUS (total score) | – | 26.0 (± 7.8) | – | – | – | – |
| Age of onset (years) of stimulant abuse or of OCD | – | 20.5 (± 5.4) | 17.1 (± 11.0) | – | – | – |
| Duration (years) of stimulant abuse or of OCD | – | 11.7 (± 7.4) | 18.3 (± 10.6) | – | – | – |
Reproduced with permission from Ersche et al. (2011)
NART National Adult Reading Test, BDI-II Beck Depression Inventory, version II (Beck et al. 1996), BIS-11 Barratt Impulsiveness Scale, version 11 (Patton et al. 1995), Y-BOCS Yale-Brown Obsessive-Compulsive Scale (Goodman et al. 1989), OCDUS Obsessive-Compulsive Drug Use Scale (Franken et al. 2002)
aFisher’s exact test
Fig. 1Schematic of the serial probabilistic reversal learning task, used with permission from Ersche et al. (2011). Two abstract stimuli were presented on either side of the screen, the participant selected one using a button press, and feedback was immediately given in the centre of the screen in the form of a green smiley face or a red frowning face. A probabilistic error occurred when a participant received spurious negative feedback after making the correct choice, which was rare and should therefore be ignored. A reversal error, on the other hand, was one where feedback was now truly negative, indicating the reversal had occurred, contingencies have thus changed and behaviour should be updated
Fig. 2Schematic of the Bayesian hierarchy used in our analysis, illustrated here for a single parameter (reward rate). HC healthy controls
Model parameter prior distributions
| Models using each parameter | Prior | Reference, if applicable | |
|---|---|---|---|
| Model parameters | |||
| Reward learning rate, | 3, 4a, 4b, 4c | Beta(1.2, 1.2) | den Ouden et al. ( |
| Punishment learning rate, | 3, 4a, 4b, 4c | Beta(1.2, 1.2) | den Ouden et al. ( |
| Combined reward/punishment learning rate, | 1, 2 | Beta(1.2, 1.2) | den Ouden et al. ( |
| Reinforcement sensitivity, | 1, 2, 3, 4a, 4b, 4c, 5 | Gamma( | Gershman ( |
| Location (side) stickness, | 4a, 4c | Normal(0, 1) | Christakou et al. ( |
| Stimulus stickness, | 2, 4b, 4c | Normal(0, 1) | Christakou et al. ( |
| Experience decay factor, | 5 | Beta(1.2, 1.2) | den Ouden et al. ( |
| Decay factor for previous payoffs, | 5 | Beta(1.2, 1.2) | den Ouden et al. ( |
| Softmax inverse temperature, | 5 [note that | Gamma( | Gershman ( |
| Intersubject variability in parameters | |||
| Intersubject standard deviations for | As above | Half-normal: Normal(0, 0.05) constrained to ≥0 | |
| Intersubject standard deviations for | As above | Half-normal: Normal(0, 1) constrained to ≥0 | Gershman ( |
rew reward, pun punishment, reinf. reinforcement, loc location, stim stimulus
Comparison of model performance
| Rank | Name | Parameters | Log marginal likelihood | Log posterior P (model) |
|---|---|---|---|---|
| 7 | Model 1 | −16984.66 | −503.8250 | |
| 3 | Model 2 | −16687.72 | −206.8821 | |
| 6 | Model 3 | −16835.28 | −354.4418 | |
| 4 | Model 4a | −16732.50 | −251.6656 | |
| 2 | Model 4b | −16585.12 | −104.2815 | |
| 1 | Model 4c | −16480.83 | 0.0000 | |
| 5 | Model 5 | −16821.35 | −340.5171 |
Models are listed in order of increasing complexity and nestedness. Model ranked 1st was the winning model. Model names and parameters correspond to Table 2. The log marginal likelihood and log posterior P (model) are comparison metrics used to determine the best model. A numerically larger, i.e. less negative, log marginal likelihood is better. The prior probabilities of all models were equal
rew reward, pun punishment, reinf. reinforcement, loc location, stim stimulus
Between-group effects on parameters from the winning model
| SUD vs HC | OCD vs HC | SUD vs OCD | |||||
|---|---|---|---|---|---|---|---|
| Parameter | Placebo | Effects of amisulpride | Effects of pramipexole | Placebo | Effects of amisulpride | Effects of pramipexole | Placebo |
| Reward learning rate, | ↓ | ↑ | ↑ | ↓ | |||
| Punishment learning rate, | ↑ | ↓ | ↑ | ||||
| Reinforcement sensitivity, | ↓ | ↓ | |||||
| Location (side) stickness, | ↓ | ↓ | ↑ | ||||
| Stimulus stickness, | ↑ | ↓ | ↓ | ↑ | |||
HC healthy controls. Contrasts shown are (left to right) SUD_placebo – HC_placebo; [(SUD_drug – SUD_placebo) – (HC_drug – HC_placebo)] for amisulpride, and separately for pramipexole; OCD_placebo – HC_placebo; [(OCD_drug – OCD_placebo) – (HC_drug – HC_placebo)] for amisulpride, and separately for pramipexole; SUD_placebo – OCD_placebo. These results correspond to Figs. 3 and 5a, b. Arrows denote an increase or decrease of a parameter in a given contrast. Lack of an arrow indicates no difference
Within-group drug effects on parameters from the winning model
| HC | SUD | OCD | ||||
|---|---|---|---|---|---|---|
| Parameter | Amisulpride | Pramipexole | Amisulpride | Pramipexole | Amisulpride | Pramipexole |
| Reward learning rate, | ↑ | ↑ | ||||
| Punishment learning rate, | ↑ | ↑ | ↓ | ↑ | ↑ | |
| Reinforcement sensitivity, | ↓ | ↓ | ↓ | |||
| Location (side) stickness, | ↓ | ↓ | ||||
| Stimulus stickness, | ↓ | |||||
All effects are comparisons between drug and placebo within a group. HC healthy controls. Within group comparisons: HC_amisulpride – HC_ placebo; HC_pramipexole – HC_placebo; likewise for SUD and OCD. These results correspond to Fig. 4a–c. Arrows denote an increase or decrease of a parameter in a given contrast. Lack of an arrow indicates no difference
Fig. 3Differences between groups on placebo. Reinf. reinforcement, HC healthy controls. The optimal computational model contained parameters measuring (from top to bottom) learning from positive feedback, learning from negative feedback, sensitivity to reinforcement, a tendency to repeat choices on a recently chosen side (side stickiness), and a tendency to repeat choices to a recently chosen stimulus (stimulus stickiness). Differences in parameter per-group means under placebo; posterior 0 ∉ 95% HDI signified in red
Fig. 4Effects of amisulpride and pramipexole (relative to placebo) in each group. Reinf. reinforcement, HC healthy controls. a Difference in parameter per-condition means between HC on amisulpride or pramipexole compared to HC on placebo; posterior 0 ∉ 95% HDI signified in red, 0 ∉ 90% HDI in orange; HC_drug – HC_placebo. b Difference in parameter per-condition means between SUD under amisulpride or pramipexole compared to SUD on placebo, posterior 0 ∉ 95% HDI; SUD_drug – SUD_placebo. c Difference in parameter per-condition means between OCD on amisulpride or pramipexole compared to OCD on placebo, posterior 0 ∉ 95% HDI; OCD_drug – OCD_placebo
Fig. 5Differences in the effects of amisulpride or pramipexole between patient groups and healthy controls. All contrasts represent the difference between drug X’s effect in the patient group and its effect in the control group. Reinf. reinforcement, HC healthy controls. a [(SUD_drug – SUD_placebo) – (HC_drug – HC_placebo)] for amisulpride, and separately for pramipexole; posterior 0 ∉ 95% HDI signified in red, 0 ∉ 90% HDI in orange. b) Posterior 0 ∈ 95% HDI denoted in black, indicating no differences. This was a subtraction of [(OCD_drug – OCD_placebo) – (HC_drug – HC_placebo)] for amisulpride, and separately for pramipexole