| Literature DB >> 29547978 |
Arjun Sethi1, Valerie Voon2,3, Hugo D Critchley1,4,5, Mara Cercignani1, Neil A Harrison1,4,5.
Abstract
Computational models of reinforcement learning have helped dissect discrete components of reward-related function and characterize neurocognitive deficits in psychiatric illnesses. Stimulus novelty biases decision-making, even when unrelated to choice outcome, acting as if possessing intrinsic reward value to guide decisions toward uncertain options. Heightened novelty seeking is characteristic of attention deficit hyperactivity disorder, yet how this influences reward-related decision-making is computationally encoded, or is altered by stimulant medication, is currently uncertain. Here we used an established reinforcement-learning task to model effects of novelty on reward-related behaviour during functional MRI in 30 adults with attention deficit hyperactivity disorder and 30 age-, sex- and IQ-matched control subjects. Each participant was tested on two separate occasions, once ON and once OFF stimulant medication. OFF medication, patients with attention deficit hyperactivity disorder showed significantly impaired task performance (P = 0.027), and greater selection of novel options (P = 0.004). Moreover, persistence in selecting novel options predicted impaired task performance (P = 0.025). These behavioural deficits were accompanied by a significantly lower learning rate (P = 0.011) and heightened novelty signalling within the substantia nigra/ventral tegmental area (family-wise error corrected P < 0.05). Compared to effects in controls, stimulant medication improved attention deficit hyperactivity disorder participants' overall task performance (P = 0.011), increased reward-learning rates (P = 0.046) and enhanced their ability to differentiate optimal from non-optimal novel choices (P = 0.032). It also reduced substantia nigra/ventral tegmental area responses to novelty. Preliminary cross-sectional evidence additionally suggested an association between long-term stimulant treatment and a reduction in the rewarding value of novelty. These data suggest that aberrant substantia nigra/ventral tegmental area novelty processing plays an important role in the suboptimal reward-related decision-making characteristic of attention deficit hyperactivity disorder. Compared to effects in controls, abnormalities in novelty processing and reward-related learning were improved by stimulant medication, suggesting that they may be disorder-specific targets for the pharmacological management of attention deficit hyperactivity disorder symptoms.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29547978 PMCID: PMC5917772 DOI: 10.1093/brain/awy048
Source DB: PubMed Journal: Brain ISSN: 0006-8950 Impact factor: 13.501
Participant demographics and ADHD scores
| Measure | Mean (SD) | F | ||
|---|---|---|---|---|
| ADHD | Controls | |||
| 30 | 30 | – | – | |
| Male | 19 | 19 | – | – |
| Female | 11 | 11 | – | – |
| Age | 33.7 (9.51) | 32.6 (9.54) | 0.2 | 0.66 |
| Handedness | – | – | ||
| Right-dominant | 28 | 29 | – | – |
| Left-dominant | 1 | 1 | – | – |
| Ambidextrous | 1 | 0 | – | – |
| FSIQ | 109.0 (6.57) | 110.1 (7.06) | 0.4 | 0.53 |
| CAARS ADHD Index | 24.0 (5.30) | 8.6 (5.01) | 133.21 | <0.001 |
| Attention/memory problems | 26.7 (5.46) | 9.9 (5.67) | 123.48 | <0.001 |
| Hyperactivity/motor restlessness | 24.4 (6.46) | 11.3 (5.68) | 68.81 | <0.001 |
| Impulsivity/emotional lability | 23.7 (7.36) | 7.6 (4.12) | 109.13 | <0.001 |
| Problems with self-concept | 11.2 (4.72) | 5.6 (4.45) | 22.5 | <0.001 |
| DSM total ADHD score | 37.6 (9.03) | 12.8 (6.92) | 159.66 | <0.001 |
| DSM Inattention | 19.3 (4.46) | 7.0 (4.55) | 125.28 | <0.001 |
| DSM Hyperactivity and Impulsivity | 18.3 (5.66) | 5.7 (3.83) | 110.44 | <0.001 |
CAARS = Conners’ Adult ADHD Rating Scale; DSM = Diagnostic and Statistical Manual of Mental Disorders; FSIQ = Full scale intelligence quotient.
aAs estimated by National Adult Reading Test (NART) scores.
Figure 1Novelty processing task. (A) Image sets: A set of 64 greyscale pictures (SET A or SET B) was randomly allocated for each session. (B) Pre-familiarization: participants were familiarized to half of the image set by passive, then active viewing. (C) Three-armed bandit task: during functional MRI, participants chose between three options (images) on each trial. Each option had a fixed probability (mean: 33%) of winning £1. Participants were instructed to choose options that maximized their total reward. Each trial consisted of stimulus presentation (3.5 s), choice feedback (3 s), and reward feedback (superimposed £1 or £0) (1.5 s). If participants failed to respond, ‘No response’ was displayed (4.5 s). There was a jittered intertrial interval (1.5–3 s). Option locations were randomly shuffled between trials. (D) On 25% of trials one option was randomly replaced by a new one from the image set. Images differed in reward value, but novel and familiar (pre-familiarized) images had the same reward probability distributions (mean 33%). The task was split into three 13-min runs, each containing 80 consecutive trials.
Model parameter estimates in ADHD and controls
| Measure | Mean scores (SE) | |
|---|---|---|
| ADHD | Controls | |
| Qn | 0.57 (0.07) | 0.46 (0.05) |
| Qf | 0.52 (0.06) | 0.45 (0.06) |
| 0.39 (0.04) | 0.54 (0.05) | |
| 7.58 (2.06) | 8.69 (2.29) | |
| Qn | 0.62 (0.06) | 0.53 (0.05) |
| Qf | 0.56 (0.06) | 0.49 (0.05) |
| 0.48 (0.06) | 0.46 (0.06) | |
| 7.18 (1.54) | 7.58 (1.27) | |
Figure 2Effects of stimulant medication on optimal versus non-optimal novel choices. The mean number of times a novel option was continuously selected after introduction, separated according to whether it was the optimal choice (the highest value option of the three on screen) or non-optimal choice (not the highest value option of the three on screen).
Figure 3Reward prediction error ( Brain regions significantly correlating with reward prediction error (δ) across all participants and conditions. Peak activations in right and left ventral striatum are highlighted in red.
Figure 4Group by drug interaction for reward prediction error and novelty signals. (A) Brain regions demonstrating a significant Group × Drug interaction for reward prediction error (δ). Peak activation in the right ventral striatum highlighted in red. (B) Contrast estimate for right ventral striatum cluster. (C) Brain regions demonstrating a significant Group × Drug interaction for novelty signal (Q). Peak activation in the left substantia nigra highlighted in red. (D) Contrast estimate for left substantia nigra cluster.