| Literature DB >> 20740077 |
Moonsang Seo1, Mazda Beigi, Marjan Jahanshahi, Bruno B Averbeck.
Abstract
A growing body of evidence suggests that the midbrain dopamine system plays a key role in reinforcement learning and disruption of the midbrain dopamine system in Parkinson's disease (PD) may lead to deficits on tasks that require learning from feedback. We examined how changes in dopamine levels ("ON" and "OFF" their dopamine medication) affect sequence learning from stochastic positive and negative feedback using Bayesian reinforcement learning models. We found deficits in sequence learning in patients with PD when they were "ON" and "OFF" medication relative to healthy controls, but smaller differences between patients "OFF" and "ON". The deficits were mainly due to decreased learning from positive feedback, although across all participant groups learning was more strongly associated with positive than negative feedback in our task. The learning in our task is likely mediated by the relatively depleted dorsal striatum and not the relatively intact ventral striatum. Therefore, the changes we see in our task may be due to a strong loss of phasic dopamine signals in the dorsal striatum in PD.Entities:
Keywords: Parkinson's disease; dopamine; reinforcement learning; sequence learning
Year: 2010 PMID: 20740077 PMCID: PMC2927276 DOI: 10.3389/fnsys.2010.00036
Source DB: PubMed Journal: Front Syst Neurosci ISSN: 1662-5137
Demographic information for all participants and clinical characteristics of patients (mean ± SEM are shown). UPDRS scores were not obtained for the PD “OFF” group.
| Control (10) | PD “ON” (11) | PD “OFF” (10) | |
|---|---|---|---|
| Age | 66 ± 1.9 | 67 ± 1.8 | 68 ± 2.5 |
| Disease duration | – | 13.5 ± 2.2 | 11.5 ± 2.3 |
| Education (years) | 11.9 ± 0.3 | 11.3 ± 0.4 | 11.5 ± 0.4 |
| MMSE (0–30) | 29.4 ± 0.2 | 29.1 ± 0.3 | 29.6 ± 0.2 |
| Beck Depression | 7.1 ± 1.8 | 9.5 ± 1.5 | 7.8 ± 1.2 |
| Inventory (0–63) | |||
| Digit Span (0–30) | 19.4 ± 1.2 | 18.8 ± 1.0 | 19.9 ± 0.7 |
| Hoehn and Yahr | – | 2.0 ± 0.3 | 2.4 ± 0.2 |
| UPDRS (Motor section) | – | 13.1 ± 1.9 | – |
| No. blocks completed | 7.2 ± 0.5 | 7.3 ± 0.5 | 7.0 ± 0.5 |
Figure 1Sequence learning task. (A) Participants executed a sequence of four button presses using combinations of left and right handed button presses. After each button press they were given feedback about whether or not they were correct. (B) Participants completed multiple sets of trials. In each set participants executed all six sequences (S1, S2,…, S6) and sequences were executed in blocks in which participants had to learn and then execute the sequence correctly eight times before advancing to the next sequence in the block. Sequence blocks were presented in pseudo-random order.
Figure 2Average learning curves in PD groups and control across the whole block. Error bars are ±1 sem.
Figure 3Learning from positive and negative feedback. (A) Parameters from a model fit to the subset of trials before the first correct trial from each block. Positive is parameter α from Eq. 3 and negative is parameter β from Eq. 3. (B). The model fit to the trials from the entire block. (C) Distribution of positive feedback parameter across subjects within each group. (D) Within participant differences in learning from positive and negative feedback for trials before first correct (compare with A). (E) Same as (D) for all data from block (compare with B).
Figure 4Evolution of positive (A) and negative (B) feedback parameter values as the block evolved. Dotted black lines show the average across the groups. Solid lines are model estimates, large dots are data averaged across participants within each group.
Correlations between learning parameters and demographic variables.
| Demographic variable | |||
|---|---|---|---|
| Group | Learning parameter | Disease duration | Digit span |
| PD “ON” | Positive feedback | −0.59 (0.058) | 0.34 (0.299) |
| Negative feedback | −0.15 (0.668) | 0.32 (0.336) | |
| PD “OFF” | Positive feedback | −0.25 (0.488) | −0.07 (0.840) |
| Negative feedback | −0.42 (0.228) | −0.29 (0.422) | |
| Combined | Positive feedback | 0.28 (0.218) | |
| groups | Negative feedback | −0.27 (0.233) | 0.19 (0.418) |