| Literature DB >> 28536513 |
Ashwin G Ramayya1, Isaac Pedisich2, Deborah Levy2, Anastasia Lyalenko2, Paul Wanda2, Daniel Rizzuto2, Gordon H Baltuch1, Michael J Kahana2.
Abstract
Neuronal firing in the substantia nigra (SN) immediately following reward is thought to play a crucial role in human reinforcement learning. As in Ramayya et al. (2014a) we applied microstimulation in the SN of patients undergoing deep brain stimulation (DBS) for the treatment of Parkinson's disease as they engaged in a two-alternative reinforcement learning task. We obtained microelectrode recordings to assess the proximity of the electrode tip to putative dopaminergic and GABAergic SN neurons and applied stimulation to assess the functional importance of these neuronal populations for learning. We found that the proximity of SN microstimulation to putative GABAergic neurons predicted the degree of stimulation-related changes in learning. These results extend previous work by supporting a specific role for SN GABA firing in reinforcement learning. Stimulation near these neurons appears to dampen the reinforcing effect of rewarding stimuli.Entities:
Keywords: GABA; Parkinson's disease; dopamine; human; microstimulation; neuron; reinforcement learning; substantia nigra
Year: 2017 PMID: 28536513 PMCID: PMC5422436 DOI: 10.3389/fnhum.2017.00200
Source DB: PubMed Journal: Front Hum Neurosci ISSN: 1662-5161 Impact factor: 3.169
Figure 1Reinforcement learning task. (A) Subjects performed a reinforcement learning task with consistent stimulus-response mapping. The visual stimuli presented during the choice and feedback interval are shown. Feedback was provided probabilistically in accordance with one of four reward probability regimes that were re-assigned every 20 trials. (B) Each subject's intra-operative session was divided into two stages. During stage 1 (40 trials), we obtained microelectrode recordings and the assigned reward probabilities were either 0.8:0.2 or 0.2:0:0.8 red:blue, whereas during stage 2 (160 trials) we applied SN microstimulation, and the assigned reward probabilities were one of the following: 0.8:0.2, 0.7:0.3, 0.3:0.7, or 0.2:0:0.8). See Section 2 for additional details. (C) Subjects demonstrated a greater win-stay than expected by chance during both stage 1 and 2. (D) Subjects made the high reward probability choice with greater frequency during the last 10 trials of a reward probability regime as compared to the first 10 trials. Error bars indicate standard error of mean (s.e.m) across subjects. *indicates p < 0.001; see main text for statistics.
Summary of participant data.
| 1 | 46 | M | 0.61 | 0.76 | +0.03 | 20.0 | 0.74 |
| 2 | 62 | F | 0.60 | 0.84 | +0.02 | 5.19 | 1.04 |
| 3 | 50 | M | 0.84 | 1 | −0.02 | 12.8 | 0.44 |
| 4 | 68 | F | 0.87 | 1 | 0 | 7.15 | 1.04 |
| 5 | 68 | M | 0.74 | 1 | 0 | 14.7 | 0.92 |
| 6 | 75 | M | 0.63 | 0.85 | −0.07 | 48.2 | 0.48 |
| 7 | 60 | M | 0.57 | 0.73 | −0.08 | 20.4 | 0.44 |
| 8 | 66 | F | 0.60 | 0.81 | −0.01 | − | − |
| 9 | 69 | F | 0.72 | 0.93 | −0.04 | 17.4 | 0.40 |
| 10 | 66 | M | 0.68 | 0.89 | −0.09 | 33.0 | 0.40 |
| 11 | 72 | F | 0.70 | 0.94 | −0.02 | 10.9 | 0.16 |
Columns 4–6 describe behavioral changes during stage 2. Columns 7–8 describe properties of multi-unit activity recorded during stage 1. “–” indicates missing data. We did not identify spiking activity from subject #8.
Figure 2Stimulation-related change in learning. (A) Each subjects' probability of win stay during stage 2 is indicated by an “x,” following control trials on the left and following stimulation trials on the right. (B) Across subjects, we observed a trend toward an stimulation-related decrease in learning (p = 0.068). Error bars indicate standard error of mean (s.e.m) across subjects; see main text for statistics.
Figure 3Stimulation-related changes in learning are related to recorded neural activity. (A) Stimulation-related changes in learning during stage 2 were negatively correlated with mean spike rate of units recorded on that electrode during stage 1 (Pearson's r = −0.64, p = 0.045). (B) Same as (A) but demonstrating a positive correlation between stimulation-related changes in learning and mean waveform duration (Pearson's r = 0.64, p = 0.044). Each dot represents a subject, the solid black line is the regression slope, and the dashed lines represent 95% confidence intervals. (C) Neural recordings of multi-unit activity observed from two subjects (shown in red in A,B). For each unit, we show the average waveform (top left, gray shading marks the standard deviation), the inter-spike interval (bottom left, dashed line marks 3 ms), the average post-reward firing response (top right, smoothed with a Gaussian kernel of half-width = 75 ms; gray shading indicates s.e.m), and the spike raster following reward trials. Dashed black line indicates reward onset.