| Literature DB >> 26019312 |
Max-Philipp Stenner1, Robb B Rutledge2, Tino Zaehle3, Friedhelm C Schmitt4, Klaus Kopitzki4, Alexander B Kowski5, Jürgen Voges6, Hans-Jochen Heinze3, Raymond J Dolan2.
Abstract
Functional magnetic resonance imaging (fMRI), cyclic voltammetry, and single-unit electrophysiology studies suggest that signals measured in the nucleus accumbens (Nacc) during value-based decision making represent reward prediction errors (RPEs), the difference between actual and predicted rewards. Here, we studied the precise temporal and spectral pattern of reward-related signals in the human Nacc. We recorded local field potentials (LFPs) from the Nacc of six epilepsy patients during an economic decision-making task. On each trial, patients decided whether to accept or reject a gamble with equal probabilities of a monetary gain or loss. The behavior of four patients was consistent with choices being guided by value expectations. Expected value signals before outcome onset were observed in three of those patients, at varying latencies and with nonoverlapping spectral patterns. Signals after outcome onset were correlated with RPE regressors in all subjects. However, further analysis revealed that these signals were better explained as outcome valence rather than RPE signals, with gamble gains and losses differing in the power of beta oscillations and in evoked response amplitudes. Taken together, our results do not support the idea that postsynaptic potentials in the Nacc represent a RPE that unifies outcome magnitude and prior value expectation. We discuss the generalizability of our findings to healthy individuals and the relation of our results to measurements of RPE signals obtained from the Nacc with other methods.Entities:
Keywords: deep brain stimulation; local field potentials; nucleus accumbens; reward prediction error
Mesh:
Year: 2015 PMID: 26019312 PMCID: PMC4533060 DOI: 10.1152/jn.00260.2015
Source DB: PubMed Journal: J Neurophysiol ISSN: 0022-3077 Impact factor: 2.714
Clinical data
| ID | Gender/Age, yr/Disease Duration, yr | Epilepsy Syndrome | Etiology | Seizure Lateralization | Seizure Onset | AED |
|---|---|---|---|---|---|---|
| P1 | M/39/9 | Multifocal | Cryptogenic | Bilateral | Bifrontal | LCM: 400 mg ZNS: 400 mg |
| P2 | M/32/31 | Multifocal | Genetic (SCNA1) | Left, possibly bilateral | Bifrontal and left medio-temporal | OXC: 900 mg Clobazam: 5 mg STP: 4,500 mg |
| P3 | F/52/17 | Focal | Cryptogenic | Left | Temporal | LTG: 250 mg LCM: 400 mg |
| P4 | F/44/14 | Multifocal | Right hippocampal sclerosis | Bilateral | Temporal | CBZ: 1,200 mg |
| P5 | M/40/31 | Focal | Left hippocampal sclerosis | Left | Temporal | LTG: 400 mg LCM: 400 mg |
| P6 | M/43/38 | Left temporal | Posttraumatic lesion | Left | Temporal | LCM: 400 mg LEV: 3,000 mg |
AED, antiepileptic drug; P, patient; M, male; F, female; LCM, lacosamide; ZNS, zonisamide; OXC, oxcarbazepine; STP, stiripentol; CBZ, carbamazepine; LTG, lamotrigine.
Patient underwent right temporal lobe resection 3 yr before deep brain stimulation (DBS) surgery.
Patient underwent left temporal lobe resection 9 yr before DBS surgery.
Fig. 1.Location of deep brain stimulation (DBS) electrodes and task. A: exemplary presurgical MRI showing the projection of the planned placement of the 2 nucleus accumbens (Nacc) electrodes onto this MRI slice. B: schematic of 1 trial in which the gamble option is chosen.
Fig. 2.Model-based analysis of patient decisions. A: the 6 patients varied in their tendency to gamble, choosing a risky gamble option over the safe option in 46–82% of trials. B: 4 patients made decisions consistent with the value function in parametric decision models. A loss aversion coefficient >1 indicates behavioral loss aversion, a greater impact of potential losses on choice than equivalent potential gains. C: compared with model 1 (which accounted for a gambling tendency but no value dependency in choice), models that included the gamble expected value (model 2) or a gamble subjective value that depended on a loss aversion coefficient (model 3) were preferred by Bayesian model comparison in 4 subjects (P1, P2, P3, and P4), which penalizes for model complexity. BIC, Bayesian Information Criterion.
Fig. 3.Outcome valence effects on time-frequency responses and evoked potentials. A: outcome valence effects on time-frequency responses up to 40 Hz (see methods for details). Color codes for t-values (gamble gains vs. gamble losses). Opaque areas are P ≥ 0.05. There is a consistent, significant increase in the power of beta oscillations within the first second after outcome onset, which is greater for gains than losses in three subjects. B: gain-evoked responses (green) have a greater (more positive) amplitude than loss-evoked responses (red) within the first few hundred milliseconds after outcome onset. In 4 of the 5 patients, there is also a later reversal of this effect. Shaded areas indicate significant (P < 0.05) time windows.
Fig. 4.Expected value effects on time-frequency responses and evoked potentials. In the time-frequency plots, color codes for a regression t-statistic. Opaque areas are P ≥ 0.05. In the time-domain plots, the red and green time courses represent averages across trials with a relatively low or high expected value of the gamble option, respectively (as determined by a median split). Shading in the time-domain plots indicates significant (P < 0.05) time windows. In both time-frequency and time-domain plots, the perioption time window (at left, for each patient and hemisphere) includes all trials (irrespective of whether the gamble option or the safe option was chosen). The periresponse and perioutcome plots (at right, for each patient and hemisphere) includes only those trials in which a gamble option was chosen. Three patients show significant preoutcome expected value signals, but these signals vary in frequency and latency. Only a single patient (P2) shows an expected value signal after outcome onset, and that signal is not consistent with an expected value component of a unified RPE signal (see results). Note that raw LFP data were band-stop filtered to remove line noise (48.5 to 51.5 Hz and harmonics up to 250 Hz).
Effects of expected value before gamble outcome on the amplitude of phase-locked responses and time-frequency power
| Effects of Expected Value Before Outcome on... | |||||||
|---|---|---|---|---|---|---|---|
| Phase-locked amplitude (“evoked response”) | Time-frequency resolved power | ||||||
| Patient | Side | Latency | Sign | Side | Latency | Frequency | Sign |
| P1 | Left | 450–610 ms postchoice | + | Right | 500–875 ms postoptions | 40–90 Hz | − |
| P2 | Left | 850–1,000 ms prechoice | − | Left | 625–1,050 ms prechoice | ≥185 Hz | + |
| Left | 700–1,000 ms prechoice | 10–15 Hz | − | ||||
| P3 | Ø | Ø | |||||
| P4 | Left | 765–930 ms postchoice | + | Left | 250 to 1,025 postchoice | 2.5–12.5 Hz | – |
| P5 | Ø | Ø | |||||
All P < 0.05 unless marked (§) after correction for multiple comparisons across both hemispheres and all time bins within each time window of interest (see Expected value effects in results). Time-frequency results are also corrected across all frequency bins of interest (2.5 to 40 or 30 to 250 Hz). Note that raw local field potential data were band-stop filtered to remove line noise (48.5 to 51.5 Hz and harmonics ≤250 Hz). Ø, no significant or near-significant clusters, independent of whether all 3 channels in each hemisphere were pooled or analyzed separately (all P > 0.1).
, +, and − refer to an increase and decrease in amplitude or power with expected value, respectively.
Only when all 3 channels of each hemisphere are analyzed separately (spatial clustering across channels; not shown in Fig. 4).
Only when all trials are taken into account, irrespective of choice (not shown in Fig. 4).
P < 0.1 (not shown in Fig. 4).