| Literature DB >> 36160041 |
Julia Pai1, Takaya Ogasawara1, Ethan S Bromberg-Martin1, Kei Ogasawara1, Robert W Gereau1,2,3,4, Ilya E Monosov1,2,4,5,6.
Abstract
Neuroeconomics studies how decision-making is guided by the value of rewards and punishments. But to date, little is known about how noxious experiences impact decisions. A challenge is the lack of an aversive stimulus that is dynamically adjustable in intensity and location, readily usable over many trials in a single experimental session, and compatible with multiple ways to measure neuronal activity. We show that skin laser stimulation used in human studies of aversion can be used for this purpose in several key animal models. We then use laser stimulation to study how neurons in the orbitofrontal cortex (OFC), an area whose many roles include guiding decisions among different rewards, encode the value of rewards and punishments. We show that some OFC neurons integrated the positive value of rewards with the negative value of aversive laser stimulation, suggesting that the OFC can play a role in more complex choices than previously appreciated.Entities:
Keywords: aversion; decision; motivation; neuroeconomics; orbitofrontal; value
Year: 2022 PMID: 36160041 PMCID: PMC9499993 DOI: 10.1016/j.crmeth.2022.100296
Source DB: PubMed Journal: Cell Rep Methods ISSN: 2667-2375
Some opportunities and limitations of different methods of aversive stimulus delivery
| Acute stimulus type | Millisecond-by-millisecond temporal precision | Compatible with electrophysiology | Compatible with functional magnetic imaging | Compatible with calcium imaging | No excessive or extraneous sensory component | Dynamic (trial-by-trial) adjustment of intensity | Dynamic (trial-by-trial) adjustment of precise stimulation location | Suitability for psychophysics (many trials within single session) |
|---|---|---|---|---|---|---|---|---|
| Pressure (von Frey) | no | yes | yes | yes | no | yes | difficult | no |
| Shock | yes | no | difficult | yes | yes | yes | not applicable | yes |
| Thermal—hot plate | difficult | yes | difficult | yes | yes | yes | not applicable | no |
| Thermal—probe | difficult | yes | difficult | yes | yes | yes | difficult | no |
| Air puff | yes | yes | difficult | yes | no | difficult | difficult | difficult |
| Laser | yes | yes | yes | yes | yes | yes | yes | yes |
Figure 1Valuation and aversive decision-making in non-human primates using laser stimulation
(A) Cartoon schematic of the experiment and laser parameters (top).
(B and C) Monkeys chose between two offers, which contained information about juice reward quantity and punishment magnitude (laser power). Each offer contained two bars, the height of which conveyed the appetitive (reward quantity) and aversive (laser power) attributes of each offer.
(D) Two monkeys’ (M1 and M2) average choice behavior indicates that the negative value of the laser grows as a function of laser power. y axis: percentage of choices of offers that contained laser stimulation versus those that contained no laser stimulation. x axis: choices organized by the difference in reward quantity between offers that predicted laser stimulation versus no laser stimulation. The choices are shown for the three laser powers separately (0.5–1.5 J). Shaded areas are confidence intervals. Indifference (50% of choosing either offer) is shown by a dashed line.
(E) Weights from a logistic regression fit to trials pooled from both subjects. Monkeys weighted increasing amounts of reward more positively and increasing laser powers more negatively. Error bars are ±1 SE. Asterisks indicate significance of each weight, and additional asterisks in between the bars indicate differences between weights of adjacent reward amounts or laser powers. ∗∗∗p < 0.001.
Figure 2Measuring motivational value of water rewards and laser punishments in head-fixed mice
(A) Behavioral task diagram.
(B) Time courses of licking activity aligned to tone CS onset, averaged over trials from all animals (S2: 540 trials, S5: 600 trials, S7: 840 trials, S8: 901 trials) over sessions after the animal learned the tones (18 sessions total; S2 = 3 sessions, S5 = 5 sessions, S7 = 5 sessions, S8 = 5 sessions). Shaded error bars are ±1 SE. Mice show anticipatory licking for water shortly after the onset of the reward CS tone, no licking to the neutral CS, and a small but significant increase in licking at the beginning of the punishment CS. (B and C, bottom) Colored bars show significant differences in time among the conditioned responses to reward and neutral (blue), punishment and neutral (red), and reward and punishment (purple) CSs (rank-sum test, p < 0.001). Gray bar shows time window used for analyses in (D)–(G) (last 500 ms of CS epoch before the US). (B, right inset) Licking activity from an example session from S7. Licks on individual trials are shown in gray on each row (60 trials for each condition). Average activity is shown in the overlaid line (blue = reward, gray = neutral, red = punishment).
(C) Time courses of normalized running activity aligned to CS time; conventions are same as in (B). Running speed was normalized within each session by first Z scoring running speeds across trials, then normalizing Z scores between 0 and 1. Mice show the greatest increase in running in anticipation of laser punishments. On some trials, mice flinch, pausing and/or running backwards before laser delivery, leading to a dip in average running activity before the US. Mice, on average, increase their running near the beginning of the reward CS but slow down in anticipation of receiving the water reward. (C, right inset) Raw running speeds from the same example session as (B). In the heatmap, red indicates forward speed, and blue indicates backward speed. Average activity is shown in overlaid line.
(D) Average of all licking (left) and running (right) activity, averaged over individual learned session averages. Different colored lines indicate average activity of different mice. Error bars are ±1 SE over session averages. Average licking activity was significantly different between reward and neutral conditions (p = 1.964 × 10−4, signed rank test) and between reward and punish conditions (p = 1.96 × 10−4). Average running activity was significantly different between punish and neutral conditions (p = 1.96 × 10−4) and between punish and reward conditions (p = 3.86 × 10−4).
(E) Licking (top) and running (bottom) for all animals across sessions of training. Mice quickly acquired conditioned responses to the reward and punish CSs, with differences in conditioned behavior to the different tones emerging after 1–3 sessions. Error bars are ±1 SE across trials.
(F and G) Average of all licking (F) and running (G) activity for the 5-condition Pavlovian conditioning paradigm. Naive mice were trained to associate 5 different tones with big water reward, small water reward, neutral (no outcome), small laser punishment, and big laser punishment outcomes. Colored lines are individual mice averages across sessions; error bars are ±1 SE across sessions (total sessions = 25; S15 = 5 sessions, S16 = 6 sessions, S17 = 7 sessions, S18 = 7 sessions).
(F) Mice show graded anticipatory licking to different CSs, licking most to the big reward CS, less to the small reward CS, and least to the neutral and punish CSs. Differences were significant between large reward and neutral (p = 1.25 × 10−5), small reward and neutral (p = 1.25 × 10−5, small punish and neutral (p = 0.025), large reward and small reward (p = 0.00942), large reward and small punish (p = 1.39 × 10−5), large reward and large punish (1.23 × 10−5), small reward and small punish (p = 1.57 × 10−5), and small reward and large punish (p = 2.54 × 10−5).
(G) Mice show graded anticipatory running to increasing negative value, running most to the big punish CS and least to the big reward CS. Differences were significant between large reward and neutral (p = 7.33 × 10−4), small reward and neutral (p = 0.00143), small punish and neutral (p = 0.00351), large punish and neutral (p = 3.22 × 10−5), large punish and small punish (p = 2.4 × 10−4), large reward and small punish (p = 7.22 × 10−5), large reward and large punish (p = 8.09 × 10−5), small reward and small punish (p = 1.26 × 10−4), and small reward and large punish (p = 1.01 × 10−4; Wilcoxon signed rank test).
Figure 3Neural correlates of aversive decision-making in orbitofrontal cortex
(A) Example single neuron’s activity is related to the subjective value of reward (left column), punishment (middle column), and their total integrated subjective value (right column). The effects were significant during offer 2. Error bars are ±1 SE. ∗p < 0.05, ∗∗p < 0.01, ∗∗∗p < 0.001 in all panels of this figure.
(B) Summary of rank correlation coefficients (rho) from (A). Error bars are bootstrapped confidence intervals (200 bootstraps).
(C) (Left) Percentage of neurons signaling subjective value of reward, punishment, and their total subjective value (STAR Methods). (Right) Among punishment subjective value neurons, a significant percentage of neurons’ activities signaled the subjective value of reward; similarly, among reward subjective value neurons, a significant percentage of neurons’ activities signaled the subjective value of punishment. Error bars are bootstrapped confidence intervals (200 bootstraps). Asterisks indicate a significantly higher proportion of neurons than expected by chance (one-tailed binomial test). The number of neurons are indicated for each analysis.
(D) Reward value neurons integrated punishment value. Reward subjective value neurons were selected as those having significant reward subjective value coding at a p < 0.05 threshold during offer 1 (left) and during offer 2 (right). Next, a separate receiver operating characteristic (ROC) analysis was used to measure the discrimination of reward magnitude and punishment magnitude for each neuron during non-overlapping trials. During offer 2, the reward and punishment discrimination indices were negatively correlated, meaning that neurons tended to encode reward and punishment with opposite signs, consistent with a total subjective value representation. Each dot is a neuron. Least square linear fits are shown for significant correlations (red). Correlations were Spearman’s rank correlations. We verified that increasing the threshold for inclusion of neurons to p <0.001 (e.g., effectively tightening the definition of reward value coding; STAR Methods) did not change the results. These data are shown as insets for each offer. In fact, the stricter inclusion strengthened, not weakened, the correlation.
(E) The proportion of the total time spent looking at the offer that monkeys spent looking at the punishment bar. Monkeys spent significantly more of this time looking at the punishment bar during offer 2 than during offer 1 (rank sum test; p < 0.001).
| REAGENT or RESOURCE | SOURCE | IDENTIFIER |
|---|---|---|
| Phosphate buffered saline | Gibco | CAT#70011069 |
| Paraformadehyde | Sigma-Aldrich | CAT#158127-100G |
| Ketamine-xylazine cocktail | Nexgen | SKU#NC-0254 |
| Rhesus macaque | PrimGen | |
| Mouse | Jackson Labs | |
| MATLAB | Mathworks | |
| pyElectrode | ||
| MATLAB toolbox for behavioral control (PLDAPS) | ||
| 32 channel linear array | Plexon | V-probes |
| Oil-driven micromanipulator | Narishige | MO-97A |
| 40kHz neural recording data acquisition system | Plexon | Omniplex |
| Eye tracker | SR research | EyeLink 1000 Plus |
| Behavioral data acquisition system | VPixx | DataPixx |
| Nd:YAP laser | Electronic Engineering (Florence, Italy) | Stimul 1340 |