Literature DB >> 19904261

Dissociable cost and benefit encoding of future rewards by mesolimbic dopamine.

Jerylin O Gan1, Mark E Walton, Paul E M Phillips.   

Abstract

Reward-predicting cues evoke activity in midbrain dopamine neurons that encodes fundamental attributes of economic value, including reward magnitude, delay and uncertainty. We found that dopamine release in rat nucleus accumbens encodes anticipated benefits, but not effort-based response costs unless they are atypically low. This neural separation of costs and benefits indicates that mesolimbic dopamine scales with the value of pending rewards, but does not encode the net utility of the action to obtain them.

Entities:  

Mesh:

Substances:

Year:  2009        PMID: 19904261      PMCID: PMC2800310          DOI: 10.1038/nn.2460

Source DB:  PubMed          Journal:  Nat Neurosci        ISSN: 1097-6256            Impact factor:   24.884


For individuals to prosper in diverse environments, they need to use predictive sensory information to optimize outcomes in a flexible manner. Decision-making processes weigh the benefits of a reward with the cost of obtaining it to determine the overall subjective value (utility) of the transaction1,2. One neural substrate highly implicated in this valuation process is dopamine. Midbrain dopamine neurons encode fundamental economic parameters pertaining to predicted rewards (magnitude, probability, delay and uncertainty) in their firing rate3–6 and innervate areas implicated in economic decision making (prefrontal cortex, amygdala, dorsal striatum and nucleus accumbens)7–9. Moreover, dopamine in the nucleus accumbens core (NAcc) enables animals to respond to cues and overcome effortful response costs10,11. However, to fully understand decision-making computations encoded by the mesoaccumbens dopamine pathway, the nature of the valuation signal needs to be deconstructed: specifically, how it accounts for changes in anticipated costs as well as benefits. We employed fast-scan cyclic voltammetry to record phasic dopamine transmission in NAcc while rats performed decision-making tasks that independently manipulated either benefits or cost. All procedures were approved by the University of Washington Institutional Animal Care and Use Committee. Animals were trained to select between a reference option (sixteen lever presses for one food pellet) and an alternative that differed in either the reward magnitude (four or zero food pellets: benefit conditions) or response requirement (two or thirty two lever presses: cost conditions) as described in Supplementary Methods. Cues signaling the availability of the reference and/or alternative options were presented either separately in “forced” trials or simultaneously in choice trials (Fig. 1a). Forced trials allowed the evaluation of cue-evoked dopamine for one option without the confound of another option present, and choice trials provided a measure of behavioral preference. Data were evaluated after animals reached a behavioral criterion – choosing one option on ≥75% of choice trials. To prevent side-bias, assignment of high-/low-utility options to the two levers were always reversed from the previous session, and counterbalanced sessions for each contingency pair were included in the analysis.
Figure 1

(a) Example trials in the benefit condition. Center schematic represents cue lights (yellow star: active, gray circle: inactive), levers (trapezoid: present, line: retracted) flanking the food magazine. Each frame represents response options on one trial (white background: forced; gray background: choice). Left and right panels represent example dopamine release evoked by presentation of cue (dashed line) predicting the availability of a response option resulting in four or one food pellets respectively. Color plot are two-dimensional plot of cyclic voltammograms over time with electrochemical current represented in color. (b) Post-criterion choice behavior (top) and cue-evoked dopamine release (bottom) across sessions in benefit and cost conditions. Data are mean ± s.e.m.; * p<0.05, ** p<0.01, *** p<0.0001.

Across all contingency pairs, animals consistently chose the option with the highest benefit or lowest cost (Fig. 1b; see Supp. fig. 4a for rate to criterion). Subjective preference was also evident on post-criterion forced trials where response latencies were significantly faster to higher-benefit or lower-cost options (all p<0.001; Supp. fig. 4c). Furthermore, when the high-benefit (4 pellets for 16 lever presses) and the low-cost (1 pellet for 2 lever presses) options were presented as concurrent choices in a decision-making session, animals were indifferent, demonstrating equivalent utility (Supp. fig. 5). Thus, not only was the utility of reward options successfully modulated as expected by both benefit and cost conditions (i.e. increased utility conferred to the option with greater benefit or lower cost), the additional utility conferred by increased benefits was equivalent to that conferred by decreased costs. Despite predictable behavior, cue-evoked NAcc dopamine release did not track utility under all conditions. Manipulating reward magnitude led to a corresponding increase (main effect of reward size: F1,5=15.61, p=0.01) or decrease (F1,4=19.88, p=0.01) in cue-evoked dopamine compared to the reference option (Fig. 1b, Supp. fig. 6). Manipulations of response cost, on the other hand, did not always alter dopamine release. When the response cost of the alternative was increased, there was no difference in dopamine release between the reference and alternative option (main effect of response cost: F1,4=0.05, p=0.84, Fig. 1b) despite the strong behavioral preference for the reference option. When the response cost was reduced, there was greater dopamine release to the low-cost cue than to the reference (F1,4=25.38, p=0.007), but this was only significant in the first of two counterbalanced sessions in each animal (session×option interaction: p=0.03, F1,4=10.92; Supp. fig. 6). Post-hoc tests indicated that this effect was driven by a reduction in dopamine release to the low-cost (p=0.0006) but not the reference cue (p=0.20) across sessions. To further investigate across-session effects, we performed regression analysis between utility encoding and experience with any "alternative" contingency prior to recording. Experience-related changes in cue-evoked dopamine release were only observed in the reduced-cost condition where the preferential dopamine release for the low-cost cue diminished over time (Pearson’s r=−0.830, p=0.005, n=9; Spearman’s rho=−0.817, p=0.007; Fig. 2a). Additional experimentation with a cohort of animals given more experience (>9 sessions) with the high-benefit option prior to recording verified that both behavioral preference and preferential encoding of the higher benefits was maintained with extended training (p=0.007, t=4.08, df=6, n=7 session; Fig. 2b). Conversely, in a parallel experiment with the low-cost option, cue-evoked dopamine release did not preferentially encode the low-cost option after additional experience prior to recording (p=0.16, t=1.55, df=8, n=9 sessions), even though behavioral preference is preserved (Fig. 2b). These data are consistent with the notion that, while preferential encoding of high benefit by dopamine release is stable over training, low costs are only preferentially encoded early in training. Further analyses of the neurochemical data with respect to contextual framing, choice trials and within-session learning are included in the Supplementary Results.
Figure 2

Effect of behavioral history on dopamine release. (a) Differences in cue-evoked dopamine release between the high- and low-utility options ([DA]HU – [DA]LU) against behavioral history. (b) Post-criterion choice behavior (left panel), cue-evoked dopamine release (right panel) for the high-benefit (4 food pellets for 16 lever presses; left) or low-cost (1 food pellet for 2 lever presses; right) option in animals given extended training (>9 sessions) with either contingency prior to testing. Data are mean ± s.e.m.; * p<0.05, ** p<0.01.

In making sound economic choices, one must consider a reasonable cost to obtain an outcome based on its perceived benefit. The data presented here demonstrate that phasic NAcc dopamine transmission reliably reflects the magnitude of the benefit, but only correlates with effort-discounted utility in situations where the response cost is both novel and better than the reference. Incorporating these findings with previous studies showing that dopamine enables effortful responses, we reason that representation of reward magnitude by phasic dopamine provides a threshold to determine worthwhile cost expenditures in familiar situations10–12. Moreover, in novel situations dopamine provides an additional opportunistic mechanism for exploitation of low-cost rewards that become available unexpectedly12–13. Thus, we show a dissociation between dopaminergic encoding of anticipated costs and benefits, demonstrating that, while dopamine release in the nucleus accumbens scales with the value of a pending reward, it is not sufficient to describe the net utility of the action to obtain it.
  11 in total

1.  Discrete coding of reward probability and uncertainty by dopamine neurons.

Authors:  Christopher D Fiorillo; Philippe N Tobler; Wolfram Schultz
Journal:  Science       Date:  2003-03-21       Impact factor: 47.728

Review 2.  Calculating utility: preclinical evidence for cost-benefit analysis by mesolimbic dopamine.

Authors:  Paul E M Phillips; Mark E Walton; Thomas C Jhou
Journal:  Psychopharmacology (Berl)       Date:  2006-11-22       Impact factor: 4.530

3.  Physiological utility theory and the neuroeconomics of choice.

Authors:  Paul W Glimcher; Michael C Dorris; Hannah M Bayer
Journal:  Games Econ Behav       Date:  2005-08

4.  Midbrain dopamine neurons encode decisions for future action.

Authors:  Genela Morris; Alon Nevet; David Arkadir; Eilon Vaadia; Hagai Bergman
Journal:  Nat Neurosci       Date:  2006-07-23       Impact factor: 24.884

Review 5.  The short-latency dopamine signal: a role in discovering novel actions?

Authors:  Peter Redgrave; Kevin Gurney
Journal:  Nat Rev Neurosci       Date:  2006-11-08       Impact factor: 34.870

Review 6.  Ventral tegmental area neurons in learned appetitive behavior and positive reinforcement.

Authors:  Howard L Fields; Gregory O Hjelmstad; Elyssa B Margolis; Saleem M Nicola
Journal:  Annu Rev Neurosci       Date:  2007       Impact factor: 12.449

Review 7.  Effort-related functions of nucleus accumbens dopamine and associated forebrain circuits.

Authors:  J D Salamone; M Correa; A Farrar; S M Mingote
Journal:  Psychopharmacology (Berl)       Date:  2007-01-16       Impact factor: 4.530

Review 8.  Cortico-limbic-striatal circuits subserving different forms of cost-benefit decision making.

Authors:  Stan B Floresco; Jennifer R St Onge; Sarvin Ghods-Sharifi; Catharine A Winstanley
Journal:  Cogn Affect Behav Neurosci       Date:  2008-12       Impact factor: 3.282

9.  Influence of reward delays on responses of dopamine neurons.

Authors:  Shunsuke Kobayashi; Wolfram Schultz
Journal:  J Neurosci       Date:  2008-07-30       Impact factor: 6.167

10.  Weighing up the benefits of work: behavioral and neural analyses of effort-related decision making.

Authors:  M E Walton; S W Kennerley; D M Bannerman; P E M Phillips; M F S Rushworth
Journal:  Neural Netw       Date:  2006-09-01
View more
  128 in total

1.  Phasic nucleus accumbens dopamine release encodes effort- and delay-related costs.

Authors:  Jeremy J Day; Joshua L Jones; R Mark Wightman; Regina M Carelli
Journal:  Biol Psychiatry       Date:  2010-05-10       Impact factor: 13.382

Review 2.  Reconsidering anhedonia in depression: lessons from translational neuroscience.

Authors:  Michael T Treadway; David H Zald
Journal:  Neurosci Biobehav Rev       Date:  2010-07-11       Impact factor: 8.989

3.  Single-neuron mechanisms underlying cost-benefit analysis in frontal cortex.

Authors:  Takayuki Hosokawa; Steven W Kennerley; Jennifer Sloan; Jonathan D Wallis
Journal:  J Neurosci       Date:  2013-10-30       Impact factor: 6.167

4.  Direct-pathway striatal neurons regulate the retention of decision-making strategies.

Authors:  Susan M Ferguson; Paul E M Phillips; Bryan L Roth; Jürgen Wess; John F Neumaier
Journal:  J Neurosci       Date:  2013-07-10       Impact factor: 6.167

5.  Individual differences in dopamine D2 receptor availability correlate with reward valuation.

Authors:  Linh C Dang; Gregory R Samanez-Larkin; Jaime J Castrellon; Scott F Perkins; Ronald L Cowan; David H Zald
Journal:  Cogn Affect Behav Neurosci       Date:  2018-08       Impact factor: 3.282

6.  Phasic dopamine release in the rat nucleus accumbens symmetrically encodes a reward prediction error term.

Authors:  Andrew S Hart; Robb B Rutledge; Paul W Glimcher; Paul E M Phillips
Journal:  J Neurosci       Date:  2014-01-15       Impact factor: 6.167

7.  The impact of motivation on cognitive performance in an animal model of the negative and cognitive symptoms of schizophrenia.

Authors:  Ryan D Ward; Vanessa Winiger; Kerin K Higa; Julia B Kahn; Eric R Kandel; Peter D Balsam; Eleanor H Simpson
Journal:  Behav Neurosci       Date:  2015-04-27       Impact factor: 1.912

8.  Learning the opportunity cost of time in a patch-foraging task.

Authors:  Sara M Constantino; Nathaniel D Daw
Journal:  Cogn Affect Behav Neurosci       Date:  2015-12       Impact factor: 3.282

9.  Dopamine-associated cached values are not sufficient as the basis for action selection.

Authors:  Nick G Hollon; Monica M Arnold; Jerylin O Gan; Mark E Walton; Paul E M Phillips
Journal:  Proc Natl Acad Sci U S A       Date:  2014-12-08       Impact factor: 11.205

Review 10.  Illicit dopamine transients: reconciling actions of abused drugs.

Authors:  Dan P Covey; Mitchell F Roitman; Paul A Garris
Journal:  Trends Neurosci       Date:  2014-03-20       Impact factor: 13.837

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.