| Literature DB >> 22262065 |
Katsuhiko Miyazaki1, Kayoko W Miyazaki, Kenji Doya.
Abstract
Classic theories suggest that central serotonergic neurons are involved in the behavioral inhibition that is associated with the prediction of negative rewards or punishment. Failed behavioral inhibition can cause impulsive behaviors. However, the behavioral inhibition that results from predicting punishment is not sufficient to explain some forms of impulsive behavior. In this article, we propose that the forebrain serotonergic system is involved in "waiting to avoid punishment" for future punishments and "waiting to obtain reward" for future rewards. Recently, we have found that serotonergic neurons increase their tonic firing rate when rats await food and water rewards and conditioned reinforcer tones. The rate of tonic firing during the delay period was significantly higher when rats were waiting for rewards than for tones, and rats were unable to wait as long for tones as for rewards. These results suggest that increased serotonergic neuronal firing facilitates waiting behavior when there is the prospect of a forthcoming reward and that serotonergic activation contributes to the patience that allows rats to wait longer. We propose a working hypothesis to explain how the serotonergic system regulates patience while waiting for future rewards.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22262065 PMCID: PMC3311865 DOI: 10.1007/s12035-012-8232-6
Source DB: PubMed Journal: Mol Neurobiol ISSN: 0893-7648 Impact factor: 5.590
Definitions and relationships of the terms
| Impulsive action | Failure to suppress inappropriate action. Impulsive action occurs due to lack of action inhibition. Action inhibition is classified into “action restraint” and “action cancelation” |
| Impulsive choice | Tendency to choice of small immediate rewards over larger delayed rewards |
| Action restraint | Action restraint describes the inhibition of the motor response before that response has been initiated. Action restraint encompasses “waiting to obtain reward” and “waiting to avoid punishment” as defined in this review |
| Action cancelation | Action cancelation describes the inhibition of a motor response during its execution |
| Waiting to obtain reward | Suppression of behavior to obtain future reward |
| Waiting to avoid punishment | Suppression of behavior to avoid future punishment |
Fig. 1Apparatus and behavioral procedure of the five-choice serial reaction time task. a Schematic diagram of the five-choice serial reaction time task chamber, showing the spatial arrangement of the five response apertures in relation to the food magazine. b Possible trial sequences of the five-choice serial reaction time task. A trial is initiated when the rat enters the food magazine. A brief light stimulus is then presented in one of five possible apertures after a 5-s inter-trial interval (ITI). The rat is required to scan the five apertures for the appearance of the light stimulus and then respond in the “correct” aperture with a nose-poke response to earn a single food pellet. If the rat responds before the stimulus (“premature response”) or in an adjacent incorrect aperture (“incorrect response”), a 5-s time-out (TO) period is introduced in which the house light is extinguished and no food reward is provided. A failure to respond within the limited hold (LH) period results in an “omission” and subsequent 5-s TO period. After collecting the reward or at the end of the TO period, a head entry in the food magazine initiates a new trial. Modified from [34]
Fig. 2Behavioral procedure of the delay discounting task. The format of a single trial is shown; the trials occurred at 100-s intervals. A session lasted 100 min and consisted of five blocks, each comprising two trials in which only one lever was presented (one trial for each lever in a random order) followed by 10 choice trials. The delay to the large reinforcer was varied systematically across the session. The delays for each block were 0, 10, 20, 40, or 60 s. Modified from [75]
Summary of the role of 5-HT in impulsive action and impulsive choice
| Task | Manipulation | Effect | Reference |
|---|---|---|---|
| 5-CSRTT | Systemic administration of M100907, 5-HT2A receptor antagonist | Decrease premature responses | [ |
| Systemic administration of DOI, 5-HT2A receptor agonist | Increase premature responses | ||
| Systemic administration of SB242084, 5-HT2C receptor antagonist | Increase premature responses | ||
| Systemic administration of Ro60-0175, 5-HT2C receptor agonist | Decrease premature responses | ||
| 5-CSRTT | Intra-DRN 5-HT depletion (5,7 dihydroxytryptamine—5,7-DHT) | Increase premature responses | [ |
| 5-CSRTT | Global 5-HT depletion (5,7-DHT) | Increase premature responses | [ |
| 5-CSRTT | Intra-mPFC infusion of M100907, 5-HT2A receptor antagonist | Decrease premature responses induced by NMDA receptor antagonist | [ |
| 5-CSRTT | Systemic and intra-mPFC administration of ketanserin, 5-HT2A/C receptor antagonist | Both systemic and intra-mPFC administration of ketanserin decrease premature responses | [ |
| 5-CSRTT | Systemic administration of ketanserin, 5-HT2A/C receptor antagonist | Decrease premature responses | [ |
| Delay discounting | Systemic administration of ketanserin | No effect | |
| 5-CSRTT | Systemic and intra-mPFC administration of M100907, 5-HT2A antagonist | Decrease premature responses in systemic administration | [ |
| Decrease premature responses in intra-mPFC administration when the stimulus duration was reduced | |||
| 5-CSRTT | Systemic administration of M100907, 5-HT2A receptor antagonist | Decreased premature responses | [ |
| Systemic administration of SB242084, 5-HT2C receptor antagonist | Increase premature responses | ||
| T-maze | DRN 5-HT depletion (5,7-DHT) | Prefer no-delay small reward over 15 s delayed large reward (transient) | [ |
| Acute systemic administration of fluoxetin and fluvoxamin (SSRI) | Prefer 25 s delayed large reward over no-delay small reward | ||
| T-maze | Acute systemic administration of citalopram (SSRI) | Prefer 25 s delayed large reward over no-delay small reward | [ |
| T-maze | Global 5-HT synthesis inhibition (p-chlorophenylalanine—pCPA) | Prefer no-delay small reward over 15 s delayed large reward | [ |
| Delay discounting | Acute systemic administration of citalopram (SSRI) | No effect | [ |
| Acute systemic administration of metergolin, nonselective 5-HT antagonist | Increase self-controlled choice | ||
| Delay discounting | Acute and chronic systemic administration of buspirone, partial 5-HT1A agonist | Increase impulsive choice in acute administration | [ |
| Increase self-controlled choice in chronic administration | |||
| Delay discounting | DRN and MRN 5-HT depletion (5,7-DHT) | Increase impulsive choice | [ |
| Probabilistic discounting | No effect | ||
| Delay discounting | Global 5-HT depletion (5,7-DHT) | No effect | [ |
| Delay discounting | Global 5-HT depletion (5,7-DHT) | No effect | [ |
| Delay discounting | Measurement of 5-HT efflux by microdialysis | Increase mPFC 5-HT efflux during the delay discounting task | [ |
| Delay discounting | Systemic administration of 8-OH-DPAT, 5-HT1A receptor agonist | Increase impulsive choice | [ |
| Global 5-HT depletion (5,7-DHT) and systemic administration of 8-OH-DPAT | No effect on 8-OH-DPAT's ability to increase impulsive choice | ||
| Delay discounting | DRN and MRN 5-HT depletion (5,7-DHT) | Increase impulsive choice | [ |
Fig. 3Design of the behavioral task and the rats’ performance. a Open-field reward cylinders (food site and water site) and a tone cylinder (tone site) for the task. The windows for nose-pokes (reward locations) are indicated. The tone cylinder also has a small window at the same position as in the reward cylinders. b Schematic of the movements required by the rats to receive rewards at the food and water sites. To start the task, the rats have to visit, insert, and keep their noses in the tone site until the tone (8 kHz, 0.4 s) is presented (tone delay). Green, red, and blue areas indicate the tone, food reward, and water reward delays, respectively. See the text for details regarding the tone and reward delay periods. Modified from [29]
Fig. 4Activity of serotonergic neurons during the tone delay and reward delay periods. a Activity of an example neuron recorded in the dorsal raphe nucleus is shown separately for food (left) and water (right) during the sequential food–water navigation task in which the waiting periods for tone (tone delay) and for rewards (food and water delay) are 2 s (the constant delay condition). For each reward, raster plots of neural activity (top) and peri-event time histograms smoothed with a Gaussian filter (SD = 50 ms) (bottom) are aligned at the time of the tone site entry (left) and at the time of the reward site entry (right). The raster plots represent neural activity in the order of the occurrence of trials for each reward site from bottom to top. Each dot represents a spike. The tones for the food and water sites are the food tone and water tone, respectively. Green, red, and blue areas indicate the tone delay, food delay, and water delay periods, respectively. Light blue areas indicate the water spout presenting period. b Average activity of the 63 neurons recorded during the constant delay condition. c Average firing rate during the tone and reward delay periods. Average firing rates during the baseline (B), food tone delay (FTD), water tone delay (WTD), food delay (FD), and water delay (WD) periods are shown. Asterisks (***) indicate significant differences relative to baseline activity (Wilcoxon signed-rank test, p < 0.0001). Hash marks (#) indicate significant differences relative to tone delay activity (Wilcoxon signed-rank test, p < 0.0001). In (a) and (b), the gray shading indicates SEM. Modified from [29]
Fig. 5Population activity of DRN serotonin neurons under an extended reward delay condition in which the tone delay was fixed at 1.5 s and the reward delay at both sites was increased gradually every 300 s (2, 4, 6, 8, 12 s). Averaged activity of 5-HT neurons aligned to the time of entry to the food site (left) and water site (right). Red and blue lines indicate activity during the food and water delay periods, respectively. Gray shadings represent SEM. Food site: 2-s delay (n = 46), 4-s delay (n = 45), 6-s delay (n = 47), 8-s delay (n = 43), and 12-s delay (n = 37). Water site: 2-s delay (n = 46), 4-s delay (n = 45), 6-s delay (n = 47), 8-s delay (n = 46), and 12-s delay (n = 39). Modified from [29]
Fig. 6Activity of 5-HT neurons during reward wait error (i.e., failure to wait for delayed rewards) in the extended reward delay test and during water reward omission. a Population activity aligned to the onset of the reward presentation (red, food; blue, water) and to the reward wait error (pink, food wait error; cyan, water wait error) (left, food site, n = 26; right, water site, n = 24). Gray shadings represent SEM. Light yellow areas indicate the periods that were used to analyze the average firing rate. b Average firing rate during the first and last 2 s of the waiting period after entry into the reward site in the case of a successful entry (red, food; blue, water) or in the case of a wait error entry (pink, food wait error; cyan, water wait error) (left, food site, n = 26; right, water site, n = 24; ±SEM). c Population activity aligned to water site rewarded entry (blue) and to water omission entry (cyan) (left) (n = 24). Population activity aligned to water site exit after water omission entry (right) (n = 24). Gray shadings represent SEM. Light yellow areas indicate the periods that were used to analyze the average firing rate. d Average firing rates during a 2-s period following water site rewarded entry, after water omission entry, and before water site exit (n = 24; ±SEM). *p < 0.01, **p < 0.001, ***p < 0.0001; Wilcoxon signed-rank test. n.s. not significant. Modified from [29]
Fig. 7Putative neural circuit that contributes to the “waiting to obtain reward”. DRN 5-HT neurons project to the NAcc, mPFC, and OFC. Previous studies revealed that 5-HT in the NAcc and mPFC contributes to regulation of impulsive action. The OFC may transmit confidence and/or reward expectation signals to influence 5-HT neural activity during the “waiting to obtain reward”