Literature DB >> 27317196

Nucleus accumbens dopamine D2-receptor expressing neurons control behavioral flexibility in a place discrimination task in the IntelliCage.

Tom Macpherson¹, Makiko Morita¹, Yanyan Wang², Toshikuni Sasaoka³, Akira Sawa⁴, Takatoshi Hikida⁵.

Abstract

Considerable evidence has demonstrated a critical role for the nucleus accumbens (NAc) in the acquisition and flexibility of behavioral strategies. These processes are guided by the activity of two discrete neuron types, dopamine D1- or D2-receptor expressing medium spiny neurons (D1-/D2-MSNs). Here we used the IntelliCage, an automated group-housing experimental cage apparatus, in combination with a reversible neurotransmission blocking technique to examine the role of NAc D1- and D2-MSNs in the acquisition and reversal learning of a place discrimination task. We demonstrated that NAc D1- and D2-MSNs do not mediate the acquisition of the task, but that suppression of activity in D2-MSNs impairs reversal learning and increased perseverative errors. Additionally, global knockout of the dopamine D2L receptor isoform produced a similar behavioral phenotype to D2-MSN-blocked mice. These results suggest that D2L receptors and NAc D2-MSNs act to suppress the influence of previously correct behavioral strategies allowing transfer of behavioral control to new strategies.

Entities: Gene Species

Mesh：

Substances：

Year: 2016 PMID： 27317196 PMCID： PMC4918782 DOI： 10.1101/lm.042507.116

Source DB: PubMed Journal: Learn Mem ISSN： 1072-0502 Impact factor: 2.460

The ability to learn behavioral strategies resulting in rewarding outcomes, as well as flexibly adapt behavior to match the changing contingencies of our environment, is critical for survival. Indeed, altered reward learning and behavioral flexibility are common features of various major psychiatric disorders, including schizophrenia and drug addiction (Elliott et al. 1995; Cardinal and Everitt 2004; Schoenbaum et al. 2004; Waltz and Gold 2007; Gold et al. 2008). The recent development of the IntelliCage, an automated group-housing experimental apparatus, has allowed researchers to explore learning processes within a more natural environmental setting. Indeed, early experiments using the IntelliCage demonstrated its value for measuring spontaneous, as well as conditioned, behaviors (Knapska et al. 2006; Onishchenko et al. 2007; Mechan et al. 2009; Võikar et al. 2010). More recently, IntelliCage place discrimination tasks have been used to investigate higher-order cognitive functions in both normal and disease model mice. Acquisition and reversal of place discrimination was found to differ between various commonly used inbred mouse strains (Krackow et al. 2010; Endo et al. 2011), and valproate-treated C57BL/6 mice, used as a model of autism, displayed impairments in place learning associated with perseverative responding (Puscian et al. 2014). However, little is known about the neural circuitry that mediates performance on this task. The nucleus accumbens (NAc) plays an important role in the acquisition and flexibility of reward and aversive behavioral strategies (Floresco et al. 2009; Macpherson et al. 2014; Hikida et al. 2016). Critical to the ability of the NAc to control reward and aversive learning are the activity of two types of intermingled output neurons, distinguished by their expression of dopamine receptors and releasable peptides. Dopamine D1-receptor, dynorphin-, and substance P-expressing medium spiny neurons (D1-MSNs), and D2-receptor and enkephalin-expressing medium spiny neurons (D2-MSNs) (Gerfen and Young 1988; Gerfen et al. 1990). Blockade of neurotransmission in NAc D1-MSNs using the selective expression of tetanus toxin impairs acquisition of food or cocaine conditioned place preference (CPP) (Hikida et al. 2010, 2013), as well as visual or response discrimination tasks (Yawata et al. 2012). Whereas, the same method of neurotransmission blocking in NAc D2-MSNs, or intra-NAc infusion of a D2 agonist, does not alter acquisition of visual or response discrimination tasks, but impairs the ability to switch between the two tasks as well as reversal learning within both tasks (Haluk and Floresco 2009; Yawata et al. 2012). Here, we use a reversible neurotransmission blocking (RNB) technique to investigate the consequences of selectively disrupting neurotransmission in NAc D1- or D2-MSNs on acquisition and serial reversal learning of a place discrimination task in the IntelliCage. Additionally, given previous evidence of the role of dopamine D2 receptors in behavioral flexibility (Kruzich et al. 2006; De Steno and Schmauss 2009; Haluk and Floresco 2009), we decided to explore the functional impact of the deletion of these receptors in the IntelliCage place discrimination task. Specifically, we investigated the consequences of targeted deletion of dopamine D2L receptors, an isoform that predominantly acts postsynaptically (Usiello et al. 2000; Wang et al. 2000).

Results

Baseline measures of activity and water drinking in RNB and D2L KO mice

Given that the NAc has been implicated in the control of motor activity (McCullough and Salamone 1992; Swanson and Kalivas 2000) and hypolocomotion in a novel environment has been reported in D2L knockout (KO) mice (Wang et al. 2000; Fetsko et al. 2005), we first examined baseline activity in group-housed NAc D1-MSN blocked (D1-RNB) and D2-MSN blocked (D2-RNB) RNB mice, and D2L KO, heterozygous (HET), and wild-type (WT) mice. We measured visits to the corners and water drinking during a 7-d period of 24 h free access to the conditioning chamber corners (Supplemental Fig. S1). No problems were found in the ability to visit conditioning corners or acquire nosepoke responses in any mice. Total visits to the corners were unaffected in D1-RNB, D2-RNB, D2L HET, and D2L KO mice when compared with equivalent WT mice (Supplemental Fig. S1, A; D1-RNB; F(1,15) = 0.18, P = 0.68, C; D2-RNB; F(1,13) = 0.35, P = 0.58, E; D2L KO; F(2,28) = 0.51, P = 0.95). Similarly, there was no significant difference in bottle licks in D1-RNB, D2-RNB, D2L HET, and D2L KO mice compared with WT mice (Supplemental Fig. S1, B; D1-RNB; F(1,15) = 0.24, P = 0.63, D; D2-RNB; F(1,13) = 0.02, P = 0.89, F; D2L KO; F(2,28) = 2.37, P = 0.11). Thus, blockade of either NAc D1- or D2-MSNs, or global KO of D2L receptors, does not alter baseline activity or water drinking in group-housed mice.

Acquisition of place discrimination in RNB mice

We next examined the role of NAc D1- and D2-MSNs in the ability to learn an IntelliCage place discrimination task designed by Endo et al. (2011). Acquisition of place discrimination requires mice to discriminate and shuttle between diagonally facing rewarded but not nonrewarded corners. Both D1-RNB and D2-RNB were able to acquire place discrimination to a similar degree as WT mice, as evidenced by an improvement in the percentage of correct responses over the course of seven daily sessions (Fig. 1A, acquisition stage; D1-RNB, nonsignificant main effect of genotype F(1,15) = 0.27, P = 0.61, significant main effect of session F(6,90) = 12.35, P < 0.0001, Fig. 1B, acquisition stage; D2-RNB, nonsignificant main effect of genotype F(1,13) = 0.14, P = 0.71, significant main effect of session F(6,78) = 5.66, P < 0.0001). Thus, NAc D1- or D2-MSNs do not appear to be involved in the acquisition of the current place discrimination task.

Figure 1.

Acquisition and serial reversal of an IntelliCage place-learning task in NAc RNB mice. (A) WT (n = 9) and NAc D1-RNB (n = 8) mice did not significantly differ in acquisition and reversal learning of shuttling responses in the place-learning task. (B) NAc D2-RNB (n = 8) mice showed normal acquisition, but impaired reversal learning of the place-learning task when compared with WT (n = 7) mice. (*) P < 0.05, significant main effect of genotype in the first reversal stage (Rev 1). Decreased percentage of correct responses in D2-RNB mice was restored to the level of WT mice by the third reversal stage (Rev 3). Small boxes at the bottom represent an example of the IntelliCage setup for each stage, with filled circles indicating the reward-paired corners and empty circles indicating the non-reward-paired corners. Graph points represent the mean responses for each genotype, with error bars indicating ±SEM.

Serial reversal learning in RNB mice

Using the same place discrimination task we next explored behavioral flexibility in RNB mice using serial reversals of the response contingency after every seven sessions, so that the previously nonrewarded corners now become rewarded, and vice versa. During the first reversal stage, as with the acquisition stage, all mice demonstrated the ability to learn to discriminate between the rewarded and nonrewarded corners, indicated by an improvement in the percentage of correct responses over the course of seven sessions (Fig. 1A, Rev 1; D1-RNB, significant main effect of session F(6,90) = 25.50, P < 0.0001, Fig. 1B, Rev 1; D2-RNB, significant main effect of session F(6,78) = 11.95, P < 0.0001). However, the percentage of correct responses across the reversal stage as a whole was significantly decreased in D2-RNB (Fig. 1B, Rev 1; significant main effect of genotype F(1,13) = 4.75, P < 0.05), but not D1-RNB mice (Fig. 1A, Rev 1; main effect of genotype F(1,15) = 0.09, P = 0.77) when compared with WT mice. When analyzed across the three reversal stages, overall performance improved with each consecutive stage in D1-RNB and D2-RNB mice and equivalent WT mice (Fig. 1A, Rev 1-3; D1-RNB, significant main effect of stage F(2,30) = 17.77, P < 0.001, Fig. 1B, Rev 1-3; D2-RNB, significant main effect of stage F(2,26) = 12.81, P < 0.001). Indeed, impaired performance of reversal learning in D2-RNB mice was restored to the level of WT mice by the final stage (Fig. 1B, Rev 3; nonsignificant main effect of genotype F(1,13) = 0.58, P = 0.46).

Perseveration in RNB mice

During the first reversal, both D1-RNB and D2-RNB and WT mice demonstrated a significant decrease in perseverative errors over the course of seven sessions (Fig. 2A, Rev 1; D1-RNB, significant main effect of session F(6,90) = 51.18, P < 0.0001, Fig. 2B, Rev 1; D2-RNB, significant main effect of session F(6,78) = 29.78, P < 0.0001). However, over the whole stage, the percentage of perseverative errors was significantly elevated in D2-RNB (Fig. 2B, Rev 1; significant main effect of genotype F(1,13) = 9.11, P < 0.01) but not D1-RNB (Fig. 2A, Rev 1; effect of genotype F(1,15) = 0.14, P = 0.71) mice when compared with WT mice.

Figure 2.

Perseverative errors in NAc D1- and D2-RNB mice. (A) The percentage of perseverative errors during any of the reversal stages did not significantly differ between WT (n = 9) and NAc D1-RNB (n = 8) mice. (B) The percentage of perseverative errors was significantly higher in NAc D2-RNB (n = 8) mice than that in WT (n = 7) mice. (**) P < 0.01, significant main effect of genotype in the first reversal stage (Rev 1). Increased perseverative errors in D2-RNB mice were restored to the level of WT mice by the third reversal stage (Rev 3). Small boxes at the bottom represent an example of the IntelliCage setup for each stage, with filled circles indicating the reward-paired corners and empty circles indicating the non-reward-paired corners. Graph points represent the mean responses for each genotype, with error bars indicating ±SEM. Across the three reversal stages, the percentage of perseverative errors decreased with each consecutive stage in all mice (Fig. 2A, Rev 1-3; D1-RNB, significant main effect of stage F(2,30) = 17.77, P < 0.001, Fig. 2B, Rev 1-3; D2-RNB, significant main effect of stage F(2,26) = 12.81, P < 0.001). Similar to impairment in correct responding, increased perseverative errors in D2-RNB mice were restored to the level of WT mice by the final stage (Fig. 2B, Rev 3; nonsignificant main effect of genotype F(1,13) = 0.11, P = 0.74). Finally, we investigated whether perseverative errors and correct responses were predictive of each other. During the reversal stages, the percentage of perseverative errors was significantly negatively correlated with the percentage of correct responses in WT and D1-RNB (Fig. 3A; WT, r = −0.96 P < 0.001, D1-RNB, r = −0.89 P < 0.001) and WT and D2-RNB (Fig. 3B; WT, r = −0.93 P < 0.001, D2-RNB, r = −0.92 P < 0.001) mice. Thus, the impairment in reversal learning in D2-RNB mice is directly correlated to the elevated perseverative responding at previously correct corners.

Figure 3.

Correlation between the percentage of correct responses and the percentage of perseverative errors during reversal stages in RNB mice. Both D1-RNB (A), D2-RNB (B) and equivalent WT mice showed a significant negative correlation between the percentage of correct responses and the percentage of perseverative errors during the reversal stages.

Place and reversal learning in D2L KO mice

Given that NAc D2-MSNs are heavily influenced by dopamine transmission at D2 receptors (Gerfen et al. 1990), we next decided to explore the functional impact of deletion of these receptors. Differential splicing of the D2 receptor gene creates two isoforms, short (D2S) receptors and long (D2L) receptors, which may act predominantly at the presynapse and postsynapse, respectively (Usiello et al. 2000; Wang et al. 2000). As we were interested in the postsynaptic effect of dopamine transmission on NAc D2-MSNs, we chose to investigate the consequences of deletion of D2L receptors. As with D1-RNB and D2-RNB mice, D2L HET and KO mice did not significantly differ from WT mice in their ability to acquire the place discrimination task (Fig. 4A, Acquisition stage; nonsignificant main effect of genotype F(2,28) = 2.75, P = 0.08, significant main effect of session F(6,168) = 40.33, P < 0.0001). Similarly, during the first reversal stage, all mice demonstrated an increase in the percentage of correct responses across the course of seven sessions (Fig. 4A, Rev 1; significant main effect of session F(6,168) = 32.35, P < 0.0001). However, overall performance across the stage was revealed to differ between genotypes (Fig. 4A, Rev 1; significant main effect of genotype F(2,28) = 7.62, P < 0.01). Post hoc analysis revealed D2L HET (P < 0.05) and KO (P < 0.01) mice had a significantly decreased percentage of correct responses in the stage when compared with WT mice. This was accompanied by a significant difference in the percentage of perseverative errors between genotypes at this stage (Fig. 4B, Rev 1; significant main effect of genotype F(2,28) = 5.30, P < 0.05). Post hoc analysis revealed D2L KO (P < 0.01) perseverative errors to be significantly elevated when compared with WT mice, while perseverative errors in HET mice did not reach significance (P = 0.05).

Figure 4.

Place-learning in D2L KO mice. (A) All genotypes demonstrated normal acquisition of the place-learning task. However, D2L HET (n = 11) and KO (n = 11) mice were significantly impaired in reversal learning compared with WT mice (n = 9). (**) P < 0.01; (*) P < 0.05, significant main effect of genotype in the first (Rev 1) and second (Rev 2) reversal stages. Decreased percentage of correct responses in HET and KO mice returned to the level of WT mice by the final reversal stage (Rev 3). (B) The percentage of perseverative errors was significantly higher in HET and KO mice than that in WT mice. (**) P < 0.01; (*) P < 0.05, significant main effect of genotype in the first (Rev 1) and second (Rev 2) reversal stages. As with the percentage of correct responses in A, the percentage of perseverative errors normalized to WT mice levels by the third reversal stage (Rev 3). (C) The percentage of correct responses and the percentage of perseverative errors were significantly negatively correlated in all genotypes. Small boxes at the bottom represent an example of the IntelliCage setup for each stage, with filled circles indicating the reward-paired corners and empty circles indicating the non-reward-paired corners. Graph points represent the mean responses for each genotype, with error bars indicating ±SEM. Across the three reversal stages, the percentage of correct responses improved with each consecutive stage in all animals (Fig. 4A, Rev 1-3; significant main effect of stage F(2,56) = 27.85, P < 0.001). This corresponded with a reduction in the percentage of perseverative errors at each successive stage in all genotypes (Fig. 4B, Rev 1-3; significant main effect of stage F(2,56) = 34.32, P < 0.001). Additionally, as with D2-RNB mice, impairment in correct responding and elevated perseverative errors in D2L HET and KO mice dissipated by the final stage (Fig. 4A, Rev 3; % correct, nonsignificant main effect of genotype F(2,28) = 1.01, P = 0.38, Fig. 4B, Rev 3; % perseverative error, nonsignificant main effect of genotype F(2,28) = 0.72, P = 0.49). Post hoc analysis revealed the percentage of correct responding and perseverative errors in D2L HET and KO to not significantly differ from WT mice (Fig. 4A, Rev 3; % correct, HET P = 0.35, KO P = 0.79, Fig. 4B, Rev 3; % perseverative error, HET P = 0.48, KO P = 0.67). Finally, a Pearson's correlation coefficient revealed a significant negative correlation between the percentage of correct responses and the percentage of perseverative errors in all genotypes (Fig. 4C; WT, r = −0.93, P < 0.001, HET, r = −0.95, P < 0.001, KO, r = −0.95. P < 0.001). Thus, as with D1-RNB and D2-RNB mice, the degree of perseveration at the previously correct corners was directly predictive of the performance on the place-learning task irrespective of genotypes.

Discussion

Previous evidence indicates that activity in NAc D1-MSNs controls reward learning and addictive behaviors (Macpherson et al. 2014; Nakanishi et al. 2014; Hikida et al. 2016). Indeed, blocking neurotransmission in D1-MSNs of the NAc inhibits acquisition of spatial and response discriminationtasks, and impairs food CPP (Hikida et al. 2010, 2013; Yawata et al. 2012). Similarly, RNB suppression or optogenetic activation of NAc D1-MSNs attenuates or enhances, respectively, cocaine CPP (Hikida et al. 2010, 2013; Lobo et al. 2010). However, in the current study, acquisition of place discrimination in an IntelliCage system was unaffected by neurotransmission blocking in NAc D1- or D2-MSNs. These data suggest a dissociation in the neural circuitry mediating responding in this place discrimination task with other spatial cue-based reward learning tasks using chocolate as a strong reward, known to affected by RNB of D1-MSNs in the NAc (Yawata et al. 2012). The dorsal striatum has been identified to be critical for the acquisition of goal-directed instrumental tasks (Yin et al. 2005; Balleine et al. 2007; Featherstone and McDonald 2004a,b). Thus, it is possible that in the current study, acquisition of place discrimination may be mediated by D1- or D2-MSNs within the dorsal striatum. Alternatively, it is possible that little reward learning is required to perform the current task, leaving it unaffected by neurotransmission blocking of NAc D1- or D2-MSNs. However, we argue that this is likely not the case, as our previous studies have revealed that NAc D1-MSNs are necessary for acquisition of more simple visual discrimination and place conditioning tasks (Hikida et al. 2010; Yawata et al. 2012). A more difficult version of the reward-based place discrimination task, for instance, one that requires a longer sequence of behavioral responses in order to achieve the water reward, may address these possible alternative explanations of our data. Although acquisition was unaffected, our data revealed transmission blocking in NAc D2- but not D1-MSNs attenuated performance in the place discrimination task following reversal of the reward-paired corners, indicating a deficit in behavioral flexibility. This is congruent with previous evidence from our group demonstrating activity in NAc D2-MSNs to control reversal learning in both visual and response discrimination tasks (Yawata et al. 2012). Similarly, global KO of D2L receptors impaired performance in the reversal stages of the IntelliCage task, consonant with studies demonstrating deletion of D2 receptors to impair reversal learning in odor and texture discrimination tasks (Kruzich et al. 2006; De Steno and Schmauss 2009). Interestingly, the findings from HET mice indicate that even a 50% reduction in D2L receptors is sufficient to severely disrupt behavioral flexibility. Increased errors in the reversal task may be explained by perseveration to the previously correct corners, or alternatively by diminished ability to efficiently learn the new behavioral strategy (i.e., repeated responding at the inactive rewarded corner). In order to elucidate the origin of this error, we examined perseverative errors (the percentage of responses at the nonrewarded corners). Our findings revealed that impaired behavioral flexibility in D2-RNB and D2L KO mice was associated with increased perseverance at the previously rewarded corners. This is interesting given that NAc lesions have previously been reported to impair the ability to acquire and maintain a new strategy, but not to alter perseveration of the previously correct strategy (Annett et al. 1989; Floresco et al. 2006). The difference in the findings of our data compared with that of previous studies may be explained by the time point at which perseverative errors were measured. While we report a significant increase in perseverative errors over the course of seven daily sessions of a reversal stage, other groups have used shorter experimental designs in which acquisition or reversal learning are analyzed within a single session (Annett et al. 1989; Floresco et al. 2006). Indeed, in our experiment, differences in correct responses and perseverative errors were largely not seen until after the first session of the reversal stage. In both D2-RNB and D2L KO, mice impaired reversal learning performance and increased perseverative errors began to normalize across each reversal stage until all mice were performing at similar levels by the final stage. These findings indicate that compensatory mechanisms may occur to counteract the absence of D2L receptors or NAc D2-MSN activity. Interestingly, recent evidence has revealed that a population of D1-MSNs in the NAc project to the same neurons in the ventral pallidum as D2-MSNs (Smith et al. 2013; Kupchik et al. 2015). It is possible that this newly discovered D1-MSN pathway may be recruited in the absence of activity in the classic D2-MSN “indirect” pathway. Given that deletion of D2L receptors produced a similar phenotype to D2-RNB mice, it could be speculated that the loss of D2L receptors may alter the activity of NAc D2-MSNs. Indeed, we suggest that activity in NAc D2-MSNs likely plays an important role in inhibiting the competing influence of previously correct strategies over newly learned correct strategies for control of behavior. Interestingly, D2 receptors and striatopallidal transmission have been implicated in the etiology of a number of psychopathologies, including addiction and obesity (Kenny et al. 2013). Indeed, compulsive eating in rats is associated with decreased D2 receptor expression in the striatum, and knockdown of striatal D2 receptors accelerates the emergence of compulsive-like consumption of high calorie chow (Johnson and Kenny 2010). Similarly, pharmacogenetic inhibition of striatopallidal neurons enhances cocaine sensitization, while optogenetic stimulation of striatopallidal neurons or NAc D2-MSNs attenuated cocaine CPP and compulsive cocaine self-administration, respectively (Lobo et al. 2010; Ferguson et al. 2011; Bock et al. 2013). These studies, alongside the findings from the current study, suggest that D2-MSNs of the dorsal striatum and NAc may provide an efficacious therapeutic target for the treatment of disorders associated with compulsive behaviors. Future research using targeted deletion of NAc D2L receptors could elucidate whether D2L receptors alter activity in D2-MSNs and whether this is sufficient to modulate behavioral flexibility.

Materials and Methods

Animals

Male NAc D1-MSN blocked (D1-RNB), and D2-MSN blocked (D2-RNB) RNB mice and wild-type (WT) littermates were generated using TeNT transgenic mice on a C57BL/6 background, in which the expression of transmission-blocking tetanus toxin light chain is driven by the interaction of tetracycline-repressive transcription factor (tTA) and the tetracycline-responsive element, in a doxycycline-off fashion (Yamamoto et al. 2003). A recombinant AAV virus bilaterally delivered into the NAc (anterior +1.5 mm, lateral + or −0.8 mm, and ventral +3.5 mm) restricted expression of tTA specifically to Substance P- (D1-MSN) or Enkephalin- (D2-MSN) containing MSNs, creating D1- and D2-RNB mice, respectively (Hikida et al. 2010). Stereotaxic coordinates were histologically verified following completion of the experiment. RNB mice remained off doxycycline throughout the duration of the experiment, resulting in continuous blockade of neurotransmission in NAc D1- or D2-MSNs. D2L KO, HET, and WT mice were generated by mating D2L HET mice, as described previously (Wang et al. 2000). Following weaning and genotyping at 4-wk old, mice were housed in groups of 10–16 in the IntelliCage apparatus. At 8 wk of age all animals were implanted with a subcutaneous microchip and RNB mice underwent viral infusion followed by a 2-wk recovery/expression period. Mice were maintained on a 12-h light–dark cycle with lights on at 8:00 a.m. and temperature controlled (24 ± 2°C), in a humidity of 50 ± 5%. All animal handling procedures were approved by the animal research committee of Kyoto University Graduate School of Medicine.

IntelliCage

The IntelliCage (New Behavior AG, Zurich, Switzerland; http://www.newbehavior.com) is an automated group-housing apparatus allowing experimental testing within the home cage (see for details Galsworthy et al. (2005) and Endo et al. (2011)). The polycarbonate cage (55 × 37.5 × 20.5 cm) is equipped with four operant conditioning chambers (15 × 15 × 21 cm) located in each corner. Each conditioning chamber contains two water drinking bottles and is accessible by a small opening containing a transponder reader antenna that registers the microchip of the entering mouse. Access to each water bottle is controlled by gated nosepoke holes containing infrared beam-break sensors, which can be programmed to open or remain closed upon a nosepoke response. As each mouse was implanted with a unique microchip, corner entry and nosepoke data could be integrated with microchip readings collected by each conditioning corner's antenna, allowing data to be separated by each individual mouse.

Behavioral testing

Analysis of baseline activity and water drinking

At 10 wk of age baseline activity and drinking of freely available water were measured in the IntelliCage for 7 d. Analysis was performed on the total amount of corner visits and water bottle licks in order to assess for any preexisting biases in locomotion or water drinking between experimental groups before more behavioral testing occurred.

Place discrimination learning and serial reversal learning

Following evaluation of baseline activity and water drinking, mice began nosepoke training, during which they had to perform a nosepoke beam-break at the hole in front of the water bottles in any corner in order to gain access for a 5-sec period. During the first 3 d, mice obtained water by nosepoke responses at any time during the 24 h. Subsequently, access to water was restricted to a 3-h period (9–12 p.m.) also requiring a nosepoke response for a 5-sec access reward. Following 3 d of water restriction, mice began behavioral testing for place discrimination acquisition and serial reversal learning using the behavioral assay established by Endo et al. (2011). Mice were required to discriminate between pairs of diagonally facing reward-paired and non-reward-paired corners and perform a shuttling sequence between the reward-paired corners in order to gain access to water for 5 sec. Following seven consecutive daily 3-h sessions the reward-paired corners were switched to the previously nonreward paired corners, and vice versa, for another seven consecutive daily 3-h sessions. This reversal process was conducted three times in total.

Data analysis

The percentage of correct responses was calculated by dividing the total amount of correct responses by total amount of incorrect responses then multiplying by 100. As the task demands that mice shuttle between the rewarded corners, a response at the inactive rewarded corner would also be deemed incorrect. Thus, the chance of a random response at one of the corners being correct is 25%. The percentage of perseverative errors was calculated by dividing the non-reward-paired corner responses by the total amount of nosepokes for each session, then multiplying by 100. Data were analyzed using SPSS software (IBM Corporation, NY, USA) and GraphPad Prism 6.0 software (GraphPad Software, La Jolla, CA, USA) by two-way repeated measure ANOVAs and Bonferroni post hoc comparisons. Correlations were measured using Pearson's correlation coefficients. For graphs, data were averaged across experimental groups and presented as the mean response for each session, including error bars representing ±SEM.

Competing Interest Statement

The authors declare that there are no conflicts of interest regarding this article.

44 in total

1. Distinct roles of synaptic transmission in direct and indirect striatal pathways to reward and aversive behavior.

Authors: Takatoshi Hikida; Kensuke Kimura; Norio Wada; Kazuo Funabiki; Shigetada Nakanishi
Journal: Neuron Date: 2010-06-24 Impact factor: 17.173

2. A comparison of learning and memory characteristics of young and middle-aged wild-type mice in the IntelliCage.

Authors: Annis O Mechan; Adrian Wyss; Henry Rieger; M Hasan Mohajeri
Journal: J Neurosci Methods Date: 2009-03-10 Impact factor: 2.390

Review 3. Distinct dopaminergic control of the direct and indirect pathways in reward-based and avoidance learning behaviors.

Authors: S Nakanishi; T Hikida; S Yawata
Journal: Neuroscience Date: 2014-04-24 Impact factor: 3.590

4. Dopamine D2 receptors in addiction-like reward dysfunction and compulsive eating in obese rats.

Authors: Paul M Johnson; Paul J Kenny
Journal: Nat Neurosci Date: 2010-03-28 Impact factor: 24.884

5. Consistent behavioral phenotype differences between inbred mouse strains in the IntelliCage.

Authors: S Krackow; E Vannoni; A Codita; A H Mohammed; F Cirulli; I Branchi; E Alleva; A Reichelt; A Willuweit; V Voikar; G Colacicco; D P Wolfer; J-U F Buschmann; K Safi; H-P Lipp
Journal: Genes Brain Behav Date: 2010-07-07 Impact factor: 3.449

6. Neuropsychological evidence for frontostriatal dysfunction in schizophrenia.

Authors: R Elliott; P J McKenna; T W Robbins; B J Sahakian
Journal: Psychol Med Date: 1995-05 Impact factor: 7.723

7. Dopamine D2 receptors mediate reversal learning in male C57BL/6J mice.

Authors: Paul J Kruzich; Suzanne H Mitchell; Audrey Younkin; David K Grandy
Journal: Cogn Affect Behav Neurosci Date: 2006-03 Impact factor: 3.282

8. Cocaine-induced adaptations in D1 and D2 accumbens projection neurons (a dichotomy not necessarily synonymous with direct and indirect pathways).

Authors: Rachel J Smith; Mary Kay Lobo; Sade Spencer; Peter W Kalivas
Journal: Curr Opin Neurobiol Date: 2013-02-18 Impact factor: 6.627

9. Cocaine-experienced rats exhibit learning deficits in a task sensitive to orbitofrontal cortex lesions.

Authors: Geoffrey Schoenbaum; Michael P Saddoris; Seth J Ramus; Yavin Shaham; Barry Setlow
Journal: Eur J Neurosci Date: 2004-04 Impact factor: 3.386

Review 10. Neural circuits subserving behavioral flexibility and their relevance to schizophrenia.

Authors: Stan B Floresco; Ying Zhang; Takeshi Enomoto
Journal: Behav Brain Res Date: 2008-12-06 Impact factor: 3.332

5 in total

1. Dopamine receptors mediate strategy abandoning via modulation of a specific prelimbic cortex-nucleus accumbens pathway in mice.

Authors: Qiaoling Cui; Qian Li; Hongyan Geng; Lei Chen; Nancy Y Ip; Ya Ke; Wing-Ho Yung
Journal: Proc Natl Acad Sci U S A Date: 2018-05-07 Impact factor: 11.205

2. High-Frequency Activation of Nucleus Accumbens D1-MSNs Drives Excitatory Potentiation on D2-MSNs.

Authors: T Chase Francis; Hideaki Yano; Tyler G Demarest; Hui Shen; Antonello Bonci
Journal: Neuron Date: 2019-06-17 Impact factor: 17.173

3. Nucleus Accumbens Core Dopamine D2 Receptor-Expressing Neurons Control Reversal Learning but Not Set-Shifting in Behavioral Flexibility in Male Mice.

Authors: Tom Macpherson; Ji Yoon Kim; Takatoshi Hikida
Journal: Front Neurosci Date: 2022-06-28 Impact factor: 5.152

4. Withdrawal from escalated cocaine self-administration impairs reversal learning by disrupting the effects of negative feedback on reward exploitation: a behavioral and computational analysis.

Authors: Peter Zhukovsky; Mickael Puaud; Bianca Jupp; Júlia Sala-Bayo; Johan Alsiö; Jing Xia; Lydia Searle; Zoe Morris; Aryan Sabir; Chiara Giuliano; Barry J Everitt; David Belin; Trevor W Robbins; Jeffrey W Dalley
Journal: Neuropsychopharmacology Date: 2019-04-06 Impact factor: 7.853

5. Nucleus accumbens pathways control cell-specific gene expression in the medial prefrontal cortex.

Authors: Takatoshi Hikida; Shuhei Yao; Tom Macpherson; Ayumi Fukakusa; Makiko Morita; Haruhide Kimura; Keisuke Hirai; Tatsuya Ando; Hiroyoshi Toyoshiba; Akira Sawa
Journal: Sci Rep Date: 2020-02-04 Impact factor: 4.379

5 in total