Literature DB >> 27046807

Direct and indirect dorsolateral striatum pathways reinforce different action strategies.

Ana M Vicente¹, Pedro Galvão-Ferreira¹, Fatuel Tecuapetla¹, Rui M Costa².

Abstract

The basal ganglia, and the striatum in particular, are critical for action reinforcement [1,2]. The dorsal striatum, which can be further subdivided into dorsomedial (DMS) and dorsolateral (DLS) striatum, is mainly composed of two subpopulations of striatal medium spiny projection neurons (MSNs): dopamine D1 receptor-expressing MSNs that constitute the striatonigral or direct pathway (dMSNs); and dopamine D2 receptor-expressing MSNs that constitute the striatopallidal or indirect pathway (iMSNs) [3]. It has been suggested that each pathway has opposing roles in reinforcement, with dMSNs being important to learn positive reinforcement and iMSNs to learn to avoid undesired actions (Go/No-Go) [1]. Furthermore, optogenetic self-stimulation of dMSNs in DMS leads to reinforcement of actions, while self-stimulation of iMSNs leads to avoidance of actions [2]. However, in DLS, which has been implicated in the consolidation of well-trained actions and habits in mice [4,5], both pathways are active during lever-pressing for reward [6]. Furthermore, extensive skill training leads to long-lasting potentiation of glutamatergic inputs into both dMSNs and iMSNs [4]. We report here that, in DLS, both dMSNs and iMSNs are involved in positive reinforcement, but support different action strategies.

Entities: Chemical Disease Gene Species

Mesh：

Substances：

Year: 2016 PMID： 27046807 PMCID： PMC4826435 DOI： 10.1016/j.cub.2016.02.036

Source DB: PubMed Journal: Curr Biol ISSN： 0960-9822 Impact factor: 10.834

Main Text

To investigate the role of DLS striatonigral and striatopallidal neurons in action reinforcement, we used a self-stimulation paradigm where we activated specifically each pathway upon lever-pressing. We used a viral Cre-dependent approach to express Channelrhodopsin-2 (ChR) in either dMSNs (D1-Cre) or iMSNs (D2-Cre) of DLS (Supplemental Figure S1A). Mice were then trained in an operant box with two levers (Supplemental Figure S1B): an active lever where pressing resulted in the delivery of blue light (473 nm), and an inactive lever (no light delivered). Reinforced lever presses resulted in the delivery of light into DLS (2 seconds, 5 Hz, 10 ms wide pulses, frequency similar to the endogenous activity of MSNs [7]) (Supplemental Figure S2A,B). Each session lasted 30 minutes with no maximum number of reinforcers. Both groups of ChR-expressing mice increased the number of presses with training, and pressed significantly more than YFP controls (Supplemental Figure S2C, D1-cre, F1,10 = 20.67, P = 0.0011; D2-cre, F1,17 = 5.845, P = 0.0271). D1-Cre animals acquired lever-pressing rapidly, and pressed the active significantly more than the inactive lever (Figure 1A, F3,20 = 21.21, P < 0.0001; Figure 1C, first versus last day of active lever with ChR: P < 0.0001). On the other hand, D2-Cre animals expressing ChR were slower in acquisition, and showed a significant increase in lever-pressing for both levers (Figure 1B, F3,34 = 3.111, P = 0.0390; Figure 1D, first versus last day for both active and inactive lever with ChR: P < 0.05). This difference does not stem from different numbers of pairings between action and reinforcer in D1- and D2-cre animals, because the same result was observed when matching the number of reinforcers between groups (Supplemental Figure S1D,E).

Figure 1

Optogenetic self-stimulation of striatonigral and striatopallidal DLS neurons supports the reinforcement of different action strategies.

(A) Acquisition of lever-pressing for ChR D1-Cre animals (n = 6) and YFP controls (n = 6). (B) Acquisition of lever-pressing for ChR D2-Cre animals (n = 10) and YFP controls (n = 9). (C) Difference in pressing from the first to the last day of training for ChR and YFP D1-Cre, for active and inactive levers. (D) Difference in pressing from the first to the last day of training for ChR and YFP D2-Cre, for active and inactive levers. (E,F) Probability of transition from an active lever press to a subsequent active lever press (versus an inactive press) for (E) ChR D1-Cre and (F) ChR D2-Cre. (G,H) Contingency degradation and reinstatement for (G) D1-Cre and (H) D2-Cre. Mean ± s.e.m. plotted in all graphs; LD: last day of training; CD: contingency degradation day; R: reinstatement day. ∗ denotes p < 0.05.

These data suggest stimulation of both dMSNs and iMSNs in DLS is reinforcing and not aversive, but leads to the development of different action strategies. To better characterize this dichotomy, we calculated the probability of pressing the active versus the inactive lever. D1-Cre animals expressing ChR showed a steady increase in the probability of pressing the active lever (F1,10 = 688.3, P < 0.0001, Supplemental Figure S1F), while D2-Cre animals converged to a similar probability of pressing either lever with training. To further investigate if this pressing pattern resulted from action generalization, or from avoidance of the active lever by shifting to the inactive after an active press, we calculated the probability of an active-to-active transition (or conversely, an active-to-inactive transition, Figure 1E,F). D1-Cre animals reached a high probability of making an active press following an active one (F1,10 = 310.9, P < 0.0001, Figure 1E). D2-Cre animals presented a slight but significantly higher probability of pressing the active lever after an active press throughout training (F1,18 = 13.38, P = 0.0018, although close to chance, Figure 1F), indicating that D2-Cre mice were not just shifting to the inactive lever after an active lever press. These data suggest that iMSN self-stimulation leads to more generalization between levers, which is consistent with the role of DLS in generalization and habit learning [5]. To evaluate if the actions of both groups were equally sensitive to action-stimulation contingency, we performed a contingency degradation (CD) session, where light stimulation was non-contingent upon lever-pressing. D1-Cre animals decreased the number of presses during CD (Figure 1G, Last day versus CD for ChR-A animals: P < 0.001), and resumed their lever-pressing behavior during contingency reinstatement (CD versus reinstatement for ChR-A animals: P < 0.01). D2-Cre animals, on the other hand, presented no changes in pressing during CD (Figure 1H), suggesting that pressing in these animals is less sensitive to action-stimulation contingency. Here we show that self-stimulation of both striatonigral and striatopallidal DLS neurons is sufficient to positively reinforce actions, but that stimulation of each pathway supports the learning of different action strategies. While dMSN stimulation resulted in rapid task acquisition, selective pressing of the active lever, and sensitivity to changes in contingency, iMSN self-stimulation resulted in slower lever-press acquisition, generalized pressing between active and inactive levers, and insensitivity to changes in contingency. Since activity in both pathways precedes lever-pressing [6], plasticity associated with instrumental learning could be occurring at recently active corticostriatal synapses (and be different for dMSN and iMSN synapses). Alternatively, stimulation of MSNs could specifically select inputs onto cortical neurons that were previously active through the cortico-basal ganglia-thalamocortical loop. These results suggest that pairing activation of DLS dMSNs with an action supports goal-directed learning, while pairing activation of DLS iMSNs with an action supports the formation of stimulus-response habits [5]. These conclusions are consistent with the role of long-lasting plasticity of glutamatergic inputs into DLS striatopallidal neurons in habit formation and skill consolidation 4, 8. They also raise the possibility that DLS might not be homogenously involved in habit formation; direct and the indirect pathways in DLS could support different action strategies and compete for action control. These results are also consistent with involvement of both striatal projection pathways in action selection, dMSNs supporting the execution of the desired actions, and iMSNs inhibiting the execution of competing actions 6, 9. These roles may be different in DMS, where striatonigral and striatopallidal neurons seem to have opposite roles in reinforcement [2]. But it could be that other factors, such as stimulation protocol (Supplemental Figure S2A,B), or the actions/task used, explain the differences. Still, it is clear from these results that self-stimulation of iMSNs in DLS is not aversive. In this context, it is interesting to note that optogenetic stimulation of iMSNs leads to the activation of a subset of cortical M1 neurons [10], and that inactivation of iMSNs does not necessarily increase basal ganglia output activity [7], underscoring that the functional organization of basal ganglia is more complex than classically proposed.

10 in total

1. Antagonistic but Not Symmetric Regulation of Primary Motor Cortex by Basal Ganglia Direct and Indirect Pathways.

Authors: Ian A Oldenburg; Bernardo L Sabatini
Journal: Neuron Date: 2015-06-03 Impact factor: 17.173

2. By carrot or by stick: cognitive reinforcement learning in parkinsonism.

Authors: Michael J Frank; Lauren C Seeberger; Randall C O'reilly
Journal: Science Date: 2004-11-04 Impact factor: 47.728

Review 3. The basal ganglia: focused selection and inhibition of competing motor programs.

Authors: J W Mink
Journal: Prog Neurobiol Date: 1996-11 Impact factor: 11.685

4. D1 and D2 dopamine receptor-regulated gene expression of striatonigral and striatopallidal neurons.

Authors: C R Gerfen; T M Engber; L C Mahan; Z Susel; T N Chase; F J Monsma; D R Sibley
Journal: Science Date: 1990-12-07 Impact factor: 47.728

5. Genetic deletion of A2A adenosine receptors in the striatum selectively impairs habit formation.

Authors: Chunxiu Yu; Jay Gupta; Jiang-Fan Chen; Henry H Yin
Journal: J Neurosci Date: 2009-12-02 Impact factor: 6.167

6. Dynamic reorganization of striatal circuits during the acquisition and consolidation of a skill.

Authors: Henry H Yin; Shweta Prasad Mulcare; Monica R F Hilário; Emily Clouse; Terrell Holloway; Margaret I Davis; Anita C Hansson; David M Lovinger; Rui M Costa
Journal: Nat Neurosci Date: 2009-02-08 Impact factor: 24.884

7. Distinct roles for direct and indirect pathway striatal neurons in reinforcement.

Authors: Alexxai V Kravitz; Lynne D Tye; Anatol C Kreitzer
Journal: Nat Neurosci Date: 2012-06 Impact factor: 24.884

8. Different dorsal striatum circuits mediate action discrimination and action generalization.

Authors: Mónica Hilario; Terrell Holloway; Xin Jin; Rui M Costa
Journal: Eur J Neurosci Date: 2012-04 Impact factor: 3.386

9. Concurrent activation of striatal direct and indirect pathways during action initiation.

Authors: Guohong Cui; Sang Beom Jun; Xin Jin; Michael D Pham; Steven S Vogel; David M Lovinger; Rui M Costa
Journal: Nature Date: 2013-01-23 Impact factor: 49.962

10. Balanced activity in basal ganglia projection pathways is critical for contraversive movements.

Authors: Fatuel Tecuapetla; Sara Matias; Guillaume P Dugue; Zachary F Mainen; Rui M Costa
Journal: Nat Commun Date: 2014-07-08 Impact factor: 14.919

10 in total

49 in total

9. Errors in Action Timing and Inhibition Facilitate Learning by Tuning Distinct Mechanisms in the Underlying Decision Process.

Authors: Kyle Dunovan; Timothy Verstynen
Journal: J Neurosci Date: 2019-01-17 Impact factor: 6.167

Review 10. Associative and sensorimotor cortico-basal ganglia circuit roles in effects of abused drugs.

Authors: C M Gremel; D M Lovinger
Journal: Genes Brain Behav Date: 2016-08-26 Impact factor: 3.449

Direct and indirect dorsolateral striatum pathways reinforce different action strategies.

Main Text

1. Antagonistic but Not Symmetric Regulation of Primary Motor Cortex by Basal Ganglia Direct and Indirect Pathways.

2. By carrot or by stick: cognitive reinforcement learning in parkinsonism.

Review 3. The basal ganglia: focused selection and inhibition of competing motor programs.

4. D1 and D2 dopamine receptor-regulated gene expression of striatonigral and striatopallidal neurons.

5. Genetic deletion of A2A adenosine receptors in the striatum selectively impairs habit formation.

6. Dynamic reorganization of striatal circuits during the acquisition and consolidation of a skill.

7. Distinct roles for direct and indirect pathway striatal neurons in reinforcement.

8. Different dorsal striatum circuits mediate action discrimination and action generalization.

9. Concurrent activation of striatal direct and indirect pathways during action initiation.

10. Balanced activity in basal ganglia projection pathways is critical for contraversive movements.

Review 1. Striatal circuits for reward learning and decision-making.

2. Parvalbumin Interneurons Modulate Striatal Output and Enhance Performance during Associative Learning.

Review 3. Synaptic functions of endocannabinoid signaling in health and disease.

4. The Hippocampus and Dorsolateral Striatum Integrate Distinct Types of Memories through Time and Space, Respectively.

Review 5. Drive and Reinforcement Circuitry in the Brain: Origins, Neurotransmitters, and Projection Fields.

Review 6. The dopamine motive system: implications for drug and food addiction.

7. Complementary Control over Habits and Behavioral Vigor by Phasic Activity in the Dorsolateral Striatum.

8. Indirect Pathway of Caudal Basal Ganglia for Rejection of Valueless Visual Objects.

9. Errors in Action Timing and Inhibition Facilitate Learning by Tuning Distinct Mechanisms in the Underlying Decision Process.

Review 10. Associative and sensorimotor cortico-basal ganglia circuit roles in effects of abused drugs.