| Literature DB >> 30559117 |
Patrick T Piantadosi1, Abby G Lieberman1, Charles L Pickens1, Hadley C Bergstrom1, Andrew Holmes1.
Abstract
Cognitive flexibility refers to various processes which enable behaviors to be modified on the basis of a change in the contingencies between stimuli or responses and their associated outcomes. Reversal learning is a form of cognitive flexibility which measures the ability to adjust responding based on a switch in the stimulus-outcome contingencies of, typically two, perceptually distinct stimuli. Reversal tasks have provided valuable insight into the neural basis of cognitive flexibility, implicating brain regions including the lateral orbitofrontal cortex (lOFC) and dorsomedial prefrontal cortex (dmPFC). However, with two-stimulus reversal, it is difficult to determine whether response errors are due excessive perseveration, deficient learning, or other problems with updating. To address this limitation, we developed a mouse three-choice touchscreen-based visual reversal task, in which the contingencies of two stimuli were switched on reversal but a third, simultaneously presented, stimulus was never reinforced. We found that, in male C57BL/6J mice, responding at the previously rewarded stimulus predominated over the newly and never-reinforced stimuli during early reversal. Next, we showed that acute pharmacological inhibition of lOFC, but not dmPFC, impaired early reversal performance, relative to noninactivated controls. Interestingly, however, lOFC inactivation deficits were characterized by increased choice of the never-reinforced stimulus and a decrease in (perseverative-like) responding at the previously rewarded stimulus. These effects are inconsistent with the historical notion of lOFC mediating response inhibition and closer to recent views of the lOFC's role in response/outcome tracking. Overall, these findings provide initial support the utility of this novel paradigm for studying cognitive flexibility and its underlying neural substrates. Published by Cold Spring Harbor Laboratory Press.Entities:
Mesh:
Year: 2018 PMID: 30559117 PMCID: PMC6298539 DOI: 10.1101/lm.048264.118
Source DB: PubMed Journal: Learn Mem ISSN: 1072-0502 Impact factor: 2.460
Figure 1.Performance in a novel three-choice visual discrimination and reversal touchscreen paradigm. (A) A mouse-eye (upper) and overhead (lower) view of the apparatus. (B) Visual stimuli used for discrimination and reversal (note, the designation of bars and dots as the S+ on discrimination, and the corresponding Sprior designation on reversal, was counterbalanced across mice and did not affect performance). (C) Behavioral raster from single training sessions of a representative mouse at each relevant training phase. Blue lines represent choice of the S+ or Sprior during discrimination (left) or reversal (right) sessions. Red lines represent choice of the S1− (discrimination sessions, left) or Snew (reversal sessions, right), while green lines represent the S2− (discrimination sessions, left) or Snever (reversal sessions, right). The maximum session length was 3600 sec (1 h). Note the shift from random stimulus selection, to efficient discrimination displayed across discrimination, and then the marked degree of perseveration in early reversal, followed by characteristic sampling and then reversal during the middle and late reversal phases, respectively. (D) The percentage of S+/Sprior responses increased across discrimination and remained high at early reversal before decreasing across subsequent reversal training (task × phase interaction: F(2,26) = 217.9, P < 0.0001; effect of task: F(1,13) = 0.68, P = 0.4245; effect of phase: F(2,26) = 9.51, P = 0.0008, followed by individual comparisons via Tukey's post-hoc tests). (E) The percentage of S1−/Snew responses decreased from early to late discrimination and progressively increased over reversal (task × phase interaction: F(2,26) = 239.1, P < 0.0001; effect of task: F(1,13) = 34.32, P < 0.0001; effect of phase: F(2,26) = 24.25, P < 0.0001, followed by Tukey's post-hoc tests). (F) The percentage of S2−/Snever choice was low by late discrimination and did not change during reversal (task × phase interaction: F(2,26) = 4.01, P = 0.0302; effect of task: F(1,13) = 43.44, P < 0.0001; effect of phase: F(2,26) = 22.52, P < 0.0001, followed by Tukey's post-hoc tests). (G) Strings of consecutive S+/Sprior responses increased over discrimination and decreased over reversal (task × phase interaction: F(2,26) = 5.50, P < 0.0102; effect of task: F(1,13) = 26.26, P = 0.0002; effect of phase: F(2,26) = 19.86, P < 0.0001, followed by Tukey's post-hoc tests). (H) Strings of S1−/Snew responses were decreased by late discrimination before increasing across reversal (task × phase interaction: F(2,26) = 91.38, P < 0.0001; effect of task: F(1,13) = 34.57, P < 0.0001; effect of phase: F(2,26) = 25.87, P < 0.0001, followed by Tukey's post-hoc tests). (I) S2−/Snever strings were reduced at late discrimination and stayed flat across reversal (task × phase interaction: F(2,26) = 2.39, P = 0.1116; effect of task: F(1,13) = 14.92, P = 0.0020; effect of phase: F(2,26) = 10.03, P < 0.0006, followed by Tukey's post-hoc tests). n = 14. Data are means ± SEM. (*) P < 0.05.
Figure 2.Inactivation of the lOFC, but not dmPFC, disrupts early three-choice reversal performance. (A) Experimental timeline for pretraining, discrimination testing, and early reversal inactivation. (B) Ventral extent of the infusion located in lOFC. (C) Inactivation of lOFC resulted in a decrease in the percentage of trials ending in Sprior choices (left panel; corrected t(16) = 3.09, P < 0.0210), without affecting the percentage of Snew trials (middle panel; corrected t(16) = 0.80, P > 0.05). In contrast, the percentage of trials ending in Snever choices were increased following lOFC inactivation (right panel; corrected t(16) = 2.56, P < 0.0412). (D) lOFC inactivation decreased the average length of strings of Sprior (left panel; t(16) = 2.72, P = 0.0151) without affecting strings of Snew responses (middle panel; t(16) = 0.24, P = 0.8157). The length of strings of Snever responses tended to be increased by lOFC inactivation (right panel; t(16) = 2.12, P = 0.0502). (E) Ventral extent of the infusion located in dmPFC. (F,G) Inactivation of the dmPFC did not significantly affect any measure (all t-values < 1.5, all P-values > 0.15). n = 9–10 per group. Data are means ± SEM. (*) P < 0.05.
Figure 3.Effects of lOFC and dmPFC inactivation on late three-choice reversal performance. (A) Experimental timeline for post-reversal criterion inactivation. (B) lOFC inactivation did not significantly affect the percentage of Sprior (left panel; corrected t(16) = 2.03, P = 0.1160), Snew (middle panel; corrected t(16) = 2.39, P = 0.0856), or Snever (right panel; corrected t(16) = 1.99, P = 0.1160) choices. (C) The average length of Sprior strings was increased by lOFC inactivation (left panel; t(16) = 2.61, P = 0.0190), while the average Snew string length was diminished by lOFC inactivation (middle panel; t(16) = 2.45, P = 0.0261). In contrast, Snever string length was not altered by lOFC inactivation (right panel; t(14) = 0.98, P = 0.3450). (D,E) Inactivation of the dmPFC at late reversal did not significantly affect any measure (all t-values <1.5, all P-values > 0.15). n = 9–10 per group. Data are means ± SEM. (*) P < 0.05.