| Literature DB >> 28240299 |
Yoshio Iguchi1,2, Ziqiao Lin1, Hiromi Nishikawa1, Yoshio Minabe1,3, Shigenobu Toda1,3.
Abstract
The distinction between goal-directed action and habitual response, particularly with respect to moderate or extended appetitive instrumental training, is well documented; however, the propensity toward instrumental behavior in the early training stage has not been elucidated. In this study, we trained Sprague Dawley rats to press a lever to obtain food as an outcome for various time periods and monitored the changes in their sensitivity to outcome devaluation and choice between the levers they had been trained with and unfamiliar levers. After the extensive training with a random interval schedule, the rats were insensitive to outcome devaluation, and exhibited a typical habit-like phenotype, as previously reported, and the untrained leverpresses were relatively rare and sporadic. During the initial stage of training (≤1 week), the rats exhibited a similar insensitivity to the devaluation; however, in contrast to the overtrained condition, they performed distinctive unbiased leverpresses on both the trained and untrained levers. Thus, we propose a possibility that, contrary to the authentic concept that instrumental learning is initiated with an outcome devaluation-sensitive goal-directed stage, under some conditions, this learning can unconventionally begin with the initial stage that is distinct from both goal-directed action and habitual response.Entities:
Mesh:
Year: 2017 PMID: 28240299 PMCID: PMC5327391 DOI: 10.1038/srep43307
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Sensitivity to outcome devaluation was exhibited in the middle but not the initial or late stages of instrumental training.
(a) Timeline for the entire experimental procedure. Each group underwent assigned RI 60-s training sessions (Group A, 2 days, n = 11; Group B, 8 days, n = 11; and Group C, 16 days, n = 10), before undergoing outcome devaluation (box d) and two-lever tests (Fig. 2). For the rats in Group C, leverpress was retrained on an RI 60-s schedule for more 15 days (Group C’). Second devaluation and two-lever tests were conducted in the same manner as that of the first set. (b) Mean ( ± SEM) leverpresses per minute over RI 60-s training sessions. (c) Mean (+SEM) consumptions of lab chow and the training outcome during the 1-h prefeeding period in the outcome devaluation test. (d) Mean ( ± SEM) leverpresses in the 3-min extinction test following 1-h prefeeding of lab chow (non-devalued condition) or the training outcome (devalued condition). Group C exhibited selectively decreased leverpresses in the devalued condition compared with the non-devalued condition (*p < 0.05).
Figure 2Response balance between the trained and untrained levers changed when instrumental training was repeated.
(a) Mean (±SEM) leverpresses during the 5-min two-lever test where animals were presented simultaneously with both trained and untrained levers (Group A, n = 11; Group B, n = 11; Group C, n = 10; and group C’, n = 10). Groups B, C, and C’ exhibited significant performance bias toward the trained lever in terms of the leverpress rate (*p < 0.05), whereas Group A did not. (b) The mean discrimination–generalization index was calculated for each group based on the scores shown in box a. Discrimination–generalization index = (X−Y)/(X + Y), where X and Y are the trained and untrained leverpresses, respectively. The 95% confidential intervals of the means are also shown. (c) Temporal plots of leverpresses by each animal for both the trained and untrained levers in the two-lever test (Group A, n = 10; Group B, n = 11; Group C, n = 7; and Group C’, n = 7). (d) According to the temporal plots in box c, we determined the time intervals for the nearest neighbor of each leverpress. For each rat, the median of the intervals was calculated and the group means (+SEM) of the individual median scores are shown separately. In Groups C and C’, the interval to the nearest neighbor of the untrained lever was significantly larger than that of the trained lever (*p < 0.05).