| Literature DB >> 30532341 |
Omar D Pérez1,2, Michael R F Aitken3, Amy L Milton1, Anthony Dickinson1.
Abstract
The higher response rates observed on ratio than on matched interval reward schedules has been attributed to the differential reinforcement of longer inter-response times (IRTs) on the interval contingency. Some data, however, seem to contradict this hypothesis, showing that the difference is still observed when the role of IRT reinforcement is neutralized by using a regulated-probability interval schedule (RPI). Given the mixed evidence for these predictions, we re-examined this hypothesis by training three groups of rats to lever press under ratio, interval and RPI schedules across two phases while matching reward rates within triads. At the end of the first phase, the master ratio and RPI groups responded at similar rates. In the second phase, an interval group yoked to the same master ratio group of the first phase responded at a lower rate than the RPI group. Post-hoc analysis showed comparable reward rates for master and yoked schedules. The experienced response-outcome rate correlations were likewise similar and approached zero as training progressed. We discuss these results in terms of a contemporary dual-system model of instrumental conditioning.Entities:
Keywords: Dual-system theories; Goal-directed behavior; Interval schedule; Ratio schedule; Reinforcement schedules
Year: 2018 PMID: 30532341 PMCID: PMC6264404 DOI: 10.1016/j.lmot.2018.07.003
Source DB: PubMed Journal: Learn Motiv ISSN: 0023-9690
Design of the Experiment. The master group is written in bold. The subscript "y" signifies that the group was yoked to the master group with respect to reward rate.
| Group | |||
|---|---|---|---|
| Phase | RR | RPI | RR/RI |
| 1 (10 sessions) | RPI-y | RR-10 | |
| 2 (10 sessions) | RPI-y | RI-y |
Fig. 1Response rates for phases 1 and 2 of the experiment.A. Average response rates maintained by rats in the last 3 sessions of training in each phase of the study. B. Average response rates across the 10 sessions of training in each phase of the study. * p < .05, ****p < .001, ns: not significant. Error bars represent 95% bootstrapped confidence intervals.
Results of the yoking procedure. Mean and 95% bootstrapped confidence intervals for the last three sessions of phases 1 and 2 of the study. The reward rates are in rewards per min; the IRT difference is calculated as the difference between the mean reinforced IRT and the mean IRT emitted by each rat in seconds. The letter “y” signifies that the group was yoked to the reward rate obtained in the master ratio group.
| Phase | Variable | Schedule | ||
|---|---|---|---|---|
| RR-20 (master) | RPI-y | RR-10 | ||
| 1 | Rewards per minute | 1.32 [1.16, 1.48] | 1.46 [1.27, 1.64] | 3.64 [3.34, 3.94] |
| IRT difference (sec) | −0.52 [−9.67, 8.63] | −1.77 [−2.26, -1.28] | −6.52 [−7.67, −5.37] | |
| RR-20 (master) | RPI-y | RI-y | ||
| 2 | Rewards per minute | 1.79 [1.53, 2.05] | 2.08 [1.76, 2.38] | 1.62 [1.39, 1.85] |
| IRT difference (sec) | −14.89 [−18.43, −11.35] | −16.53 [−19.77, −13.29] | 3.31 [2.45, 4.17] | |
Fig. 2Response-reward rate correlations experienced by each group in Phase 1 and Phase 2 of the experiment. The values shown are mean and 95% bootstrapped confidence intervals for the first (early) and last (late) three sessions of training for each rat and group. (see Appendix for details).