| Literature DB >> 30966915 |
Trevor Humby1,2,3,4, Yateen Patel1,2,3,4, Jenny Carter1,2,3,4, Laura-Jean G Stokes5, Robert D Rogers5, Lawrence S Wilkinson1,2,3,4.
Abstract
People, like animals, tend to choose the variable option when given the choice between a fixed and variable delay to reward where, in the variable delay condition, some rewards are available immediately (Laura-Jean et al. 2019 Phil. Trans. R. Soc. B 374, 20180141. ( doi:10.1098/rstb.2018.0141 )). This bias has been suggested to reflect evolutionary pressures resulting from food scarcity in the past placing a premium on obtaining food quickly that can win out against the risks of sometimes sustaining longer delays to food. The psychologies mediating this effect may become maladaptive in the developed world where food is readily available contributing, potentially, to overeating and obesity. Here, we report our development of a novel touchscreen task in mice allowing comparisons of the impact of food delay and food magnitude across species. We show that mice exhibit the typical preference, as shown by humans, for variable over fixed delays to rewards but no preference when it comes to fixed versus variable reward amounts and further show that this bias is sensitive to manipulations of the 5-HT2C receptor, a key mediator of feeding and impulse control. We discuss the data in terms of the utility of the task to model the psychologies and underlying brain mechanisms impacting on feeding behaviours. This article is part of the theme issue 'Risk taking and impulsive behaviour: fundamental discoveries, theoretical perspectives and clinical implications'.Entities:
Keywords: 5-HT2CR; SB242084; WAY161503; feeding; response control; risk sensitivity
Mesh:
Substances:
Year: 2019 PMID: 30966915 PMCID: PMC6335459 DOI: 10.1098/rstb.2018.0144
Source DB: PubMed Journal: Philos Trans R Soc Lond B Biol Sci ISSN: 0962-8436 Impact factor: 6.237
Figure 1.Apparatus and task schedule for the fixed versus variable delay to reward (FST-D) or fixed versus variable reward amount (FST-R) food scheduling tasks. (a) The apparatus used was a touchscreen chamber (Campden Cognition, UK), consisting of a trapezoid-shaped animal enclosure with a touch-sensitive screen making up the larger wall. (b) A trial started with presentation of two stimuli to the screen, touching either one would lead to reward presentation following a delay. Delay durations in the FST-D task or reward amount in the FST-R task, and for the probe manipulation sessions, were determined by the response made (see figure 2a). Different pairs of stimuli were used for each phase of the task and for each new test within each phase (see electronic supplementary material, methods figure S1 for stimuli pairs used).
Figure 2.Schematic of the experimental design and procedures. For both the fixed versus variable delay to reward (FST-D) and fixed versus variable reward amount (FST-R) food-scheduling tasks, the subjects were presented with a choice between a fixed option and a variable option, which had two possible outcomes (1 : 1 probability of presentation). The combination of delay durations and reward quantities for each task (a) were determined such that the coefficients of reinforcement were equivalent [36,37]. Mice were initially shaped to respond to the touchscreens with presentation of a single stimulus to the screen (SST), before proceeding through the FST-R and FST-D tasks, with task order counter-balanced between subjects (b). Between the tasks, the mice were given three transition sessions of SST to stabilize behaviour. Performance criteria for reversal: consistent/stable choice performance (>75% choice preference) for more than two sessions, maximum of 20 sessions per phase.
Figure 3.Preference for responding in the fixed versus variable delay to reward (FST-D) and fixed versus variable reward amount (FST-R) food-scheduling tasks. For each task, mice were given a maximum of 20 sessions to reach an arbitrary 75% preference criterion. (a) In the FST-D, all mice initially showed no choice preference but a majority (34/44) did reach the preference criterion within the 20 testing sessions available; the mice that did not reach the preference criterion (10/44) still showed a significant above-chance preference. (b) The 34 mice that reached the 75% performance criterion in less than 20 sessions in the FST-D task were further tested in reversal. After an initial disruption to behaviour, leading to the temporary abolition of the preference for the variable delay condition, this preference was rapidly re-established post-reversal with all of the mice reaching the 75% preference criterion within 20 sessions. (c) In the FST-R task, however, the initial lack of preference for fixed or variable amounts of reward persisted throughout the 20 test sessions available. This was the case for the large majority (42/44) of the animals but two animals did show a preference for the variable amount of reward choice and indeed reached the arbitrary 75% preference criterion, the data for these two mice have been included for completeness. Data show mean ± s.e.m. N values for each condition are shown within the individual bar charts. *** denotes p < 0.001 for choice preference in comparison to chance (no preference) performance.
Figure 4.Effects of 5-HT2CR antagonism and agonism on the preference for variable or fixed delays to reward. Data showing the effects of increasing doses of (a) the selective 5-HT2CR antagonist SB212084 and (b) selective 5-HT2CR agonist WAY161503 on choice preference for fixed or variable delays to reward compared to vehicle (0.9% saline). Choice preferences were assessed following 5-HT2CR manipulations with an acute single, extended session/dose-modified FST-D protocol, with new combinations of stimuli for each drug dose. SB212084 was administered s.c. and WAY161503 i.p. Data are mean ± s.e., N = 34. * denotes p < 0.05 for comparison between vehicle and drug dose, and ### and # denote p < 0.001 and p < 0.05 for comparison to chance (no preference) performance.