| Literature DB >> 21833291 |
John M Pearson1, Benjamin Y Hayden, Michael L Platt.
Abstract
Animals are notoriously impulsive in common laboratory experiments, preferring smaller, sooner rewards to larger, delayed rewards even when this reduces average reward rates. By contrast, the same animals often engage in natural behaviors that require extreme patience, such as food caching, stalking prey, and traveling long distances to high-quality food sites. One possible explanation for this discrepancy is that standard laboratory delay discounting tasks artificially inflate impulsivity by subverting animals' common learning strategies. To test this idea, we examined choices made by rhesus macaques in two variants of a standard delay discounting task. In the conventional variant, post-reward delays were uncued and adjusted to render total trial length constant; in the second, all delays were cued explicitly. We found that measured discounting was significantly reduced in the cued task, with discount parameters well below those reported in studies using the standard uncued design. When monkeys had complete information, their decisions were more consistent with a strategy of reward rate maximization. These results indicate that monkeys, and perhaps other animals, are more patient than is normally assumed, and that laboratory measures of delay discounting may overstate impulsivity.Entities:
Keywords: discounting; foraging; impulsivity; macaque; neuroeconomics
Year: 2010 PMID: 21833291 PMCID: PMC3153841 DOI: 10.3389/fpsyg.2010.00237
Source DB: PubMed Journal: Front Psychol ISSN: 1664-1078
Figure 1Task design for the uncued (A) and cued (B) variants of the inter-temporal choice task. In the cued version of the task, pairs of targets appeared after a brief fixation period. In a third variant, monkeys had to maintain fixation to shrink the bar. Lengths of vertical bars represented delays to reward, colored stripes differing juice volumes. Monkeys were required to briefly hold fixation on the target of their choice (0.5 s), after which the unselected cue vanished and the bar corresponding to the chosen option shrank at a fixed rate. Reward was delivered when the top of the bar reached the colored stripe. In the uncued paradigm, the screen remained blank through the post-reward delay. In the cued paradigm, the remaining bar corresponded to the post-reward delay and continued to shrink through the end of the trial.
Model specifications for the 13 models used.
| Number | Discounting | Value ( | Decision variable | Choice function | Parameters | Half-life (γ) |
|---|---|---|---|---|---|---|
| 1 | – | max{ | Bernoulli | 1 | – | |
| 2 | – | Logistic | 3 | – | ||
| 3 | Hyperbolic | Logistic | 4 | 1/ | ||
| 4 | Hyperbolic | Logistic | 5 | (1 + | ||
| 5 | Hyperbolic | Logistic | 5 | (1 + | ||
| 6 | Exponential | Logistic | 4 | |||
| 7 | Exponential | Logistic | 5 | |||
| 8 | Marginal gain | – | Logistic | 3 | – | |
| 9 | – | Logistic | 3 | – | ||
| 10 | Reward rate | Logistic | 3 | 2 | ||
| 11 | Hyperbolic | Logistic | 4 | 1/ | ||
| 12 | Hyperbolic | Logistic | 5 | (1 + | ||
| 13 | Hyperbolic | Logistic | 4 | κ |
*Strictly speaking, this formula is only valid on average, since we have D + τ = T every trial, but D + τ −1 = T only on average.
†Because of differences in the way in which the separate decision variables (v − v′ vs v/v′) enter the logistic choice function, Models 3 and 4 also include within themselves equivalents of these two models, in which there is no constant in the denominator. In addition, Model 12 is clearly equivalent to Model 13 in the limit of large k.
Variables used: r, reward; D, delay to reward; τ, post-reward delay; τ−1, post-reward delay (previous trial); T, total trial length (T = D + τ). k and κ are fit to the data.
Percentage of LL choices, average trial lengths, and earned and maximum reward rates as a function of task condition.
| Task | Subject | % LL choices | ERR ( | MRR ( | Avg trial length (s) | % Rwd earned |
|---|---|---|---|---|---|---|
| Uncued | E | 0.520 ± 0.009 | 36.6 ± 0.1 | 41.3 ± 0.9 | 6.05 ± 0.01 | 0.887 ± 0.006 |
| O | 0.500 ± 0.007 | 36.9 ± 0.8 | 41.3 ± 0.8 | 6.17 ± 0.01 | 0.894 ± 0.004 | |
| Cued | E | 0.661 ± 0.009 | 43.0 ± 1.0 | 46.5 ± 0.9 | 5.22 ± 0.02 | 0.925 ± 0.005 |
| O | 0.714 ± 0.011 | 40.5 ± 1.4 | 43.3 ± 1.2 | 5.85 ± 0.04 | 0.935 ± 0.006 | |
| Cued, | E | 0.810 ± 0.006 | 37.9 ± 1.0 | 39.5 ± 0.9 | 6.50 ± 0.00 | 0.960 ± 0.003 |
| Fixated | O | 0.704 ± 0.011 | 39.6 ± 1.6 | 42.4 ± 1.4 | 6.20 ± 0.01 | 0.934 ± 0.006 |
Error measures are ±SEM. ERR, earned reward rate; MRR, maximum reward rate.
Figure 3Model-averaged half-lives for both monkeys, calculated using only trials with total length 6. 5 s. Cued, combined refers to data pooled across the two cued conditions for each subject. Error bars represent 95% confidence intervals for the means, calculated by bootstrap resampling.
Figure 2Discounting rates decrease with explicit information about post-reward delay. Discount parameters were estimated by a maximum likelihood analysis of a hyperbolic model (see Materials and Methods). Error bars represent 95% confidence intervals obtained by bootstrap analysis. (A) Estimated discount parameter values were significantly lower for both of the two paradigms in which post-reward delays were cued, though not significantly different from one another. (B) Model-averaged effective half-lives were significantly increased in both explicit cuing paradigms.
Half-lives and confidence intervals across task conditions.
| Task | Monkey E | Monkey O | ||
|---|---|---|---|---|
| Half-life | 95% | Half-life | 95% | |
| Uncued | 1.71 | [1.53, 2.07] | 5.68 | [3.71, 7.46] |
| Cued | 6.93 | [4.92, 9.61] | 10.44 | [7.21, 14.41] |
| Cued, Fixated | 8.45 | [2.24, 10.56] | 21.61 | [14.75, 38.57] |
Half-lives are averaged across models, weighted by Akaike weights. Confidence intervals are determined by bootstrap resampling of the data for each task condition.
Akaike weights for each of the 13 fitted models and task conditions.
| Task | Subject | Model weight ( | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | ||
| Uncued | E | 0 | 0 | 0.0156 | 0.0056 | 0.0049 | 0 | 0 | 0 | 0 | 0 | 0.0677 | 0.9058 | 0.0003 |
| O | 0 | 0 | 0.5725 | 0.2106 | 0.2113 | 0 | 0 | 0 | 0 | 0 | 0.0023 | 0.0009 | 0.0023 | |
| Cued | E | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.2510 | 0.7490 |
| O | 0 | 0 | 0.0010 | 0.0004 | 0 | 0 | 0 | 0 | 0 | 0 | 0.6117 | 0.2897 | 0.0968 | |
| Cued, | E | 0 | 0 | 0.0306 | 0.0112 | 0.0111 | 0.0002 | 0 | 0 | 0 | 0 | 0.7019 | 0.2449 | 0 |
| Fixated | O | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.0323 | 0.2535 | 0.7142 |
Values less than 10.
Figure 4Model-averaged half-lives for both monkeys over the course of training. In early sessions, cued and uncued conditions were interleaved in blocks of several hundred trials. Following this, both monkeys received several sessions of the cued condition only, followed by a return to the uncued condition. Cued, combined refers to data pooled across the two cued conditions for each subject. Final uncued data for Monkey O were collected after an extended hiatus from testing. Error bars represent 95% confidence intervals for the means, calculated by bootstrap resampling.