Literature DB >> 33060283

The influence of rewards on incidental memory: more does not mean better.

Si Cheng1, Ting Jiang1, Jingming Xue1, Songxue Wang1, Chuansheng Chen2, Mingxia Zhang3.   

Abstract

Studies have revealed that rewards promote long-term memory, even in an incidental way. However, most previous studies using the incidental paradigm have included two reward levels, and it is still not clear how the reward magnitude influences memory. Adopting the incidental paradigm and three reward levels, the current study revealed that the reward magnitude impacted 1-d delayed episodic memory in a nonlinear, inverted U-shaped pattern. An additional experiment showed that there was no reward effect in immediate episodic memory. Our results support the dopaminergic memory consolidation theory and further imply that the reward magnitude needs to be considered in the theory.
© 2020 Cheng et al.; Published by Cold Spring Harbor Laboratory Press.

Entities:  

Year:  2020        PMID: 33060283      PMCID: PMC7571270          DOI: 10.1101/lm.051722.120

Source DB:  PubMed          Journal:  Learn Mem        ISSN: 1072-0502            Impact factor:   2.460


Recently, studies have revealed that a reward could facilitate memory (Shohamy and Adcock 2010; Miendlarzewska et al. 2016), not just in the intentional paradigm but also in the incidental paradigm. In the intentional paradigm, participants would strategically pay attention to memorize the reward-related items as they are explicitly informed that reward is contingent upon memory performance in a subsequent test when they encode the items (Adcock et al. 2006; Ariel and Castel 2014; Cohen et al. 2014, 2016; Spaniol et al. 2014; Hidi 2016; Miendlarzewska et al. 2016). In the incidental paradigm, the reward accompanies some items during the encoding phase but is independent of memory performance for these items in the test phase (Mather and Sutherland 2011; Murayama and Kitagami 2014). For example, some studies used the category of the items (e.g., living or nonliving objects) to signal rewards in the encoding phase (e.g., participants would anticipate a reward when they met a particular category) and observed a better memory for the reward-related category in the subsequent test (Wittmann et al. 2005; 2011; Patil et al. 2017). In this paradigm, participants always do not know the subsequent memory test before they process it; thus, the enhancement effect on the memory process is explained in terms of pure reward effect rather than top-down attention and effort. The reward effect on memory is associated with an increased dopamine release that improves the plasticity of the hippocampal memory system (Bromberg-Martin et al. 2010; Chiew et al. 2016). The dopaminergic modulation of the hippocampus could last a long time after reward events occur (e.g., in the memory consolidation phase) (Bayer and Glimcher 2005; Lisman and Grace 2005; Shohamy and Adcock 2010; Flagel et al. 2011; Gruber et al. 2016; Murty et al. 2017), and, correspondingly, behavioral evidence showed reward-related memory improvements in the delayed memory test but not in the immediate memory test (McClure et al. 2004; Murayama and Kitagami 2014; Patil et al. 2017). Previous studies using the intentional paradigm have revealed that reward values affected memory in a linear way (Castel et al. 2007, 2013); the greater the reward, the better the memory. However, most previous studies using the incidental paradigm revealed the reward's positive effect on memory by comparing items for two reward levels. It is still unclear how multiple levels of reward influence incidental memory. As many animal studies have revealed that the excitatory responses of midbrain dopaminergic neurons increase as the reward magnitude (or probability) increases (Fiorillo et al. 2003; Satoh et al. 2003; Matsumoto and Hikosaka 2009), one might expect that the impact of the dopaminergic modulation on the hippocampus as well as the reward magnitude on episodic memory occur in a simple linear (or on-off) manner. That is, we are more sensitive to a higher reward, and a higher magnitude would never lead to a worse performance than a lower reward. However, many studies have revealed that reward magnitude modulates performance in a more complex and nonlinear way. For example, previous studies revealed that reward magnitude influenced the performance of some tasks (e.g., motor learning, intelligence tests) (Kuhbandner et al. 2016; Lee et al. 2019) in an inverted U shape (Mobbs et al. 2009; Aarts et al. 2014; Lee and Grafton 2015). That is, a high reward leads to a less-than-optimal performance compared with a medium reward. Some researchers described this phenomenon as “overmotivation” or “overarousal” and suggested that performance was superior at the level of arousal and dopamine release (Dodson 1915) and that a high reward induced excessive arousal or “overdosed” the dopaminergic system beyond this level and led to a decrement in performance (Mobbs et al. 2009; Aarts et al. 2014). Other researchers have suggested that the negative effect of a high reward was due to the shift of attention and executive control to the internal state (e.g., anxiety) (Callan and Schweighofer 2008; Ariely et al. 2009; Capa and Bouquet 2018; Watanabe et al. 2019) or details of the task (Beilock and Carr 2005; Beilock and DeCaro 2007; Vealey et al. 2014; Lee and Grafton 2015; Lee et al. 2019) that led to a reduction in goal-directed attentional control for the target task. Thus, the negative effect would not occur in tasks that do not require a substantial amount of top-down strategy (e.g., an incidental motor learning task) (Lee et al. 2019). The inconsistent idea of the relationship between reward magnitude and performance motivated us to explore how reward magnitude influenced incidental episodic memory. Two experiments were conducted in the current study. Experiment 1 comprised an incidental encoding task and a subsequent surprise recognition test 1 d later to investigate the effect on delayed memory. Sixty healthy participants were randomly assigned to the control and reward groups. One participant was excluded for having memory performance more than three standard deviations below average, resulting in a total sample size of 30 (13 males; mean ± SD = 22.4 ± 3.02 yr) in the reward group and 29 (13 males; mean ± SD = 21.3 ± 2.36 yr) in the control group. In the encoding phase (Fig. 1A), a central fixation cross “+” was presented for 500 msec. Then, two images were displayed simultaneously on the left and right sides of the central fixation cross for 1500 msec, participants were instructed to judge the number of target images (living images or nonliving images, counterbalanced across participants in each group) in the two images within the following 1500 msec when a “?” was displayed. Informative feedback was provided for 1500 msec after the judgment. To prevent carryover effects (Anderson et al. 2006), participants performed three flanker tasks in which they judged the direction of the middle arrow of a set of three arrows.
Figure 1.

The incidental encoding task and recognition memory test. (A) In the incidental encoding task, participants were instructed to judge the number of target images (living images or nonliving images). In this example, the target images were living images. A correct response was followed by neutral feedback in the control group and monetary reward feedback based on the number of target images in the reward group. (B) A surprise memory test was performed 1 d later (Experiment 1) or immediately afterward (Experiment 2). Participants were asked to determine whether the images were old or new and to rate their confidence.

The incidental encoding task and recognition memory test. (A) In the incidental encoding task, participants were instructed to judge the number of target images (living images or nonliving images). In this example, the target images were living images. A correct response was followed by neutral feedback in the control group and monetary reward feedback based on the number of target images in the reward group. (B) A surprise memory test was performed 1 d later (Experiment 1) or immediately afterward (Experiment 2). Participants were asked to determine whether the images were old or new and to rate their confidence. Participants in the control and reward groups went through the same encoding procedure, except that they received different feedback for their judgment of the number of target images. A correct response was followed by neutral feedback in the control group and monetary reward feedback based on the number of target images in the reward group. That is, participants in the reward group received no reward (¥0), medium reward (¥1 ≈ 0.144 USD) and high reward (¥2) for correct judgments in nontarget trials, one-target trials and two-target trials respectively. The encoding phase included 20 nontarget (nonreward) trials, 40 one-target (medium-reward) trials, and 20 two-target (high-reward) trials. This design produced four types of images, each including 40 images (160 images in total): nontarget images in the nontarget trials (nonreward images in the nonreward trials), nontarget images in the one-target trials (nonreward images in the medium-reward trials), target images in the one-target trials (reward images in the medium-reward trials), and target images in the two-target trials (reward images in the high-reward trials).
Figure 2.

Results from Experiment 1. Recognition memory (d′ score) as a function of Group (reward, control) and Image Type (00 = nontarget images in the nontarget trials, 0[1] = nontarget images in the one-target trials, 1[0] = target images in the one-target trials, 11 = target images in the two-target trials) in the delayed test. Error bars represent the standard error.

Results from Experiment 1. Recognition memory (d′ score) as a function of Group (reward, control) and Image Type (00 = nontarget images in the nontarget trials, 0[1] = nontarget images in the one-target trials, 1[0] = target images in the one-target trials, 11 = target images in the two-target trials) in the delayed test. Error bars represent the standard error. Participants returned 1-d later (the mean interval between the encoding phase and the memory test was 23.73 h, SD = 1.51, in the reward group and was 24.00 h, SD = 1.44, in the control group) for a surprise recognition memory test (Fig. 1B) in which the 160 previously presented images and 160 new images were presented one by one. Participants were instructed to judge whether the images were old or new within 3000 msec and to rate their confidence within 3000 msec. Additional procedural details are in the Supplemental Material.
Figure 3.

Results from Experiment 2. Recognition memory (d′ score) as a function of Group (reward, control) and Image Type (00 = nontarget images in the nontarget trials, 0[1] = nontarget images in the one-target trials, 1[0] = target images in the one-target trials, 11 = target images in the two-target trials) in the immediate test. Error bars represent the standard error.

Results from Experiment 2. Recognition memory (d′ score) as a function of Group (reward, control) and Image Type (00 = nontarget images in the nontarget trials, 0[1] = nontarget images in the one-target trials, 1[0] = target images in the one-target trials, 11 = target images in the two-target trials) in the immediate test. Error bars represent the standard error. The analyses on participants’ response during the encoding phase were in the Supplemental Material and we focused on how reward magnitude influenced memory performance in the main text. We analyzed the memory performance of the participants’ correctly judged trials in the encoding phase. The memory performance was indexed by d′ score, calculated by the z score of the hit rates (the percentage of “Old” responses for the “Old” stimuli) minus the z scores of the false alarm rates (the percentage of “Old” responses for the “New” stimuli). The mean hit rates for nontarget images in nontarget trials, nontarget images in one-target trials, target images in one-target trials, and target images in two-target trials were respectively, 0.385, 0.416, 0.505, and 0.443 in the reward group and 0.421, 0.441, 0.434, and 0.436 in the control group. The mean false alarm rates for nontarget images and target images were respectively, 0.258 and 0.269 in the reward group and 0.304 and 0.335 in the control group. A 2 (Group: reward vs. control) × 4 (Image Type: nontarget images in nontarget trials vs. nontarget images in one-target trials vs. target images in one-target trials vs. target images in two-target trials) ANOVA with d′ score as the dependent variable revealed a significant main effect of Group, F(1,57) = 5.100, P = 0.028, η2 = 0.082, and a nonsignificant main effect of Image Type, F(2.28,129.89) = 1.519, P = 0.220, η2 = 0.026. The interaction of Group × Image Type was significant, F(2.28,129.89) = 4.466, P = 0.010, η2 = 0.073 (Fig. 2). Simple main effect analyses revealed that in the control group, there was no significant difference in memory performance across the four types of images, F(3,55) = 0.886, P = 0.454, whereas in the reward group, recognition memory performance was significantly different across the four types of images, F(3,55) = 4.383, P = 0.008. The pairwise comparison analyses revealed that participants in the reward group recognized the reward images in the medium-reward trials significantly better than the nonreward images in the nonreward trials, t = 3.19, P = 0.014, and the reward images in the high-reward trials, t = 2.77, P = 0.041. However, the reward images in the medium-reward trials were not significantly better recognized than the nonreward images in the medium-reward trials, t = 2.23, P = 0.165. The memory performance for the nonreward images in the nonreward and the medium-reward trials and for the reward images in the high-reward trials was not different, Ps > 0.4. That is, the reward images in the medium-reward trials were best recognized, and compared with the medium reward, the high reward did not further improve but instead impaired memory performance of reward images. To directly test the reward effect in the four image types, post-hoc t-tests at each level between the reward group and the control group were also conducted. The target images in the one-target trials and in the two-target trials were better recognized in the reward group than in the control group, t = 3.87, P < 0.001, and t = 2.11, P = 0.040, whereas the memory performance of nontarget images in the nontarget trials and in the one-target trials in the reward group was not significantly different from that in the control group, Ps > 0.3. It seems the high reward could also promote memory performance though it was not optimal. We also analyzed memory performance with participants’ subsequent confidence in their judgments. The overall pattern of the results with confidence was very similar to that without consideration of the confidence, which is presented in the Supplemental Material. Experiment 1 suggested reward magnitude impacted delayed memory in a nonlinear way. We further tested how reward magnitude affected immediate memory in Experiment 2. Experiment 2 was identical to Experiment 1 except that participants took the recognition test immediately after they finished the encoding task. Sixty participants were recruited and randomly assigned to two groups with one participant excluded for not following the task instructions, resulting in a total sample size of 30 (13 males; mean ± SD = 21.3 ± 2.14 yr) in the reward group and 29 (13 males; mean ± SD = 21.0 ± 2.24 yr) in the control group. Identical to that in Experiment 1, the analyses on participants’ response during the encoding phase were in the Supplemental Material and we focused on the reward effect on memory performance in the main text. In the test, the mean hit rates for nontarget images in nontarget trials, nontarget images in one-target trials, target images in one-target trials, and target images in two-target trials were respectively, 0.549, 0.555, 0.598, and 0.601 in the reward group and 0.551, 0.549, 0.550, and 0.547 in the control group. The mean false alarm rates for nontarget images and target images were 0.265 and 0.274 in the reward group and 0.287 and 0.236 in the control group. The results in Experiment 2 showed no significant main effect of Group, F(1,57) = 0.342, P = 0.561, η2 = 0.006, or Image Type, F(2.12,120.63) = 2.923, P = 0.055, η2 = 0.049, or interaction of Group × Image Type, F(2.12,120.63) = 0.016, P = 0.987, η2 = 0.000 (Fig. 3). The pattern of the results with confidence was similar to that without consideration of the confidence, which is also presented in the Supplemental Material. Taken together, our results showed that in the delayed test, the reward images in the medium-reward trials were better recognized than the nonreward images in the nonreward trials and the reward images in the high-reward trials. The memory performance did not differ between the nonreward images in the nonreward and medium-reward trials and the reward images in the high-reward trials. No reward effect on episodic memory was observed in the immediate test. These results demonstrated an inverted U pattern instead of a linear relationship between the reward magnitude and long-term incidental memory. Many previous studies have revealed that rewards facilitate memory and learning (Shohamy and Adcock 2010; Miendlarzewska et al. 2016). Consistent with these findings, the current study found that reward images (in the medium-reward trials) were better recognized than nonreward images. Some researchers have proposed a dopaminergic memory consolidation theory to explain the reward effect on memory, which suggests that a reward would trigger the release of dopamine that modulates the activity of the hippocampus and other memory-related regions in the consolidation (post-encoding) phase (Wittmann et al. 2005; Murayama and Kitagami 2014; Gruber et al. 2016). Other researchers have suggested that a reward could also impact memory by prompting people to use more effortful strategies on reward-related items (Cohen et al. 2014). Although the current study adopted the incidental paradigm to avoid the use of strategy, one may argue that reward images might automatically capture attention (Anderson et al. 2011; Anderson and Halpern 2017), and be encoded deeper than nonreward images (Murayama and Kitagami 2014), thus inducing an enhancement effect on memory performance. However, it should be noted that the effect only occurred in the delayed test, not in the immediate test, which should be explained by dopaminergic memory consolidation rather than the encoding receiving more attention and cognitive control efforts. More importantly, the reward images in the high-reward trials were recognized as worse than those in the medium-reward trials. To our knowledge, the current study was the first to reveal high reward impairment in incidental memory. Previous research has shown the negative effect of a high reward on the performance of other tasks (the “choking on money” phenomenon) and proposed two underlying mechanisms to explain the impairment effect of a high reward on performance: excess arousal (and dopamine release) and attention shift (Mobbs et al. 2009; Aarts et al. 2014; Lee and Grafton 2015; Lee et al. 2019). We suggest that in the current study, this effect was due to excess arousal and dopamine release rather than an attention shift to an internal state or to task details because we used an incidental instead of an intentional paradigm to exclude the attention effect. Again, the attention shift could not explain why this effect occurred only in the delayed test, not in the immediate test. One previous study revealed a negative relationship between arousal and intentional spatial memory in reward conditions across trials and participants and suggested arousal as a potential inhibitor in reward-related intentional learning (Murty et al. 2011). Future studies should investigate the physiological and neural bases underlying high reward impairment in incidental memory and learning. Here, we should caution that high rewards “impaired,” or had a “negative” effect on memory only when it was compared with the medium reward. When we compared the memory performance between the reward and control groups, we could see the positive effect of reward on the target images for both the one-target (medium-reward) and two-target (high-reward) levels. Thus, the reward could overall enhance memory performance regardless of magnitude and it had an especially positive effect on the medium level. Although previous studies suggested that dopamine is sent to the hippocampus in a nontargeted way (Adcock et al. 2006; Spaniol et al. 2014; Murty et al. 2017; Pu and Yu 2019) and hence rewards should enhance memory for stimuli nonselectively (Mather and Schoeke 2011; Murayama and Kitagami 2014), we found that the reward in our study only enhanced the memory for the target images, not for the nontarget images in the medium-reward trials. This lack of the reward effect on nontarget images was evident in both the immediate and delayed tests. We speculated that more attention or some kind of arousal-biased competition (Mather and Sutherland 2011) in favor of goal-relevant stimuli (target images) might have interfered with the processing of the simultaneously presented nontarget images and thus attenuated the reward effect on the nontarget images in the immediate test. In addition, a recent study revealed that the effect of dopamine would tag the worthiest items and reprocess them during sleep intensively (Asfestani et al. 2020). It is possible that although dopamine affected the processing of the target and nontarget images nonselectively during the encoding phase, it only tagged and reprocessed the target images during sleep and thus we observed the reward effect on the target images only in the delayed test. It also should be noted that, although many previous studies revealed the reward effect on the delayed test instead of the immediate test, some studies also reported the latter, for both intentional memory (Madan and Spetch 2012; Spaniol et al. 2014; Wolosin et al. 2013) and incidental memory (Murty and Adcock 2014). In the study by Murty and Adcock (2014), the immediate reward effect was observed on the salient items (the novel images that interrupted participant's expectancy) in the reward anticipation stage. It is possible that either the items’ salience or the participants’ strong expectancy or both might have increased the reward effect in the immediate test. Future studies should investigate the factors that influence the timing of the reward effect. In addition, the reversed effect of a high reward in the current study was inconsistent with the results of some previous studies that also included more than two levels of reward to test reward and memory (Wittmann et al. 2005; Castel et al. 2007; Callan and Schweighofer 2008; Bunzeck et al. 2010; Wittmann et al. 2011; Madan and Spetch 2012; Castel et al. 2013). These studies used different tasks (or paradigms, or intervals between encoding and test) from those in the current study. For example, previous research showed a linear pattern between reward magnitude and intentional memory in the immediate test (Castel et al. 2007, 2013). Thus, we should caution that reward magnitude may have a complex impact on memory and learning and may depend on many factors, such as task, paradigm, the interval between encoding and test, and personality traits (Callan and Schweighofer 2008). Finally, we should add a note about the theoretical accounts of current results. In addition to the arousal or dopamine hypothesis aforementioned, our results were also consistent with the “adaptive scaling of reward” theory that suggested the performance was dependent on the relative reward value rather than the absolute reward value (Bunzeck et al. 2010). That is, in our study, the images with a relatively high reward value (the reward images in the medium-reward trials with the relative value of ¥1) were better recognized than those with a relatively low reward value (the reward images in the high-reward trials and the nonreward images in the nonreward trials with the relative value of ¥0). However, there was a major difference between the study by Bunzeck et al. (2010) study and the current study. Whereas Bunzeck et al. (2010) focused on the effect of the magnitude difference between the expected reward and the actual reward feedback on memory, our study did not lead participants to have an anticipation about the reward before they received the feedback. Future studies should further test the “adaptive scaling of reward” hypothesis in the current task (e.g., to test whether there is the reward effect on the condition of two reward-predicting stimuli, with the cumulative reward magnitude was matched with that of the one-target trials). In conclusion, this study revealed an optimal effect of a medium reward in long-term incidental episodic memory. This result has two implications for reward and memory research. First, the current study supported dopaminergic memory consolidation. Second, the reward magnitude needs to be considered in efforts to optimize memory and learning.
  46 in total

Review 1.  Dopamine in motivational control: rewarding, aversive, and alerting.

Authors:  Ethan S Bromberg-Martin; Masayuki Matsumoto; Okihide Hikosaka
Journal:  Neuron       Date:  2010-12-09       Impact factor: 17.173

2.  Post-learning Hippocampal Dynamics Promote Preferential Retention of Rewarding Events.

Authors:  Matthias J Gruber; Maureen Ritchey; Shao-Fang Wang; Manoj K Doss; Charan Ranganath
Journal:  Neuron       Date:  2016-02-11       Impact factor: 17.173

3.  Emotion enhances remembrance of neutral events past.

Authors:  Adam K Anderson; Peter E Wais; John D E Gabrieli
Journal:  Proc Natl Acad Sci U S A       Date:  2006-01-24       Impact factor: 11.205

4.  From poor performance to success under stress: working memory, strategy selection, and mathematical problem solving under pressure.

Authors:  Sian L Beilock; Marci S Decaro
Journal:  J Exp Psychol Learn Mem Cogn       Date:  2007-11       Impact factor: 3.051

5.  Reward-enhanced memory in younger and older adults.

Authors:  Julia Spaniol; Cécile Schain; Holly J Bowen
Journal:  J Gerontol B Psychol Sci Soc Sci       Date:  2013-05-20       Impact factor: 4.077

Review 6.  Influence of reward motivation on human declarative memory.

Authors:  Ewa A Miendlarzewska; Daphne Bavelier; Sophie Schwartz
Journal:  Neurosci Biobehav Rev       Date:  2015-12-02       Impact factor: 8.989

Review 7.  The hippocampal-VTA loop: controlling the entry of information into long-term memory.

Authors:  John E Lisman; Anthony A Grace
Journal:  Neuron       Date:  2005-06-02       Impact factor: 17.173

8.  Out of control: diminished prefrontal activity coincides with impaired motor performance due to choking under pressure.

Authors:  Taraz G Lee; Scott T Grafton
Journal:  Neuroimage       Date:  2014-10-30       Impact factor: 6.556

9.  Dopamine and the cognitive downside of a promised bonus.

Authors:  Esther Aarts; Deanna L Wallace; Linh C Dang; William J Jagust; Roshan Cools; Mark D'Esposito
Journal:  Psychol Sci       Date:  2014-02-13

10.  Eyes wide open: enhanced pupil dilation when selectively studying important information.

Authors:  Robert Ariel; Alan D Castel
Journal:  Exp Brain Res       Date:  2013-10-27       Impact factor: 1.972

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.