Carolin Sievers1,2, Chris M Bird3, Louis Renoult2. 1. Department of Psychology, University of Cambridge, Cambridge CB2 3EB, United Kingdom. 2. School of Psychology, University of East Anglia, Norwich NR4 7T, United Kingdom. 3. School of Psychology, University of Sussex, Brighton BN1 9QH, United Kingdom.
Abstract
Repeated study typically improves episodic memory performance. Two different types of explanations of this phenomenon have been put forward: (1) reactivating the same representations strengthens and stabilizes memories, or (2) greater encoding variability benefits memory by promoting richer traces. The present experiment directly compared these predictions in a design with multiple repeated study episodes, allowing to dissociate memory for studied items and their context of study. Participants repeatedly encoded names of famous people four times, either in the same task, or in different tasks. During the test phase, an old/new judgment task was used to assess item memory, followed by a source memory judgment about the encoding task. Consistent with predictions from the encoding variability view, encoding stimulus in different contexts resulted in higher item memory. In contrast, consistent with the reactivation view, source memory performance was higher when participants encoded stimuli in the same task repeatedly. Taken together, our findings indicate that encoding variability benefits episodic memory, by increasing the number of items that are recalled. These benefits are however at the expenses of source recollection and memory for details, which are decreased, likely due to interference and generalization across contexts.
Repeated study typically improves episodic memory performance. Two different types of explanations of this phenomenon have been put forward: (1) reactivating the same representations strengthens and stabilizes memories, or (2) greater encoding variability benefits memory by promoting richer traces. The present experiment directly compared these predictions in a design with multiple repeated study episodes, allowing to dissociate memory for studied items and their context of study. Participants repeatedly encoded names of famous people four times, either in the same task, or in different tasks. During the test phase, an old/new judgment task was used to assess item memory, followed by a source memory judgment about the encoding task. Consistent with predictions from the encoding variability view, encoding stimulus in different contexts resulted in higher item memory. In contrast, consistent with the reactivation view, source memory performance was higher when participants encoded stimuli in the same task repeatedly. Taken together, our findings indicate that encoding variability benefits episodic memory, by increasing the number of items that are recalled. These benefits are however at the expenses of source recollection and memory for details, which are decreased, likely due to interference and generalization across contexts.
Behavioral studies have generally shown that stimulus repetition facilitates subsequent processing, as demonstrated by faster reaction times or increased accuracy for repeated as compared to nonrepeated stimuli (Henson 2003). In the domain of memory, encoding the same material repeatedly typically enhances episodic memory (Glenberg et al. 1977; Van Strien et al. 2005; Opitz 2010; see also Crowder 1976; Baddeley 1978), especially when encoding episodes are spaced rather than massed (Cepeda et al. 2008; Smith and Scarf 2017). Two contrasting views have been described in the literature, as to the best predictors of memory formation over multiple study episodes. The first one, the “reactivation view” stipulates that previously encoded episodes can serve as retrieval cues to reactivate and strengthen memories during repeated exposure, making memory representations more stable (Thios and D'Agostino 1976; Benjamin and Tullis 2010). In contrast, the “encoding variability view” posits that each stimulus presentation is encoded differently over time (due to “contextual drift,” Bower 1972), providing multiple traces of the same item (Hintzman 1986; Nadel and Moscovitch 1997) and thus a larger variety of retrieval cues (Martin 1968; Bower 1972), thereby resulting in improved episodic memory performance when encoding variability is increased. Recent functional neuroimaging studies have demonstrated that, consistent with the reactivation view, cortical activation patterns are generally more similar across multiple encoding presentations for subsequently remembered as compared to subsequently forgotten trials in various cortical regions (Xue et al. 2010, 2013; Ward et al. 2013). However, these studies used paradigms in which participants performed the same task repeatedly, thereby not providing an optimal test of the encoding variability view, as the benefits of variability may not be optimized in such conditions.Another line of research has investigated the effects of retroactive interference by presenting stimuli in different contexts, for example, different encoding tasks (e.g., Koen and Rugg 2016; Kim et al. 2018). Retroactive interference is generally measured by using an AB–AC paradigm (Postman and Underwood 1973), where a stimulus A is first presented in a context B, followed by presenting A in another, interfering context C. Consistent with the reactivation view, these paradigms typically report worse context memory compared to stimuli that were presented only once, in a single context (McGovern 1964; Anderson and Neely 1996; Hupbach et al. 2007; Kim et al. 2012; Kim et al. 2018). When the same stimulus is repeated in a different context, it is thought to reactivate the memory associated with the first context (Hintzman 2004, 2010) and integrate the novel context in order to generalize across the two contexts (Shohamy and Wagner 2008; Zeithamova and Preston 2010; Schlichting et al. 2014; Schlichting and Preston 2015; Richter et al. 2016). This generalization, facilitated by reactivation, then weakens subsequent context memory. These observations are in line with the competition trace theory (Yassa and Reagh 2013), which suggests that repetition improves item memory or familiarity at the cost of episodic details, such as context memory, as multiple exposures would result in a competition of nonoverlapping features (i.e., contextual details) of the memories. Similarly, the context binding theory predicts that being in a stable context during encoding enhances the likelihood of episodic recollection (Yonelinas et al. 2019).While a number of studies have compared the respective benefits of encoding stimuli repeatedly in the same versus different tasks (Bird et al. 1978; Hunt and Einstein 1981; Young and Bellezza 1982; Huff and Bodner 2014), these studies have produced mixed findings and typically did not differentiate item memory (and stimulus familiarity) versus recollection (and retrieval of qualitative aspects of study). It is thus possible that simple reactivation, using the same encoding task repeatedly, and encoding variability in different encoding tasks, would differently impact memory for items themselves and for their contexts of study. Indeed, as previously mentioned, theories like the competition trace theory (Yassa and Reagh 2013; see also Kim et al. 2012) would predict that encoding variability in multiple tasks would likely impair memory for contextual details, due to interference between the various contexts. In addition, as noted by Huff and Bodner (2014), the vast majority of previous studies comparing encoding processes in the same versus different tasks only included two study blocks, and thus it is possible that some of the benefits of encoding variability, and/or any interference effects, may only emerge with more studies opportunities and the use of a greater varieties of encoding contexts (Kim et al. 2012).The present experiment was designed to circumvent these limitations and compare the respective benefits of encoding variability and simple reactivation over subsequent memory (item and source memory). We note here that, even though we contrast these two types of processes in the present paradigm and compare their respective benefits, they are not necessarily mutually exclusive views and could combine their effects in a situation-dependent manner. Here we were interested to compare the effects of encoding variability and reactivation on item versus source memory. Names of famous people were presented four times during a study phase. Half of the stimuli were repeated under the same encoding instructions, while the other half was repeated across four different encoding instructions (“Is this person female?,” “Is this person currently active in show business?,” “Is this person British?” and “Do you like this person?”). At test, participants performed an old/new judgment task, assessing item memory, followed by a source memory question probing participants’ memory for the encoding task they performed during the study phase. Based on previous studies on retroactive interference, source memory was predicted to be worse for the different compared to the same encoding task condition, due to generalization across contexts. In other words, consistent with competitive trace theory (Yassa and Reagh 2013) and the reactivation view (e.g., Thios and D'Agostino 1976), correct source memory (and recollection) performance should be higher when participants encode stimuli repeatedly in the same encoding task. In contrast, consistent with the encoding variability view (Huff and Bodner 2014), more variable encoding conditions in the form of being exposed to the same stimuli in different tasks should provide a greater variety of retrieval traces and increase item memory. Results were published in a preprint format (Sievers and Renoult 2019).
Results
The repeated-measures design included the three following factors: memory performance (hits+, hits−, misses), encoding context (same vs. different encoding task) and repetition (presentation 1,2,3,4 for analyses of reaction times at study). Because of a lack of low confidence responses, confidence was not included in further analyses. Participants’ responses to the item memory question indicated that they responded much more often with high (MHC = 96 ± 8%) than with low confidence (MLC = 4 ± 8%), t(20) = 26.912, P < 0.001. Hits+ trials included all correct item and correct source judgments irrespective of confidence ratings. Hits− trials were characterized as old/new hits, irrespective of confidence ratings, followed by an incorrect source memory response or no response, indicating the source could not be retrieved.
Study phase
Reaction times during the study phase (displayed in Fig. 1) were analyzed in a 3 × 2 × 4 repeated-measures ANOVA with the factors memory performance (hits+, hits−, misses), encoding context (same, different), and presentation (1, 2, 3, 4). The ANOVA revealed main effects of encoding context, F(1,37) = 142.339, P < 0.001, η2 = 0.807, and presentation, F(3,111) = 18.752, P < 0.001, η2 = 0.355. No statistically significant interactions involved subsequent memory performance. However, there was an interaction between encoding context and presentation, F(3,111) = 10.905, P < 0.001, η2 = 0.243. Further analyses revealed that the effect of presentation was significant in the same encoding task condition, F(3,111) = 65.098, P < 0.001, η2 = 0.65, but not in the different task condition, F(3,111) = 2.204, P = 0.108, η2 = 0.058. Reaction times (RTs) under the same task condition were best fit by a quadratic distribution (decreasing rapidly from first to second presentation) and then more slowly for subsequent presentations, F(1,37) = 82.932, P < 0.001, η2 = 0.703.
Figure 1.
Mean reaction times (in seconds) for all four presentations during the study phase, separately for subsequent memory performance (hits+, hits−, misses) and encoding contexts (same vs. different tasks). Error bars denote standard error.
Mean reaction times (in seconds) for all four presentations during the study phase, separately for subsequent memory performance (hits+, hits−, misses) and encoding contexts (same vs. different tasks). Error bars denote standard error.
Test phase
Discriminability analysis
Discriminability scores (d′) were calculated based on the frequencies of hits and false alarms (FAs). The normalized probabilities of overall hits and FAs were compared in a paired-samples t-test. The t-test showed that participants’ performance in the recognition memory task was statistically significantly above chance, t(37) = 22.529, P < 0.001. Mean and standard deviations of d′ scores and percentages of hits and FAs are illustrated in Table 1. Those individual d′ scores indicate that recognition memory performance was higher in the different encoding task condition than on the same task condition, driven by a higher hit rate in the former task, t(37) = 7.030, P < 0.001.
Table 1.
Mean d′ scores and mean percentages of Hits and FAs with standard deviations (in parentheses) for overall memory performance and across the two encoding conditions
Mean d′ scores and mean percentages of Hits and FAs with standard deviations (in parentheses) for overall memory performance and across the two encoding conditionsFalse alarm responses were analyzed with respect to confidence judgments and source memory responses. As presented in Table 1, on average 6 ± 4% of new items were incorrectly identified as old (FA). Participants made more high than low confidence FA responses, t(37) = 6.645, P < 0.001.To identify whether participants were biased toward giving a particular task response when making incorrect item and source judgments, frequencies of source responses for FAs were also analyzed in a one-way ANOVA with six levels (“all four tasks,” “gender task,” “show business task,” “British task,” “like task,” “don't know”). The ANOVA revealed a main effect of task response, F(5,222) = 20.585, P < 0.001. Results showed that about half of FAs (51 ± 31%) corresponded to “I don't know” responses, which was more likely than any other source response, P < 0.001. Specific task source responses were less likely for FAs and did not differ from each other in terms of frequencies, except for the show business task (4 ± 8%), which was less often selected than the gender task (13 ± 21%, P = 0.029) and the British task (12 ± 17%, P = 0.014).
Response frequencies
Mean percentages of recognition performance in the two encoding task conditions are displayed in Figure 2. The majority of responses resulted in hits− judgments (correct item memory, incorrect source memory), fewer responses resulted in hits+ judgments (correct item and correct source memory) and the least responses resulted in misses. Encoding under the same task condition was associated with more hits+ judgments (40 ± 17%), than the different encoding task condition (28 ± 16%), t(37) = −3.548, P < 0.001. In contrast, the same encoding task condition was associated with a smaller number of correct hits− judgments (42 ± 11%), than the different encoding task condition (63 ± 16%), t(37) = 5.787, P < 0.001. Finally, the different encoding task condition produced nearly half of the number of misses (9 ± 9%), as compared to the same encoding task condition (17 ± 13%), t(37) = −7.030, P < 0.001.
Figure 2.
Recognition performance. Mean percentages of the three levels of memory performance (hits+, hits−, misses) as a function of encoding context (same vs. different tasks). Error bars denote standard errors.
Recognition performance. Mean percentages of the three levels of memory performance (hits+, hits−, misses) as a function of encoding context (same vs. different tasks). Error bars denote standard errors.In a follow-up analysis, we analyzed frequencies of correct item and source memory judgments in the same task condition with respect to the four different tasks (i.e., Is this person female?,” “Is this person currently active in show business?,” “Is this person British?,” and “Do you like this person?”) that were repeatedly performed, in two separate repeated-measures ANOVAs. Mean percentages of correct item and source memory responses across the tasks are displayed in Figure 3. Both, correct item and correct source memory judgments differed statistically significantly across the four tasks, F(3,111) = 9.054, P < 0.001, F(3,111) = 11.389, P < 0.001, respectively.
Figure 3.
Mean percentages of correct item (A) and correct source memory (B) responses in the same task condition across the four encoding tasks. Error bars denote standard errors.
Mean percentages of correct item (A) and correct source memory (B) responses in the same task condition across the four encoding tasks. Error bars denote standard errors.Simple effects analyses revealed that participants made fewer correct item memory judgments when famous names were presented in the gender task compared to the British and the like task (P < 0.003) and more correct item memory judgments when stimuli were encoded in the like task compared to the other three tasks (P < 0.046). Similarly, fewer correct source judgments were made when stimuli were presented in the gender task compared to the other tasks (P < 0.013) and more correct source judgments were made in the like task compared to all other three (P < 0.008).
Incorrect source responses
Same encoding task
Frequencies of incorrect source responses were first analyzed in the same encoding task condition (means and standard errors are displayed in Fig. 4) to examine whether (1) participants were biased toward a particular task response when making incorrect source memory judgments and (2) whether they were more likely to select any one of the single tasks or the “all four tasks” response. A one-way ANOVA revealed a main effect of task response, F(4,185) = 8.065, P < 0.001. Simple effects analyses showed that “all four tasks” source errors were more often made than single task responses, all P < 0.004, while no difference in frequencies of responses was statistically significant between the single tasks.
Figure 4.
Mean percentages of incorrect source responses (as expressed in percentages of all source responses) in the same encoding task condition across the four single task response options, “Gender task,” “Show business task,” “British task,” and “Like task” and “all four tasks.” Error bars denote standard errors.
Mean percentages of incorrect source responses (as expressed in percentages of all source responses) in the same encoding task condition across the four single task response options, “Gender task,” “Show business task,” “British task,” and “Like task” and “all four tasks.” Error bars denote standard errors.To test whether participants were more likely to select any one of the single tasks or the “all four tasks” response, frequencies of the sum of incorrect single task responses was compared to frequencies of the “all four tasks” response using a paired t-test, revealing that participants were more likely to select any one of the single tasks than the “all four tasks” response when making incorrect source judgments in the same task condition, t(37) = 5.836, P < 0.001.
Different encoding task
Frequencies of incorrect source responses in the different encoding task condition were analyzed to identify: (1) whether participants were biased toward giving a particular task response when making a wrong source judgment (i.e., when they failed to respond that an item had been studied in “all four tasks”); (2) whether there was a link between wrong source judgments and the nature of the task that participants had performed first (i.e., at the first presentation: primacy effect) or last (i.e., at the fourth presentation: recency effect). Response frequencies to the four single task responses are displayed in Figure 5 along with frequencies of primacy and recency responses.
Figure 5.
Mean percentages of incorrect source responses (as expressed in percentages of all source responses) in the different encoding task condition across the four single task response options, “Gender task,” “Show business task,” “British task,” and “Like task” and percentage of primacy and recency responses based on the first and last task that was performed, respectively. Error bars denote standard errors.
Mean percentages of incorrect source responses (as expressed in percentages of all source responses) in the different encoding task condition across the four single task response options, “Gender task,” “Show business task,” “British task,” and “Like task” and percentage of primacy and recency responses based on the first and last task that was performed, respectively. Error bars denote standard errors.
Effect of type of task
A one-way ANOVA revealed differences between the four single tasks with a main effect of task, F(3,111) = 12.603, P < 0.001. Simple effects analyses showed that participants were less likely to select the gender task compared to the other three and more likely to select the like task than the other three, all P < 0.01.
Effect of recency
A paired t-test revealed that participants were more likely to give a recency response than a primacy response, t(37) = 4.342, P < 0.001.
Reaction times at test
RTs measures during the test phase were analyzed in two separate repeated-measures ANOVAs. First, RTs to the item memory (old/new) judgment were examined in a 3 × 2 ANOVA with the factors memory performance (hits+, hits−, misses) and encoding task condition (same, different). In a second analysis, RTs to the source memory task were analyzed. In this 2 × 2 repeated-measures ANOVA, the factors were memory performance (only two levels, as misses were not followed up with a source memory question) and encoding task condition.The ANOVA analyzing item memory RTs at test revealed a main effect of encoding context, F(1,37) = 10.12, P = 0.003, η2 = 0.229. Item memory responses were made faster to items previously encoded under the different encoding task condition (964 ± 259 msec), compared to the same encoding task condition (1026 ± 231 msec; see Fig. 6). No main effect of memory performance on RTs was found for the item memory responses, F(2,74) = 2.299, P = 0.132, η2 = 0.063, nor any interaction with encoding context, F(2,74) = 1.032, P = 0.333, η2 = 0.029. However, simple effects analyses indicated that hits+ judgments to the old/new question were made faster than hits− judgments, P < 0.001.
Figure 6.
Mean reaction times (in seconds) for item memory judgments as a function of the encoding context (same vs. different task). Error bars denote standard errors.
Mean reaction times (in seconds) for item memory judgments as a function of the encoding context (same vs. different task). Error bars denote standard errors.The ANOVA analyzing RTs to the source memory question revealed main effects of source memory performance, F(1,37) = 30.207, P < 0.001, η2 = 0.449, and encoding context, F(1,37) = 10.958, P = 0.002, η2 = 0.228. Correct source responses (3170 ± 159 msec), were given faster than incorrect source responses (3271 ± 137 msec) and, similar to item memory, source memory responses were made faster to items previously encoded under the different encoding task condition (3195 ± 153 msec), compared to the same encoding task condition (3245 ± 143 msec). This difference appeared more pronounced for hits+ than hits− (see Fig. 7), but the interaction between encoding context and memory performance failed to reach standard level of significance F(1,37) = 3.304, P = 0.077, η2 = 0.082.
Figure 7.
Mean reaction times (in seconds) for source memory judgments (hits+ and hits−) as a function of the encoding context (same vs. different task). Error bars denote standard errors.
Mean reaction times (in seconds) for source memory judgments (hits+ and hits−) as a function of the encoding context (same vs. different task). Error bars denote standard errors.
Discussion
The present experiment investigated the best predictors of memory formation over multiple study episodes. Participants repeatedly encoded names of famous people four times, either in the same task (optimal encoding for a reactivation view), or in different tasks (optimal encoding for an encoding variability view). During the test phase, an old/new judgment task was used to assess item memory, followed by a source memory judgment about the encoding task. Consistent with the reactivation view (e.g., Thios and D'Agostino 1976) and with the competitive trace theory (Yassa and Reagh 2013), it was proposed that same task encoding would be associated with the reactivation of the same item and contextual cues across repetitions, leading to superior source memory for the encoding context. In the different task condition, however, source memory performance was expected to be lower, because of those same reactivation processes leading to interference (conflicting contextual information). Results generally supported these predictions. In addition, and consistent with the encoding variability view (e.g., Martin 1968; Bower 1972), item memory was substantially higher when participants encode stimuli in different tasks. Taken together, these results illustrate complementary benefits of reactivation versus encoding variability on episodic memory formation.At study, there were significant effects of stimulus repetition in the same task condition: RTs decreased rapidly between the first and second presentation and more slowly afterwards (following a quadratic distribution), as commonly found in semantic categorization tasks using multiple stimulus presentations (e.g., Renoult et al. 2012). In the different task condition, even though the same stimuli were also presented four times, the processing of these stimuli in different contexts appears to have canceled the facilitatory effects of repetition, consistent with findings from previous studies reporting an absence of repetition priming when words are classified on different tasks (Ratcliff et al. 1985; Vriezen and Moscovitch 1990). The fact that effects of repetition were not significant in this condition suggests that stimuli maintained a certain degree of novelty when processed in different contexts, despite being repeatedly presented.At test, the discriminability index and overall hit scores were higher for the different encoding than the same encoding condition, supporting higher benefits of encoding variability (Martin 1968; Bower 1972). However, looking at memory performance in more detail revealed a more complex picture. Encoding items repeatedly in the same task resulted in higher source memory (hits+) but worse item memory (hits−). In contrast, when participants studied items repeatedly in different tasks, they had lower source memory performance (28% vs. 40%) but much higher item memory (63% vs. 42%).Analyses of reactions times at test confirmed the overall benefits of encoding stimuli in different tasks: reaction times to item and source judgments were systematically faster (across memory conditions) than for stimuli that had been encoded in the same task repeatedly.Despite these distinct task effects on memory performance, detailed analyses of source responses showed that very similar processes appeared to be in play in the two tasks. In the same task condition, the like task was associated with the highest source memory performance and the gender task with the worst performance. The benefits of the subjective judgments of likeness are likely related to a self-reference effect (Symons and Johnson 1997; Sui and Humphreys 2015), while the relatively lower performance in the gender judgment task may reflect that this task is a shallow type of judgment for famous names (Craik 2002). In the different encoding task, even though the correct source response was that participants studied the famous names in all four tasks, analyses of frequencies of incorrect source responses revealed a highly similar pattern to the same task condition: participants were less likely to select the gender task compared to the other three tasks and more likely to select the like task than the other three. Additionally, participants made more recency-based errors than primacy-based errors, that is, they were more likely to select the last task they performed (16% of responses) than the first task they performed (8%).These results add to the existing body of research from retroactive interference paradigms (Anderson and Neely 1996; Hupbach et al. 2007; Kim et al. 2012, 2018) indicating that stimulus occurrence in multiple contexts may cause interference, resulting in higher levels of generalization at the cost of contextual source information. However, we show that this decrease in recollection is accompanied by an important increase in item memory when stimuli are encoded in differing contexts, suggesting that encoding variability is associated with better item memory (Bower 1972; Hintzman 1986). One could argue that the lower item memory performance in the same encoding task condition could be due to participants switching off their attentional resources during repeated encoding. While these trials were less novel for the participants (with clear repetition priming effects on RTs, see above), we think that this possibility is not likely as (1) participants were not aware of the forthcoming recognition memory test (and that some stimuli would be presented in the same task, while others would be presented in multiple tasks), so there would be no reason for them to adopt a strategy to attempt to remember the task or to omit to do so, (2) it would be unclear how such low attentional levels during encoding would have resulted in superior source memory performance in this same task.Note that the increase in source memory performance in the same task condition is unlikely to be due to a lower difficulty of this experimental condition. Indeed, remembering that one has studied a particular item in one specific task only (out of four possibilities) is arguably more complex or difficult, than deciding that it was studied in all four tasks. “All four tasks” was the correct answer for 50% of the old items, whereas individual tasks (“gender task,” “show business task,” “British task,” or “like task”) were the correct answer for only 12.5% (1/8) of the old items. This is concordant with reaction time data showing faster responses for source judgments performed after encoding stimuli in different encoding contexts as compared to the same encoding context. Analyses of source errors in the same encoding task condition revealed that “all four tasks” source responses were more common than any of the individual single tasks. However, participants did not appear to be biased to select “all four tasks,” as this incorrect source response was still selected less often by participants than single task responses. Taken together, this dissociation in item and source memory performance (and the superior source memory performance in the same task condition) is unlikely to be due to the use of four different tasks in our paradigm. Indeed, as participants had to remember the single task in which they studied the items in the same task condition, the use of additional tasks would increase rather than decrease the complexity of source judgments. At the same time, using four encoding tasks allowed us to overcome the limitations of encoding variability paradigms that typically only include two study blocks (Huff and Bodner 2014).Our findings indicate that context variability is beneficial to episodic memory, by increasing the number of items that are recalled. This could be due to: (1) an increased saliency of each presentation at encoding (as indicated by the abolition of repetition priming effects at study) potentially increasing novelty-encoding strategies (Tulving and Kroll 1995; Tulving et al. 1996; van Kesteren et al. 2012), and (2) an integration and generalization across contexts, promoting the creation of multiple traces, resulting in a larger variety of retrieval cues and thereby enhancing item memory performance (Martin 1968; Bower 1972; Hintzman 1986; Nadel and Moscovitch 1997). Note that this interpretation is compatible with the notion that processing the stimuli in different tasks might lead to deeper encoding processes (Craik 2002), as compared to repeated encoding in the same task, which may involve learning of stimulus-response associations and less elaborate processing at each presentation (Jacoby 1978; Henson et al. 2014). The benefits of context variability are however at the expenses of source recollection, which is decreased, likely due to interference of nonoverlapping features and generalization across contexts (Yassa and Reagh 2013). Interestingly, Reagh and Yassa (2014) have recently reported that, even though recognition memory performance was improved for stimuli that were studied multiple times, it also resulted in impaired discrimination of similar lures, as compared to stimuli presented only once. Similar to the findings of the present study, the authors stipulated that this could be due to a tradeoff between gist (item memory in our case) and memory for details (context of the task in our case). Similarly, Opitz (2010) reported that studying picture stimuli in different tasks resulted in greater contribution of familiarity processes at retrieval (increased rate of “know” responses), as compared to studying items in the same task repeatedly. Our results therefore extend these observations, and show that when the same context is maintained across repetitions, memory for contextual details is in fact improved, compared to when context is varied across encoding episodes.One has to note however, that in source memory paradigms, incorrect source judgments do not necessarily mean that no relevant details of the encoding episode can be remembered. Participants may still be able to remember details from the encoding episode which were not assessed by the source memory task, and recollection of noncriterial episodic information may occur during incorrect source memory judgments (Yonelinas and Jacoby 1996; Mulligan and Hirshman 1997). However, it is reasonable to assume that incorrect source memory judgments rely more on familiarity processes than correct judgments (Squire et al. 2007). Nonetheless, it would be important to replicate these findings using another measure of recollection, such as a Remember-Know judgment (Tulving 1985).AB–AC interference paradigms often include a baseline condition were items are presented only once (e.g., Henson et al. 2002). In our paradigm, a condition with no repetition would not allow a direct comparison of source memory performance with the different tasks condition, as by definition it requires two presentations of each item. Such baseline condition could still be used to compare item memory and test whether, as observed with AB–AC paradigms, repeated presentations in interference conditions can make performance actually worse than after a single presentation (e.g., Henson et al. 2002). However, in our case, the lowest item memory was observed in the same encoding condition (mean hit rate: 83%), that is in the condition that is most typically used in studies including multiple repetitions, and that does not include any type of interference (participants essentially study the same items repeatedly in the same task). Results of studies that have compared the respective effects of single versus multiple presentations on memory performance suggest that repeated encoding results in increased memory performance compared to a single presentation. For example, in a recent study in our laboratory where famous faces were presented either one or four times in a dead or alive judgment task, single presentation was clearly associated with a lower hit rate (82%) than the four presentations condition (96%; Lambert et al. 2019). These results obtained with 60 famous faces (compared to 288 here) illustrate that such designs with multiple presentations can produce ceiling effects in memory performance. In a similar study using pictures stimuli, Opitz (2010) also observed higher hit rate for items presented three times as compared to items presented only once. Similar results are obtained in continuous recognition paradigms in which recognition performance increases progressively from the first as compared to the second presentations and from the second as compared to the third, until a plateau in performance is reached (Van Strien et al. 2005). In the context of the present experiment, we were particularly interested in comparing two types of encoding conditions rather than determining the effects of repetition per se. Nonetheless, it would be interesting to conduct follow-up studies contrasting various levels of repetitions. For instance, future studies could be conducted where some stimuli are presented two, three or four times, to investigate whether the respective benefits of multiple encoding tasks on item memory and of same task on source memory (and respective detriments in the other condition) are graded or even linear across repetitions.The stimuli that we have used (names of famous people) are typically associated with a web of semantic (e.g., Pistono et al. 2019) and episodic (e.g., Renoult et al. 2015) associations. For instance, famous names that easily bring to mind episodic memories are associated with superior performance on tests of semantic and episodic memory, as compared to equally famous names that do not bring such recollection to mind (Westmacott and Moscovitch 2003; Westmacott et al. 2004; Renoult et al. 2015). Interestingly, in some of these studies on the effects of autobiographical significance, the famous names were presented repeatedly in four different tasks (Westmacott and Moscovitch 2003; Westmacott et al. 2004), or in the same task repeatedly (Renoult et al. 2015). Even though the magnitude of the effects of autobiographical significance for same versus different task encoding have not been directly compared in the same study, the presence of robust effects in both conditions suggest that encoding stimuli in different tasks is not necessarily detrimental to episodic recollection processes, as long as relevant contextual information is not reactivated together with other, potentially conflicting, contextual details.Our findings can be taken to support benefits of both encoding variability, by increased item memory, but also of reactivation view, via increased source memory. While in certain situations, such as academic study, maximizing encoding variability would certainly show clear benefits via an increased number of concepts remembered (e.g., I remember the concept of “ecphory” that I studied in four different lectures), in other real-life and more personal contexts, one may often benefit from remembering source information (e.g., Endel explained to me four times what “ecphory” meant, I'd better acknowledge that I remember his efforts next time we meet).In future studies, it would be interesting to extend these findings by exploring the consequences of context variability on repeated testing (rather than study) and investigating whether the same respective benefits of reactivation versus variability are observed.
Materials and Methods
Participants
Thirty-eight right-handed adult volunteers (23 females) participated in the experiment. Participants were aged 18 to 36 yr (Mage = 22 ± 4) and had completed an average of 15 ± 2 yr of education. Exclusion criteria consisted of any neurological or medical conditions known to compromise brain function, and active substance abuse. All participants had normal or corrected-to-normal vision, were English native speakers, and were right-handed. The study received ethics approval from the Research Ethics Committee of the School of Psychology at the University of East Anglia.
Materials
Stimuli were a total of 288 (written) names of famous people (e.g., Keith Richards, Michelle Obama). They were selected from 350 famous names based on data from a previous study (Renoult et al. 2015) that was updated for a UK population (by removing any unknown Canadian name and adding names of local celebrities). A group of 13 participants (aged between 18 and 36 yr) who did not participate in the main experiment, filled in an online questionnaire and reported any unknown celebrities. Any name that was rated as unknown by more than two participants was removed. The remaining 288 famous names were matched across all tasks and conditions in accordance with the four encoding tasks (gender; currently active in show business or not; British or not). All stimuli were presented as white written words in Courier New 36 font on a black background.
Task and procedure
In the incidental encoding phase, participants performed four different categorization tasks on the names they were presented with. At the beginning of each block, they were presented with a question they had to answer with regards to the stimuli. The four questions were “Is this person female?,” “Is this person currently active in show business?,” “Is this person British?” and “Do you like this person?”. Task order was pseudorandomized across participants. Participants were encouraged to guess the answer in cases where they were not familiar with the famous name or when they did not know the answer. They were instructed to press one of two buttons corresponding to whether their answer to the question was “yes” or “no.” Stimuli were presented for 1000 msec, followed by a fixation cross of random duration (800–1200 msec) indicating the beginning of the next trial.During the encoding phase, each stimulus was presented four times. Half of the stimuli were presented once in each of the four tasks (different task condition), the other half was presented repeatedly within only one of the four encoding tasks (same task condition). Participants were made aware at the beginning of the task that stimuli may be repeated, but no reference was made in the instructions to the different encoding conditions. The four repetitions per stimulus resulted in a total of 576 encoding trials. The experimental procedure is illustrated in Figure 8. At the end of the encoding phase, participants performed a trail-making distractor task.
Figure 8.
Same versus multiple encoding tasks paradigm. The paradigm included four encoding presentations of each stimulus during the study phase; participants encoded half of the stimuli in a different task condition, that is, performing a different task at each presentation of the stimulus, the other half were encoded in a same task condition, that is, participants repeatedly performed the same encoding task. In the test phase, participants made old/new judgments followed by source judgments.
Same versus multiple encoding tasks paradigm. The paradigm included four encoding presentations of each stimulus during the study phase; participants encoded half of the stimuli in a different task condition, that is, performing a different task at each presentation of the stimulus, the other half were encoded in a same task condition, that is, participants repeatedly performed the same encoding task. In the test phase, participants made old/new judgments followed by source judgments.During the test phase, participants performed an unexpected recognition-source memory test, that is, they did not know their memory was tested for famous names and associated contexts, in which they were encoded. In this task, all old stimuli from the encoding phase were presented along with the remaining set of new stimuli. Both lists were matched in terms of gender, whether famous people were currently active in show business or not and whether they were British or not. Participants were cued with a name and instructed to indicate whether this stimulus had been presented during the encoding phase, by pressing one of eight buttons on the response pad corresponding to the following responses: “definitely old,” “perhaps old,” “perhaps new,” and “definitely new.” “Old” responses were followed by a source memory question asking participants in which task the famous name had been categorized previously with the response options “all four tasks,” “gender task,” “show business task,” “British task,” “like task,” and “I don't know.” Stimuli were presented for 1500 msec, followed by a fixation cross for 1000 msec. Depending on participants’ old/new response, either a fixation cross appeared for 1500 msec or the source memory question appeared for 1500 msec. Another fixation cross of random duration (800–1200 msec) then indicated the beginning of the next trial.