Literature DB >> 26179230

A signal detection theory analysis of behavioral pattern separation paradigms.

Rita E Loiotile1, Susan M Courtney2.   

Abstract

Behavioral pattern separation (BPS) paradigms ask participants to discriminate previously encoded (old) stimuli from highly similar (lure) and categorically distinct (novel) stimuli. The lure-old discrimination, thought to uniquely reflect pattern separation in the hippocampal formation, is typically pitted against the traditional novel-old discrimination. However, BPS paradigms have measured lure-old discrimination neither consistently across studies nor in such a way that allows for accurate comparison to novel-old discrimination. Therefore, we advocate for signal detection theory (SDT) as a unified framework. Moreover, we compare SDT with previously used measures of lure-old discrimination, indicating how other formulas' inaccuracies can lead to erroneous conclusions.
© 2015 Loiotile and Courtney; Published by Cold Spring Harbor Laboratory Press.

Entities:  

Mesh:

Year:  2015        PMID: 26179230      PMCID: PMC4509922          DOI: 10.1101/lm.038141.115

Source DB:  PubMed          Journal:  Learn Mem        ISSN: 1072-0502            Impact factor:   2.460


Tasks measuring behavioral pattern separation (BPS) are distinguished from standard tests of memory recognition by the inclusion of stimuli that are highly similar to those previously encoded. Because each of these “lure” stimuli differs only slightly from its corresponding previously encoded “old” stimulus, successful discrimination of old from lure stimuli is thought to denote particularly detailed memory representations of the old stimuli. In contrast, it has been suggested that successful discrimination of old from “novel” stimuli (i.e., categorically new, as commonly used in tests of recognition) need not rely on detailed representations of the old stimuli, but rather can be achieved even if the old stimuli have only gist-based representations. Consistent with this distinction, several studies imply a dissociation between novel-old and lure-old discrimination. For example, using the BPS paradigm, researchers have argued that only detail-based lure-old discrimination is impaired in healthy aging (Stark et al. 2013) and enhanced by post-study caffeine (Borota et al. 2014). Claims from a more recent study (Reagh and Yassa 2014) support an even stronger, double dissociation: repeated-encoding benefits gist-based, novel-old discrimination (a finding known even to Ebbinghaus (1885)), but surprisingly impairs detail-based, lure-old discrimination. One major concern—particularly with findings such as these, where there appear to be differences between novel-old and lure-old discrimination—is that the measures used for lure-old discrimination vary from each other and from those used for novel-old discrimination. A unified measurement approach is needed if one is to reliably characterize lure-old discrimination as well as properly compare it to novel-old discrimination. Here, we propose the use of signal detection theory (SDT) (Green and Swets 1966) for measuring lure-old discrimination. We reran the aforementioned BPS paradigm (Reagh and Yassa 2014), which tested effects of repeated encoding on both detail and gist-based memory representations. We demonstrate not only that SDT is an appropriate framework for measuring lure-old discrimination but that SDT formulas yield opposite results to those reported: lure-old discrimination, like novel-old discrimination, gets better with repeated encoding. Additionally, since the original, repeated-encoding study used a unique formula for calculating lure-old discrimination, we discuss potential failings of other formulas more commonly used for calculating lure-old discrimination in BPS tasks. Our task followed a very similar structure to the original. The twenty-two healthy volunteers (17 female; age: mean = 20.1, standard deviation = 2.4) provided written consent under a protocol approved by the Institutional Review Board of Johns Hopkins University. Participants incidentally encoded 450 object stimuli (from http://darwin.bio.uci.edu/~cestark/BPSO/bpso.html and http://web.mit.edu/tkonkle/www/stimuli.html), while performing an “indoor” or “outdoor” judgment task during their 2.5-sec presentation time (each with a 0.5-sec interstimulus interval). Of these encoded stimuli, half were presented only once (1-repeat) while the other half were presented three times, nonconsecutively (3-repeat). Repeated stimuli were presented randomly throughout the encoding session so that the lag between presentations differed substantially across items. Participants were then given a surprise, untimed recognition task consisting of 150 old stimuli (half 1-repeat, half 3-repeat), 150 lure stimuli (half similar to untested 1-repeat old stimuli, half similar to untested 3-repeat old stimuli), and 150 novel stimuli. This created five test stimulus types: 1-repeat old, 3-repeat old, 1-repeat lure, 3-repeat lure, and novel. (Because lures were never shown during encoding, the term “repeat,” as in 1-repeat lure, refers to the number of times the corresponding old stimulus was shown during encoding.) Consistent with the initial study, participants first provided an “old” or “new” recognition judgment and, unique to our study, subsequently provided a confidence judgment of “very sure,” “somewhat sure,” or “not at all sure.” Participants were instructed to respond “new” to any stimulus that had not been previously seen during encoding—i.e., lure and novel stimuli. Across subjects, mean (and standard deviation of) response frequencies for each stimulus type are reported in Table 1.
Table 1.

Means and standard deviations of response frequencies per stimulus type

Means and standard deviations of response frequencies per stimulus type First, we sought to replicate results from the original study using their formulas. For novel-old discrimination, we used SDT's d′, defined as the difference between the normalized proportion of old stimuli judged as “old” and the normalized proportion of novel stimuli judged as “old”—i.e., d′ = z(“old”|old) − z(“old”|novel). We found that repetition enhanced novel-old discrimination, t(21) = 18.68, P < 0.001, Cohen's d = 3.98. For lure-old discrimination, we subtracted proportion of old stimuli misses, pooled across repeat conditions, from lure correct rejections—i.e., 1-repeat lure-old discrimination = p(“new”|lure)1-repeat−p(“new”|old)pooled, and 3-repeat lure-old discrimination = p(“new”|lure)3-repeat − p(“new”|old)pooled. Using these formulas, we replicated a novel result of the original experiment: lure-old discrimination appeared to be significantly better for stimuli encoded once, rather than three times, t(21) = 3.22, P < 0.005, Cohen's d = 0.69. Taken together, these results suggest that repeated exposure to a stimulus enhances the stimulus's gist representation and, conversely, degrades the stimulus's detailed representation (Reagh and Yassa 2014). Next, we examined whether it was appropriate to analyze our data using SDT. Because SDT assumes that each stimulus type's probability density function (PDF), with respect to memory recognition, forms a normal distribution, it predicts that plots of zROC—i.e., the receiver operating characteristic in z-space—will form straight lines (Macmillan and Creelman 2005). Therefore, for each subject we plotted zROCs for lure-old discrimination—i.e., 1-repeat lure versus 1-repeat old, 3-repeat lure versus 3-repeat old—and for novel-old discrimination—i.e., novel versus 1-repeat old, novel versus 3-repeat old—from which we could individually test linear fit and hence the assumption of normal PDFs. Our analysis indicated that all zROCs were linear, with no systematic deviations, consistent with SDT's assumption of normal PDFs. Mean (and standard deviation) coefficient of determination (R2) values are as follows: lure-old 1-repeat = 0.94 (0.08), lure-old 3-repeat = 0.88 (0.12), novel-old 1-repeat = 0.85 (0.18), novel-old 3-repeat = 0.74 (0.24). On average, the assumptions of SDT fit lure-old discrimination just as well, if not better, than novel-old discrimination. This is worth noting as SDT has been the preferred framework for calculating novel-old memory sensitivity, independently of response-criteria, since its seminal introduction by Green and Swets (1966). Therefore, it seems more than reasonable to use SDT for calculating lure-old discrimination. Moreover, SDT has been used to measure lure-old discrimination in non-BPS paradigms (e.g., see Wixted and Stretch 2000). We subsequently measured both lure-old and novel-old discrimination using SDT. We chose ; Simpson and Fitter 1973) as an appropriate metric for discriminative sensitivity over the more commonly used d′. d′ assumes equal variances of the normal PDFs, an assumption that empirical data often fails to validate (Macmillan and Creelman 2005; see Dougal and Rotello 2007 for an example of d′ leading to a different, erroneous, conclusion from da). Therefore, da, which accounts for both the distance of the PDFs as well as their relative variances, is a more accurate measure of relative overlap between two stimulus type PDFs and hence of a participant's ability to discriminate between those two stimulus types. (Note that d′ = da when PDF variances are equal.) Using slopes and intercepts of the best-fit lines from our zROC graphs for each participant, we were able to calculate da as a measure of novel-old and lure-old discrimination for both 1-repeat and 3-repeat conditions. For example, 1-repeat lure-old da for each participant was calculated using the participant's ordinary least squares line of best fit to 1-repeat lure-old zROC plots. Our analysis indicated significantly better discrimination of novel stimuli from old 3-repeat, rather than old 1-repeat stimuli, t(21) = 13.09, P < 0.001, Cohen's d = 2.79. Corresponding lure stimuli were also better discriminated from old 3-repeat than old 1-repeat stimuli, t(21) = 9.72, P < 0.001, Cohen's d = 2.07. The latter result is particularly remarkable as it differs in direction from that found in the original study, and reproduced by us, using the original paper's formula. Therefore, encoding repetition was found to either enhance or degrade lure-old discrimination dependent on whether SDT or the original paper's formulas, respectively, were used (Fig. 1).
Figure 1.

Repeated encodings' effect on memory discrimination. Graphs A and B analyze the same data in two different ways. (A) Gist-based novel-old discrimination, as measured by d′, is significantly enhanced by repeated exposure to old stimuli. Conversely, detailed lure-old discrimination, as measured by a formula specific to Reagh and Yassa (2014), is significantly impaired by repeated exposure to old stimuli. (B) Using SDT's da, both novel-old and lure-old discrimination demonstrate significant benefit from repeated exposure to old stimuli. (C) Results from an independent 2AFC test on the same participants. Enhanced lure-old discrimination in the 3-repeat condition corroborates our findings from SDT analysis of the old/new recognition test. All error bars represent standard error of the mean.

Repeated encodings' effect on memory discrimination. Graphs A and B analyze the same data in two different ways. (A) Gist-based novel-old discrimination, as measured by d′, is significantly enhanced by repeated exposure to old stimuli. Conversely, detailed lure-old discrimination, as measured by a formula specific to Reagh and Yassa (2014), is significantly impaired by repeated exposure to old stimuli. (B) Using SDT's da, both novel-old and lure-old discrimination demonstrate significant benefit from repeated exposure to old stimuli. (C) Results from an independent 2AFC test on the same participants. Enhanced lure-old discrimination in the 3-repeat condition corroborates our findings from SDT analysis of the old/new recognition test. All error bars represent standard error of the mean. As an “adjudicator” between the conflicting formulas, we gave participants a two-alternative forced choice (2AFC) recognition task immediately following the aforementioned old/new recognition task, for the stimuli that had not been tested in the old/new task. In each of the 150 trials (half 1-repeat, half 3-repeat), we displayed two stimuli: an old stimulus (not tested in the old/new recognition task) and a corresponding lure stimulus. We asked participants to pick the stimulus they had seen at encoding and to then indicate their confidence in that choice (i.e., “very sure,” “somewhat sure,” or “not sure”). We thought that the 2AFC paradigm was particularly appropriate for assessing lure-old discrimination since participants were quite literally tasked with discriminating each old stimulus from its corresponding lure stimulus. Moreover, because a 2AFC enables participants to make a relative recognition decision (i.e., which one of the two has more memory strength?) rather than an absolute recognition decision (i.e., does the recognition strength for this one stimulus exceed my criterion for calling it “old?”), 2AFC responses do not require SDT formulas for extracting memory sensitivity from criterion-confounded responses of “old.” Rather, the commonly accepted formula for measuring memory sensitivity in a 2AFC task is a simple proportion correct (= no. of old stimuli correctly chosen/total no. of trials).[4] Consistent with our SDT analyses, repetition at encoding produced significantly better lure-old discrimination, t(21) = 9.81, P < 0.001, Cohen's d = 2.09. Thus, the 2AFC task provides an independent and compelling confirmation of the SDT results demonstrating enhanced gist- and detail-based memory for repeatedly encoded items. Then, what contributed to the original formula's “flipped” effect of enhanced detail-based memory for items that were only encoded once? For each condition, the original formula measures lure-old discrimination as the difference between memory strength for that condition's lure stimuli and both conditions’ old stimuli. The face validity of this measurement seems questionable. For example, for 1-repeat lure-old discrimination, the formula equals the difference between 1-repeat lure memory strength and an average of 1- and 3-repeat old memory strength, rather than the arguably more informative difference between 1-repeat lure and 1-repeat old memory strength. Because this pooled response to old stimuli is identical for both 1-repeat and 3-repeat lure-old discrimination measures, the original formula reduces to measuring lure “correct rejections” = p(“new”|lure) for each stimulus type. On average, the memory strength distribution for the 3-repeat lures has a larger variance than that for 1-repeat lures (1-repeat σ = 1.29, 3-repeat σ = 1.64), although the distributions have roughly equivalent means (1-repeat μ = 0.88, 3-repeat μ = 0.86). That is, the 3-repeat condition has a higher proportion of lures with slightly greater (false) memory recognition than the 1-repeat condition (see Fig. 2). Therefore, any measure of lure-old discrimination that primarily characterizes the memory strength of lure stimuli—rather than the difference in memory strength between lure and old stimuli—would produce a “flipped” result for this particular data set and, possibly, other BPS paradigms that use such a formula.
Figure 2.

A SDT modeling of recognition memory: probability density functions with respect to memory recognition strength for each of the five test stimulus types, averaged across participants. (Novel PDFs are standardized at μ = 0, σ = 1. Parameters for each of the other four PDFs are determined by the line best fit to the novel-stimulus zROC; μ = intercept/slope, σ = 1/slope.) The vertical line in both graphs represents participants’ new/old decision criterion, such that all stimuli exceeding the criterion's recognition strength were deemed “old” at test. Comparing the two graphs, one can see that 1-repeat lures are more often labeled “new” than 3-repeat lures—i.e., the portion of the 1-repeat lure PDF falling to the left of the decision criterion is larger in area than that of the 3-repeat's. However, it is also evident that the 3-repeat lure and old distributions are substantially less overlapping than the 1-repeat lure and old distributions. This degree of separation is reflected in SDT's memory discrimination measure da. Therefore, repeated encoding of old items enhances their subsequent discrimination from highly similar items.

A SDT modeling of recognition memory: probability density functions with respect to memory recognition strength for each of the five test stimulus types, averaged across participants. (Novel PDFs are standardized at μ = 0, σ = 1. Parameters for each of the other four PDFs are determined by the line best fit to the novel-stimulus zROC; μ = intercept/slope, σ = 1/slope.) The vertical line in both graphs represents participants’ new/old decision criterion, such that all stimuli exceeding the criterion's recognition strength were deemed “old” at test. Comparing the two graphs, one can see that 1-repeat lures are more often labeled “new” than 3-repeat lures—i.e., the portion of the 1-repeat lure PDF falling to the left of the decision criterion is larger in area than that of the 3-repeat's. However, it is also evident that the 3-repeat lure and old distributions are substantially less overlapping than the 1-repeat lure and old distributions. This degree of separation is reflected in SDT's memory discrimination measure da. Therefore, repeated encoding of old items enhances their subsequent discrimination from highly similar items. It is important to underscore that one might reach different conclusions if one uses a formula that interrogates memory for lure stimuli, rather than memory for the lure stimuli relative to the old stimuli. In the present study, 1-repeat lures are more likely than 3-repeat lures to be correctly identified as “new.” However, this increase in lure false alarming for the 3-repeat condition does not imply deteriorated memory for items that were repeatedly encoded. Indeed, both SDT analysis of the old/new recognition task and results from the independent 2AFC task demonstrate better discrimination of 3-repeat, rather than 1-repeat, old stimuli from novel and corresponding lure stimuli. Examining the SDT PDFs reveals the reason behind the seemingly counterintuitive dissociation between lure false alarm rate and discrimination ability: while repetition at encoding engenders slightly greater false recognition of analogous lure items, the much-increased recognition strength of the repeatedly encoded old items results in a net decrease in overlap between the 3-repeat lure and 3-repeat old PDFs. It is this decrease in lure-old overlap for the 3-repeat condition that engenders enhanced, rather than degraded, discrimination. Ultimately the formulas must reflect what one hopes to measure. For BPS studies, the question of interest is not typically “recognition strength” for lures. Rather, one wishes to assess the precision of memory representation for old stimuli by comparing that representation to that of lure stimuli. Exactly why repeated encoding increases false alarming to lures is an interesting theoretical issue. Memory strength for lure items is based on encoding success of the corresponding old items. Therefore, a participant would likely respond “new” to a lure stimulus for two very different reasons: (1) if the corresponding old stimulus was encoded with very low fidelity (i.e., if the old item had not been encoded at all) and (2) if the corresponding old stimulus was encoded with very high fidelity (i.e., if recalled details of the old item allow the participant to reject the lure item as “old,” commonly known as “recall-to-reject” (Kirwan and Stark 2007)). As such, the increased variance of the 3-repeat lure PDF relative to that of the 1-repeat lure PDF makes perfect sense: repeated encoding allows for decreased encoding-failures of old stimuli (in turn, begetting stronger memory for some lure stimuli) and for increasingly detailed encoding of the old stimuli (in turn, begetting weaker memory for other lure stimuli). The fact that SDT cannot distinguish between these two alternative reasons for lure memory strength does not preclude its usage. Rather, in allowing one to measure the “difference” in memory strength between lure and old stimuli (themselves an undoubtable mixture of different types of representations), SDT allows one to make conclusions on the precision of the underlying old stimulus representations. For example, if “new” responses to lures primarily reflect encoding failure, this encoding failure would also be evidenced in many “new” responses to the old stimuli and would, therefore, produce a small lure-old da with both lure and old PDFs highly overlapped with the novel PDF. Therefore, SDT can help to characterize these nuanced aspects of memory strength. Because the “pooled” formula is unique to the original study, it behooves us to examine formulas more commonly used to determine lure-old discrimination in BPS paradigms. The majority of BPS studies (e.g., Toner et al. 2009; Yassa et al. 2010; Stark et al. 2013; Borota et al. 2014) ask participants to characterize tested stimuli as “old,” “similar,” or “new” rather than just “old” or “new.” Often lure-old discrimination is measured as p(“similar”|lure) or p(“similar”|lure) − p(“old”|lure) (e.g., Yassa, Mattfeld, et al. 2011). These tasks assume that a participant only ever responds “similar” when she/he is successfully performing recall-to-reject. Yet this cannot possibly be true, as “similar” is also given in response to both old and novel stimuli. It is likely, therefore, that a handful of “similar” responses are given as a kind of low-confidence “old” judgment. On the surface, our inability to characterize the “similar” response given to lures in an old/similar/new recognition task might seem analogous to the varied reasons why a participant might respond “new” to lures in an old/new recognition task. However, there are noteworthy differences that render the former problematic and the latter manageable. As discussed above, exactly why a participant responds “new” to lures becomes somewhat irrelevant as SDT allows one to compare lure with old stimuli. Critically, in the old/similar/new recognition task, no single formula captures both pieces of information. (Sometimes these analyses include independent estimates of responses to old items, but a direct comparison to lure items is lacking.) Therefore, as with the pooled and “correct rejection” formulas, “similar” formulas only use participants’ responses to lure items as a measure of lure-old discrimination. To address this concern, one might be tempted to try a SDT analysis on an old/similar/new recognition task. However, this would require the researcher to decide a priori whether to treat “similar” as indicative of recall-to-reject (and hence group the responses with “new”) or to treat “similar” as a low-confidence “old” judgment. Thus, while using the old/similar/new response set may be intuitively appealing, it is highly problematic. Of these BPS lure-old discrimination formulas, one of the closest to our preferred measure of da is a type of response-bias corrected proportion correct Pr = p(“old”|old) − p(“old”|lure) (e.g., in Leal et al. 2014). Like SDT's da and d′, and unlike the aforementioned BPS formulas, Pr accounts for the relative overlap between lure and old memory strengths. This is a strength of the formula. Unlike SDT's da and d′, Pr assumes that lure and old PDFs have boxcar, rather than normal, distributions with respect to recognition strength. Moreover, like d′, Pr is a one-point measurement which assumes that lure and old distributions have equal-variances (though d′ and Pr differ on the assumed shape of those distributions). As seen in Figure 3, the data better reflect the assumptions of da than Pr: da assumes curvilinear ROCs and linear zROCs (of slope = 1 and y-intercept = d′), while Pr assumes a linear ROC (of slope = 1 and y-intercept = Pr) and a concave-up curvilinear zROC (Macmillan and Creelman 2005). Across subjects, the ROC models implied by SDT's da fit the data significantly better than those implied by Pr; root-mean-square error is significantly lower for ROC values predicted by da in the 1-repeat, t(21) = 5.99, P < 0.001, Cohen's d = 1.31, and 3-repeat conditions, t(21) = 3.41, P < 0.005, Cohen's d = 0.74. This worse fit of ROCs implied by Pr might be somewhat trivially expected, as Pr utilizes only one of the five cumulative proportions to characterize memory discrimination. Therefore, it may be more appropriate to compare Pr with SDT's d′. d′ also provides a significantly better fit than Pr for both conditions: 1-repeat, t(21) = 2.65, P < 0.05, Cohen's d = 0.57, and 3-repeat, t(21) = 3.17, P < 0.005, Cohen's d = 0.68. It should be noted that using the Pr formula does not yield different results from our SDT analyses in this particular case: 3-repeat lure-old discrimination is significantly better than 1-repeat, t(21) = 6.76, P < 0.001, Cohen's d = 1.44. However, this does not mean that Pr will always yield the correct result. Ultimately, the implied (z)ROC models reflect each formula's measure of memory discrimination across varying decision criteria. As such, a poor fit can lead to conclusions of differing memory discrimination between two conditions for which memory discrimination is the same or, conversely, to equivalent memory discrimination between two conditions for which there is a difference in memory discrimination. Therefore, although p(“old”|old) − p(“old”|lure) is one of the better formulas used in BPS paradigms to measure lure-old discrimination, it is also not without its flaws and can lead to erroneous conclusions.
Figure 3.

Lure-Old ROCs and zROCs for 1-repeat (A,B) and 3-repeat (C,D) conditions. ROC (zROC) data points are cumulative proportions (z-scored cumulative proportions) of each response type—e.g., “very sure old”—for lure and old stimuli, across all subjects. Dark curves/lines represent the fitted SDT model, allowing variances between lure and old PDFs to be unequal; light lines/curves represent the Pr’s implied model of memory discrimination. Better discrimination between lure and old stimuli in the 3-repeat condition is evidenced by the increased distance of the 3-repeat points (and models) from the line y = x. Comparing the fit from models assumed by the da and Pr formulas, it is evident that the former fits the data better for both the 1-repeat and 3-repeat conditions and, therefore, more accurately characterizes the magnitude of memory discrimination. More generally, the data appear consistent with SDT predictions, and inconsistent with high-threshold model predictions: the lure-old ROC data points appear curvilinear, rather than linear, and the zROC data points appear linear, rather than curvilinear and concave-up.

Lure-Old ROCs and zROCs for 1-repeat (A,B) and 3-repeat (C,D) conditions. ROC (zROC) data points are cumulative proportions (z-scored cumulative proportions) of each response type—e.g., “very sure old”—for lure and old stimuli, across all subjects. Dark curves/lines represent the fitted SDT model, allowing variances between lure and old PDFs to be unequal; light lines/curves represent the Pr’s implied model of memory discrimination. Better discrimination between lure and old stimuli in the 3-repeat condition is evidenced by the increased distance of the 3-repeat points (and models) from the line y = x. Comparing the fit from models assumed by the da and Pr formulas, it is evident that the former fits the data better for both the 1-repeat and 3-repeat conditions and, therefore, more accurately characterizes the magnitude of memory discrimination. More generally, the data appear consistent with SDT predictions, and inconsistent with high-threshold model predictions: the lure-old ROC data points appear curvilinear, rather than linear, and the zROC data points appear linear, rather than curvilinear and concave-up. It is worth mentioning that Pr is a specific instantiation of a high-threshold model for recognition memory. Dual-process theories (Yonelinas 1994) state that recognition memory is a combination of signal-detection and high-threshold processes (i.e., familiarity and recollection, respectively). Therefore, they predict that, because novel-old discrimination utilizes both processes, novel-old ROCs will be less curvilinear and more linear in parts of the ROC-plot corresponding to high confidence old judgments. Additionally, because lure-old discrimination is thought to utilize only recollection, dual-process accounts predict a completely linear lure-old ROC. This dual-process account is currently well-debated. We, as others (e.g., Wixted and Stretch 2004), do not find evidence of linearity in ROCs (and curvilinearity in zROCs) and, as such, favor a signal-detection account for both novel-old and lure-old discrimination. If one wishes to remain neutral to this debate or agnostic to the assumption of normally distributed recognition strength for Novel, Lure, and Old items, one can simply take the area under the ROC (AUC), in lieu of da, for both novel-old and lure-old discrimination. Unlike da, AUC is calculated without fitting a normally distributed model to the data (Pollack and Hsieh 1969). When the assumption of normal distribution is met, , where Φ−1 represents the inverse normal cumulative distribution function. (Note that in practice, even when the data are normally distributed, methods for calculating AUC frequently underestimate the true area.) Using AUC calculated via the trapezoidal rule, our da results were confirmed: novel-old discrimination, t(21) = 11.11, P < 0.001, Cohen's d = 2.37, and lure-old discrimination, t(21) = 8.60, P < 0.001, Cohen's d = 1.83, were both enhanced with repeated exposure. In considering how lure-old discrimination is measured in BPS paradigms, it is imperative that we also reflect on the original motivation behind the paradigm and the measurement's implied construct. As the task name suggests, behavioral lure-old discrimination is thought to measure the success of the hippocampal process of “pattern separation”—i.e., the orthogonalization of memory representations for similar stimuli (Yassa and Stark 2011). Many in the field have acknowledged that it is difficult to make a direct link between a behavioral outcome in a complex task and a specific neural computation in one brain structure (e.g., Yassa and Stark 2011; Santoro 2013; Stark et al. 2013; Reagh and Yassa 2014;). There has been some evidence, however, that aspects of performance in BPS paradigms do correlate with activation in the hippocampus, and that this varies in aging and disease (e.g., Toner et al. 2009; Yassa et al. 2010, 2011; Holden et al. 2012). In order to more precisely link behavioral performance to neural computations, a better understanding is first needed of the underlying behavioral representations. Pattern separation is measured by the amount of neural representational overlap between lure and old stimuli (Vazdarjanova and Guzowski 2004; Leutgeb et al. 2007). Therefore, at a minimum, the behavioral data should also be measured by a formula that characterizes the amount of recognition overlap between lure and old stimuli. This is the essence of SDT's da (or even proportion correct in a 2AFC recognition task). Other formulas have either considered only the lure representation, and not the relationship between lure and old representations, or have poorly characterized this relationship. Though da does reliably measure the degree of lure-old overlap, more conditions must be met for one to conclude that less behaviorally measured overlap in these representations is specifically due to the hippocampal process of pattern separation during processing of lure stimuli. For example, our data demonstrate increased discrimination, and hence less lure-old overlap, in the 3-repeat than 1-repeat condition. But, can we conclude that this increased separation in behavioral recognition strength is specifically driven by increased hippocampal pattern separation? In beginning to answer this question, it is first important to note that the representation of old stimuli—as measured by novel-old discrimination—is different between the 1-repeat and 3-repeat conditions. Therefore, it seems that increased discrimination between lure and old stimuli with repeated encodings is driven not by increased pattern separation of the lures but by differences in the representation of the old stimuli. More generally, one might argue that strong evidence for greater behavioral pattern separation in condition A than condition B should satisfy two conditions: lure-old overlap should be smaller in condition A, while novel-old overlap stays the same for both conditions. Simply put: increased behavioral pattern separation should shift only the lure distribution. As mentioned above regarding dual-process theories of memory recognition, a broader consideration when using these paradigms, however, is that there may be multiple processes contributing to the behavior (Molitor et al. 2014). Therefore, behavioral paradigms alone cannot usually suffice for conclusions about neural processes (Santoro 2013), but a better behavioral measure can help to test whether there is indeed a relationship. In conclusion, the BPS paradigm uniquely allows one to examine both gist- and detail-based recognition. However, the many current measures of lure-old discrimination in BPS paradigms fail to accurately characterize the fidelity of memory representation for the old stimuli. These formulas have either neglected to compare memory strength between old and lure stimuli, focusing only on the latter, or have imprecisely modeled the overlap in old and lure representations. Moreover, when BPS studies use different formulas for measuring novel-old and lure-old discrimination, it is impossible to directly compare the two types of discrimination. Therefore, SDT for old/new recognition tasks—or proportion correct for nAFC tasks (see Brady et al. 2011 for a good example)—provides an appropriate and unified framework for measuring both novel-old and lure-old discrimination, allowing stronger tests of hypotheses that predict differences between gist- and detail-based recognition. Our analyses demonstrate that a formula used in a particular BPS paradigm (Reagh and Yassa 2014) lead to an erroneous conclusion. It is important to realize that this one result, as well as the shortcomings of other BPS formulas, do not ipso facto invalidate all conclusions thus far derived from BPS paradigms. Future BPS paradigms using SDT or other theoretically justified analysis methods will have to bear this out. More generally, researchers should remember that conclusions are contingent upon dependent measures, which are themselves contingent upon theory-laden measurement assumptions that can, and should be tested whenever possible.
  21 in total

1.  The case against a criterion-shift account of false memory.

Authors:  J T Wixted; V Stretch
Journal:  Psychol Rev       Date:  2000-04       Impact factor: 8.934

2.  In defense of the signal detection interpretation of remember/know judgments.

Authors:  John T Wixted; Vincent Stretch
Journal:  Psychon Bull Rev       Date:  2004-08

3.  Pattern separation in the dentate gyrus and CA3 of the hippocampus.

Authors:  Jill K Leutgeb; Stefan Leutgeb; May-Britt Moser; Edvard I Moser
Journal:  Science       Date:  2007-02-16       Impact factor: 47.728

4.  Visual object pattern separation deficits in nondemented older adults.

Authors:  Chelsea K Toner; Eva Pirogovsky; C Brock Kirwan; Paul E Gilbert
Journal:  Learn Mem       Date:  2009-04-29       Impact factor: 2.460

5.  Spatial pattern separation in cognitively normal young and older adults.

Authors:  Heather M Holden; Calhuei Hoebel; Kelly Loftis; Paul E Gilbert
Journal:  Hippocampus       Date:  2012-03-30       Impact factor: 3.899

6.  The contribution of familiarity to recognition memory is a function of test format when using similar foils.

Authors:  Ellen Migo; Daniela Montaldi; Kenneth A Norman; Joel Quamme; Andrew Mayes
Journal:  Q J Exp Psychol (Hove)       Date:  2008-12-16       Impact factor: 2.143

7.  Differences in hippocampal neuronal population responses to modifications of an environmental context: evidence for distinct, yet complementary, functions of CA3 and CA1 ensembles.

Authors:  Almira Vazdarjanova; John F Guzowski
Journal:  J Neurosci       Date:  2004-07-21       Impact factor: 6.167

8.  Yes/no recognition, forced-choice recognition, and the human hippocampus.

Authors:  P J Bayley; J T Wixted; R O Hopkins; L R Squire
Journal:  J Cogn Neurosci       Date:  2008-03       Impact factor: 3.225

9.  Memory-related eye movements challenge behavioral measures of pattern completion and pattern separation.

Authors:  Robert J Molitor; Philip C Ko; Erin P Hussey; Brandon A Ally
Journal:  Hippocampus       Date:  2014-02-18       Impact factor: 3.899

10.  Reassessing pattern separation in the dentate gyrus.

Authors:  Adam Santoro
Journal:  Front Behav Neurosci       Date:  2013-07-30       Impact factor: 3.558

View more
  8 in total

1.  Retrieval of high-fidelity memory arises from distributed cortical networks.

Authors:  Peter E Wais; Sahar Jahanikia; Daniel Steiner; Craig E L Stark; Adam Gazzaley
Journal:  Neuroimage       Date:  2017-01-31       Impact factor: 6.556

2.  It's All in the Details: Relations Between Young Children's Developing Pattern Separation Abilities and Hippocampal Subfield Volumes.

Authors:  Kelsey L Canada; Chi T Ngo; Nora S Newcombe; Fengji Geng; Tracy Riggins
Journal:  Cereb Cortex       Date:  2019-07-22       Impact factor: 5.357

3.  Age-related impairment on a forced-choice version of the Mnemonic Similarity Task.

Authors:  Derek J Huffman; Craig E L Stark
Journal:  Behav Neurosci       Date:  2016-12-22       Impact factor: 1.912

Review 4.  Mnemonic Similarity Task: A Tool for Assessing Hippocampal Integrity.

Authors:  Shauna M Stark; C Brock Kirwan; Craig E L Stark
Journal:  Trends Cogn Sci       Date:  2019-10-06       Impact factor: 20.229

Review 5.  Measuring memory is harder than you think: How to avoid problematic measurement practices in memory research.

Authors:  Timothy F Brady; Maria M Robinson; Jamal R Williams; John T Wixted
Journal:  Psychon Bull Rev       Date:  2022-10-19

6.  Exploring the use of phonological and semantic representations in working memory.

Authors:  Nelson Cowan; Dominic Guitard; Nathaniel R Greene; Sylvain Fiset
Journal:  J Exp Psychol Learn Mem Cogn       Date:  2022-01-10       Impact factor: 3.140

7.  Repeated encoding fosters retention of perceptual detail in visual recognition memory.

Authors:  Mengting Zhang; Almut Hupbach
Journal:  Learn Mem       Date:  2020-10-15       Impact factor: 2.460

8.  Rest-related consolidation protects the fine detail of new memories.

Authors:  Michael Craig; Michaela Dewar
Journal:  Sci Rep       Date:  2018-05-01       Impact factor: 4.379

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.