| Literature DB >> 30265012 |
Craig Hedge1, Georgina Powell1, Aline Bompas1, Solveiga Vivian-Griffiths1, Petroc Sumner1.
Abstract
The underpinning assumption of much research on cognitive individual differences (or group differences) is that task performance indexes cognitive ability in that domain. In many tasks performance is measured by differences (costs) between conditions, which are widely assumed to index a psychological process of interest rather than extraneous factors such as speed-accuracy trade-offs (e.g., Stroop, implicit association task, lexical decision, antisaccade, Simon, Navon, flanker, and task switching). Relatedly, reaction time (RT) costs or error costs are interpreted similarly and used interchangeably in the literature. All of this assumes a strong correlation between RT-costs and error-costs from the same psychological effect. We conducted a meta-analysis to test this, with 114 effects across a range of well-known tasks. Counterintuitively, we found a general pattern of weak, and often no, association between RT and error costs (mean r = .17, range -.45 to .78). This general problem is accounted for by the theoretical framework of evidence accumulation models, which capture individual differences in (at least) 2 distinct ways. Differences affecting accumulation rate produce positive correlation. But this is cancelled out if individuals also differ in response threshold, which produces negative correlations. In the models, subtractions between conditions do not isolate processing costs from caution. To demonstrate the explanatory power of synthesizing the traditional subtraction method within a broader decision model framework, we confirm 2 predictions with new data. Thus, using error costs or RT costs is more than a pragmatic choice; the decision carries theoretical consequence that can be understood through the accumulation model framework. (PsycINFO Database Record (c) 2018 APA, all rights reserved).Entities:
Mesh:
Year: 2018 PMID: 30265012 PMCID: PMC6195302 DOI: 10.1037/bul0000164
Source DB: PubMed Journal: Psychol Bull ISSN: 0033-2909 Impact factor: 17.737
Pearson’s r and Spearman’s rho Correlations Between Reaction Time Costs and Error Costs in Cognitive Tasks
| Study | Task/effect | Trial | Pearsons’ | Spearman’s rho | ||
|---|---|---|---|---|---|---|
| † Dataset combines two groups of participants who underwent the same procedure with different trial numbers. ‡ Trial numbers varied between participants due to task design (e.g. randomized trials). Averages are reported. The correlations in | ||||||
| Our data and unpublished data | Flanker (arrows) | 104 | 480/480 | .28 | .27 | |
| Stroop | 103 | 480/480 | .27 | .29 | ||
| SNARC | 40 | 640/640 | .20 | .20 | ||
| Navon – local conflict | 40 | 320/320 | .41 | .32 | ||
| Navon – global conflict | 40 | 320/320 | −.11 | −.06 | ||
| Navon – global precedence | 40 | 320/320 | −.25 | −.23 | ||
| Other data | Flanker (arrows) | 50 | 336/336 | .23 | .21 | |
| Simon | 50 | 336/336 | .47 | .54 | ||
| Antisaccade | 48 | 200|400/300|400† | −.13 | −.18 | ||
| Illogical rule task | 44 | 200/200 | −.20 | −.12 | ||
| Distractor | 48 | 200/200 | .38 | .30 | ||
| Antisaccade | 21 | 400/400 | −.13 | −.15 | ||
| New analysis of published data | Antisaccade | 502 | 60/60 | .20 | .22 | |
| Lexical decision task (English) | 809 | 1,686/1,686‡ | .34 | .37 | ||
| Task switching | 49 | 78/78 | −.07 | .02 | ||
| Exp. 1 Task switching | 52 | 185/199† | .10 | −.08 | ||
| Exp. 1 Rule congruency | 52 | 192/192 | .11 | .23 | ||
| Exp. 1 List congruency | 52 | 192/192 | −.13 | .18 | ||
| Exp. 2 Task switching | 32 | 225/223† | .24 | .30 | ||
| Exp. 2 Rule congruency | 32 | 224/224 | .18 | .32 | ||
| Exp. 2 List congruency | 32 | 224/224 | −.37 | −.09 | ||
| Exp. 3 Task switching | 32 | 123/131‡ | .22 | .26 | ||
| Exp. 3 Rule congruency | 32 | 380/126‡ | .33 | .26 | ||
| Exp. 3 Incentive | 32 | 46/47‡ | .07 | .02 | ||
| Exp. 3 Mixed task | 32 | 252/123 | .43 | .26 | ||
| Flanker | 42 | 120/120 | .14 | .25 | ||
| Task switching (antisaccade) | 18 | 104/104 | −.14 | −.21 | ||
| Flanker (colour) | 58 | 120/120 | .09 | .13 | ||
| Simon | 216 | 192/192 | .33 | .38 | ||
| Stroop | 216 | 192/192 | .21 | .19 | ||
| Numeric Stroop | 216 | 192/192 | .24 | .19 | ||
| Navon (conflict) | 216 | 192/192 | .52 | .27 | ||
| Task switching (animacy/size) | 216 | 128/128 | −.03 | .01 | ||
| Task switching (shape/colour) | 216 | 128/128 | .02 | −.01 | ||
| Task switching (parity/magnitude) | 216 | 128/128 | .02 | .04 | ||
| Task switching (fill/frame) | 216 | 128/128 | .07 | .05 | ||
| Task mixing (animacy/size) | 216 | 512/128 | .15 | .20 | ||
| Task mixing (shape/colour) | 216 | 512/128 | .08 | .15 | ||
| Task mixing (parity/magnitude) | 216 | 512/128 | .19 | .18 | ||
| Task mixing (fill/frame) | 216 | 512/128 | .05 | .21 | ||
| Stroop | 3,305 | 21/42 | .15 | .12 | ||
| Task switching | 21 | 878/438‡ | .11 | .08 | ||
| Lexical decision task (French) | 868 | 1,000/1,000 | .55 | .55 | ||
| Stroop (picture/word) | 95 | 600/600 | .28 | .31 | ||
| Flanker | 142 | 192/192 | −.45 | −.08 | ||
| Simon | 142 | 192/192 | .38 | .43 | ||
| Stroop | 142 | 192/192 | .46 | .37 | ||
| Task switching (animacy/size) | 142 | 128/128 | .15 | .12 | ||
| Task switching (shape/colour) | 142 | 128/128 | .09 | .16 | ||
| Task switching (parity/magnitude) | 142 | 128/128 | .18 | .13 | ||
| Single vs. mixed task (animacy/size) | 142 | 512/128 | −.13 | −.09 | ||
| Single vs. mixed task (shape/colour) | 142 | 512/128 | .26 | .16 | ||
| Single vs. mixed task (parity/magnitude) | 142 | 512/128 | .02 | .04 | ||
| Flanker | 73 | 60/60 | −.02 | .02 | ||
| Flanker | 64 | 72/72 | .04 | .13 | ||
| Flanker | 26 | 24/24 | .29 | .16 | ||
| Lexical decision task (Dutch) | 39 | 14,089/14,089 | .73 | .71 | ||
| Lexical decision task (English) | 79 | 14,365/14,365 | .78 | .81 | ||
| Flanker (gratings) | 18 | 48/48 | −.24 | −.12 | ||
| Flanker | 120 | 50/50 | .22 | .26 | ||
| Flanker | 197 | 50/50 | .24 | .22 | ||
| Simon | 21 | 96/96 | .29 | .21 | ||
| Manual “antisaccade” | 44 | 256/256 | −.03 | −.11 | ||
| Task mixing | 44 | 256/256 | .03 | .03 | ||
| SNARC | 17 | 56/56 | .31 | .28 | ||
| Stroop | 57 | 90/30 | .48 | .49 | ||
| Task switching | 62 | 130/148‡ | .21 | .21 | ||
| Reward | 62 | 140/140 | .41 | .50 | ||
| Flanker | 56 | 250/250 | .28 | .36 | ||
| Flanker | 58 | 250/250 | .58 | .59 | ||
| Stroop | 217 | 288/288 | .28 | .29 | ||
| Flanker | 2,249 | 50/50 | .51 | .39 | ||
| Flanker | 120 | 48/48 | −.05 | −.08 | ||
| Simon | 120 | 150/50 | .15 | .22 | ||
| Numerical Stroop | 120 | 48/48 | .36 | .31 | ||
| Task switching (animacy/size) | 120 | 64/64 | −.08 | −.09 | ||
| Task switching (colour/shape) | 120 | 64/64 | .07 | −.02 | ||
| Task switching (parity/size) | 120 | 64/64 | −.18 | −.17 | ||
| Single vs. mixed task (animacy/size) | 120 | 256/64 | .09 | .10 | ||
| Single vs. mixed task (colour/shape) | 120 | 256/64 | .14 | .12 | ||
| Single vs. mixed task (parity/size) | 120 | 256/64 | .03 | .11 | ||
| Flanker | 23 | 80/80 | −.13 | .08 | ||
| Simon | 23 | 320/120 | .35 | .26 | ||
| Age implicit association test (IAT) | 98,1873 | 40/40 | .27 | .38 | ||
| Arab IAT | 33,8103 | 40/40 | .07 | .17 | ||
| Asian IAT | 37,4882 | 40/40 | .18 | .26 | ||
| Disability IAT | 30,9792 | 40/40 | .26 | .37 | ||
| Gender-career IAT | 85,2861 | 40/40 | .18 | .28 | ||
| Gender-science IAT | 63,6003 | 40/40 | .19 | .28 | ||
| Native American IAT | 21,7444 | 40/40 | .21 | .28 | ||
| President IAT | 37,9465 | 40/40 | .21 | .31 | ||
| Race IAT | 3,339,097 | 40/40 | .30 | .40 | ||
| Religion IAT | 169,247 | 40/40 | .18 | .28 | ||
| Sexuality IAT | 1,452,795 | 40/40 | .24 | .35 | ||
| Skin color IAT | 872,781 | 40/40 | .26 | .36 | ||
| Weapons IAT | 534,563 | 40/40 | .21 | .32 | ||
| Weight IAT | 969,372 | 40/40 | .26 | .36 | ||
| Flanker | 160 | 64/64 | −.08 | −.09 | ||
| Simon | 160 | 92/92 | .43 | .42 | ||
| Previously reported correlations | Task switching (antisaccade) – schizophrenia | 21 | 104/104 | −.05 | ||
| Task switching (antisaccade) – controls | 16 | 104/104 | .22 | |||
| Task switching | 552 | 96/96 | −.08 | |||
| Task switching | 1,902 | 46/98 | .01 | |||
| Task switching | 46 | 264/120 | .21 | |||
| Stroop | 87 | 36/36 | −.02 | |||
| Stroop | 88 | 36/36 | .17 | |||
| Stroop | 138 | 36/36 | .10 | |||
| Attention networks test: Alerting | 1,129 | 72/72 | −.10 | |||
| Attention networks test: Orienting | 1,129 | 72/72 | .05 | |||
| Attention networks test: Executive | 1,129 | 96/96 | .21 | |||
| Manual “antisaccade” | 117 | 30/60 | .29 | |||
| Stroop | 35 | 54/54 | .20 | |||
| Flanker (Parkinson’s patients) | 50 | 103/103 | −.39 | |||
Figure 1PRISMA flow diagram illustrating our process for identifying eligible articles and datasets. N refers to records (articles or records on data repositories), K refers to correlations identified. Manual searches refers to records obtained through reference lists, Google, and manually searching data repositories (e.g. OSF.io).
Figure 2Funnel plot of observed effect sizes (Pearson’s r) for correlations between RT costs and error costs with associated standard errors. Larger values on the y-axis reflect larger sample sizes. Solid black line indicates weighted mean effect from a random effects model. Grey area indicates 95% confidence region. Dashed black lines show 95% confidence intervals of the mean effect estimated from a random-effects model. Red line indicates an effect size of zero. The lexical decision task effects are shown in black circles, all other tasks are shown in gray (see text for details).
Figure 3Schematic of two sequential sampling models. i) The drift-diffusion model (Ratcliff, 1978; Ratcliff & McKoon, 2008) consists of a single accumulator accruing evidence from a starting point (z) to one or the other response threshold (a and 0). The drift rate on each simulated trial is taken from a distribution that has a mean (v) and standard deviation (η) across trials, and is subject to within-trial noise (s). ii) The LBA model consists of an accumulator for each response option, accruing evidence to a common response threshold (b). On each simulated trial, drift rates are taken from distributions which have a mean (vc, ve) and standard deviation (s), and begin accumulating evidence from a starting point selected from a uniform distribution (A-0). The models also normally add non-decision time to capture sensory and motor delays, but here we simply assume this is a constant, as variance in non-decision time is not needed for our discussion.
Figure 4Schematic of two sequential sample models for conflict tasks. i) The diffusion model for conflict tasks, DMC (Ulrich et al., 2015), an extension of the drift-diffusion model to accommodate the flanker and Simon tasks. The DMC adds a transient input for the irrelevant competing information (black gamma function in the lower panel) to the sustained linear process for the correct information (μc: grey line in the lower panel). The gamma function, defined by the parameters A, a and τ, provides an impulse function, so that the irrelevant features (e.g. the flankers) initially have a large input, which diminishes rapidly within the trial. ii) ALIGATER is an extension of LATER (Carpenter and Williams, 1995) originally tested in the context of saccadic interference effects (Bompas & Sumner, 2011). Two LATER units, one for the target and one for the distractor, attempt to rise to threshold while mutually inhibiting each other. To produce goal-directed selectivity ALIGATER includes reactive inhibition instead of altering drift rates. This inhibition attenuates the activation in the distractor node by a specified amount (Iendo) after a delay (δendo) (lower right panel).
Figure 5Pattern of RT costs and Error costs produced by variation in response caution and selection in the drift diffusion model. Straight, solid lines show condition averages, faint lines show example individual trials. Black lines show drift rates in congruent/baseline condition, coloured lines show incongruent condition. A. Response caution: Individuals who are low in response caution will set a lower threshold (e.g. grey dotted line) than highly cautious individuals (black dotted line). This means not only that their RTs will be faster, but also the difference between conditions will be smaller, leading to smaller RT costs, noted by grey arrows compared to black arrows. However, the lower threshold will lead to more errors due to noise in the accumulation process, which can be overcome with higher thresholds (example trial in purple reaches the grey error threshold, but not the black error threshold). Note that this will predominantly affect the incongruent or more difficult condition, as errors are rare in congruent/baseline conditions, leading to higher relative error costs. B. Response selection: Individuals who have high selection efficiency will have relatively higher drift rates in incongruent conditions (red solid lines) compared to individuals with lower selection efficiency (blue solid lines), leading to smaller RT costs (noted by red arrows compared to blue arrows). Moreover, the higher drift rate means noise is less likely to cause the incorrect response (illustrated with blue example trial that reaches the error threshold). Note that individuals could also vary in their average drift rates in congruent conditions, and the conclusions would remain the same, since the same difference in drift rate between conditions creates larger costs if average drift rates are lower. For simplicity we keep average congruent drift rates constant in our simulations.
Parameters Used for Model Simulations
| Model | Response selection | Response caution | Other parameters | ||||
|---|---|---|---|---|---|---|---|
| DDM | Congruent drift rate (v1) | Variability in drift rates (η) | Start point bias (a/z) | Within-trial noise (s) | |||
| .45 | .1 | .5 | .1 | ||||
| LBA | Congruent drift rate (v1) | Variability in drift rates (s) | Variability in start points (A) | ||||
| 1 | .27 | 250 | |||||
| DMC | Controlled process drift rate (μc) | Time-to-peak of automatic activation (τ) | Start point shape (α) | Within-trial noise (σ) | Automatic activation shape (a) | ||
| .63 | 90 | 2 | 4 | 2 | |||
| ALIGATER | Drift rates (μc, μi) | Variability in drift rates (η) | Mutual inhibition strength (w) | Mutual inhibition delay (δw) | Reactive inhibition delay (δendo) | ||
| .0078 | .0039 | .01 | 1 ms | 70 ms | |||
Figure 6Simulated error costs and RT costs produced by four decision models. DDM = Drift-diffusion model, LBA = Linear ballistic accumulator model, DMC = Diffusion model for conflict tasks, ALIGATER = Approximately linear rise to threshold with ergodic rate. The first and second columns show the patterns of error costs and RT costs, respectively, as a function of variation in both caution and response selection as implemented in the different models (see main text for details). The third column shows the correlation between RT costs and error costs that arise from holding response selection constant and allowing caution to vary (purple line and crosses), and for allowing response selection to vary while caution is held constant (grey line and circles). Though the simulated data are often non-linear, linear trend lines are plotted for illustrative purposes since most studies of individual differences would calculate linear correlations. Note some changes of scale between plots, due to the range of parameters used, as guided by previous literature (see text). Trials with decision times longer than 2000 ms were excluded from the plots.
Sample Sizes and Pearson’s r Correlations Between RT and Error Costs from Studies 1 and 2
| Dataset | Instruction condition | Speed-standard | |||
|---|---|---|---|---|---|
| Speed | Standard | Accuracy | |||
| Flanker 1 Session 1 | 55 | .56 | .36 | .31 | .24 |
| Flanker 1 Session 2 | 47 | .40 | .34 | −.01 | .07 |
| Stroop 1 Session 1 | 52 | .19 | .19 | .21 | .00 |
| Stroop 1 Session 2 | 46 | .33 | .15 | .21 | .19 |
| Flanker 2 | 81 | .46 | .23 | .01 | .26 |
| Dot-motion 2 | 73 | .22 | −.07 | −.04 | .28 |
Spearman’s Correlations Between RT Costs and Error Costs in the Simon Task in Study 3 (N = 102)
| Measure/condition | RT cost–mixed | Error cost–mixed | RT cost–blocked | Error cost–blocked |
|---|---|---|---|---|
| * | ||||
| RT cost–mixed | .10 | .10 | ||
| Error cost–mixed | −.02 | .14 | ||
| RT cost–blocked | ||||
| Mean | 20 ms*** | 3.4%*** | 46 ms*** | 3.6%*** |
| Std. dev | 17 ms | 5.0% | 24 ms | 5.0% |