| Literature DB >> 24793447 |
Mads Dyrholm1, Signe Vangkilde, Claus Bundesen.
Abstract
Conventional wisdom on psychological experiments has held that when one or more independent variables are manipulated it is essential that all other conditions are kept constant such that confounding factors can be assumed negligible (Woodworth, 1938). In practice, the latter assumption is often questionable because it is generally difficult to guarantee that all other conditions are constant between any two trials. Therefore, the most common way to check for confounding violations of this assumption is to split the experimental conditions in terms of "trial types" to simulate a reduction of unintended trial-by-trial variation. Here, we pose a method which is more general than the use of trial types: use of mathematical models treating measures of potentially confounding factors and manipulated variables as equals on the single-trial level. We show how the method can be applied with models that subsume under the generalized linear item response theory (GLIRT), which is the case for most of the well-known psychometric models (Mellenbergh, 1994). As an example, we provide a new analysis of a single-letter recognition experiment using a nested likelihood ratio test that treats manipulated and measured variables equally (i.e., in exactly the same way) on the single-trial level. The test detects a confounding interaction with time-on-task as a single-trial measure and yields a substantially better estimate of the effect size of the main manipulation compared with an analysis made in terms of trial types.Entities:
Mesh:
Year: 2014 PMID: 24793447 PMCID: PMC4366569 DOI: 10.1007/s00426-014-0570-8
Source DB: PubMed Journal: Psychol Res ISSN: 0340-0727
Fig. 1Experimental procedure. a Time course of a single trial. An initial fixation cross was presented. Then a brief cue appeared, to remind the participant of the hazard rate condition (high vs. low). The fixation cross then reappeared in a variable foreperiod before the single target letter was presented either above (as shown) or below the fixation cross before being masked. The participant then reported the letter identity if known. b Foreperiod distributions. These were defined to be geometric and such that, in the high hazard rate condition the expected foreperiod was 0.75 s, and in the low hazard rate condition it was 4.5 s
Testing with a single-trial measure of time-on-task
| Variable | Coefficient (as % difference) | |||
|---|---|---|---|---|
| Model 1 | Model 2 | Model 3† | Model 4 | |
| In terms of trial types | ||||
| Foreperiod | ||||
| =1.0 s | 5.24*** | 4.91*** | 7.28*** | 7.46** |
| =1.5 s | −2.21 | |||
| ≥1.5 s | −5.21 | |||
| ≥2.0 s | −4.46 | |||
| Hazard Rate | ||||
| =high | 25.30* | 24.17* | 28.38*** | 45.59*** |
| Beyond trial types | ||||
| Time-on-task | ||||
| | −16.52*** | |||
| Interactions | ||||
| | −3.93* | −4.01* | −3.73* | |
| | −26.46*** | −26.64*** | −26.68*** | |
Estimated differences were given by GLIRT coefficients represented as percentage change in v value (perceptual processing speed) per explanatory variable unit increase on average across subjects and sessions. From Model 1 onwards, the foreperiod (FP) coefficients were not significant beyond the FP of 1.0 s. Model 2 was designed as an alternative to simply eliminating the nonsignificant FP coefficients beyond 1.0 s. The step from Model 1 to Model 2 could not be rejected, −2lnΛ = 66.7, p [> χ 2(64)] = .383. Model 3 was designed to test elimination of FP coefficients beyond 1.0 s, and the step from Model 2 to Model 3 could not be rejected, −2lnΛ = 58.3, p[> χ 2(64)] = .677. Model 4 was designed to test whether the time-on-task (T) effects were independent of the hazard rate (HR) conditions, but this model was rejected in favor of Model 3, −2lnΛ = 92.1, p[> χ 2(64)] = .012. Model 3 won the model selection as further nesting to Model 4 was rejected
HR = hazard rate; T = time-on-task
†Model 3 wins the model selection. Further nesting to Model 4 was rejected, p < .05
*p < 0.05, ** p < 0.01, *** p < 0.005
Fig. 2Expected value of the perceptual processing speed v given the trial types and the target onset times of an exemplary session. Model coefficients were set to the sample average. The trial types were trials with high hazard rate (green and yellow) versus trials with low hazard rate (red and blue) and trials with a foreperiod of 1 s (circled dots) versus trials with other foreperiods (simple dots). a The output of a conventional analysis, Model 7, where time-on-task is represented in terms of early and late trial types. b The output of Model 3, which differs from the conventional analysis by treating time-on-task and manipulated variables equally on the single-trial level. The divergence over trials between the results from the two hazard rate conditions (yellow vs. blue) shows very clearly the interaction between time-on-task and hazard rate
Testing time-on-task in terms of trial types
| Variable | Coefficient (as % difference) | |||
|---|---|---|---|---|
| Model 5 | Model 6 | Model 7† | Model 8 | |
| In terms of trial types | ||||
| Foreperiod | ||||
| =1.0 s | 5.25*** | 4.91*** | 7.16*** | 7.22** |
| =1.5 s | −2.42 | |||
| ≥1.5 s | −4.97 | |||
| ≥2.0 s | −4.04 | |||
| Hazard rate | ||||
| =high | 37.79*** | 36.41*** | 40.75*** | 49.19*** |
| Time-on-task trial type | ||||
| { | −7.35 | |||
| Interactions | ||||
| { | −1.24** | −1.12* | −.97** | |
| { | −12.49* | −12.56* | −12.72* | |
Estimated differences were given by GLIRT coefficients represented as percentage change in v value (perceptual processing speed) per explanatory variable unit increase on average across subjects and sessions. Time-on-task is represented in terms of early and late trial types
†Model 7 wins the model selection. Further nesting to Model 8 was rejected, p < .005
*p < .05, **p < .01, ***p < .005