| Literature DB >> 29379867 |
Phillip M Alday1, Matthias Schlesewsky2, Ina Bornkessel-Schlesewsky2.
Abstract
The recent trend away from ANOVA-based analyses places experimental investigations into the neurobiology of cognition in more naturalistic and ecologically valid designs within reach. Using mixed-effects models for epoch-based regression, we demonstrate the feasibility of examining event-related potentials (ERPs), and in particular the N400, to study the neural dynamics of human auditory language processing in a naturalistic setting. Despite the large variability between trials during naturalistic stimulation, we replicated previous findings from the literature: the effects of frequency, animacy, and word order and find previously unexplored interaction effects. This suggests a new perspective on ERPs, namely, as a continuous modulation reflecting continuous stimulation instead of a series of discrete and essentially sequential processes locked to discrete events.Entities:
Keywords: ecological validity; mixed-effects models; naturalistic stimuli; predictive coding
Mesh:
Year: 2017 PMID: 29379867 PMCID: PMC5779117 DOI: 10.1523/ENEURO.0311-16.2017
Source DB: PubMed Journal: eNeuro ISSN: 2373-2822
Figure 1.Single trial and average ERPs from electrode CPz from a single subject for unambiguous accusatives placed before a nominative. In the upper part, single trials are displayed stacked and sorted from top to bottom in decreasing orthographic length as a weak proxy for acoustic length, while the lower part displays the average ERP. Amplitude is given by color in the upper part and by the y-axis in the lower part. The dashed vertical lines indicate the boundaries of the N400 time window, 300 and 500 ms after stimulus onset.
“Design” matrix for the sentence processing cues
| inanimate | accusative | noninitial | 89 |
| inanimate | accusative | initial | 4 |
| inanimate | nominative | noninitial | 8 |
| inanimate | nominative | initial | 13 |
| inanimate | ambiguous | noninitial | 99 |
| inanimate | ambiguous | initial | 52 |
| animate | accusative | noninitial | 22 |
| animate | accusative | initial | 7 |
| animate | nominative | noninitial | 8 |
| animate | nominative | initial | 16 |
| animate | ambiguous | noninitial | 39 |
| animate | ambiguous | initial | 86 |
Count represents the number of “trials.” The extreme lack of balance reflects natural language statistics and can only be appropriately modeled by methods using variance pooling, such as LMMs.
Power calculations were performed via simulation with 1000 iterations via the simr package (Green and MacLeod, 2016)
| Model | Predictor | Data Structure | Type of test | Lower | Upper |
|---|---|---|---|---|---|
| a | chan[cz] | Asymptotically normal | Wald | 0.000 | 0.004 |
| a | chan[pz] | Asymptotically normal | Wald | 0.966 | 0.985 |
| a | index | Asymptotically normal | Wald | 0.854 | 0.896 |
| a | freq.class | Asymptotically normal | Wald | 0.000 | 0.004 |
| a | index:freq.class | Asymptotically normal | Wald | 0.000 | 0.004 |
| b | chan[cz] | Asymptotically normal | Wald | 0.000 | 0.004 |
| b | chan[pz] | Asymptotically normal | Wald | 0.966 | 0.985 |
| b | index | Asymptotically normal | Wald | 0.231 | 0.286 |
| b | rel.freq.class | Asymptotically normal | Wald | 0.000 | 0.004 |
| b | index:rel.freq.class | Asymptotically normal | Wald | 0.000 | 0.007 |
| c | chan[cz] | Asymptotically normal | Wald | 0.002 | 0.012 |
| c | chan[pz] | Asymptotically normal | Wald | 0.663 | 0.721 |
| c | animacy[−] | Asymptotically normal | Wald | 0.009 | 0.026 |
| c | morphology[−] | Asymptotically normal | Wald | 0.991 | 0.999 |
| c | morphology[+] | Asymptotically normal | Wald | 0.000 | 0.004 |
| c | position[−] | Asymptotically normal | Wald | 0.000 | 0.004 |
| c | animacy[−]:morphology[−] | Asymptotically normal | Wald | 0.010 | 0.027 |
| c | animacy[−]:morphology[+] | Asymptotically normal | Wald | 0.086 | 0.125 |
| c | animacy[−]:position[−] | Asymptotically normal | Wald | 0.000 | 0.004 |
| c | morphology[−]:position[−] | Asymptotically normal | Wald | 0.147 | 0.195 |
| c | morphology[+]:position[−] | Asymptotically normal | Wald | 0.000 | 0.004 |
| c | animacy[−]:morphology[−]:position[−] | Asymptotically normal | Wald | 0.006 | 0.020 |
| c | animacy[−]:morphology[+]:position[−] | Asymptotically normal | Wald | 0.000 | 0.006 |
| d | chan | Asymptotically normal | Type-II Wald χ2 | 0.610 | 0.671 |
| d | animacy | Asymptotically normal | Type-II Wald | 0.179 | 0.230 |
| d | morphology | Asymptotically normal | Type-II Wald | 0.996 | 1.000 |
| d | position | Asymptotically normal | Type-II Wald | 0.964 | 0.985 |
| d | animacy:morphology | Asymptotically normal | Type-II Wald | 0.173 | 0.223 |
| d | animacy:position | Asymptotically normal | Type-II Wald | 0.182 | 0.233 |
| d | morphology:position | Asymptotically normal | Type-II Wald | 0.917 | 0.949 |
| d | animacy:morphology:position | Asymptotically normal | Type-II Wald | 0.178 | 0.229 |
| e | chan | Asymptotically normal | Type-II Wald | 0.611 | 0.672 |
| e | index | Asymptotically normal | Type-II Wald | 0.594 | 0.655 |
| e | freq.class | Asymptotically normal | Type-II Wald | 0.991 | 0.999 |
| e | animacy | Asymptotically normal | Type-II Wald | 0.037 | 0.065 |
| e | morphology | Asymptotically normal | Type-II Wald | 0.994 | 1.000 |
| e | position | Asymptotically normal | Type-II Wald | 0.922 | 0.953 |
| e | index:freq.class | Asymptotically normal | Type-II Wald | 0.888 | 0.925 |
| e | index:animacy | Asymptotically normal | Type-II Wald | 0.178 | 0.228 |
| e | freq.class:animacy | Asymptotically normal | Type-II Wald | 0.064 | 0.099 |
| e | index:morphology | Asymptotically normal | Type-II Wald | 0.461 | 0.523 |
| e | freq.class:morphology | Asymptotically normal | Type-II Wald | 0.980 | 0.994 |
| e | animacy:morphology | Asymptotically normal | Type-II Wald | 0.169 | 0.219 |
| e | index:position | Asymptotically normal | Type-II Wald | 0.264 | 0.321 |
| e | freq.class:position | Asymptotically normal | Type-II Wald | 0.025 | 0.049 |
| e | animacy:position | Asymptotically normal | Type-II Wald | 0.071 | 0.107 |
| e | morphology:position | Asymptotically normal | Type-II Wald | 0.718 | 0.773 |
| e | index:freq.class:animacy | Asymptotically normal | Type-II Wald | 0.037 | 0.065 |
| e | index:freq.class:morphology | Asymptotically normal | Type-II Wald | 0.358 | 0.419 |
| e | index:animacy:morphology | Asymptotically normal | Type-II Wald | 0.884 | 0.922 |
| e | freq.class:animacy:morphology | Asymptotically normal | Type-II Wald | 0.753 | 0.805 |
| e | index:freq.class:position | Asymptotically normal | Type-II Wald | 0.386 | 0.448 |
| e | index:animacy:position | Asymptotically normal | Type-II Wald | 0.763 | 0.815 |
| e | freq.class:animacy:position | Asymptotically normal | Type-II Wald | 0.345 | 0.406 |
| e | index:morphology:position | Asymptotically normal | Type-II Wald | 0.269 | 0.326 |
| e | freq.class:morphology:position | Asymptotically normal | Type-II Wald | 0.146 | 0.194 |
| e | animacy:morphology:position | Asymptotically normal | Type-II Wald | 0.129 | 0.175 |
| e | index:freq.class:animacy:morphology | Asymptotically normal | Type-II Wald | 0.259 | 0.316 |
| e | index:freq.class:animacy:position | Asymptotically normal | Type-II Wald | 0.435 | 0.497 |
| e | index:freq.class:morphology:position | Asymptotically normal | Type-II Wald | 0.414 | 0.476 |
| e | index:animacy:morphology:position | Asymptotically normal | Type-II Wald | 0.208 | 0.262 |
| e | freq.class:animacy:morphology:position | Asymptotically normal | Type-II Wald | 0.932 | 0.961 |
| e | index:freq.class:animacy:morphology:position | Asymptotically normal | Type-II Wald | 0.488 | 0.550 |
| f | chan | Asymptotically normal | Type-II Wald | 0.611 | 0.672 |
| f | rel.freq.class | Asymptotically normal | Type-II Wald | 0.846 | 0.889 |
| f | freq.class | Asymptotically normal | Type-II Wald | 0.891 | 0.927 |
| f | animacy | Asymptotically normal | Type-II Wald | 0.192 | 0.244 |
| f | morphology | Asymptotically normal | Type-II Wald | 0.996 | 1.000 |
| f | position | Asymptotically normal | Type-II Wald | 0.677 | 0.734 |
| f | rel.freq.class:freq.class | Asymptotically normal | Type-II Wald | 0.837 | 0.881 |
| f | rel.freq.class:animacy | Asymptotically normal | Type-II Wald | 0.994 | 1.000 |
| f | freq.class:animacy | Asymptotically normal | Type-II Wald | 0.334 | 0.395 |
| f | rel.freq.class:morphology | Asymptotically normal | Type-II Wald | 0.084 | 0.122 |
| f | freq.class:morphology | Asymptotically normal | Type-II Wald | 0.983 | 0.996 |
| f | animacy:morphology | Asymptotically normal | Type-II Wald | 0.666 | 0.724 |
| f | rel.freq.class:position | Asymptotically normal | Type-II Wald | 0.905 | 0.939 |
| f | freq.class:position | Asymptotically normal | Type-II Wald | 0.225 | 0.280 |
| f | animacy:position | Asymptotically normal | Type-II Wald | 0.406 | 0.468 |
| f | morphology:position | Asymptotically normal | Type-II Wald | 0.978 | 0.993 |
| f | rel.freq.class:freq.class:animacy | Asymptotically normal | Type-II Wald | 0.067 | 0.102 |
| f | rel.freq.class:freq.class:morphology | Asymptotically normal | Type-II Wald | 0.276 | 0.334 |
| f | rel.freq.class:animacy:morphology | Asymptotically normal | Type-II Wald | 0.242 | 0.298 |
| f | freq.class:animacy:morphology | Asymptotically normal | Type-II Wald | 0.895 | 0.931 |
| f | rel.freq.class:freq.class:position | Asymptotically normal | Type-II Wald | 0.361 | 0.422 |
| f | rel.freq.class:animacy:position | Asymptotically normal | Type-II Wald | 0.085 | 0.124 |
| f | freq.class:animacy:position | Asymptotically normal | Type-II Wald | 0.696 | 0.752 |
| f | rel.freq.class:morphology:position | Asymptotically normal | Type-II Wald | 0.061 | 0.095 |
| f | freq.class:morphology:position | Asymptotically normal | Type-II Wald | 0.405 | 0.467 |
| f | animacy:morphology:position | Asymptotically normal | Type-II Wald | 0.195 | 0.247 |
| f | rel.freq.class:freq.class:animacy:morphology | Asymptotically normal | Type-II Wald | 0.281 | 0.340 |
| f | rel.freq.class:freq.class:animacy:position | Asymptotically normal | Type-II Wald | 0.054 | 0.087 |
| f | rel.freq.class:freq.class:morphology:position | Asymptotically normal | Type-II Wald | 0.066 | 0.101 |
| f | rel.freq.class:animacy:morphology:position | Asymptotically normal | Type-II Wald | 0.336 | 0.397 |
“Lower” and “upper” are the bounds of the 95% confidence interval on the power estimates. No power estimates are provided for model comparisons, because it is not entirely clear which model to use as the simulation basis, especially for non-nested models. We note moreover that observed power calculations are problematic (Hoenig and Heisey, 2001) and indeed closely follows the observed significance (as implemented here: |t| > 2 or p < 0.05).
Figure 2.Time course of regression coefficients for the interaction between morphology and position (at the head noun of the NP), first calculated within and then averaged over participants (following the traditional grand-average methodology) with only the predictors shown for computational tractability. This is equivalent to the traditional difference wave (Smith and Kutas, 2015a). Note that already at word onset, the effects have begun to diverge; the effects at a given word in a naturalistic context reflect the sum of the context and word-local, complex interactions. Large variances in word length enhance this effect.
Figure 3.Time course of regression coefficients for the effect of frequency (logarithmic class), first calculated within and then averaged over participants (following the traditional grand-average methodology) with only the predictors shown for computational tractability. This is analogous to the traditional difference wave (Smith and Kutas, 2015a), but instead of the difference between binary classes represents the average difference between frequency classes, i.e., the average difference in the wave form for every order-of-magnitude reduction in frequency. Note that already at word onset, the effects have begun to diverge; the effects at a given word in a naturalistic context reflect the sum of the context and word-local, complex interactions. Large variances in word length enhance this effect.
Figure 4.Grand average plot for the upper and lower tertiles of frequency (logarithmic class). Note that already at word onset, the effects have begun to diverge; the effects at a given word in a naturalistic context reflect the sum of the context and word-local, complex interactions. Large variances in word length enhance this effect. Nonetheless, the overall effect of frequency is so large that the change overcomes the initial offsets. This is visible as the change in sign for the regression coefficients in Figure 3.
Summary of model fit for (corpus) frequency class and index (ordinal position) in the time window 300–500 ms from stimulus onset using all content wordsa
| Linear mixed model fit by maximum likelihood | ||||
|---|---|---|---|---|
| AIC | BIC | logLik | Deviance | |
| 2043327 | 2043410 | −1021655 | 2043311 | |
| Scaled residuals: | ||||
| Min | 1Q | Median | 3Q | Max |
| −24.19 | −0.49 | −0.01 | 0.49 | 12.54 |
| Random effects: | ||||
| Groups | Name | Variance | SD | |
| subj | (Intercept) | 0.04 | 0.19 | |
| Residual | 141.06 | 11.88 | ||
| Number of obs: 262392, groups: subj, 52. | ||||
| Fixed effects: | ||||
| Estimate | SE | |||
| (Intercept) | 0.037 | 0.13 | 0.28 | |
| chan[cz] | −0.029 | 0.033 | −0.89 | |
| chan[pz] | 0.13 | 0.033 | 4 | |
| index | 0.00043 | 0.00014 | 3.1 | |
| corpus | −0.02 | 0.0093 | −2.2 | |
| index:corpus | −2.7e−05 | 9.9e−06 | −2.7 | |
Summary of model fit for relative frequency class and index (ordinal position) in the time window 300–500 ms from stimulus onset using all content wordsb
| Linear mixed model fit by maximum likelihood | ||||
|---|---|---|---|---|
| AIC | BIC | logLik | Deviance | |
| 2043374 | 2043457 | −1021679 | 2043358 | |
| Scaled residuals: | ||||
| Min | 1Q | Median | 3Q | Max |
| −24.2 | −0.49 | −0.01 | 0.49 | 12.55 |
| Random effects: | ||||
| Groups | Name | Variance | SD | |
| subj | (Intercept) | 0.04 | 0.19 | |
| Residual | 141.09 | 11.88 | ||
| Number of obs: 262392, groups: subj, 52. | ||||
| Fixed effects: | ||||
| Estimate | SE | |||
| (Intercept) | 0.17 | 0.17 | 0.98 | |
| chan[cz] | −0.029 | 0.033 | −0.89 | |
| chan[pz] | 0.13 | 0.033 | 4 | |
| index | 0.00023 | 0.00018 | 1.3 | |
| relative | −0.068 | 0.028 | −2.4 | |
| index:relative | −2.5e−05 | 3e−05 | −0.86 | |
Comparison of models for corpus and relative frequency
| df | AIC | BIC | logLik | |
|---|---|---|---|---|
| m.rel.index | 8 | 2043373 | 2043457 | −1021678 |
| m.freq.index | 8 | 2043326 | 2043410 | −1021655 |
Both models yield similar fits as evidenced by log-likelihood, AIC, and BIC. Model names reflect the predictor used; ‘rel’ refers to relative frequency and ‘freq’ refers to corpus frequency.
Figure 5.Plot of effects for corpus frequency interacting with index (ordinal position in the story). Shaded areas indicate 95% confidence intervals. Light points are grand averages by participants over all trials; the corresponding lines are standard error of the (grand) mean. Index is divided into tertiles and plotted in an overlap to show the interaction. There is an increasing negativity with decreasing frequency (higher logarithmic class), which is weakly affected by position in the story.
Figure 6.Plot of effects for relative frequency interacting with index. Shaded areas indicate 95% confidence intervals. Light points are grand averages by participants over trials; the corresponding lines are standard error of the (grand) mean. Index is divided into tertiles.
Summary of model fit for linguistic cues (animacy, morphology, linear position) known to elicit N400-like effectsc
| Linear mixed model fit by maximum likelihood | ||||
|---|---|---|---|---|
| AIC | BIC | logLik | Deviance | |
| 538127 | 538273 | −269047 | 538095 | |
| Scaled residuals: | ||||
| Min | 1Q | Median | 3Q | Max |
| −18.56 | −0.5 | −0.01 | 0.49 | 10.65 |
| Random effects: | ||||
| Groups | Name | Variance | SD | |
| subj | (Intercept) | 0.15 | 0.39 | |
| Residual | 140.86 | 11.87 | ||
| Number of obs: 69108, groups: subj, 52. | ||||
| Fixed effects: | ||||
| Estimate | SE | |||
| (Intercept) | −0.15 | 0.093 | −1.6 | |
| chan[cz] | −0.05 | 0.064 | −0.78 | |
| chan[pz] | 0.16 | 0.064 | 2.5 | |
| animacy[−] | −0.0068 | 0.075 | −0.091 | |
| morphology[−] | 0.53 | 0.12 | 4.5 | |
| morphology[+] | −0.35 | 0.11 | −3.1 | |
| position[−] | −0.36 | 0.075 | −4.8 | |
| animacy[−]:morphology[−] | −0.026 | 0.12 | −0.22 | |
| animacy[−]:morphology[+] | 0.084 | 0.11 | 0.74 | |
| animacy[−]:position[−] | −0.13 | 0.075 | −1.7 | |
| morphology[−]:position[−] | 0.12 | 0.12 | 0.99 | |
| morphology[+]:position[−] | −0.37 | 0.11 | −3.2 | |
| animacy[−]:morphology[−]:position[−] | −0.022 | 0.12 | −0.19 | |
| animacy[−]:morphology[+]:position[−] | −0.091 | 0.11 | −0.8 | |
Dependent variable are single-trial means in the time window 300–500 ms from stimulus onset using only subjects and (direct) objects. For animacy and position, the coefficients are named for the dispreferred condition (note the minus sign) and represent the contrast dispreferred > mean.” Morphology also has an additional “neutral” level for ambiguous case marking, and so the coefficients represent the contrast from the respective marked conditions (note the minus and plus signs for dispreferred/unambiguous accusative and preferred/unambiguous nominative) to the (grand) mean.
Type-II Wald tests for the model presented in Table 6d
| df | Pr(> | ||
| chan | 6.66 | 2 | 0.0357 |
| animacy | 1.34 | 1 | 0.248 |
| morphology | 31.48 | 2 | <0.001 |
| position | 15.17 | 1 | <0.001 |
| animacy:morphology | 1.75 | 2 | 0.416 |
| animacy:position | 1.23 | 1 | 0.267 |
| morphology:position | 14.62 | 2 | <0.001 |
| animacy:morphology:position | 2.00 | 2 | 0.368 |
Model comparison for linguistic-cue based models, extended with (1) index and corpus frequency or (2) corpus and relative frequency
| df | AIC | BIC | logLik | Deviance | χ2 | χ2 df | Pr(>χ2) | |
|---|---|---|---|---|---|---|---|---|
| prom | 16 | 538126 | 538273 | −269047 | 538094 | |||
| prom.rel.freq | 50 | 538042 | 538499 | −268971 | 537942 | 152.68 | 34 | <0.001 |
| prom.freq.index | 52 | 538034 | 538509 | −268965 | 537930 | 11.77 | 2 | 0.00278 |
Note that the basic model is nested within both of the larger models, but the larger models are not nested and so the results of the likelihood-ratio test must be carefully interpreted. Model names reflect the predictor used; ‘rel’ refers to relative frequency and ‘freq’ refers to corpus frequency, while ‘prom’ indicates ‘prominence’, i.e. linguistic cues.
Figure 7.Interaction of position, morphology, and corpus frequency from the full sentence-feature model with index and frequency class. Shaded areas indicate 95% confidence intervals. Light gray points are grand averages by participants over all trials; the corresponding lines are standard error of the (grand) mean. Interactions with position show themselves as differences between the top and bottom rows, while interactions with morphology show themselves as differences between columns.
Figure 8.Interaction of animacy, morphology and position from the full sentence-feature model with index and frequency class. Bars indicate 95% confidence intervals. Light red points are grand averages by participants over all trials; the corresponding lines are standard error of the (grand) mean. Interactions with position show themselves as differences between the top and bottom rows, while interactions with animacy show themselves as differences between columns.
Type-II Wald tests for the clearest effects in the model combining index, (corpus) frequency, and linguistic cuese
| χ2 | df | Pr(>χ2) | |
|---|---|---|---|
| chan | 6.68 | 2 | 0.0355 |
| index | 4.94 | 1 | 0.0262 |
| corpus | 20.47 | 1 | <0.001 |
| morphology | 28.25 | 2 | <0.001 |
| position | 11.98 | 1 | <0.001 |
| index:corpus | 10.68 | 1 | 0.00108 |
| corpus:morphology | 19.64 | 2 | <0.001 |
| morphology:position | 8.85 | 2 | 0.012 |
| index:animacy:morphology | 13.21 | 2 | 0.00135 |
| corpus:animacy:morphology | 9.13 | 2 | 0.0104 |
| index:animacy:position | 8.02 | 1 | 0.00462 |
| corpus:animacy:morphology:position | 14.81 | 2 | <0.001 |
Type-II Wald tests for the clearest effects in the model combining linguistic cues with both corpus and relative frequencyf
| χ2 | df | Pr(>χ2) | |
|---|---|---|---|
| chan | 6.68 | 2 | 0.0355 |
| relative | 9.46 | 1 | 0.0021 |
| corpus | 11.49 | 1 | <0.001 |
| morphology | 34.44 | 2 | <0.001 |
| position | 6.20 | 1 | 0.0128 |
| relative:corpus | 9.65 | 1 | 0.00189 |
| relative:animacy | 24.73 | 1 | <0.001 |
| corpus:morphology | 20.13 | 2 | <0.001 |
| animacy:morphology | 7.44 | 2 | 0.0242 |
| relative:position | 10.88 | 1 | <0.001 |
| morphology:position | 21.47 | 2 | <0.001 |
| corpus:animacy:morphology | 12.40 | 2 | 0.00203 |
| corpus:animacy:position | 6.77 | 1 | 0.00926 |
| corpus:animacy:morphology:position | 11.96 | 2 | 0.00253 |