Literature DB >> 23761778

Random effects structure for testing interactions in linear mixed-effects models.

Dale J Barr1.   

Abstract

Entities:  

Year:  2013        PMID: 23761778      PMCID: PMC3672519          DOI: 10.3389/fpsyg.2013.00328

Source DB:  PubMed          Journal:  Front Psychol        ISSN: 1664-1078


× No keyword cloud information.
In a recent paper on mixed-effects models for confirmatory analysis, Barr et al. (2013) offered the following guideline for testing interactions: “one should have by-unit [subject or item] random slopes for any interactions where all factors comprising the interaction are within-unit; if any one factor involved in the interaction is between-unit, then the random slope associated with that interaction cannot be estimated, and is not needed” (p. 275). Although this guideline is technically correct, it is inadequate for many situations, including mixed factorial designs. The following new guideline is therefore proposed: models testing interactions in designs with replications should include random slopes for the highest-order combination of within-unit factors subsumed by each interaction. Designs with replications are designs where there are multiple observations per sampling unit per cell. Psychological experiments typically involve replicated observations, because multiple stimulus items are usually presented to the same subjects within a single condition. If observations are not replicated (i.e., there is only a single observation per unit per cell), random slope variance cannot be distinguished from random error variance and thus random slopes need not be included. This new guideline implies that a model testing AB in a 2 × 2 design where A is between and B within should include a random slope for B. Likewise, a model testing all two- and three- way interactions in a 2 × 2 × 2 design where A is between and B, C are within should include random slopes for B, C, and BC. The justification for the guideline comes from the logic of mixed-model ANOVA. In an ANOVA analysis of the 2 × 2 design described above, the appropriate error term for the test of AB is MS, the mean squares for the unit-by-B interaction (e.g., the subjects-by-B or items-by-B interaction). For the 2 × 2 × 2 design, the appropriate error term for ABC and BC is MS, the unit-by-BC interaction; for AB, it is MS; and for AC, it is MS. To what extent is this ANOVA logic applicable to tests of interactions in mixed-effects models? To address this question, Monte Carlo simulations were performed using R (R Core Team, 2013). Models were estimated using the lmer() function of lme4 (Bates et al., 2013), with p-values derived from model comparison (α = 0.05). The performance of mixed-effects models (in terms of Type I error and power) was assessed over two sets of simulations, one for each of two different mixed factorial designs. The first set focused on the test of the AB interaction in a 2 × 2 design with A between and B within; the second focused on the test of the ABC interaction in a 2 × 2 × 2 design with A between and B, C within. For simplicity all datasets included only a single source of random effect variance (e.g., by-subject but not by-item variance). The number of replications per cell was 4, 8, or 16. Predictors were coded using deviation (−0.5, 0.5) coding; identical results were obtained using treatment coding. In the rare case (~2%) that a model did not converge, it was removed from the analysis. Power was reported with and without adjustment for Type I error rate, using the adjustment method reported in Barr et al. (2013). For each set of simulations at each of the three replication levels, 10,000 datasets were randomly generated, each with 24 sampled units (e.g., subjects). The dependent variable was continuous and normally distributed, with all data-generating parameters drawn from uniform distributions. Fixed effects were either between −2 and −1 or between 1 and 2 (with equal probability). The error variance was fixed at 6, and the random effects variance/covariance matrix had variances ranging from 0 to 3 and covariances corresponding to correlations ranging from −0.9 to 0.9. For the 2 × 2 design, mixed-effects models with two different random effects structures were fit to the data: (1) by-unit random intercept but no random slope for B (“RI”), and (2) a maximal model including a slope for B in addition to the random intercept (“Max”). For comparison purposes, a test of the interaction using mixed-model ANOVA (“AOV”) was performed using R's aov() function. Results for the test of the AB interaction in the 2 × 2 design are in Tables 1 and 2. As expected, the Type I error rate for ANOVA and maximal models were very close to the stated α-level of 0.05. In contrast, models lacking the random slope for B (“RI”) showed unacceptably high Type I error rates, increasing with the number of replications. Adjusted power was comparable for all three types of analyses (Table 2), albeit with a slight overall advantage for RI.
Table 1

Type I error rate for the test of .

RepsRIMaxAOV
40.1700.0630.050
80.2670.0640.052
160.3950.0630.049
Table 2

Power for the test of .

RepsRIMaxAOV
40.495 (0.704)0.469 (0.507)0.471 (0.471)
80.594 (0.847)0.558 (0.604)0.558 (0.565)
160.649 (0.922)0.619 (0.657)0.619 (0.619)
Type I error rate for the test of . Power for the test of . The test of the ABC interaction in the 2 × 2 design was evaluated under four different random effects structures, all including a random intercept but varying in which random slopes were included. The models were: (1) random intercept only (“RI”); (2) slopes for B and C but not for BC (“nBC”); (3) slope for BC but not for B nor C (“BC”); and (4) maximal (slopes for B, C, and BC; “Max”). For the test of the ABC interaction, ANOVA and maximal models both yielded acceptable Type I performance (Table 3); the model with the BC slope alone (“BC”) was comparably good. However, the model excluding the BC slope had unacceptably high Type I error rates; surprisingly, omitting this random slope may be even worse than a random-intercept-only model. Adjusted power was comparable across all analyses (Table 4).
Table 3

Type I error rate for test of .

RepsRInBCBCMaxAOV
40.0690.1020.0500.0460.046
80.1240.1590.0590.0570.051
160.1970.2410.0630.0620.052
Table 4

Power for test of .

RepsRInBCBCMaxAOV
40.422 (0.478)0.418 (0.546)0.396 (0.397)0.412 (0.412)0.405 (0.405)
80.562 (0.711)0.567 (0.753)0.552 (0.575)0.564 (0.578)0.554 (0.557)
160.649 (0.866)0.651 (0.889)0.653 (0.690)0.657 (0.687)0.656 (0.661)
Type I error rate for test of . Power for test of . To summarize: when testing interactions in mixed designs with replications, it is critical to include the random slope corresponding to the highest-order combination of within-subject factors subsumed by each interaction of interest. It is just as important to attend to this guideline when one seeks to simplify a non-converging model as when one is deciding on what structure to fit in the first place. Failing to include the critical slope in the test of an interaction can yield unacceptably high Type I error rates. Indeed, a model that includes all relevant random slopes except for the single critical slope may perform just as badly as (or possibly even worse than) a random-intercepts-only model, even though such a model is nearly maximal. Finally, note that including only the critical random slope in the model was sufficient to obtain acceptable performance, as illustrated by the “BC” model in the 2 × 2 × 2 design. Although the current simulations only considered interactions between categorical variables, the guideline applies whenever there are replicated observations, regardless of what types of variables are involved in an interaction (e.g., continuous only, or a mix of categorical and continuous). For example, consider a design with two independent groups of subjects, where there are observations at multiple time points for each subject. When testing the time-by-group interaction, the model should include a random slope for the continuous variable of time; if time is modeled using multiple terms of a polynomial, then there should be a slope for each of the terms in the polynomial that interact with group. For instance, if the effect of time is modeled as Y = β0 + β1 t + β2 t2 and the interest is in whether the β0 and β1 parameters vary across group, then the random effects structure should include slopes for both the group-by-t and group-by-t2 interactions.
  1 in total

1.  Random effects structure for confirmatory hypothesis testing: Keep it maximal.

Authors:  Dale J Barr; Roger Levy; Christoph Scheepers; Harry J Tily
Journal:  J Mem Lang       Date:  2013-04       Impact factor: 3.059

  1 in total
  112 in total

1.  Testosterone reduces the threat premium in competitive resource division.

Authors:  Shawn N Geniole; Valentina Proietti; Brian M Bird; Triana L Ortiz; Pierre L Bonin; Bernard Goldfarb; Neil V Watson; Justin M Carré
Journal:  Proc Biol Sci       Date:  2019-05-29       Impact factor: 5.349

2.  Revisiting the incremental effects of context on word processing: Evidence from single-word event-related brain potentials.

Authors:  Brennan R Payne; Chia-Lin Lee; Kara D Federmeier
Journal:  Psychophysiology       Date:  2015-08-27       Impact factor: 4.016

3.  The effects of simulated acuity and contrast sensitivity impairments on detection of pedestrian hazards in a driving simulator.

Authors:  Garrett Swan; Maha Shahin; Jacqueline Albert; Joseph Herrmann; Alex R Bowers
Journal:  Transp Res Part F Traffic Psychol Behav       Date:  2019-07

4.  The Activation of Embedded Words in Spoken Word Recognition.

Authors:  Xujin Zhang; Arthur G Samuel
Journal:  J Mem Lang       Date:  2015 February-April       Impact factor: 3.059

5.  Catecholaminergic challenge uncovers distinct Pavlovian and instrumental mechanisms of motivated (in)action.

Authors:  Jennifer C Swart; Monja I Froböse; Jennifer L Cook; Dirk Em Geurts; Michael J Frank; Roshan Cools; Hanneke Em den Ouden
Journal:  Elife       Date:  2017-05-15       Impact factor: 8.140

6.  Primate archaeology reveals cultural transmission in wild chimpanzees (Pan troglodytes verus).

Authors:  Lydia V Luncz; Roman M Wittig; Christophe Boesch
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2015-11-19       Impact factor: 6.237

7.  Common neural basis of motor sequence learning and word recognition and its relation with individual differences in reading skill.

Authors:  Yi-Hui Hung; Stephen J Frost; Peter Molfese; Jeffrey G Malins; Nicole Landi; W Einar Mencl; Jay G Rueckl; Louisa Bogaerts; Kenneth R Pugh
Journal:  Sci Stud Read       Date:  2018-03-26

8.  Thermal facial reactivity patterns predict social categorization bias triggered by unconscious and conscious emotional stimuli.

Authors:  Giorgia Ponsi; Maria Serena Panasiti; Giulia Rizza; Salvatore Maria Aglioti
Journal:  Proc Biol Sci       Date:  2017-08-30       Impact factor: 5.349

9.  Distance-decay effect in stone tool transport by wild chimpanzees.

Authors:  Lydia V Luncz; Tomos Proffitt; Lars Kulik; Michael Haslam; Roman M Wittig
Journal:  Proc Biol Sci       Date:  2016-12-28       Impact factor: 5.349

10.  Contributions of nonlinguistic task-shifting to language control in bilingual children.

Authors:  Megan Gross; Margarita Kaushanskaya
Journal:  Biling (Camb Engl)       Date:  2016-10-26
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.