| Literature DB >> 34027594 |
Simon Grund1,2, Oliver Lüdtke3,4, Alexander Robitzsch3,4.
Abstract
Multilevel models often include nonlinear effects, such as random slopes or interaction effects. The estimation of these models can be difficult when the underlying variables contain missing data. Although several methods for handling missing data such as multiple imputation (MI) can be used with multilevel data, conventional methods for multilevel MI often do not properly take the nonlinear associations between the variables into account. In the present paper, we propose a sequential modeling approach based on Bayesian estimation techniques that can be used to handle missing data in a variety of multilevel models that involve nonlinear effects. The main idea of this approach is to decompose the joint distribution of the data into several parts that correspond to the outcome and explanatory variables in the intended analysis, thus generating imputations in a manner that is compatible with the substantive analysis model. In three simulation studies, we evaluate the sequential modeling approach and compare it with conventional as well as other substantive-model-compatible approaches to multilevel MI. We implemented the sequential modeling approach in the R package mdmb and provide a worked example to illustrate its application.Entities:
Keywords: Interaction effects; Missing data; Multilevel analysis; Multiple imputation
Mesh:
Year: 2021 PMID: 34027594 PMCID: PMC8613130 DOI: 10.3758/s13428-020-01530-0
Source DB: PubMed Journal: Behav Res Methods ISSN: 1554-351X
Fig. 1Schematic representation of the substantive analysis models in study 1 (a), study 2 (b), and study 3 (c). =
Simulated conditions in studies 1, 2, and 3
| Design factor | Study 1 | Study 2 | Study 3 |
|---|---|---|---|
| Level 1 sample size | 10, 20 | 10, 20 | 20 |
| Level 2 sample size | 50, 100, 200, 500, 1000 | 50, 100, 200, 500, 1000 | 1000 |
| ICCs of | .10, .20, .50 | .20 | .20 |
| Correlation of | .20 | .20 | |
| Total | .50 | ||
| Nonlinear proportion of | 0, .25, .50, .75, 1 | ||
| Effect of | .40 | .15 | |
| Effect of | .40 | ||
| Effect of | 0 | ||
| Effect of | .20 | .20 | .15 |
| Effect of | .20 | .15 | |
| Effect of | .20 | ||
| Effect of | 0 | ||
| Effect of | .15 | ||
| Effect of | .15 | ||
| Proportion missing | 30% | 30% | 30% |
| Effect of | 0, .35, .70 | 0, .35, .70 | 0, .35, .70 |
Note. ICC = intraclass correlation
Fig. 2Bias (in %) of the estimated regression coefficients for the overall effect of x (β1), the CLI (β3), and the slope variance () in conditions with small samples at level 1 (n = 10) and moderate ICCs (ρ = ρ = .20) in study 1. J = level 2 sample size; CD = complete data; LD = listwise deletion; FCS = multilevel MI (fully conditional specification); JOMO = substantive-model-compatible multilevel MI (joint modeling); BLIMP = substantive-model-compatible multilevel MI (fully conditional specification); MDMB = substantive-model-compatible multilevel MI (sequential modeling). The Monte Carlo error ranged from 0.1% to 1.5% (median 0.4%)
Bias (in %) of the estimated regression coefficient for the overall effect of x (β1), the CLI (β3), and the slope variance () in study 1
| MCAR ( | MAR ( | |||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Par. | CD | LD | FCS | JOMO | BLIMP | MDMB | CD | LD | FCS | JOMO | BLIMP | MDMB | ||
| 10 | 200 | 0.1 | 0.1 | − 4.0 | 0.0 | − 0.1 | 0.1 | − 0.1 | − 8.7 | − 2.3 | − 0.9 | − 0.1 | ||
| 0.5 | 0.3 | 0.3 | 0.2 | 0.2 | 0.4 | − 1.8 | − 0.4 | 0.0 | ||||||
|
| − 0.2 | − 0.4 | − 1.0 | 0.2 | − 3.7 | − 0.1 | − 3.0 | 2.7 | − 4.2 | |||||
| 1000 | − 0.0 | 0.0 | − 3.4 | − 0.0 | − 0.1 | 0.1 | 0.1 | − 7.0 | − 1.9 | − 0.4 | 0.2 | |||
| 0.1 | − 0.1 | − 0.1 | − 0.1 | − 0.2 | − 0.6 | − 2.7 | − 1.5 | − 0.9 | ||||||
|
| 0.1 | 0.4 | 0.3 | 0.5 | − 1.8 | − 0.1 | − 3.1 | 1.8 | − 3.2 | |||||
| 20 | 200 | 0.6 | 0.6 | − 3.4 | 0.6 | 0.5 | 0.6 | − 0.1 | − 7.6 | − 2.0 | − 0.9 | − 0.4 | ||
| − 0.2 | − 0.2 | − 9.9 | − 0.3 | − 0.2 | − 0.3 | 0.9 | − 1.1 | − 0.9 | 0.3 | |||||
|
| 0.1 | 0.0 | − 0.3 | 0.1 | − 0.5 | − 0.8 | − 4.0 | 2.1 | − 1.0 | |||||
| 1000 | − 0.2 | − 0.2 | − 3.9 | − 0.2 | − 0.2 | − 0.1 | − 0.0 | − 6.8 | − 1.8 | − 0.7 | − 0.1 | |||
| 0.1 | 0.2 | − 9.6 | 0.2 | 0.1 | 0.1 | 0.0 | − 2.0 | − 1.9 | − 0.6 | |||||
|
| 0.1 | 0.1 | 0.1 | 0.2 | − 0.3 | − 0.3 | − 3.2 | 2.7 | − 0.3 | |||||
| 10 | 200 | − 0.4 | − 0.4 | − 4.3 | − 0.4 | − 0.6 | − 0.3 | 0.3 | − 8.0 | − 1.9 | − 0.6 | 0.2 | ||
| − 0.1 | − 0.3 | − 0.4 | − 0.5 | − 0.5 | − 0.1 | − 2.4 | − 1.1 | − 0.8 | ||||||
|
| 0.2 | − 0.3 | − 1.0 | 0.3 | − 4.0 | − 0.1 | − 2.8 | 3.5 | − 4.6 | |||||
| 1000 | 0.2 | 0.1 | − 3.2 | 0.1 | 0.1 | 0.3 | 0.1 | − 7.0 | − 2.0 | − 0.6 | 0.2 | |||
| 0.1 | 0.1 | 0.1 | 0.1 | 0.1 | 0.3 | − 1.8 | − 0.5 | − 0.1 | ||||||
|
| − 0.0 | − 0.2 | − 0.4 | − 0.1 | − 4.1 | 0.1 | − 2.6 | 2.1 | − 5.0 | |||||
| 20 | 200 | 0.1 | 0.0 | − 3.9 | 0.0 | − 0.0 | 0.1 | 0.1 | − 7.5 | − 1.9 | − 1.1 | − 0.5 | ||
| 0.2 | 0.0 | − 9.4 | 0.1 | 0.2 | 0.1 | 0.7 | − 1.7 | − 1.7 | − 0.7 | |||||
|
| − 0.2 | − 0.4 | − 0.6 | 0.1 | − 1.4 | − 0.3 | − 2.6 | 4.4 | − 0.0 | |||||
| 1000 | − 0.0 | − 0.0 | − 3.7 | − 0.0 | − 0.0 | 0.1 | − 0.1 | − 7.0 | − 2.0 | − 1.1 | − 0.4 | |||
| 0.1 | 0.1 | − 9.3 | 0.2 | 0.1 | 0.1 | − 0.2 | − 2.3 | − 2.4 | − 1.3 | |||||
|
| 0.1 | 0.1 | 0.1 | 0.2 | − 1.0 | − 0.3 | − 2.6 | 3.8 | − 0.8 | |||||
| 10 | 200 | 0.5 | 0.5 | − 3.6 | 0.3 | 0.3 | 0.1 | 0.1 | − 7.8 | − 2.2 | − 1.2 | − 0.5 | ||
| 0.2 | 0.2 | − 10.0 | 0.3 | 0.3 | − 0.5 | − 0.1 | − 3.2 | − 2.2 | − 3.0 | |||||
|
| − 0.3 | 0.0 | − 1.0 | 0.3 | 0.1 | − 0.2 | 5.6 | |||||||
| 1000 | 0.0 | − 0.1 | − 3.4 | − 0.0 | − 0.0 | − 0.1 | − 0.0 | − 7.0 | − 2.1 | − 0.9 | − 0.3 | |||
| − 0.0 | − 0.1 | − 0.0 | − 0.1 | − 0.9 | 0.2 | − 2.1 | − 1.1 | − 2.1 | ||||||
|
| − 0.0 | − 0.2 | − 0.1 | 0.3 | 0.2 | − 0.2 | 3.7 | |||||||
| 20 | 200 | − 0.3 | − 0.2 | − 4.3 | − 0.3 | − 0.3 | − 0.5 | 0.1 | − 7.3 | − 2.6 | − 2.1 | − 3.1 | ||
| − 0.4 | − 0.1 | − 9.5 | − 0.2 | − 0.2 | − 0.9 | − 0.3 | − 4.1 | − 5.2 | − 6.9 | |||||
|
| 0.8 | 0.9 | 0.3 | 1.0 | − 4.2 | − 0.3 | 2.0 | 8.7 | − 3.3 | |||||
| 1000 | 0.0 | 0.1 | − 3.8 | 0.1 | 0.1 | − 0.2 | 0.0 | − 6.8 | − 2.4 | − 1.7 | − 2.7 | |||
| 0.1 | 0.1 | − 9.4 | 0.1 | 0.0 | − 0.6 | 0.3 | − 3.0 | − 3.5 | − 5.3 | |||||
|
| 0.0 | 0.1 | − 0.0 | 0.1 | − 5.0 | 0.1 | 2.0 | 7.3 | − 4.1 | |||||
Note. Biases larger than ± 10% are printed in bold. n = level 1 sample size; J = level 2 sample size; ρ = ρ = intraclass correlations of x and y; CD = complete data; LD = listwise deletion; FCS = multilevel MI (fully conditional specification); JOMO = substantive-model-compatible multilevel MI (joint modeling); BLIMP = substantive-model-compatible multilevel MI (fully conditional specification); MDMB = substantive-model-compatible multilevel MI (sequential modeling). The Monte Carlo error ranged from 0.1% to 0.8% (median 0.2%)
Coverage of the 95% confidence intervals for the CLI (β3) in conditions with moderate ICCs (ρ = ρ = .20) in study 1
| MCAR ( | MAR ( | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| CD | LD | FCS | JOMO | BLIMP | MDMB | CD | LD | FCS | JOMO | BLIMP | MDMB | ||
| 10 | 50 | 94.0 | 93.7 | 94.5 | 94.0 | 94.3 | 93.6 | 93.8 | 94.2 | 94.1 | 94.7 | ||
| 100 | 96.0 | 96.0 | 94.2 | 95.3 | 95.8 | 95.1 | 94.1 | 95.1 | 93.7 | 93.5 | |||
| 200 | 93.6 | 93.4 | 93.1 | 93.5 | 93.4 | 95.1 | 96.6 | 95.3 | 94.8 | ||||
| 500 | 94.6 | 94.7 | 94.6 | 94.9 | 94.0 | 94.8 | 97.4 | 94.7 | 94.1 | ||||
| 1000 | 94.8 | 95.2 | 96.1 | 95.4 | 95.5 | 96.4 | 99.1 | 95.6 | 95.3 | ||||
| 20 | 50 | 93.5 | 92.9 | 92.6 | 93.3 | 93.6 | 93.1 | 93.8 | 95.4 | 95.1 | 94.1 | ||
| 100 | 94.5 | 94.1 | 94.8 | 93.9 | 94.5 | 94.9 | 96.1 | 95.5 | 94.5 | ||||
| 200 | 95.2 | 95.9 | 96.1 | 96.2 | 95.9 | 93.7 | 95.9 | 93.2 | 93.8 | ||||
| 500 | 95.2 | 94.9 | 94.8 | 94.9 | 94.2 | 96.0 | 98.1 | 94.1 | 95.4 | ||||
| 1000 | 94.7 | 94.6 | 94.8 | 94.6 | 94.2 | 94.9 | 98.7 | 93.9 | |||||
Note. Coverage rates below 92.5% are printed in bold. n = level 1 sample size; J = level 2 sample size; CD = complete data; LD = listwise deletion; FCS = multilevel MI (fully conditional specification); JOMO = substantive-model-compatible multilevel MI (joint modeling); BLIMP = substantive-model-compatible multilevel MI (fully conditional specification); MDMB = substantive-model-compatible multilevel MI (sequential modeling). The Monte Carlo error ranged from 0.2% to 1.1% (median 0.5%)
Fig. 3Bias (in %) of the estimated regression coefficients for the effect of x at level 1 (β1), the CLI (β4), and the slope variance () in conditions with small samples at level 1 (n = 10) and moderate ICCs (ρ = ρ = .20) in study 2. J = level 2 sample size; CD = complete data; LD = listwise deletion; FCS = multilevel MI (fully conditional specification); JOMO = substantive-model-compatible multilevel MI (joint modeling); BLIMP = substantive-model-compatible multilevel MI (fully conditional specification); MDMB = substantive-model-compatible multilevel MI (sequential modeling). The Monte Carlo error ranged from 0.1% to 1.5% (median 0.5%)
Coverage of the 95% confidence intervals for the cross-level-interaction (CLI, β4) in conditions with moderate ICCs (ρ = ρ = .20) in study 2
| MCAR ( | MAR ( | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| CD | LD | FCS | JOMO | BLIMP | MDMB | CD | LD | FCS | JOMO | BLIMP | MDMB | ||
| 10 | 50 | 93.5 | 93.0 | 94.7 | 93.4 | 93.6 | 93.8 | 94.5 | 94.5 | 93.5 | 93.9 | 94.3 | |
| 100 | 95.3 | 94.1 | 94.6 | 94.2 | 94.5 | 95.0 | 95.1 | 94.6 | 94.7 | ||||
| 200 | 95.2 | 95.1 | 95.0 | 95.6 | 94.5 | 95.8 | 95.9 | 95.4 | 95.1 | ||||
| 500 | 95.6 | 94.9 | 94.6 | 94.1 | 95.1 | 95.3 | 96.1 | 95.5 | 96.2 | ||||
| 1000 | 95.5 | 95.5 | 94.5 | 95.6 | 95.4 | 94.5 | 94.7 | 93.4 | 94.4 | ||||
| 20 | 50 | 95.2 | 93.9 | 93.7 | 94.2 | 94.1 | 94.7 | 94.5 | 93.4 | 95.1 | 94.8 | 95.0 | |
| 100 | 94.2 | 94.2 | 93.5 | 94.7 | 94.5 | 94.4 | 94.8 | 95.1 | 94.7 | 94.6 | |||
| 200 | 95.9 | 95.3 | 95.3 | 95.1 | 94.9 | 94.1 | 96.4 | 94.7 | 94.7 | ||||
| 500 | 95.5 | 95.5 | 95.3 | 95.1 | 95.4 | 95.6 | 96.9 | 94.6 | 94.8 | ||||
| 1000 | 94.2 | 94.1 | 93.7 | 94.1 | 93.4 | 96.0 | 97.8 | 95.7 | 95.7 | ||||
Note. Coverage rates below 92.5% are printed in bold. n = level 1 sample size; J = level 2 sample size; CD = complete data; LD = listwise deletion; FCS = multilevel MI (fully conditional specification); JOMO = substantive-model-compatible multilevel MI (joint modeling); BLIMP = substantive-model-compatible multilevel MI (fully conditional specification); MDMB = substantive-model-compatible multilevel MI (sequential modeling). The Monte Carlo error ranged from 0.2% to 1.2% (median 0.5%)
Fig. 4Bias (in %) of the estimated regression coefficients for the linear effect of x (β1), the linear effect of z (β2), the CLI (β3), the quadratic effect of x (β4), and the quadratic effect of z (β5) in study 3. w = relative weight of the nonlinear effect of z on x; CD = complete data; LD = listwise deletion; JOMO = substantive-model-compatible multilevel MI (joint modeling); BLIMP = substantive-model-compatible multilevel MI (fully conditional specification); MDMB = substantive-model-compatible multilevel MI (sequential modeling). The Monte Carlo error ranged from 0.1% to 0.4% (median 0.2%)
Results from the empirical example
| LD | JOMO | BLIMP | MDMB | |||||
|---|---|---|---|---|---|---|---|---|
| Parameter | Est. | Est. | Est. | Est. | ||||
| Intercept ( | -0.048 | 0.054 | -0.171** | 0.053 | -0.173** | 0.053 | -0.202*** | 0.052 |
| Mood ( | 0.576*** | 0.128 | 0.737*** | 0.138 | 0.715*** | 0.145 | 0.621*** | 0.138 |
| Stress ( | -0.084* | 0.042 | -0.078 | 0.044 | -0.074 | 0.045 | -0.076 | 0.042 |
| Mood × Stress ( | 0.381** | 0.144 | 0.470** | 0.152 | 0.489** | 0.166 | 0.428** | 0.151 |
| Mood ( | 1.933*** | 0.223 | 1.568*** | 0.235 | 1.650*** | 0.233 | 1.579*** | 0.216 |
| Stress ( | 0.183 | 0.201 | 0.242 | 0.204 | 0.244 | 0.206 | 0.082 | 0.204 |
| Gender ( | 0.570*** | 0.069 | 0.559*** | 0.068 | 0.553*** | 0.068 | 0.548*** | 0.067 |
| Age ( | 0.014*** | 0.004 | 0.015*** | 0.004 | 0.015*** | 0.004 | 0.015*** | 0.004 |
| Mood × Stress ( | 0.780** | 0.254 | 0.688** | 0.257 | 0.711** | 0.258 | 0.645* | 0.253 |
| Mood × Gender ( | -0.015 | 0.264 | 0.086 | 0.268 | 0.079 | 0.267 | 0.058 | 0.266 |
| Stress × Gender ( | 1.728*** | 0.393 | 1.941*** | 0.473 | 2.321*** | 0.479 | 1.604*** | 0.468 |
| Gender × Age ( | -0.001 | 0.005 | -0.000 | 0.005 | -0.000 | 0.005 | 0.000 | 0.005 |
| Mood × Stress ( | 0.275* | 0.136 | 0.422** | 0.152 | 0.435** | 0.164 | 0.408** | 0.154 |
| Mood × Gender ( | -0.075 | 0.056 | -0.098 | 0.059 | -0.085 | 0.061 | -0.086 | 0.056 |
| Stress × Mood ( | 0.549* | 0.262 | 0.611* | 0.308 | 0.724* | 0.323 | 0.481 | 0.305 |
| Stress × Gender ( | 0.196 | 0.116 | 0.297* | 0.139 | 0.114 | 0.138 | -0.130 | 0.108 |
| Intercept ( | 1.535 | 1.390 | 1.396 | 1.398 | ||||
| Slope (Mood, | 1.535 | 1.399 | 1.801 | 1.106 | ||||
| Residual ( | 1.659 | 1.797 | 1.785 | 1.823 | ||||
∗p < .05, ∗∗p < .01, ∗∗∗p < .001 (two-sided)
For the cross-level effects, level 1 variables are named first (before the “×”) and level 2 variables second (after the “×”)
Note. LD = listwise deletion; JOMO = substantive-model-compatible multilevel MI (joint modeling); BLIMP = substantive-model-compatible multilevel MI (fully conditional specification); MDMB = substantive-model-compatible multilevel MI (sequential modeling)