| Literature DB >> 34128810 |
Simon B Goldberg1,2, Daniel M Bolt3, Richard J Davidson2,4,5.
Abstract
BACKGROUND: Missing data are common in mobile health (mHealth) research. There has been little systematic investigation of how missingness is handled statistically in mHealth randomized controlled trials (RCTs). Although some missing data patterns (ie, missing at random [MAR]) may be adequately addressed using modern missing data methods such as multiple imputation and maximum likelihood techniques, these methods do not address bias when data are missing not at random (MNAR). It is typically not possible to determine whether the missing data are MAR. However, higher attrition in active (ie, intervention) versus passive (ie, waitlist or no treatment) conditions in mHealth RCTs raise a strong likelihood of MNAR, such as if active participants who benefit less from the intervention are more likely to drop out.Entities:
Keywords: differential attrition; missing data; mobile phone; randomized controlled trial; sensitivity analysis; statistical methodology
Mesh:
Year: 2021 PMID: 34128810 PMCID: PMC8277392 DOI: 10.2196/26749
Source DB: PubMed Journal: J Med Internet Res ISSN: 1438-8871 Impact factor: 5.428
Attrition rates and study design characteristics.
| Study | Txa ITTb | Tx drop | WLc ITT | WL drop | Diffd | Methode | Multiple imputation | Maximum likelihood |
| Bakker et al [ | 234 | 146 | 78 | 25 | N/Af | ANOVAg | Yes | No |
| Bidargaddi et al [ | 192 | 106 | 195 | 88 | Yes, higher in active | Yes | No | |
| Bostock et al [ | 128 | 5 | 110 | 4 | N/A | ANOVA | No | No |
| Carissoli et al [ | 20 | 0 | 18 | 0 | N/A | ANOVA | N/A | N/A |
| Champion et al [ | 38 | 9 | 36 | 3 | No | MLMh | Yes | Yes |
| Enock et al [ | 206 | 38 | 36 | 0 | N/A | MLM | No | Yes |
| Faurholt-Jepsen et al [ | 39 | 6 | 39 | 5 | N/A | MLM | No | Uncleari |
| Hall et al [ | 76 | 34 | 25 | 13 | N/A | MLM | No | Unclear |
| Horsch et al [ | 74 | 29 | 77 | 15 | N/A | MLM | Yes | Unclear |
| Ivanova et al [ | 101 | 20 | 51 | 4 | N/A | MLM | No | Yes |
| Kahn et al [ | 80 | 1 | 80 | 0 | N/A | No | No | |
| Krafft et al [ | 67 | 15 | 31 | 5 | N/A | MLM | No | Yes |
| Kristjansdottir et al [ | 70 | 23 | 70 | 33 | N/A | No | No | |
| Kuhn et al [ | 62 | 11 | 58 | 6 | No | ANOVA | Yes | No |
| Lee and Jung [ | 102 | 25 | 104 | 18 | N/A | ANOVA | No | No |
| Levin et al [ | 12 | 0 | 11 | 0 | N/A | MLM | No | Unclear |
| Levin et al [ | 59 | 13 | 28 | 5 | No | MLM | No | Unclear |
| Lüdtke et al [ | 45 | 10 | 45 | 6 | No | ANOVA | Yes | No |
| Lukas and Berking [ | 16 | 2 | 15 | 2 | N/A | ANOVA | No | No |
| Ly et al [ | 36 | 3 | 37 | 2 | N/A | MLM | No | Yes |
| Ly et al [ | 14 | 0 | 14 | 0 | N/A | MLM | No | Yes |
| Marx [ | 46 | 2 | 50 | 0 | N/A | ANOVA | No | No |
| Miner et al [ | 25 | 2 | 24 | 3 | N/A | ANOVA | Yes | No |
| Moëll et al [ | 29 | 3 | 28 | 1 | N/A | ANOVA | No | No |
| Oh et al [ | 39 | 1 | 20 | 4 | N/A | ANOVA | No | No |
| Pham et al [ | 31 | 14 | 32 | 7 | N/A | ANOVA | No | No |
| Proudfoot et al [ | 242 | 116 | 230 | 32 | Yes, higher in active | MLM | Yes | Yes |
| Roepke et al [ | 190 | 152 | 93 | 57 | Yes, higher in active | MLM | No | Yes |
| Rosen et al [ | 57 | 17 | 55 | 7 | Yes, higher in active | MLM | No | Yes |
| Schlosser et al [ | 22 | 3 | 21 | 0 | N/A | ANOVA | No | No |
| Stjernsward and Hansson [ | 196 | 60 | 202 | 42 | N/A | ANOVA | Yes | No |
| Stolz et al [ | 60 | 18 | 30 | 7 | No | MLM | Yes | yes |
| Tighe et al [ | 31 | 2 | 30 | 0 | N/A | ANOVA | No | No |
| van Emmerik et al [ | 191 | 111 | 186 | 45 | Yes, higher in active | MLM | Yes | Unclear |
| Versluis et al [ | 46 | 9 | 42 | 3 | Yes, higher in active | MLM | No | Unclear |
| Yang et al [ | 45 | 3 | 43 | 4 | N/A | ANOVA | No | No |
aTx: active treatment conditions.
bITT: intention-to-treat sample size; drop=attrition at posttreatment assessment.
cWL: waitlist (or no treatment control condition).
dWhether differential attrition was tested and, if so, whether a between-group difference was detected.
ePrimary data analysis method.
fN/A: not applicable (because of lack of missing data or differential attrition test not conducted).
gANOVA: analysis of variance or related method (eg, analysis of covariance).
hMLM: multilevel model.
iUnclear whether multiple imputation estimator was used.
Figure 1Forest plot displaying results of the meta-analysis. Effects sizes are in log-odds units, with larger values indicating higher attrition in active conditions relative to passive conditions. The size of points indicates relative weight in the meta-analysis (ie, inverse variance). RE: random effects.
Figure 2Results of meta-regression indicating that larger studies are associated with higher rates of differential attrition (ie, higher attrition in active vs passive conditions). Points are displayed relative to their weight in the meta-regression model (ie, inverse variance).
Results of pattern-mixture model sensitivity analysis based on multiple imputationa.
| Model | Estimateb | |
| MARc | −0.34 | .002 |
| 0.20 | −0.31 | .004 |
| 0.50 | −0.28 | .01 |
| 0.80 | −0.24 | .03 |
| 1.10 | −0.20 | .08 |
| 1.40 | −0.17 | .16 |
aModels are based on varying assumptions regarding the meaning of missingness. Multiply imputed posttest values based on 100 imputations are offset [23] by varying amounts (ie, 0.20, 0.50, and residual SD).
bCoefficient for active group status (vs waitlist) predicting posttest distress scores controlling for pretest distress scores pooled across imputed data sets.
cMAR: missing at random (with no offset applied to posttest values).
Figure 3Pre- and posttreatment scores for active and passive conditions with varying constant offset parameters added to multiply imputed values for missing outcomes under conditions of missing not at random (ie, Missing). Values are in z-score units, scaled by distress at baseline (mean 0, SD 1). Panels illustrate trajectories with offsets ranging from 0.2 to 1.4 residual SD. The missing at random panel represents values derived using multiple imputation with no offset applied. MAR: missing at random; WL: waitlist.
Figure 4Pre- and posttreatment scores for active and passive conditions under varying missing not at random conditions using fixed-value replacement of missings. Pretreatment values represent z-scaled distress at baseline (mean 0, SD 1). Posttreatment values vary across plots. For Comp Raw, posttreatment values are posttreatment distress scaled based on baseline distress. Subsequent plots display residualized change scores z-transformed at posttreatment to aid in visual interpretation of relative, between-group pre-post change. Comp Resid computed posttreatment as baseline plus residualized change scores for completers only. Worst Resid replaced missing posttreatment Comp Resid values with the lowest improvement in distress. Subsequent figures (0.2, 0.5, 0.8) replaced missing posttreatment Comp Resid values with values 0.2, 0.5, and 0.8 SD worse than the mean residual. Comp: completer; Resid: residualized change; WL: waitlist.
Results of fixed-value replacement sensitivity analysis.
| Group and model | Sample size, n (%) | Mean rank (SD) | SE | ||
|
| |||||
|
| Compb | 91 (39.9) | 69.11 (43.01) | 4.51 | <.001 |
|
| Worstc | 228 (100) | 178.1 (93.05) | 6.16 | .08 |
|
| 0.20d | 228 (100) | 161.89 (83.25) | 5.51 | .004 |
|
| 0.50d | 228 (100) | 165.19 (84.25) | 5.58 | .05 |
|
| 0.80d | 228 (100) | 168.52 (86.01) | 5.70 | .32 |
|
| |||||
|
| Comp | 67 (58.3) | 93.61 (45.9) | 5.61 | N/Ae |
|
| Worst | 115 (100) | 159.9 (85.62) | 7.98 | N/A |
|
| 0.20d | 115 (100) | 192.05 (102.27) | 9.54 | N/A |
|
| 0.50d | 115 (100) | 185.5 (102.26) | 9.54 | N/A |
|
| 0.80d | 115 (100) | 178.9 (100.34) | 9.36 | N/A |
aP value from Wilcoxon signed-rank test comparing active and passive conditions across varying missingness assumptions.
bComp: completer sample.
cWorst: worst-case scenario, which assumed missing values are equivalent to the worst outcome (ie, smallest change in distress).
d0.20, 0.50, 0.80: missing values assumed to be 0.20, 0.50, or 0.80 SDs worse than the mean residualized change score.
eN/A: not applicable.
Figure 5Results of Wilcoxon signed-rank test using a fixed-value replacement sensitivity analysis across varying missing not at random conditions. A lower mean rank indicates larger relative decreases in distress. Comp: completer sample; Worst: worst-case scenario which assumed missing values are equivalent to the worst outcome (ie, the smallest change in distress); 0.2, 0.5, 0.8: missing values assumed to be 0.2, 0.5, or 0.8 SD worse than the mean residualized change score; error bars: 1.96×SE; WL: waitlist. *P<.05, **P<.01, ***P<.001.