| Literature DB >> 28557022 |
Anower Hossain1,2, Karla DiazOrdaz1, Jonathan W Bartlett3.
Abstract
Missing outcomes are a commonly occurring problem for cluster randomised trials, which can lead to biased and inefficient inference if ignored or handled inappropriately. Two approaches for analysing such trials are cluster-level analysis and individual-level analysis. In this study, we assessed the performance of unadjusted cluster-level analysis, baseline covariate-adjusted cluster-level analysis, random effects logistic regression and generalised estimating equations when binary outcomes are missing under a baseline covariate-dependent missingness mechanism. Missing outcomes were handled using complete records analysis and multilevel multiple imputation. We analytically show that cluster-level analyses for estimating risk ratio using complete records are valid if the true data generating model has log link and the intervention groups have the same missingness mechanism and the same covariate effect in the outcome model. We performed a simulation study considering four different scenarios, depending on whether the missingness mechanisms are the same or different between the intervention groups and whether there is an interaction between intervention group and baseline covariate in the outcome model. On the basis of the simulation study and analytical results, we give guidance on the conditions under which each approach is valid.Entities:
Keywords: baseline covariate-dependent missingness; cluster randomised trials; complete records analysis; missing binary outcome; multiple imputation
Mesh:
Year: 2017 PMID: 28557022 PMCID: PMC5518290 DOI: 10.1002/sim.7334
Source DB: PubMed Journal: Stat Med ISSN: 0277-6715 Impact factor: 2.373
Figure 1Simulation results for risk difference (RD). The columns represent the four scenarios considered in the simulation studies. The first and second rows represent the average estimates of RD and coverage rates for nominal 95% confidence interval, respectively, using unadjusted cluster‐level analysis. The third and fourth rows represent the similar estimates using adjusted cluster‐level analysis. Results are shown for complete records analysis (•) and multilevel multiple imputation ( ) over 1000 simulation runs. The lines (—) correspond to the true value.
Figure 2Simulation results for risk ratio (RR). The columns represent the four scenarios considered in the simulation studies. The first and second rows represent the average estimates of log(RR) and coverage rates for nominal 95% confidence interval, respectively, using unadjusted cluster‐level analysis. The third and fourth rows represent the similar estimates using adjusted cluster‐level analysis. Results are shown for complete records analysis (•) and multilevel multiple imputation ( ) over 1000 simulation runs. The lines (—) correspond to the true value.
Average estimates of log(OR), their average estimated SEs and coverage rates for nominal 95% confidence intervals over 1000 simulation runs, using RELR and GEEs with full data, CRA and MMI. Monte Carlo errors for average estimates and average estimated SEs are all less than 0.016 and 0.003, respectively. The true value of conditional log(OR) in RELR is 1.36. The true value of population‐averaged log(OR) for GEE was empirically estimated using full data.
|
| Average estimate | Average estimated SE | Coverage (%) | ||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Full | CRA | MMI | Full | CRA | MMI | Full | CRA | MMI | |||||||||||
| RELR | GEE | RELR | GEE | RELR | GEE | RELR | GEE | RELR | GEE | RELR | GEE | RELR | GEE | RELR | GEE | RELR | GEE | ||
| S1 | 5 | 1.363 | 1.321 | 1.360 | 1.320 | 1.384 | 1.328 | 0.341 | 0.363 | 0.364 | 0.382 | 0.391 | 0.372 | 94.6 | 95.2 | 94.4 | 94.7 | 97.7 | 96.5 |
| 10 | 1.365 | 1.321 | 1.368 | 1.323 | 1.392 | 1.329 | 0.252 | 0.258 | 0.268 | 0.271 | 0.284 | 0.272 | 94.6 | 95.2 | 94.4 | 95.1 | 96.1 | 96.0 | |
| 20 | 1.361 | 1.315 | 1.363 | 1.317 | 1.385 | 1.322 | 0.182 | 0.184 | 0.193 | 0.192 | 0.201 | 0.195 | 94.7 | 95.0 | 95.0 | 94.7 | 95.8 | 95.5 | |
| 50 | 1.359 | 1.310 | 1.361 | 1.310 | 1.380 | 1.316 | 0.118 | 0.117 | 0.125 | 0.122 | 0.129 | 0.124 | 94.4 | 95.1 | 94.8 | 95.4 | 94.8 | 95.0 | |
| S2 | 5 | 1.345 | 1.311 | 1.368 | 1.333 | 1.402 | 1.335 | 0.336 | 0.320 | 0.405 | 0.417 | 0.456 | 0.438 | 94.7 | 94.8 | 95.5 | 94.9 | 98.6 | 98.6 |
| 10 | 1.350 | 1.309 | 1.356 | 1.313 | 1.384 | 1.308 | 0.250 | 0.258 | 0.298 | 0.301 | 0.330 | 0.317 | 93.2 | 94.4 | 94.7 | 95.4 | 97.0 | 97.1 | |
| 20 | 1.358 | 1.311 | 1.352 | 1.305 | 1.376 | 1.301 | 0.184 | 0.185 | 0.215 | 0.213 | 0.232 | 0.224 | 94.8 | 95.8 | 95.0 | 94.9 | 96.7 | 96.4 | |
| 50 | 1.366 | 1.316 | 1.367 | 1.318 | 1.389 | 1.316 | 0.118 | 0.117 | 0.138 | 0.135 | 0.146 | 0.141 | 95.3 | 95.7 | 95.0 | 95.0 | 95.8 | 96.0 | |
| S3 | 5 | 1.391 | 1.353 | 1.407 | 1.367 | 1.434 | 1.374 | 0.343 | 0.358 | 0.392 | 0.400 | 0.414 | 0.389 | 94.8 | 94.1 | 95.2 | 94.4 | 97.7 | 97.4 |
| 10 | 1.352 | 1.307 | 1.359 | 1.314 | 1.385 | 1.320 | 0.254 | 0.259 | 0.284 | 0.286 | 0.299 | 0.285 | 92.8 | 94.1 | 94.0 | 94.5 | 95.4 | 95.0 | |
| 20 | 1.372 | 1.326 | 1.370 | 1.325 | 1.395 | 1.330 | 0.183 | 0.184 | 0.204 | 0.202 | 0.212 | 0.203 | 93.2 | 94.4 | 93.2 | 94.1 | 94.1 | 94.1 | |
| 50 | 1.363 | 1.313 | 1.363 | 1.313 | 1.386 | 1.317 | 0.118 | 0.117 | 0.132 | 0.127 | 0.135 | 0.129 | 95.1 | 95.1 | 94.8 | 95.5 | 95.4 | 95.4 | |
| S4 | 5 | 1.375 | 1.336 | 1.413 | 1.378 | 1.476 | 1.390 | 0.346 | 0.366 | 0.497 | 0.493 | 0.535 | 0.505 | 94.5 | 95.2 | 97.0 | 94.0 | 98.6 | 98.5 |
| 10 | 1.366 | 1.325 | 1.377 | 1.334 | 1.431 | 1.342 | 0.252 | 0.258 | 0.353 | 0.351 | 0.375 | 0.357 | 94.6 | 95.3 | 95.3 | 94.6 | 96.5 | 96.6 | |
| 20 | 1.376 | 1.328 | 1.387 | 1.339 | 1.432 | 1.346 | 0.183 | 0.184 | 0.252 | 0.247 | 0.266 | 0.251 | 94.7 | 94.8 | 94.3 | 94.4 | 94.5 | 94.8 | |
| 50 | 1.360 | 1.312 | 1.362 | 1.313 | 1.397 | 1.317 | 0.118 | 0.117 | 0.160 | 0.156 | 0.167 | 0.157 | 95.4 | 95.7 | 94.8 | 94.5 | 94.4 | 94.2 | |
SEs: standard errors; RELR: random effects logistic regression; GEE: generalised estimation equations; CRA: complete records analysis; MMI: multilevel multiple imputation.
Risk difference, risk ratio and odds ratio estimates using CRA and MMI for the IST intervention trial data.
| Analysis approach |
|
| Risk difference | Risk ratio | Odds ratio | |
|---|---|---|---|---|---|---|
| Estimate (95% CI) | Estimate (95% CI) | Estimate (95% CI) | ||||
| Cluster‐level analysis | ||||||
| CRA | ||||||
| Unadjusted | 2027 | 2173 | 0.019 (−0.040, 0.077) | 1.047 (0.908, 1.208) | ||
| Adjusted | 1935 | 2027 | 0.022 (−0.033, 0.077) | 1.037 (0.908, 1.185) | ||
| MMI | ||||||
| Unadjusted | 2373 | 2451 | 0.021 (−0.038, 0.080) | 1.053 (0.911, 1.218) | ||
| Adjusted | 2373 | 2451 | 0.017 (−0.035, 0.070) | 1.040 (0.910, 1.189) | ||
| Individual‐level analysis | ||||||
| CRA | ||||||
| RELR | ||||||
| Unadjusted | 2027 | 2173 | — | 1.090 (0.841, 1.414) | ||
| Adjusted | 1935 | 2027 | — | 1.088 (0.839, 1.409) | ||
| GEE | ||||||
| Unadjusted | 2027 | 2173 | 1.048 (0.908, 1.209) | 1.082 (0.850, 1.378) | ||
| Adjusted | 1935 | 2027 | 1.019 (0.911, 1.141) | 1.070 (0.842, 1.359) | ||
| MMI | ||||||
| RELR | ||||||
| Unadjusted | 2373 | 2451 | — | 1.101 (0.849, 1.428) | ||
| Adjusted | 2373 | 2451 | — | 1.089 (0.841, 1.413) | ||
| GEE | ||||||
| Unadjusted | 2373 | 2451 | 1.053 (0.912, 1.215) | 1.090 (0.856, 1.389) | ||
| Adjusted | 2373 | 2451 | 1.019 (0.911, 1.140) | 1.072 (0.843, 1.363) | ||
Cluster‐level analysis was used to estimate the risk difference and the risk ratio.
GEE was used to estimate the risk ratio using log link and to estimate the marginal odds ratio using logit link.
CRA, complete records analysis; MMI, multilevel multiple imputation; RELR, random effects logistic regression; GEE, generalised estimation equation; IST, intermittent screening and treatment; CI, confidence interval.