| Literature DB >> 30314463 |
Jane Candlish1, M Dawn Teare2, Munyaradzi Dimairo2, Laura Flight2, Laura Mandefield2, Stephen J Walters2.
Abstract
BACKGROUND: In individually randomised trials we might expect interventions delivered in groups or by care providers to result in clustering of outcomes for participants treated in the same group or by the same care provider. In partially nested randomised controlled trials (pnRCTs) this clustering only occurs in one trial arm, commonly the intervention arm. It is important to measure and account for between-cluster variability in trial design and analysis. We compare analysis approaches for pnRCTs with continuous outcomes, investigating the impact on statistical inference of cluster sizes, coding of the non-clustered arm, intracluster correlation coefficient (ICCs), and differential variance between intervention and control arm, and provide recommendations for analysis.Entities:
Keywords: Clustering; Individually randomised cluster trial; Individually randomised group treatment; Intervention studies; Partially clustered; Partially nested; Randomised controlled trial; Therapist effects
Mesh:
Year: 2018 PMID: 30314463 PMCID: PMC6186141 DOI: 10.1186/s12874-018-0559-x
Source DB: PubMed Journal: BMC Med Res Methodol ISSN: 1471-2288 Impact factor: 4.615
Summary of relevant literature on analysis of pnRCTs
| Paper | Relevant themes | Range of valuesa | Findings |
|---|---|---|---|
| Schweig & Pane [ | Describe and compare models for pnRCTs with non-compliance using a simulation study. | Simulation for two levels of clustering, exact cluster sizes ( | Clustering and non-compliance may have a substantial impact on statistical inference about intention-to-treat effects. Provide methods that may accommodate pnRCT with non-compliance, recommend using complier average causal effect estimate (CACE) and scaling by the proportion of compliers. No mention of degrees of freedom, we have assumed they used default degrees of freedom method available in R lme packages. |
| Flight et al. [ | Compare models applied to four examples of pnRCTs. Compare three different methods for classifying the non-clustered control arm in pnRCTs, including: singleton clusters, one large cluster and pseudo clusters. | Examples with { | Recommend use of the heteroscedastic model, recommendations based only on re-analysis of case studies. Methods for classifying the non-clustered control arm in pnRCTs had a large impact in fully clustered mixed effects models and no measurable impact in partially nested mixed-effects models. ICCs in four examples were small. |
| Sterba [ | Review of modelling developments for pnRCTs, focused on those particularly relevant to psychotherapy trials. | Recommend the inclusion of cluster variability in analysis model as it provides insight into treatment process (rather than treating it as a nuisance). Annotated Mplus commands for models | |
| Lohr, Schochet & Sanders [ | Report presenting a guide to design and analysis issues for pnRCTs in education research, using example trials. Discussion of degrees of freedom issue in Appendix. | Guidance document, defines pnRCT in context of education research and show methods to analyse these using SAS. Provide SAS commands for model fitting in examples. | |
| Korendijk [ | Compare models for pnRCTs using simulation study, investigate mis-specification for the estimation of the parameters and their standard errors. | Simulation study with | All models perform comparably with respect to fixed effect estimates. Recommend use of partially nested mixed-effects model. Simulations were under null and ICC always greater than zero. No mention of degrees of freedom, we have we assumed default degrees of freedom used from MLwiN software, and homoscedasticity was assumed for ndividual variances between the two arms. |
| Sanders [ | Compare models for pnRCTs using simulation study in terms of Type I error and power | Simulation study with { | Type I error rate increased as ICC increased, Satterthwaite degrees of freedom had better control than Kenward-Roger degrees of freedom. Found using mixed-effects model for pnRCT when ICC is zero likely leads to never detecting intervention effects, observed Type I error rates nearly non-existent under all scenarios with ICC equal to zero. Recommend to evaluate if ICC is significantly different from zero prior to selecting analysis method. Homoscedasticity was assumed for individual variances between the two arms. |
| Baldwin et al. [ | Compare analysis models for pnRCT simulation study, comparing three degrees of freedom calculations, and a pnRCT example. | Simulation for | Recommend pnRCTs take account of heteroscedasticity. Satterthwaite and Kenward-Roger degrees of freedom control Type I error rate. The heteroscedastic model provides an unbiased estimate and little reduction in power compared to the homoscedastic model. Argue that using a partially nested mixed-effects model only a problem for statistical inference when the number of clusters is small. The number of clusters has greater impact on power in pnRCTs. At least eight, preferably 16 clusters, to maintain Type I error rate. |
| Bauer et al. [ | Review of RCTs to ascertain the prevalence of pnRCTS in four public health and clinical research journals. Analysis models for pnRCTs extended to include pre-test measures as covariates, individual and group level covariates, and example of pnRCT | Example with clustering in one arm | Out of 94 RCTs, 32% were pnRCTs, 40% iRCTs and 27% cRCT. None used methods specific to pnRCTs. Example pnRCT data could be analysed using mixed-effects models. Argue pnRCTs “often increase external validity at the expense of internal validity” (p.20). |
| Roberts & Roberts [ | Examine the case of pnRCTs, heterogeneity, comparison of analysis methods for simulation study and present an example. | Simulation for m = 6, | Recommend pnRCTs take account of heteroscedasticity. Satterthwaite unequal variances t-test gave robust to heteroscedasticity. The heteroscedastic model gives slightly inflated test size for large ρ: suggest Satterthwaite degrees of freedom as a solution. |
| Lee & Thompson [ | Describe analysis models for iRCTs with clustering and apply to two examples (using Bayesian approach) | Show that ignoring clustering may underestimate uncertainty, leading to incorrect conclusions. | |
| Hoover [ | Statistical tests for RCTs with clustering that differ across trial arms. | Example with clustering in both arms with | Provide an adjustment for the independent samples t-test for pnRCTs. Statistical impact of heterogeneity effect increases as the cluster size increases, and as heterogeneity increases. The test does not allow for the inclusion of covariates, multiple treatments, baseline measures, or non-normally distributed outcomes. |
am = cluster size, c = number of clusters, ρ= ICC, d = standardised effect size, ω2= Omega Squared effect size percent of variability accounted for by treatment condition, λ= ratio of total variance in control arm compared to clustered, λ= ratio of individual variance in control arm compared to clustered arm. Ordered by year of publication
Models for the analysis of pnRCTs
| Model description | Statistical model | Heteroscedastic residuals | |
|---|---|---|---|
| Model 1 | Linear regression (ignore clustering) | No | |
| Model 2 | Fully clustered (impose clustering) | No | |
| Model 3 | Partially nested homoscedastic | No | |
| Model 4 | Partially nested heteroscedastic | Yes | |
Options for imposing clustering in the non-clustered control arm
| Option | Cluster | Intervention |
|---|---|---|
| 1 | ||
| 2 | ||
| 3 |
Fig. 1Flowchart representing the simulation study steps
Simulation input scenario values (total 1440 scenarios)
| Variable | Notation | Values |
|---|---|---|
| Number of clusters |
| 3, 6, 12, 24 |
| Cluster size |
| 5, 10, 20, 30 |
| Intervention effect |
| 0, 0.2, 0.5 |
| ICC |
| 0, 0.01, 0.05, 0.1, 0.2a, 0.3a |
| Ratio of individual variance between control and cluster trial arms |
| 0.25a, 0.5, 1, 2, 4a |
aConsidered extreme values to occur in rare scenarios
Fig. 2Example of simulated partially nested trial dataset, ρ = 0.1, γ = 1, c = 12, and m = 10
Fig. 3Mean Type I error rate by γ and ρ over all scenarios, for each model
Fig. 4Type I error rate of models 1, 3 and 4, by ρ, γ, c, and m
Fig. 5Mean coverage of 95% confidence interval, by ρ and γ over all scenarios
Fig. 6Power when θ = 0.5, by ρ, γ, c, and m
Fig. 7Power with standardised intervention effect of 0.5 (θ = 0.5 and γ = 1)
Mean and SD of power of model 4 versus model 1 under ρ = 0 over all scenarios
| Intervention effect (θ) | Model | Power |
|---|---|---|
| Mean (SD) | ||
| 0 | 1 | 0.050 (0.007) |
| 4 | 0.033 (0.014) | |
| 0.2 | 1 | 0.388 (0.276) |
| 4 | 0.327 (0.286) | |
| 0.5 | 1 | 0.803 (0.254) |
| 4 | 0.740 (0.298) |
Fig. 8Mean estimated ICC by γ and ρ over all scenarios, for each model
Fig. 9ICC estimation of heteroscedastic partially nested model, by γ, ρ, m and c
Summary of simulation results by different models split by ρ, m, and c averaged over all γ
*Model 1: simple linear regression; Model 3: homoscedastic partially nested mixed effects model; Model 4: heteroscedastic partially nested mixed effects model. Green highlighted ≤ than expected, red highlighted > than expected