| Literature DB >> 26846897 |
James Martin1, Monica Taljaard2, Alan Girling1, Karla Hemming1.
Abstract
BACKGROUND: Stepped-wedge cluster randomised trials (SW-CRT) are increasingly being used in health policy and services research, but unless they are conducted and reported to the highest methodological standards, they are unlikely to be useful to decision-makers. Sample size calculations for these designs require allowance for clustering, time effects and repeated measures.Entities:
Keywords: CONSORT; cluster; randomised trial
Mesh:
Year: 2016 PMID: 26846897 PMCID: PMC4746455 DOI: 10.1136/bmjopen-2015-010166
Source DB: PubMed Journal: BMJ Open ISSN: 2044-6055 Impact factor: 2.692
Figure 1Schematic illustration of the stepped-wedge cluster randomised trial.
Figure 2Flow chart showing studies identified by the systematic review. SW, stepped-wedge.
Basic trial demographics of included SW-CRTs, values are numbers (percentages) unless stated otherwise
| Total | Protocols | Full reports | |
|---|---|---|---|
| Year of publication | |||
| 1987–2012 | 28 (46.7) | 12 (42.9) | 16 (50.0) |
| 2013–2014 | 32 (53.3) | 16 (57.1) | 16 (50.0) |
| Journal Impact Factor | |||
| Median (IQR) | 2.6 (2.0–3.5) | 2.3 (2.1–4.8) | 3.3 (2.0–4.8) |
| Country of study | |||
| Australia | 7 (11.7) | 6 (21.4) | 1 (3.1) |
| Canada or USA | 15 (25.0) | 4 (14.3) | 11 (34.4) |
| UK or Ireland | 11 (18.3) | 3 (10.7) | 8 (25.0) |
| Other higher income country | 15 (25.0) | 9 (32.1) | 6 (18.8) |
| Middle-income country | 9 (15.0) | 4 (14.3) | 5 (15.6) |
| Low-income country | 3 (5.0) | 2 (7.1) | 1 (3.1) |
| Type of setting | |||
| Healthcare | 50 (83.3) | 25 (89.3) | 25 (78.1) |
| Non-healthcare | 10 (16.7) | 3 (10.7) | 7 (21.9) |
| Cluster | |||
| General practice | 7 (11.7) | 6 (21.4) | 1 (3.1) |
| Hospital/ward/specialties | 12 (20.0) | 5 (17.9) | 7 (21.9) |
| Other health cluster | 20 (33.3) | 9 (32.1) | 11 (34.4) |
| Geographical unit | 11 (18.3) | 5 (17.9) | 6 (18.8) |
| Other/unclear | 10 (16.7) | 3 (10.7) | 7 (21.9) |
| Number of study arms | |||
| Two | 56 (93.3) | 25 (89.3) | 31 (96.9) |
| Three or more | 4 (6.7) | 3 (10.7) | 1 (3.1) |
| Randomisation type | |||
| Simple | 35 (58.3) | 15 (53.6) | 20 (62.5) |
| Paired | 4 (6.7) | 0 (0) | 4 (12.5) |
| Stratified | 14 (23.3) | 10 (35.7) | 4 (12.5) |
| Other/unclear | 7 (11.7) | 3 (10.7) | 4 (12.5) |
| Primary outcome type | |||
| Continuous | 15 (25.0) | 10 (35.7) | 5 (15.6) |
| Binary | 34 (55.7) | 13 (4.4) | 21 (65.6) |
| Other | 5 (8.3) | 2 (7.1) | 3 (9.4) |
| Unclear/not reported | 6 (10.0) | 3 (10.7) | 3 (9.4) |
| Published protocol | NA | 5 (15.6) | |
NA, not available; SW-CRT, stepped-wedge cluster randomised trial.
Summary of the realised design features of the included stepped-wedge cluster randomised trial. Values are numbers (percentages) unless stated otherwise
| Full trial report | |
|---|---|
| Number of steps* | |
| Two | 9 (28.1) |
| Three or four | 8 (25.0) |
| More than four | 14 (43.8) |
| Not reported | 1 (3.1) |
| Median (IQR) | 4.0 (2.0–6.0) |
| Number of clusters | |
| Less than 10 | 9 (28.1) |
| 10 or more | 22 (68.8) |
| Not reported | 1 (3.1) |
| Median (IQR) | 17.0 (8.0–38.0) |
| Total cluster size† | |
| Median (IQR) | 55.0 (24.0–326.0) |
| Number of clusters randomised per step | |
| Median (IQR) | 3.0 (1.0–8.0) |
| Number of measurement points‡ | |
| Median (IQR) | 5.0 (3.0–7.5) |
| Study duration (months), median (IQR) | 16.0 (8.0–24.0) |
| Step duration (months), median (IQR) | 2.0 (1.0–4.0) |
| Design type§ | |
| Cross-sectional | 5 (15.6) |
| Cohort | 12 (37.5) |
| Open cohort | 10 (31.3) |
| Unclear | 5 (15.6) |
| Variations on design | |
| Transition periods | 1 (3.1) |
| Extended pre-period or postperiod | 11 (34.4) |
| Other | 6 (18.8) |
*Steps are points at which clusters are randomised.
†For cohort studies this is the total number of observations made within the cluster, it includes the size of clusters in which there was lack of clarity of cluster size and cluster size per measurement period but for which a judgement was made.
‡Measurement points are the number of separate periods or points in time in which outcome data are collected.
§Design type includes those for which there was lack of clarity but for which a judgement was made.
Quality of reporting of basic sample size elements from the CONSORT 2010 statement and the Cluster 2012 extension to the CONSORT statement
| All studies | 1987–2012 | 2013–2014 | Absolute difference (95% CI) | p Value | |
|---|---|---|---|---|---|
| Sample size justification | |||||
| Reported | 45 (75.0) | 18 (64.3) | 27 (84.4) | 20.1 (−1.7 to 41.8) | 0.073 |
| Item 1 | |||||
| Level of significance | 39 (65.0) | 16 (57.1) | 23 (71.8) | 14.7 (−9.3 to 38.8) | 0.233 |
| Item 2 | |||||
| Power | 45 (75.0) | 18 (64.3) | 27 (84.4) | 20.1 (−1.7 to 41.8) | 0.073 |
| Item 3 | |||||
| Treatment effect† | 33 (55.0) | 15 (53.6) | 18 (56.3) | 2.7 (−22.6 to 27.9) | 0.835 |
| Item 4 | |||||
| Consistency with primary outcome | 38 (63.3) | 14 (50.0) | 24 (75.0) | 25.0 (1.2 to 48.8) | 0.045 |
| Item 5 | |||||
| Allowance for attrition | 18 (30.0) | 7 (25.0) | 11 (34.4) | 9.4 (−13.6 to 32.4) | 0.429 |
| Item 6 | |||||
| Number of clusters | 58 (96.7) | 27 (96.4) | 31 (96.9) | 0.4 (−8.7 to 9.6) | 0.923 |
| Median cluster size | 39 (65.0) | 15 (53.6) | 24 (75.0) | 21.4 (−2.4 to 45.2) | 0.083 |
| Item 7 | |||||
| Variation in cluster size* | 6 (10.0) | 1 (3.6) | 5 (15.6) | 12.1 (−2.3 to 26.4) | 0.201 |
| Item 8 | |||||
| Variation in outcome across clusters (ie, ICC) | 33 (55.0) | 11 (39.3) | 22 (68.8) | 29.5 (5.3 to 53.7) | 0.022 |
| Item 9 | |||||
| Uncertainty of ICC (or equivalent)* | 8 (13.3) | 3 (10.7) | 5 (15.6) | 4.9 (−12.1 to 21.9) | 0.712 |
| All ItemS | |||||
| Number items reported median (IQR) | 5.0 (2.5–6.0) | 4.0 (1.0–6.0) | 6.0 (5.0–6.0) | 1.22 (0.07 to 2.36) | 0.067 |
| Reporting all nine items | 0 (0) | 0 (0) | 0 (0) | ||
Values are numbers (percentages) unless stated. p Value is for the comparison of 1987–2012 publications and 2013–2014 publications using a χ2 test for proportions (categorical outcomes) or Mann-Whitney U test (where medians are reported), or (*) using Fisher's exact test.
†A sufficient reporting of the treatment effect consists of either a standardised effect size; a mean difference and SD; means in both arms and SD; proportions in both arms; proportion in one arm and a difference.
ICC, intracluster correlation.
Reporting of stepped-wedge cluster randomised trial sample size elements according to the proposed modification to the Cluster 2012 extension for cluster randomised trials
| All reports | 1987–2012 | 2013–2014 | Absolute difference (95% CI) | p Value | |
|---|---|---|---|---|---|
| Number of steps | |||||
| Explicitly reported | 54 (90.0) | 23 (82.1) | 31 (96.9) | 14.7 (−0.7 to 30.1) | 0.058 |
| Reported or deducible | 59 (98.3) | 27 (96.4) | 32 (100.0) | 3.6 (−3.3 to 10.4) | 0.281 |
| Number clusters randomised per step | |||||
| Reported | 56 (93.3) | 25 (89.3) | 31 (96.9) | 7.6 (−5.4 to 20.5) | 0.240 |
| Schematic representation | |||||
| Reported | 46 (76.7) | 20 (71.4) | 26 (81.3) | 9.8 (−11.7 to 31.3) | 0.370 |
| Design type (ie, cross-sectional/cohort) | |||||
| Explicitly reported | 16 (26.7) | 6 (21.4) | 10 (31.3) | 9.8 (−12.3 to 31.9) | 0.391 |
| Reported or deducible | 43 (71.7) | 19 (67.9) | 24 (75.0) | 7.1 (−15.8 to 30.0) | 0.540 |
| Clarity of cluster size† | |||||
| Total cluster size reported | 17 (28.3) | 8 (28.6) | 9 (28.1) | −0.4 (−23.3 to 22.4) | 0.969 |
| Cluster size per measurement period reported | 25 (41.7) | 10 (35.7) | 15 (46.9) | 11.2 (−13.6 to 35.9) | 0.382 |
| Unclear/not reported | 29 (48.3) | 15 (53.6) | 14 (43.8) | −9.8 (−35.1 to 15.4) | 0.448 |
Values are numbers (percentages) unless stated otherwise.
p Value is for the comparison of 1987–2012 publications and 2013–2014 publications using a χ2 test for proportions.
†Some studies reported both total cluster size and cluster size per measurement period.
Methodological assessment sample size calculations and trial justification in SW-CRTs, among those studies reporting a sample size calculation
| All reports | 1987–2012 | 2013–2014 | Absolute difference (95% CI) | p Value | |
|---|---|---|---|---|---|
| Allowance for clustering | |||||
| Number (%) | 33 (73.3) | 11 (61.1) | 22 (81.5) | 20.4 (−6.5 to 47.2) | 0.130 |
| Allowance for time effects | |||||
| Number (%)* | 15 (33.3) | 3 (16.7) | 12 (44.4) | 27.8 (2.3 to 53.2) | 0.063 |
| Allowance for repeated measurements†* | |||||
| Number (%) | 3/24 (12.5) | 2/11 (18.2) | 1/13 (7.7) | −10.5 (−37.5 to 16.5) | 0.576 |
| Power methodology | |||||
| Hussey and Hughes | 14 (31.1) | 3 (16.7) | 11 (40.7) | 24.1 (−1.2 to 49.4) | 0.087 |
| Other, allowing for time effects* | 2 (4.4) | 0 (0) | 2 (7.4) | 7.4 (−2.5 to 17.3) | 0.509 |
| Other, not allowing for time effects | 14 (31.1) | 10 (55.6) | 4 (14.8) | −40.7 (−67.3 to −14.2) | 0.004 |
| Not stated | 15 (33.3) | 5 (27.8) | 10 (37.0) | 9.3 (−18.3 to 36.8) | 0.519 |
| Power methodology for additional features | |||||
| Transition periods | 0 (0) | 0 (0) | 0 (0) | ||
| Interactions (eg, lag effects) | 0 (0) | 0 (0) | 0 (0) | ||
| Extended correlations* | 2 (4.4) | 2 (11.1) | 0 (0) | −11.1 (−25.6 to 3.4) | 0.155 |
| Varying cluster size* | 3 (6.7) | 1 (5.6) | 2 (7.4) | 1.9 (−12.6 to 16.3) | 1.000 |
| Variation in outcomes across clusters‡ | |||||
| Reported using ICC | 20/33 (60.6) | 8/11 (72.7) | 12/22 (54.6) | −18.2 (−51.7 to 15.4) | 0.314 |
| Reported using CV* | 10/33 (30.3) | 2/11 (18.2) | 8/22 (36.4) | 18.2 (−12.2 to 48.6) | 0.430 |
| Reported using DE* | 1/33 (3.0) | 1/11 (9.1) | 0/22 (0) | −9.1 (−26.1 to 7.9) | 0.333 |
| Reported using between cluster variation* | 2/33 (6.1) | 0/11 (0) | 2/22 (9.1) | 9.1 (−2.9 to 21.1) | 0.542 |
Values are numbers (percentages) unless stated otherwise by year of publication.
p Value is for the comparison of 1987–1990 publications and 2013–2014 publications using a χ2 test for proportions or (*) using Fisher's exact test.
†Among those with a cohort design.
‡As a percentage of studies for which some measure of variation was reported.
CV, coefficient of variation; DE, design effect; ICC, intracluster correlation; SW-CRT, stepped-wedge cluster randomised trial.