| Literature DB >> 28797179 |
Jennifer A Thompson1,2, Katherine Fielding1, James Hargreaves3, Andrew Copas2.
Abstract
Background/Aims We sought to optimise the design of stepped wedge trials with an equal allocation of clusters to sequences and explored sample size comparisons with alternative trial designs. Methods We developed a new expression for the design effect for a stepped wedge trial, assuming that observations are equally correlated within clusters and an equal number of observations in each period between sequences switching to the intervention. We minimised the design effect with respect to (1) the fraction of observations before the first and after the final sequence switches (the periods with all clusters in the control or intervention condition, respectively) and (2) the number of sequences. We compared the design effect of this optimised stepped wedge trial to the design effects of a parallel cluster-randomised trial, a cluster-randomised trial with baseline observations, and a hybrid trial design (a mixture of cluster-randomised trial and stepped wedge trial) with the same total cluster size for all designs. Results We found that a stepped wedge trial with an equal allocation to sequences is optimised by obtaining all observations after the first sequence switches and before the final sequence switches to the intervention; this means that the first sequence remains in the control condition and the last sequence remains in the intervention condition for the duration of the trial. With this design, the optimal number of sequences is [Formula: see text], where [Formula: see text] is the cluster-mean correlation, [Formula: see text] is the intracluster correlation coefficient, and m is the total cluster size. The optimal number of sequences is small when the intracluster correlation coefficient and cluster size are small and large when the intracluster correlation coefficient or cluster size is large. A cluster-randomised trial remains more efficient than the optimised stepped wedge trial when the intracluster correlation coefficient or cluster size is small. A cluster-randomised trial with baseline observations always requires a larger sample size than the optimised stepped wedge trial. The hybrid design can always give an equally or more efficient design, but will be at most 5% more efficient. We provide a strategy for selecting a design if the optimal number of sequences is unfeasible. For a non-optimal number of sequences, the sample size may be reduced by allowing a proportion of observations before the first or after the final sequence has switched. Conclusion The standard stepped wedge trial is inefficient. To reduce sample sizes when a hybrid design is unfeasible, stepped wedge trial designs should have no observations before the first sequence switches or after the final sequence switches.Entities:
Keywords: Stepped wedge trial; cluster randomised trial; design effect; hybrid trial; power; sample size; study design
Mesh:
Year: 2017 PMID: 28797179 PMCID: PMC5718336 DOI: 10.1177/1740774517723921
Source DB: PubMed Journal: Clin Trials ISSN: 1740-7745 Impact factor: 2.486
Figure 1.Diagrammatic illustrations of trial designs. Each has the same number of clusters and the same total cluster size. (a)–(d) Stepped wedge cluster-randomised trials (SWTs) with four sequences varying the amount of data before and after rollout. (a) Standard design: the same number of observations before and after rollout and between sequences switching, (b) number of observations before and after rollout is half the number between sequences switching, (c) optimised design: no observations before or after rollout, (d) no observations before rollout, 50% after rollout. (e)–(g) Other designs: (e) parallel cluster-randomised trial: CRT, (f) parallel cluster-randomised trial with baseline observations, (g) hybrid design with 50% CRT, 50% SWT with four sequences and the number of observations before and after rollout equal to half the number between sequences switching.
Figure 2.Optimal number of sequences by the cluster-mean correlation. The number of sequences tends to infinity as the cluster-mean correlation tends to 1.
Figure 3.Graph of the sample size of the SWT with no observations outside rollout, relative to the optimised hybrid against the cluster-mean correlation. Darkest and dotted line = 2 sequences, lightest and solid line = 20 sequences. The optimal SWT is the lowest line at any given cluster-mean correlation.
Illustrative example of the number of clusters required by different designs to achieve 80% power to detect a difference of 0.1 with standard deviation of 1.
| Design | Calculated number of clusters | Final design | ||
|---|---|---|---|---|
| Number of clusters after rounding | Total cluster size[ | Power (%) | ||
| Optimised SWT | ||||
| 8 sequences, no observations outside rollout | 86.1 | 88 | 84 | 81 |
| Other SWT designs | ||||
| 88 sequences, no observations outside rollout | 87.7 | 88 | 87 | 81 |
| 8 sequences, 22% outside rollout (standard SWT) | 94.0 | 96 | 81 | 80 |
| 3 sequences, no observations outside rollout | 96.9 | 99 | 84 | 81 |
| 3 sequences, optimal outside rollout (14%) | 94.2 | 96 | 84 | 81 |
| Other designs | ||||
| CRT | 161.5 | 162 | 84 | 80 |
| CRT with optimal proportion of observations at baseline (36%) | 111.6 | 112 | 84 | 80 |
| Hybrid: 78% 17-sequence SWTs (optimal[ | 84.8 | 86: 68 SWTs, 18 CRTs | 85 | 81 |
CRT: parallel cluster-randomised trial; SWT: stepped wedge trial.
Total cluster size = 84, intracluster correlation coefficient (ICC) = 0.04, 5% significance level and 80% power.
Difference in calculated number of clusters and final number of clusters is due to rounding up and the requirement for an equal number of clusters per sequence.
For power calculations, the total cluster size had to be varied for some of these designs to allow an equal number of observations in each period of the trial.
The optimal number of sequences was 68, which gave a calculated number of clusters or 84.7. For 17 sequences, the calculated number is higher, but the final number of clusters required was the same as for 68 sequences and allowed a total cluster size similar to the other designs being considered.