| Literature DB >> 31910861 |
Shaun Treweek1, Simon Bevan2, Peter Bower3, Matthias Briel4, Marion Campbell5, Jacquie Christie6, Clive Collett7, Seonaidh Cotton5, Declan Devane8, Adel El Feky5, Sandra Galvin8, Heidi Gardner5, Katie Gillies5, Kerenza Hood9, Jan Jansen10, Roberta Littleford11, Adwoa Parker12, Craig Ramsay5, Lynne Restrup13, Frank Sullivan14, David Torgerson12, Liz Tremain2, Erik von Elm15, Matthew Westmore2, Hywel Williams16, Paula R Williamson17, Mike Clarke18.
Abstract
The evidence base available to trialists to support trial process decisions-e.g. how best to recruit and retain participants, how to collect data or how to share the results with participants-is thin. One way to fill gaps in evidence is to run Studies Within A Trial, or SWATs. These are self-contained research studies embedded within a host trial that aim to evaluate or explore alternative ways of delivering or organising a particular trial process.SWATs are increasingly being supported by funders and considered by trialists, especially in the UK and Ireland. At some point, increasing SWAT evidence will lead funders and trialists to ask: given the current body of evidence for a SWAT, do we need a further evaluation in another host trial? A framework for answering such a question is needed to avoid SWATs themselves contributing to research waste.This paper presents criteria on when enough evidence is available for SWATs that use randomised allocation to compare different interventions.Entities:
Mesh:
Year: 2020 PMID: 31910861 PMCID: PMC6945587 DOI: 10.1186/s13063-019-3980-5
Source DB: PubMed Journal: Trials ISSN: 1745-6215 Impact factor: 2.279
Should we do a further evaluation of the intervention in a SWAT?
| The five proposed criteria for deciding whether the intervention needs another evaluation in a SWAT. The more criteria that are met, the more likely we are to conclude that further evaluation in a SWAT is appropriate. | |
| 1. | |
| 2. | |
| 3. | |
| • P – is the population in the host trial so different from those already included that the current evidence does not provide sufficient certainty? | |
| • I – are the health interventions in the host trial so different from those already included that the current evidence does not provide sufficient certainty? | |
| • C – is the comparator in the host trial so different from those already included that the current evidence does not provide sufficient certainty? | |
| • O – is the SWAT outcome(s) so different to those used in the existing evaluations that that the current evidence does not provide sufficient certainty? | |
| • T – in the time since the existing evaluations were done, have regulatory, technological or societal changes made those evaluations less relevant? | |
| 4. | |
| 5. |
Notes
a A GRADE assessment of ‘high’ means that we are confident that the true effect lies close to the estimate of effect coming from the cumulative meta-analysis [24]. In Cochrane’s deliberations as to when to close a Cochrane Review (https://www.cochranelibrary.com/cdsr/doi/10.1002/14651858.ED000107/full), the collaboration chose not to require ‘high’ GRADE certainty in the evidence because it was felt that this may not always be achievable. Although we recognise the pragmatic nature of this, we recommend ‘high’ in our criteria because SWATs are usually simple studies for which it should be possible to generate high certainty evidence. We will, however, keep this criterion under review to consider whether it needs relaxing.
b This is a judgement that depends on the behaviour of the effect estimates and on whether the confidence intervals include the threshold for an important benefit (or disadvantage). For example, if there is drift in the effect estimates of a meta-analyses but the confidence intervals around the estimates are consistently above what you think is an important benefit (or below a relevant disadvantage) then the cumulative meta-analysis can be judged to have converged despite movement in the effect estimates. For more on GRADE see http://www.gradeworkinggroup.org.
c This is a judgement that depends on the behaviour of the effect estimates and on whether the confidence intervals include the threshold for an important benefit (or disadvantage). For example, if there is drift in the effect estimates of a meta-analyses but the confidence intervals around the estimates are consistently above what you think is an important benefit (or below a relevant disadvantage) then the cumulative meta-analysis can be judged to have converged despite movement in the effect estimates. For more on GRADE see http://www.gradeworkinggroup.org.
d This is to provide reassurance about the applicability of the result to different types of trials. Care is needed to avoid a default position of insisting on an evaluation in every conceivable context. In other words, is there any reason to believe that the intervention would not work in your context given the contexts already studied? It is possible that evidence from SWATs will eventually splinter off to focus specifically on certain contexts but, for now, we suggest pooling evaluations of the same intervention because there are so few SWAT evaluations of any intervention and this pooling will provide a basic foundation on which to build.
e Where there may be no conceivable benefit or disadvantage for participants, they should be considered as balanced.
f A benefit might be that the host trial recruits faster, or its data quality is improved. Examples of disadvantages might be that there are added costs to the host trial, or that a new task is introduced into the workload of trial managers.
The cumulative effect estimates for the two telephone reminders compared to no reminder studies included in the updated Cochrane recruitment interventions review [1]
| Total number of participants | Intervention (n recruited/N invited) | Control (n recruited/N invited) | Baseline (control) recruitment rate | Effect estimate (95% CI) | |
|---|---|---|---|---|---|
Comparator was no call. Calls were made by research team. People were being recruited to a return to work trial for people on sick leave for > 7 weeks). | 498 | 31/256 | 11/242 | 4.5% | 8% (3%–12%) |
Comparator was no call. Calls were made by research nurses. People were being recruited to a colorectal cancer screening trial). | 952 | 59/480 | 35/472 | 7.4% | 5% (1%–9%) |
| 1450 | 90/736 | 46/714 | 6.0% (mean) | 6% (3%–9%) |
The GRADE rating of the certainty in the evidence is high
1. Both trials are scored as low risk of bias on the Cochrane Risk of bias tool
2. The results are consistent
3. The outcome was direct
4. The results are not imprecise; the confidence intervals are not too large and wholly on the side of benefit
5. There are too few trials for an assessment of publication bias and we have assumed that there is none
NOTE: the evidence for this intervention comes entirely from trials with low (< 10%) underlying recruitment. When applied to trials with higher recruitment we would downgrade the GRADE assessment because of Indirectness to moderate
Fig. 1Summary of the cumulative evidence for the effect of telephone reminders on trial recruitment. The dotted lines represent decision thresholds of 0%, 5%, 10% and 15% that trialists can consider when deciding whether to use the intervention in their own trial
The cumulative effect estimates for the three monetary incentives compared to no incentive studies included in the Cochrane retention interventions review [2]
| Total number of participants | Intervention (n recruited/N invited) | Control (n recruited/N invited) | Baseline (control) recruitment rate | Effect estimate (95% CI) | |
|---|---|---|---|---|---|
(Sending $10 or $2 with invitations to return DNA sample (in mouthwash). Comparator was no money. People responding were a subgroup of a smoking cessation trial population). | 300 | 77/200 | 34/100 | 34% | 5% (−7% to 16%) |
(Sending £5 voucher with invitations to return trial follow-up questionnaire. Comparator was no money. People responding were taking part in a trial to improve neonatal outcomes). | 722 | 156/369 | 108/353 | 31% | 12% (5%–19%) |
(Sending £5 voucher with invitations to return trial follow-up questionnaire. Comparator was no money. People responding were taking part in a trial to improve neck injury outcomes). | 2144 | 560/1070 | 493/1074 | 46% | 6% (2%–11%) |
| 3166 | 793/1639 | 635/1527 | 37% (mean) | 8% (4%–11%) |
The GRADE rating of the certainty in the evidence is moderate
1. Only one of the three trials is scored as low risk of bias on the Cochrane Risk of bias tool; one was uncertain, the other high risk of bias. We considered this a serious limitation and downgraded 1 level
2. The results have some inconsistency in confidence intervals but not the direction of effect and on balance we decided not to downgrade
3. The outcome was direct
4. The results showed signs of imprecision but just for the smallest trial; the confidence intervals of the two larger trials are not too large and wholly on the side of benefit. We did not downgrade
5. There are too few trials for an assessment of publication bias and we have assumed that there is none
Fig. 2Summary of the cumulative evidence for the effect of monetary incentives on trial retention. The dotted lines represent decision thresholds of 0%, 5%, 10% and 15% that trialists can consider when deciding whether to use the intervention in their own trial