| Literature DB >> 28912825 |
Abstract
BACKGROUND: The preferred method to evaluate public health interventions delivered at the level of whole communities is the cluster randomised trial (CRT). The practical limitations of CRTs and the need for alternative methods continue to be debated. There is no consensus on how to classify study designs to evaluate interventions, and how different design features are related to the strength of evidence. ANALYSIS: This article proposes that most study designs for the evaluation of cluster-level interventions fall into four broad categories: the CRT, the non-randomised cluster trial (NCT), the controlled before-and-after study (CBA), and the before-and-after study without control (BA). A CRT needs to fulfil two basic criteria: (1) the intervention is allocated at random; (2) there are sufficient clusters to allow a statistical between-arm comparison. In a NCT, statistical comparison is made across trial arms as in a CRT, but treatment allocation is not random. The defining feature of a CBA is that intervention and control arms are not compared directly, usually because there are insufficient clusters in each arm to allow a statistical comparison. Rather, baseline and follow-up measures of the outcome of interest are compared in the intervention arm, and separately in the control arm. A BA is a CBA without a control group.Entities:
Keywords: Cluster randomisation; Public health interventions; Study design
Year: 2017 PMID: 28912825 PMCID: PMC5590121 DOI: 10.1186/s12982-017-0063-5
Source DB: PubMed Journal: Emerg Themes Epidemiol ISSN: 1742-7622
Defining characteristics of cluster randomised trials (CRT), non-randomised cluster trials (NCT), controlled before-and-after study (CBA), and before-and-after trials (BA)
| CRT | NCT | CBA | BA | |
|---|---|---|---|---|
| Defining features | Randomisation of adequate number of clusters to allow statistical between-arm comparison | Non-random allocation of adequate number of clusters to allow statistical between arm comparison | Random or non-random allocation of small number of clusters - too few to allow statistical between-arm comparison | Before and after assessment of outcomes in the absence of a control group |
| Appropriate setting | Adequate resources, randomisation feasible | Adequate resources, randomisation is politically or logistically not possible | Resource limited evaluations or where number of clusters is naturally constrained (e.g. a district-level intervention in a province that only has 4 districts) | Mass media campaign where no unexposed group can be found |
| Detectable effect size given adequate sample size | Small | Small to moderate (depending on baseline comparability and temporal stability of outcome in control arm) | Moderate to large (depending on baseline comparability and temporal stability of outcome in control arm) | Large |
| Number of clusters | At least 4–6 clusters per arm, higher if effect size is small | At least 4–6 clusters per arm, higher if effect size is small | At least 2 clusters per arm, unless outcome is assessed repeatedly before and after | Study power is determined by number of participants and number of pre/post measures |
| Baseline measure of outcome of interest | Not required, but may increase study power and allow adjusting for imbalances | Required | Required | Required |
| Outcome assessment at multiple time-points | Not required, but may increase study power | Usually not required, but may increase study power | Desirable, required if there is only one cluster per arm | Required |
| Statistical analysis | Direct comparison between intervention and control | Direct comparison between intervention and control, by adjusting for baseline measure of outcome, by calculating change scores or by calculating the difference-in- difference | Comparison before versus after; control arm only serves to demonstrate absence of trends. | Comparison before versus after. Analysis of slope and intercept of trends in the outcome measure (if multiple pre/post measures are available) |
| Special types | Stepped wedge design | Controlled interrupted time series analysis | Interrupted time series analysis |
Fig. 1Trend interpretation in non-randomised cluster trails (NCT) and before-and-after trails with control group (CBA): a good balance, no trend in control arm; b good balance, strong trend in control arm; c poor balance, no trend in control arm; d poor balance, strong trend in control arm; e poor balance, erratic trends; f poor balance, opposing trends
Study design and the potential level of evidence
| Study type | Designs with the potential to provide strong evidence | Designs with the potential to provide reasonable evidence | Designs likely to provide weak or misleading evidence |
|---|---|---|---|
| CRT | CRT that is blinded to participants and investigators OR uses an objective outcome measure | CRT that is not blinded to the participants AND uses a subjective outcome measure AND study participation is not obvious to participants (see text) | CRT that is not blinded to the participants AND uses a subjective outcome measure AND study participation is obvious to participants (see text) |
| NCT | NCT with very good balance of the outcome of interest at baseline across arms | NCT with imbalance of the outcome of interest at baseline across arms AND no major trend in the control arm | NCT with imbalance of the outcome of interest at baseline across arms AND a major trend in the control arm |
| CBA | CBA with reasonable balance of the outcome of interest at baseline across arms AND no major trend in the control arm | CBA with imbalance of the outcome of interest at baseline across arms OR major trend in the control arm | |
| BA | BA with multiple outcome measures before and after the intervention | BA with a single outcome measure before and after the intervention |