| Literature DB >> 35962318 |
Jennifer A Thompson1, Clemence Leyrat2, Katherine L Fielding3, Richard J Hayes3.
Abstract
BACKGROUND: Cluster randomised trials (CRTs) are often designed with a small number of clusters, but it is not clear which analysis methods are optimal when the outcome is binary. This simulation study aimed to determine (i) whether cluster-level analysis (CL), generalised linear mixed models (GLMM), and generalised estimating equations with sandwich variance (GEE) approaches maintain acceptable type-one error including the impact of non-normality of cluster effects and low prevalence, and if so (ii) which methods have the greatest power. We simulated CRTs with 8-30 clusters, altering the cluster-size, outcome prevalence, intracluster correlation coefficient, and cluster effect distribution. We analysed each dataset with weighted and unweighted CL; GLMM with adaptive quadrature and restricted pseudolikelihood; GEE with Kauermann-and-Carroll and Fay-and-Graubard sandwich variance using independent and exchangeable working correlation matrices. P-values were from a t-distribution with degrees of freedom (DoF) as clusters minus cluster-level parameters; GLMM pseudolikelihood also used Satterthwaite and Kenward-Roger DoF.Entities:
Keywords: Cluster level analysis; Cluster randomised trial; Cluster-level analysis; Comparison of methods; Generalised estimating equations; Generalised linear mixed model; Small number of clusters
Mesh:
Year: 2022 PMID: 35962318 PMCID: PMC9375419 DOI: 10.1186/s12874-022-01699-2
Source DB: PubMed Journal: BMC Med Res Methodol ISSN: 1471-2288 Impact factor: 4.612
Summary of simulation study scenarios
| Parameter | Number of scenarios | Values |
|---|---|---|
| 4 | 8, 12, 20, 30 | |
| 3 | 10, 50, 1000 | |
| 3 | ||
| 2 | 10%, 30% | |
| 2 | No effect, or odds ratio between 1.12 and 11.49 selected for each scenario to achieve 80% power | |
| 4 | 0.001, 0.01, 0.05, 0.1 | |
| 3 | Normal: Gamma Uniform Distributions are defined to give the specified between cluster variability set by the ICC |
Fig. 1Performance measures of cluster-level analysis methods by number of clusters (rows), cluster size and outcome prevalence (colour). Measures shown (columns): Standardised intervention effect estimate bias, standard error bias, type-one error. Each dot represents a scenario summarised over the 1000 repetitions. All 864 scenarios are shown for each measure
Fig. 2Performance measures of GLMM methods by number of clusters (rows), and mean cluster size (colour). Measures shown (columns): Standardised intervention effect estimate bias, standard error bias, type-one error
Fig. 3Performance measures of GEE methods by number of clusters (rows), and mean cluster size (colour). Measures shown (columns): Standardised intervention effect estimate bias, standard error bias, type-one error
Fig. 4Comparison of bias and type-one error of unweighted cluster-level analysis, GLMM with REPL and DFCP, and GEE with FG standard errors and DFCP by number of clusters (rows), and mean cluster size (colour). Measures shown (columns): Standardised intervention effect estimate bias, standard error bias, type-one error
Fig. 5Power comparison of unweighted cluster-level analysis (CL.UNW), GLMM with REPL and DFCP (REPL), and GEE with FG standard errors and DFCP (FG.I) (columns) by number of clusters (rows), ICC (y axis), and variability of cluster size (colour)
Summary of simulation study results and recommendations on their use
| Use equal weighting of clusters and clusters minus cluster-level parameters degrees of freedom | Use restricted pseudo-likelihood and clusters minus cluster-level parameters degrees of freedom | Use Fay and Graubard standard errors, clusters minus cluster-level parameters degrees of freedom, and an independent working correlation matrix | |
Cluster size > 10 Or Common outcome (prevalence > 10%) | All scenarios | Cluster size ≤ 50 Or CV cluster size ≤ 0.5 | |
Common cluster size Or High ICC (ICC > 0.05) | Varying cluster size Or 20 + clusters | Low ICC ≤ 0.01 |
a Unbiased effects with controlled or conservative type-one error
b The method/s with greatest or similar to greatest power in a scenario
Fig. 6Motivating example results analysed by all methods considered in the simulation study. Left panel shows odds ratios and confidence intervals, right panel shows p values. Rows are analysis methods