| Literature DB >> 31138319 |
Jody D Ciolino1, Alicia Diebold2, Jessica K Jensen2, Gerald W Rouleau3, Kimberly K Koloms4, Darius Tandon2.
Abstract
BACKGROUND: In cluster-randomized controlled trials (C-RCTs), covariate-constrained randomization (CCR) methods efficiently control imbalance in multiple baseline cluster-level variables, but the choice of imbalance metric to define the subset of "adequately balanced" possible allocation schemes for C-RCTs involving more than two arms and continuous variables is unclear. In an ongoing three-armed C-RCT, we chose the min(three Kruskal-Wallis [KW] test P values) > 0.30 as our metric. We use simulation studies to explore the performance of this and other metrics of baseline variable imbalance in CCR.Entities:
Keywords: Cluster randomization; Cluster-randomized controlled trial; Continuous covariate; Covariate-constrained randomization; Imbalance
Mesh:
Year: 2019 PMID: 31138319 PMCID: PMC6537428 DOI: 10.1186/s13063-019-3324-5
Source DB: PubMed Journal: Trials ISSN: 1745-6215 Impact factor: 2.279
Fig. 1Distribution of site-level randomization variables by arm for trial NCT02979444. We planned for a total of 42 randomized sites (6:18:18), but owing to dropout we have 38 active sites. Sample size and power considerations accounted for dropout that we observed. Medians (interquartile ranges) are displayed in each arm for each variable above
Threshold summary statistics by simulated scenario and imbalance criterion (1:3:3 scenarios)
| Imbalance criterion | Resampled | Hypothetical | ||
|---|---|---|---|---|
| N | % | N | % | |
| Min(KW | 60,508 | 60.51 | 62,341 | 62.34 |
| Inadequate ( | ||||
| Adequate ( | 39,492 | 39.49 | 37,659 | 37.66 |
| min(ANOVA | 61,691 | 61.69 | 61,114 | 61.11 |
| Inadequate ( | ||||
| Adequate ( | 38,309 | 38.31 | 38,886 | 38.89 |
| MANOVA | 29,997 | 30.00 | 29,824 | 29.82 |
| Inadequate ( | ||||
| Adequate ( | 70,003 | 70.00 | 70,176 | 70.18 |
| Min( | 90,223 | 90.22 | 87,872 | 87.87 |
| Inadequate ( | ||||
| Adequate ( | 9777 | 9.78 | 12,128 | 12.13 |
| Min(WRS | 88,029 | 88.03 | 87,114 | 87.11 |
| Inadequate ( | ||||
| Adequate ( | 11,971 | 11.97 | 12,886 | 12.89 |
Abbreviations: ANOVA analysis of variance, KW Kruskal–Wallis, MANOVA multivariate analysis of variance, WRS Wilcoxon rank-sum
Fig. 2Maximum pairwise imbalance observed for scenarios meeting adequacy threshold (P > 0.30) in simulated trials by criterion. Each panel represents the distribution of the max(mean difference) for simulated scenarios meeting the criterion for “adequate” overall variable balance across arms. All simulated schemes meet criteria for adequate under simple random allocation (panel a), but the remaining panels illustrate only those allocation schemes meeting the P > 0.30 criterion for each metric. The mean difference depicted is on the standard deviation unit scale. Those meeting this criterion would ideally have a small max(mean difference), and we deem a max(mean difference) > 1.0 (vertical line) unacceptable since previously a value of 0.8 would be deemed “large” [16]
Sensitivity and specificity of detecting 1.0 standard deviation max(mean differences) across arms (1:3:3 allocation)
| Imbalance criterion | Resampled | Hypothetical | ||||||
|---|---|---|---|---|---|---|---|---|
| max(mean diff) < 1.0 | max(mean diff) > 1.0 | max(mean diff) < 1.0 | max(mean diff) > 1.0 | |||||
| N | % | N | % | N | % | N | % | |
| KW | 44,340 | 54.15 | 16,168 | 89.27 | 43,800 | 53.80 | 18,541 | 99.73 |
| Inadequate ( | ||||||||
| Adequate ( | 37,549 | 45.85 | 1943 | 10.73 | 37,609 | 46.20 | 50 | 0.27 |
| ANOVA | 46,555 | 56.85 | 15,136 | 83.57 | 45,758 | 56.21 | 15,356 | 82.60 |
| Inadequate ( | ||||||||
| Adequate ( | 35,334 | 43.15 | 2975 | 16.43 | 35,651 | 43.79 | 3235 | 17.40 |
| MANOVA | 18,933 | 23.12 | 11,064 | 61.09 | 18,630 | 22.88 | 11,194 | 60.21 |
| Inadequate ( | ||||||||
| Adequate ( | 62,956 | 76.88 | 7047 | 38.91 | 62,779 | 77.12 | 7397 | 39.79 |
| 72,112 | 88.06 | 18,111 | 100.00 | 69,281 | 85.10 | 18,591 | 100.00 | |
| Inadequate ( | ||||||||
| Adequate ( | 9777 | 11.94 | 0 | 0 | 12,128 | 14.90 | 0 | 0 |
| WRS | 70,287 | 85.83 | 17,742 | 97.96 | 68,523 | 84.17 | 18,591 | 100.00 |
| Inadequate ( | ||||||||
| Adequate ( | 11,602 | 14.17 | 369 | 2.04 | 12,886 | 15.83 | 0 | 0 |
Abbreviations: ANOVA analysis of variance, KW Kruskal–Wallis, MANOVA multivariate analysis of variance, WRS Wilcoxon rank-sum
Fig. 3Pairwise scatterplots exploring associations between Kruskal–Wallis (KW) P value with other measures. The panels here present a selection of pairwise plots to illustrate the relationships between the imbalance metric we used in our randomization algorithm, the KW test P value, and additional candidate imbalance metrics explored. Each plot includes 5000 observations from the resampled scenarios using 1:3:3 allocation as in our present study. In each plot, there is often a non-linear relationship. For example, the first plot illustrating min(KW P value) in comparison with the multivariate analysis of variance (MANOVA) demonstrates a somewhat noisy relationship between the two whereby the min(KW P value) tends to be lower than the overall MANOVA P value, but the two are related. The comparison of the min(KW P value) versus the min(Wilcoxon rank-sum [WRS] P value) shows a more pronounced relationship and a clearer, non-linear pattern. All metrics of imbalance as determined are highly related; broadly, the more global tests (e.g., MANOVA) tended to be less conservative (i.e., have larger P value) than the corresponding more specific tests based on more than one comparison (e.g., WRS). The line y = x has been added for reference