| Literature DB >> 31208463 |
Hubert Wong1, Yongdong Ouyang2, Mohammad Ehsanul Karim2.
Abstract
Health researchers are familiar with the concept of trial power, a number that prior to the start of a trial is intended to describe the probability that the results of the trial will correctly conclude that the intervention has an effect. Trial power, as calculated using standard software, is an expected power that arises from averaging hypothetical trial results over all possible treatment allocations that could be generated by the randomization algorithm. However, in the trial that ultimately is conducted, only one treatment allocation will occur, and the corresponding attained power (conditional on the allocation that occurred) is not guaranteed to be equal to the expected power and may be substantially lower. We provide examples illustrating this issue, discuss some circumstances when this issue is a concern, define and advocate the examination of the pre-randomization power distribution for evaluating the risk of obtaining unacceptably low attained power, and suggest the use of randomization restrictions to reduce this risk. In trials that randomize only a modest number of units, we recommend that trial designers evaluate the risk of getting low attained power and, if warranted, modify the randomization algorithm to reduce this risk.Entities:
Keywords: Attained power; Power; Power distribution; Restricted randomization; Stepped-wedge design
Year: 2019 PMID: 31208463 PMCID: PMC6580524 DOI: 10.1186/s13063-019-3471-8
Source DB: PubMed Journal: Trials ISSN: 1745-6215 Impact factor: 2.279
Fig. 1Risk of the attained power falling below threshold values (77%, 75%, or 73%) due to sample size imbalance as the number of units randomized (total sample size) increases in a parallel, two-arm, individually randomized trial using unrestricted randomization. Results were obtained under the condition that the power is 80% when the sample sizes are equal in the two arms. The risk of low attained power can be substantial with small sample sizes but decreases rapidly as the sample size increases. (The lack of smoothness is a consequence of discretization on the probabilities of obtaining allocations with different sample sizes.)
The risk that the attained power will fall below selected threshold values
| Threshold | Risk that the attained power falls below the threshold value | ||
|---|---|---|---|
| Example 1 (unrestricted)a | Example 4 (unrestricted)b | Example 4 (restricted)c | |
| 72% | 0.6% | 0.0% | 0.0% |
| 73% | 1.7% | 0.2% | 0.0% |
| 74% | 1.7% | 1.0% | 0.0% |
| 75% | 3.8% | 2.7% | 0.0% |
| 76% | 3.8% | 7.3% | 0.0% |
| 77% | 8.1% | 16.6% | 0.0% |
| 78% | 15.4% | 33.5% | 0.0% |
| 79% | 26.8% | 57.9% | 0.0% |
| 80% | 87.5% | 81.0% | 5.4% |
| 81% | 100.0% | 97.8% | 68.0% |
| 82% | 100.0% | 100.0% | 99.0% |
| 83% | 100.0% | 100.0% | 100.0% |
aResults are for unrestricted randomization in Example 1.
bResults are for unrestricted randomization in Example 4.
cResults are for a randomization algorithm that allows only allocations with the four largest clusters transitioning at the first or last steps (two clusters at each of these steps) in Example 4.
Fig. 2Density plot (histogram) of the power distribution. Upper panel: Individually randomized trial from Example 1 with unrestricted randomization. Middle panel: Stepped-wedge trial from Example 4 with unrestricted randomization. Lower panel: Stepped-wedge trial from Example 4 with restricted randomization (four largest clusters transitioning to intervention at the first or last step, two clusters at each of these steps).