| Literature DB >> 30582068 |
Lei Clifton1, Jacqueline Birks1, David A Clifton2.
Abstract
We discuss different methods of sample size calculation for two independent means, aiming to provide insight into the calculation of sample size at the design stage of a parallel two-arm randomised controlled trial (RCT). We compare different methods for sample size calculation, using published results from a previous RCT. We use variances and correlation coefficients to compare sample sizes using different methods, including 1. The choice of the primary outcome measure: post-intervention score vs. change from baseline score. 2. The choice of statistical methods: t-test without using correlation coefficients vs. analysis of covariance (ANCOVA). We show that the required sample size will depend on whether the outcome measure is the post-intervention score, or the change from baseline score, with or without baseline score included as a covariate. We show that certain assumptions have to be met when using simplified sample size equations, and discuss their implications in sample size calculation when planning an RCT. We strongly recommend publishing the crucial result "mean change (SE, standard error)" in a study paper, because it allows (i) the calculation of the variance of the change score in each arm, and (ii) to pool the variances from both arms. It also enables us to calculate the correlation coefficient in each arm. This subsequently allows us to calculate sample size using change score as the outcome measure. We use simulation to demonstrate how sample sizes by different methods are influenced by the strength of the correlation.Entities:
Keywords: Arm; Baseline; Change score; Correlation; Covariate; Independent; Means; Outcome measure; Post-intervention; RCT; Sample size; Standard deviation; Standard error; Variance
Year: 2018 PMID: 30582068 PMCID: PMC6297128 DOI: 10.1016/j.conctc.2018.100309
Source DB: PubMed Journal: Contemp Clin Trials Commun ISSN: 2451-8654
SF-36 energy score at baseline and 6-month post-intervention, reproduced using results from the MOSAIC trial.
| Energy | Control arm (N = 168) | CPAP arm (N = 171) |
|---|---|---|
| Baseline mean score (SD) | 49.7 (23.7) | 49.8 (22.4) |
| 6-month mean score (SD) | 53.9 (22.5) | 60.6 (20.9) |
| Mean change (SE) | +4.2 (1.4) | +10.8 (1.3) |
| Adjusted treatment effect (95% CI) | +6.6 (+3.1 to +10.1) | |
| p value | p < 0.0001 |
CPAP, continuous positive airway pressure; SF-36, 36-item Short-Form health survey; SD, standard deviation; SE, standard error; CI, confidence interval; N, number of participants.
Simulated sample sizes at different values of . “ by ANCOVA” produced by option 1 (plotted in Fig. 1) are the same as those produced by option 2 (plotted in Fig. 2). “ by t-test on post score” remains at a constant value of 170 throughout. In contrast, “ by t-test on change score” by option 1 and 2 are different, and are plotted in Figs. 1 and 2, respectively.
| Correlation | 0 | 0.1 | 0.2 | 0.3 | 0.4 | 0.5 | 0.6 | 0.7 | 0.8 | 0.9 | 1.0 |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 170 | 169 | 164 | 155 | 143 | 128 | 109 | 87 | 62 | 33 | 0 | |
| 170 | 170 | 170 | 170 | 170 | 170 | 170 | 170 | 170 | 170 | 170 | |
| 112 | 112 | 112 | 112 | 112 | 112 | 112 | 112 | 112 | 112 | 112 | |
| 363 | 326 | 290 | 254 | 218 | 182 | 146 | 110 | 73 | 37 | 0 |
Summary of sample variances.
| Energy score | Control arm (N = 168) | CPAP arm (N = 171) | Pooled |
|---|---|---|---|
| Variance of baseline score, | |||
| Variance of post score, | |||
| Variance of change score, | |||
| Correlation between baseline and post scores | 0.6925 | 0.6937 | – |
Comparing sample sizes using different outcome measures and statistical methods.
| Outcome | ||
|---|---|---|
| ANCOVA | ||
| 87 (85) | 170 (171) | |
| ( | – | 112 (113) |
, number of patients in each arm. calculated by equation are shown together with produced by PASS software: by equation ( by PASS).
Fig. 1Comparing values of sample size produced using different methods at different values of , using the same parameter values as are shown in Table 3. The values of remain fixed for all values of , resulting in a constant value of via a t-test for outcome (), shown by the short-dashed line. Fig. 1 is intended to be compared with Fig. 2, where the values of are allowed to vary according to the values of .
Fig. 2Similar to Fig. 1 above, except that the values of are allowed to vary according to the values of . Note that the range of the y-axis here is different from that in Fig. 1.