| Literature DB >> 35439947 |
Maria Stark1, Mailin Hesse2, Werner Brannath3, Antonia Zapf4.
Abstract
BACKGROUND: The sample size calculation in a confirmatory diagnostic accuracy study is performed for co-primary endpoints because sensitivity and specificity are considered simultaneously. The initial sample size calculation in an unpaired and paired diagnostic study is based on assumptions about, among others, the prevalence of the disease and, in the paired design, the proportion of discordant test results between the experimental and the comparator test. The choice of the power for the individual endpoints impacts the sample size and overall power. Uncertain assumptions about the nuisance parameters can additionally affect the sample size.Entities:
Keywords: Adaptive design; Co-primary endpoints; Paired design; Sensitivity; Specificity; Unpaired design
Mesh:
Year: 2022 PMID: 35439947 PMCID: PMC9019976 DOI: 10.1186/s12874-022-01564-2
Source DB: PubMed Journal: BMC Med Res Methodol ISSN: 1471-2288 Impact factor: 4.612
Fig. 1Study designs of a confirmatory diagnostic accuracy trial
Assumptions of the paired diagnostic accuracy trial for the comparison of the experimental Positron Emission Tomography (PET) combined with the computed tomography (CT) against the comparator test PET
| General input parameters: | |||
|---|---|---|---|
| Prevalence: | Comparator test (CT) | Experimental test (PET/CT) | Proportion of discordant test results |
| Diseased population | SeC = 0.81 | SeE = 0.90 |
|
| Non-diseased population | SpC = 0.66 | SpE = 0.80 |
|
Fig. 2Empirical power and sample size of the conventional and optimal sample size calculation. Simulations are based on the example study given in Table 1 with a varying prevalence π. The figure considering the sample size contains an enlarged image section so that the differences between both approaches are highlighted
Fig. 3Procedure of the blinded adaptive design in an unpaired and paired diagnostic trial
Results in a paired diagnostic study
|
| |||
|
| |||
|
| ||
|
|
| |
|
|
| |
|
| |||
| |||
|
|
|
|
|
|
| |
Comparison of the blinded adaptive design procedure with McCray et al. [11]
| McCray et al. (2017) | Our approach | ||
|---|---|---|---|
| General information |
|
| SeE − SeC and SpE − SpC |
|
|
|
| |
|
| Conventional approach
Power per endpoint: 0.8 | Optimal approach
Overall power: 0.8 | |
|
|
|
| |
Initial sample size calculation |
| TPPRmax and TNNRmax correspond to | |
|
| TPPRmax = SeC = 0.81 TNNRmax = SpC = 0.66 |
| |
|
| 186 | 133 | |
Sample size re-estimation |
|
|
|
|
| 242 | 200 | |
Simulated scenarios in the unpaired and paired study design testing for superiority in both endpoints. The proportion of discordant test results is only relevant in the paired design
| 10,000 simulation runs per scenario | ||
|---|---|---|
| Nominal significance level α per endpoint | 0.05 (two-sided) | |
| Nominal overall target power | 0.8 | |
|
|
| |
| Sensitivity comparator test SeC | 0.8 | 0.6, 0.7 |
| Specificity comparator test SpC | 0.7 | 0.6, 0.8 |
| True prevalence | 0.2 | 0.4, 0.6, 0.8 |
| Assumed prevalence | ||
True discordant results diseased population | 0.11 (0.15, if: SeE − SeC = 0.15) | 0.18, 0.26 |
Assumed discordant results diseased population | 0.18 | |
True discordant results non-diseased population | 0.14 (0.15, if: SpE − SpC = 0.15) | 0.24, 0.38 |
Assumed discordant results non-diseased population | 0.24 | |
| Sensitivity experimental test SeE |
| |
| Specificity experimental test SpE |
| |
| Sensitivity experimental test SeE | SeC + 0.1 | SeC + 0.05 SeC + 0.15 |
| Specificity experimental test SpE | SpC + 0.1 | SpC + 0.05 SpC + 0.15 |
Description of the data generation mechanism of the unpaired and paired design in the simulation study (Bin: binomial distribution, MVBin: multivariate binomial distribution, k: number of trials, p: success probability, ρ: dependence between both tests, N: total sample size, n: diseased individuals in experimental group, n: diseased individuals in comparator group)
| Unpaired design | Paired design | |
|---|---|---|
Diseased individuals (nD) according to reference standard |
|
|
True Positive Results (TP) |
|
|
True Negative Results (TN) |
|
|
Fig. 4Global type I error, overall power and sample sizes of the fixed and adaptive paired design. Simulations are based on the initial scenario and a variation of the true proportion of discordant test results in the diseased population (). In Fig. A, black dotted lines mark the interval of Monte Carlo error due to simulations. In Fig. B, the target power equals 0.8
Fig. 5Overall power of the fixed and adaptive design in an unpaired and paired diagnostic study. Simulations are based on the initial scenario and a variation of the true prevalence (πtrue). The target power equals 0.8