| Literature DB >> 35199380 |
Martina E McMenamin1,2, Jessica K Barrett1, Anna Berglind3, James M S Wason1,4.
Abstract
Mixed outcome endpoints that combine multiple continuous and discrete components are often employed as primary outcome measures in clinical trials. These may be in the form of co-primary endpoints, which conclude effectiveness overall if an effect occurs in all of the components, or multiple primary endpoints, which require an effect in at least one of the components. Alternatively, they may be combined to form composite endpoints, which reduce the outcomes to a one-dimensional endpoint. There are many advantages to joint modeling the individual outcomes, however in order to do this in practice we require techniques for sample size estimation. In this article we show how the latent variable model can be used to estimate the joint endpoints and propose hypotheses, power calculations and sample size estimation methods for each. We illustrate the techniques using a numerical example based on a four-dimensional endpoint and find that the sample size required for the co-primary endpoint is larger than that required for the individual endpoint with the smallest effect size. Conversely, the sample size required in the multiple primary case is similar to that needed for the outcome with the largest effect size. We show that the empirical power is achieved for each endpoint and that the FWER can be sufficiently controlled using a Bonferroni correction if the correlations between endpoints are less than 0.5. Otherwise, less conservative adjustments may be needed. We further illustrate empirically the efficiency gains that may be achieved in the composite endpoint setting.Entities:
Keywords: latent variable modeling; mixed outcome endpoints; sample size estimation
Mesh:
Year: 2022 PMID: 35199380 PMCID: PMC7612654 DOI: 10.1002/sim.9356
Source DB: PubMed Journal: Stat Med ISSN: 0277-6715 Impact factor: 2.497
Figure 1Power function for individual SLEDAI (continuous), PGA (continuous), BILAG (ordinal), and Taper (binary) outcomes and the power functions with when they are treated as co-primary, multiple primary, and composite endpoints using data from the MUSE trial
Sample sizes n = n = n for the co-primary and multiple primary endpoints for overall power 1 − β ≈ 0.80, α = 0.025, k = 2, K = 4 using the MUSE trial data
| SLEDAI | PGA | BILAG | Taper | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
| ( |
| ( |
|
|
|
|
|
|
|
| 0.88 | 18 | 0.38 | 0.35 | (0.97,0.95) | 0.24 | (0.54,0.38) | 0.40 | 403 | 46 | 365 | 39 | 273 | 99 |
| 0.88 | 19 | 0.38 | 0.35 | (0.97,0.95) | 0.24 | (0.54,0.38) | 0.40 | 419 | 46 | 386 | 39 | 273 | 99 |
| 0.88 | 20 | 0.38 | 0.35 | (0.97,0.95) | 0.24 | (0.54,0.38) | 0.40 | 435 | 46 | 406 | 39 | 273 | 99 |
| 0.88 | 18 | 0.38 | 0.45 | (0.97,0.95) | 0.24 | (0.54,0.38) | 0.40 | 403 | 55 | 365 | 49 | 273 | 99 |
| 0.88 | 18 | 0.38 | 0.55 | (0.97,0.95) | 0.24 | (0.54,0.38) | 0.40 | 403 | 63 | 365 | 60 | 273 | 99 |
| 0.88 | 18 | 0.38 | 0.65 | (0.97,0.95) | 0.24 | (0.54,0.38) | 0.40 | 403 | 70 | 365 | 71 | 273 | 99 |
Note: SS1, SS2, SS3, SS4 are sample sizes required per group for the individual endpoints for a power of at least 1 − β = 0.80.
Figure 2Overall power 1 − β to detect the treatment effects assumed from the MUSE trial for the systemic lupus erythematosus co-primary, multiple primary, and composite endpoints for different sample sizes per group n = n = n and differing correlations between outcomes, where Low = 0.3, Medium = 0.5, and High = 0.8
Sample sizes and empirical power (%) for n = n = n for the co-primary, multiple primary, and composite endpoints for overall power 1 − β ≈ 0.80, α = 0.025, k = 2, K = 4 with observed and latent effect sizes and correlation ρ equal to 0.3, 0.5, 0.8 where correlations are assumed to be equal between all endpoints
|
|
|
|
|
| Co-primary | Multiple primary | Composite |
|---|---|---|---|---|---|---|---|
| 0.12 | 0.12 | 0.12 | 0.12 | 0.0 | 1766 (80.1) | 591 (80.0) | 1031 (80.1) |
| 0.3 | 1692 (80.0) | 744 (80.1) | 883 (80.0) | ||||
| 0.5 | 1617 (80.0) | 867 (80.1) | 687 (80.0) | ||||
| 0.8 | 1439 (80.0) | 1117 (80.0) | 589 (79.9) | ||||
| 0.35 | 0.35 | 0.15 | 0.15 | 0.0 | 917 (79.9) | 105 (80.1) | 201 (79.9) |
| 0.3 | 894 (80.0) | 122 (79.9) | 156 (80.0) | ||||
| 0.5 | 870 (80.0) | 134 (80.0) | 115 (80.1) | ||||
| 0.8 | 815 (80.1) | 153 (80.0) | 92 (80.0) | ||||
| 0.12 | 0.35 | 0.55 | 0.10 | 0.0 | 1772 (80.2) | 61 (80.3) | 81 (80.0) |
| 0.3 | 1736 (80.2) | 67 (80.5) | 72 (80.1) | ||||
| 0.5 | 1700 (80.0) | 70 (79.9) | 67 (80.2) | ||||
| 0.8 | 1625 (80.2) | 74 (80.6) | 58 (80.1) |
Figure 3Family-wise error rate (FWER) of the multiple primary endpoints shown both unadjusted and adjusted using the Bonferroni correction. FWERs are shown for K = (2, 3, 4) outcomes and correlations are constrained to be equal between all outcomes