| Literature DB >> 34870495 |
Marta Bofill Roig1,2, Guadalupe Gómez Melis1.
Abstract
We propose a class of two-sample statistics for testing the equality of proportions and the equality of survival functions. We build our proposal on a weighted combination of a score test for the difference in proportions and a weighted Kaplan-Meier statistic-based test for the difference of survival functions. The proposed statistics are fully non-parametric and do not rely on the proportional hazards assumption for the survival outcome. We present the asymptotic distribution of these statistics, propose a variance estimator, and show their asymptotic properties under fixed and local alternatives. We discuss different choices of weights including those that control the relative relevance of each outcome and emphasize the type of difference to be detected in the survival outcome. We evaluate the performance of these statistics with small sample sizes through a simulation study and illustrate their use with a randomized phase III cancer vaccine trial. We have implemented the proposed statistics in the R package SurvBin, available on GitHub (https://github.com/MartaBofillRoig/SurvBin).Entities:
Keywords: Clinical trials; mixed outcomes; multiple endpoints; non-proportional hazards; survival analysis; weighted Mean survival test
Mesh:
Year: 2021 PMID: 34870495 PMCID: PMC8829729 DOI: 10.1177/09622802211048030
Source DB: PubMed Journal: Stat Methods Med Res ISSN: 0962-2802 Impact factor: 3.021
Figure 1.Illustration of two different follow-up configurations, the red and blue arrows represent the time-frame for binary and time-to-event outcomes, respectively. The red line goes from the start of the study (at time-point ) until the binary outcome is evaluated at time . The blue (dashed) line goes from when the time-to-event information begins to be collected ( ) to the end of the study ( ).
Figure 2.Kaplan–Meier curves for overall survival for ipilimumab-plus-gp100 and gp100-alone groups (arms 1 and 0, respectively).
Comparison of statistics (at years) using different weights: -statistics with different combinations of ; and weighted Kaplan–Meier (WKM) statistics and weighted log-rank (WLR) statistics with different .
| Statistics | Weights | Standardized test |
|---|---|---|
|
| 4.11 | |
|
| 3.93 | |
|
| 2.94 | |
| WKM statistics |
| 3.70 |
| WKM statistics |
| 3.69 |
| WLR statistics |
| 3.13 |
| WLR statistics |
| 2.75 |
Scenarios used in the simulation study.
| Parameter | Value | Parameter | Value |
|---|---|---|---|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
| HR |
|
|
| ( |
|
|
Median empirical size and median empirical power from and replications, respectively. The empirical size and powers are calculated using: the -statistics (in (2)) according to the pooled, unpooled, bootstrap variance estimators (labeled as Pooled, Unpooled, and Boots.); and the Bonferroni procedure (Bonf.). Under the null hypothesis there is no effect on any of the endpoints ( HR ). Under the alternative hypothesis there is effect on both endpoints (Case 1: HR ) and the effect on the survival endpoint satisfies the proportional hazards assumptions ( ).
| Empirical size | Empirical powers (Case 1) | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| Pooled | Unpooled | Boots. | Bonf. | Pooled | Unpooled | Boots. | Bonf. | ||
|
|
| 0.053 | 0.053 | 0.050 | 0.050 | 0.84 | 0.85 | 0.83 | 0.80 |
|
| 0.056 | 0.055 | 0.050 | 0.050 | 0.85 | 0.85 | 0.83 | 0.79 | |
|
| 0.001 | 0.051 | 0.051 | 0.052 | 0.051 | 0.87 | 0.88 | 0.87 | 0.82 |
| 0.510 | 0.054 | 0.055 | 0.049 | 0.050 | 0.84 | 0.83 | 0.81 | 0.79 | |
| 0.910 | 0.056 | 0.056 | 0.048 | 0.049 | 0.82 | 0.82 | 0.79 | 0.78 | |
|
| 0.1 | 0.053 | 0.054 | 0.050 | 0.051 | 0.89 | 0.89 | 0.88 | 0.84 |
| 0.3 | 0.053 | 0.055 | 0.050 | 0.049 | 0.78 | 0.78 | 0.76 | 0.73 | |
|
| 0.5 | 0.053 | 0.054 | 0.051 | 0.050 | 0.87 | 0.87 | 0.86 | 0.82 |
| 1 | 0.053 | 0.054 | 0.050 | 0.050 | 0.84 | 0.85 | 0.83 | 0.80 | |
| 2 | 0.054 | 0.055 | 0.048 | 0.050 | 0.82 | 0.81 | 0.79 | 0.78 | |
|
| (0,0,1) | 0.054 | 0.053 | 0.051 | 0.050 | 0.84 | 0.85 | 0.83 | 0.80 |
| (0,1,1) | 0.054 | 0.055 | 0.051 | 0.050 | 0.84 | 0.85 | 0.83 | 0.80 | |
| (1,0,1) | 0.053 | 0.055 | 0.050 | 0.050 | 0.83 | 0.82 | 0.80 | 0.79 | |
| (1,1,1) | 0.054 | 0.053 | 0.050 | 0.050 | 0.86 | 0.86 | 0.84 | 0.81 | |
Figure 3.Boxplot of empirical powers based on scenarios in Table 2. The empirical powers are calculated using: the -statistics (in (2)) according to the pooled, unpooled, bootstrap variance estimators; the Bonferroni procedure; and the individual statistics (3) and (4). The individual statistics for the binary and survival endpoints are labeled, respectively, as BE and SE. The color indicates which combination of weights ( ) were used: red for ( ); blue for ( ); and green for ( ).