| Literature DB >> 35862473 |
Moritz Fabian Danzer1, Jannik Feld1, Andreas Faldum1, Rene Schmidt1.
Abstract
The one-sample log-rank test is the method of choice for single-arm Phase II trials with time-to-event endpoint. It allows to compare the survival of patients to a reference survival curve that typically represents the expected survival under standard of care. The one-sample log-rank test, however, assumes that the reference survival curve is known. This ignores that the reference curve is commonly estimated from historic data and thus prone to sampling error. Ignoring sampling variability of the reference curve results in type I error rate inflation. We study this inflation in type I error rate analytically and by simulation. Moreover we derive the actual distribution of the one-sample log-rank test statistic, when the sampling variability of the reference curve is taken into account. In particular, we provide a consistent estimate of the factor by which the true variance of the one-sample log-rank statistic is underestimated when reference curve sampling variability is ignored. Our results are further substantiated by a case study using a real world data example in which we demonstrate how to estimate the error rate inflation in the planning stage of a trial.Entities:
Mesh:
Year: 2022 PMID: 35862473 PMCID: PMC9302761 DOI: 10.1371/journal.pone.0271094
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.752
Empirical type I error rates under consideration of sampling variability.
|
| ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
|
| |
|
using | ||||||||||
| 25 | 0.143 | 0.689 | 0.100 | 0.804 | 0.077 | 0.884 | 0.065 | 0.935 | 0.058 | 0.963 |
| 50 | 0.155 | 0.696 | 0.107 | 0.810 | 0.080 | 0.889 | 0.066 | 0.938 | 0.059 | 0.966 |
| 100 | 0.161 | 0.701 | 0.108 | 0.813 | 0.079 | 0.892 | 0.065 | 0.941 | 0.057 | 0.968 |
| 200 | 0.164 | 0.703 | 0.108 | 0.815 | 0.079 | 0.893 | 0.064 | 0.942 | 0.057 | 0.969 |
|
using | ||||||||||
| 25 | 0.167 | 0.689 | 0.117 | 0.804 | 0.086 | 0.884 | 0.071 | 0.935 | 0.063 | 0.963 |
| 50 | 0.169 | 0.696 | 0.114 | 0.810 | 0.084 | 0.889 | 0.070 | 0.938 | 0.061 | 0.966 |
| 100 | 0.167 | 0.701 | 0.112 | 0.813 | 0.082 | 0.892 | 0.067 | 0.941 | 0.059 | 0.968 |
| 200 | 0.166 | 0.703 | 0.110 | 0.815 | 0.080 | 0.893 | 0.065 | 0.942 | 0.058 | 0.969 |
(i) Empirical two–sided type I error rates α of test procedure (8) when used for testing H0 : Λ = Λ, and (ii) median factors as in (11) by which the true standard deviation of the one–sample log–rank statistic is underestimated when ignoring the reference curve sampling variability for different parameter constellations of practical relevance. Survival times were Weibull distributed with shape parameter κ = 1 and 1–year survival rate S1 = 0.5 in the historic control group A and the new treatment group B. Theoretical two–sided significance level: 5%. Underlying sample size of group B is n with allocation ratio π = n/n between new and historic groups.
Empirical one-sided type I error rates under consideration of sampling variability.
|
| ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
|
| |
|
using | ||||||||||
| 25 | 0.081 | 0.062 | 0.066 | 0.034 | 0.054 | 0.023 | 0.048 | 0.017 | 0.043 | 0.015 |
| 50 | 0.087 | 0.069 | 0.065 | 0.042 | 0.052 | 0.029 | 0.044 | 0.022 | 0.039 | 0.019 |
| 100 | 0.087 | 0.074 | 0.063 | 0.046 | 0.047 | 0.032 | 0.040 | 0.025 | 0.035 | 0.022 |
| 200 | 0.086 | 0.077 | 0.060 | 0.048 | 0.045 | 0.034 | 0.037 | 0.027 | 0.033 | 0.024 |
|
using | ||||||||||
| 25 | 0.050 | 0.117 | 0.038 | 0.079 | 0.028 | 0.058 | 0.023 | 0.048 | 0.020 | 0.043 |
| 50 | 0.062 | 0.106 | 0.043 | 0.071 | 0.032 | 0.052 | 0.026 | 0.044 | 0.022 | 0.039 |
| 100 | 0.069 | 0.099 | 0.047 | 0.065 | 0.033 | 0.049 | 0.027 | 0.040 | 0.023 | 0.036 |
| 200 | 0.073 | 0.094 | 0.048 | 0.062 | 0.035 | 0.045 | 0.028 | 0.037 | 0.025 | 0.033 |
(i) Empirical one-sided type I error rates α1 and α2 of test procedures (9) when used for testing H0,sup and H0,inf, respectively, for different parameter constellations of practical relevance. Survival times were Weibull distributed with shape parameter κ = 1 and 1–year survival rate S1 = 0.5 in the historic control group A and the new treatment group B. Theoretical one–sided significance level: 2.5%. Underlying sample size of group B is n with allocation ratio π = n/n between new and historic groups.
Fig 1Type I error rate approximation.
Type I error rate approximation given by as a function of the allocation ratio π for different durations f ∈ {1, 2, 3} of the follow-up period in the new trial. Calculations were done for exponentially distributed survival times with a 1 year survival rate of 50%. Accrual a for the historic control and new treatment groups was set to 5 years, follow-up f of the historic trial was set to 3 years. To satisfy the conditions of Theorem 1 (see S1 Appendix), we choose smax = a + f−10−8.
Apriori estimated type I error rates under consideration of sampling variability.
|
| ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
|
| |
| 25 | 0.156 | 0.724 | 0.107 | 0.823 | 0.079 | 0.896 | 0.064 | 0.943 | 0.057 | 0.970 |
| 50 | 0.161 | 0.715 | 0.108 | 0.820 | 0.079 | 0.895 | 0.065 | 0.943 | 0.057 | 0.970 |
| 100 | 0.163 | 0.711 | 0.109 | 0.818 | 0.079 | 0.895 | 0.065 | 0.943 | 0.057 | 0.970 |
| 200 | 0.165 | 0.709 | 0.109 | 0.817 | 0.080 | 0.895 | 0.065 | 0.943 | 0.057 | 0.970 |
(i) Median a priori estimates of type I error rate αpre (see Eq (15)) of test procedure (8) when used for testing H0 : Λ = Λ, and (ii) median a priori estimates of underestimation of the standard deviation Rpre (see Eq (14)) of the one–sample log–rank statistic when ignoring the reference curve sampling variability for different parameter constellations of practical relevance. Survival times were Weibull distributed with shape parameter κ = 1 and 1–year survival rate S1 = 0.5 in the historic control group A. Underlying sample size of n = n/π with allocation ratio π.
Fig 2Distribution of survival and censoring variable.
Distribution of overall survival and censoring in the cohort treated with DPCA of the Mayo Clinical trial in primary biliary cirrhosis. Left: Cumulative hazards according to the Nelson-Aalen estimator. Right: Survival distributions according to the Kaplan-Meier estimator
Fig 3Type I error inflation.
Actual type I error levels of the classical one-sample log-rank test when sampling variability of the reference curve is ignored. Left: Variation of the allocation ratio with fixed accrual duration a and four different durations of the follow-up period f. Right: Variation of the length of the follow-up period f for a fixed allocation ratio π and three different durations of the accrual period a.