| Literature DB >> 32375680 |
Ash Bullement1, Anna Willis2, Amerah Amin3, Michael Schlichting4, Anthony James Hatswell1,5, Murtuza Bharmal6.
Abstract
BACKGROUND: Due to limited duration of follow up in clinical trials of cancer treatments, estimates of lifetime survival benefits are typically derived using statistical extrapolation methods. To justify the method used, a range of approaches have been proposed including statistical goodness-of-fit tests and comparing estimates against a previous data cut (i.e. interim data collected). In this study, we extend these approaches by presenting a range of extrapolations fitted to four pre-planned data cuts from the JAVELIN Merkel 200 (JM200) trial. By comparing different estimates of survival and goodness-of-fit as JM200 data mature, we undertook an iterative process of fitting and re-fitting survival models to retrospectively identify early indications of likely long-term survival.Entities:
Keywords: Cancer; Extrapolation; Immune-oncology; Immunotherapy; Survival
Mesh:
Year: 2020 PMID: 32375680 PMCID: PMC7204248 DOI: 10.1186/s12874-020-00997-x
Source DB: PubMed Journal: BMC Med Res Methodol ISSN: 1471-2288 Impact factor: 4.615
Data cuts from Part A of the JAVELIN Merkel 200 clinical trial
| Label | Database lock | Minimum patient follow-up | Source(s) |
|---|---|---|---|
| 12mo | September 3, 2016 | 12 months | Kaufman et al., (2018) [ |
| 18mo | March 24, 2017 | 18 months | D’Angelo et al., (2018) [ |
| 24mo | September 26, 2017 | 24 months | Nghiem et al., (2018) [ |
| 36mo | September 14, 2018 | 36 months | D’Angelo et al., (2020) [ |
Measures of statistical goodness-of-fit
| Acronym | Full name | Formula |
|---|---|---|
| −2 | −2 × | −2 |
Key:k Number of model parameters; L Maximized likelihood function; log Natural logarithm; n Number of data points (sample size)
Fig. 1Overall survival data from Part A of the JAVELIN Merkel 200 clinical trial. Key: mFU, minimum follow-up; mo, month(s)
Fig. 2Smoothed hazard plots from Part A of the JAVELIN Merkel 200 clinical trial. Note: Owing to the sample size of JAVELIN Merkel 200 Part A (n = 88 patients), the max.time argument required by the muhaz function was set to the minimum follow-up time for each data cut. Consequently, the smoothed hazard estimate for each data cut is presented within this figure for a limited time period
Statistical goodness-of-fit scores of fitted models
| Statistic | Model | 12mo mFU | 18mo mFU | 24mo mFU | 36mo mFU |
|---|---|---|---|---|---|
| Log-logistic | 375.3 | 429.4 | 453.5 | 483.0 | |
| Log-normal | 373.7 | 427.5 | 451.3 | 480.5 | |
| Gen Gamma | 373.3 | 426.5 | 448.7 | 475.0 | |
| 1-knot Hazard | 373.3 | 426.5 | 448.0 | 471.9 | |
| 1-knot Odds | 373.3 | 426.3 | 447.8 | 472.5 | |
| 1-knot Normal | 373.4 | 426.6 | 448.8 | 474.2 | |
| 2-knot Hazard | 373.4 | 426.5 | 447.4 | 470.8 | |
| 2-knot Odds | 373.3 | 426.3 | 447.3 | 471.3 | |
| 2-knot Normal | 373.2 | 426.2 (2) | 447.2 | 471.1 | |
| 3-knot Hazard | 372.7 (3) | 426.4 | 446.6 (2) | 469.6 (1) | |
| 3-knot Odds | 372.5 (2) | 426.3 (3) | 446.7 (3) | 469.9 (3) | |
| 3-knot Normal | 372.2 (1) | 426.1 (1) | 446.6 (1) | 469.8 (2) | |
| Log-logistic | 379.3 | 433.4 | 457.5 | 487.0 | |
| Log-normal | 377.7 (1) | 431.5 (1) | 455.3 | 484.5 | |
| Gen Gamma | 379.3 | 432.5 | 454.7 (3) | 481.0 | |
| 1-knot Hazard | 379.3 (3) | 432.5 (3) | 454.0 (2) | 477.9 (1) | |
| 1-knot Odds | 379.3 (2) | 432.3 (2) | 453.8 (1) | 478.5 (2) | |
| 1-knot Normal | 379.4 | 432.6 | 454.8 | 480.2 | |
| 2-knot Hazard | 381.4 | 434.5 | 455.4 | 478.8 (3) | |
| 2-knot Odds | 381.3 | 434.3 | 455.3 | 479.3 | |
| 2-knot Normal | 381.2 | 434.2 | 455.2 | 479.1 | |
| 3-knot Hazard | 382.7 | 436.4 | 456.6 | 479.6 | |
| 3-knot Odds | 382.5 | 436.3 | 456.7 | 479.9 | |
| 3-knot Normal | 382.2 | 436.1 | 456.6 | 479.8 | |
| Log-logistic | 379.4 (2) | 433.5 | 457.6 | 487.2 | |
| Log-normal | 377.8 (1) | 431.7 (1) | 455.5 | 484.7 | |
| Gen Gamma | 379.6 | 432.8 | 455.0 (3) | 481.3 | |
| 1-knot Hazard | 379.6 | 432.8 (3) | 454.3 (2) | 478.2 (1) | |
| 1-knot Odds | 379.5 (3) | 432.6 (2) | 454.1 (1) | 478.8 (2) | |
| 1-knot Normal | 379.6 | 432.9 | 455.0 | 480.5 | |
| 2-knot Hazard | 381.9 | 434.9 | 455.9 | 479.3 (3) | |
| 2-knot Odds | 381.8 | 434.8 | 455.8 | 479.7 | |
| 2-knot Normal | 381.7 | 434.7 | 455.7 | 479.6 | |
| 3-knot Hazard | 383.4 | 437.2 | 457.3 | 480.3 | |
| 3-knot Odds | 383.2 | 437.0 | 457.4 | 480.6 | |
| 3-knot Normal | 382.9 | 436.8 | 457.3 | 480.6 | |
| Log-logistic | 381.3 (2) | 435.4 (3) | 459.5 | 489.0 | |
| Log-normal | 379.7 (1) | 433.5 (1) | 457.3 (3) | 486.5 | |
| Gen Gamma | 382.3 | 435.5 | 457.7 | 484.0 | |
| 1-knot Hazard | 382.3 | 435.5 | 457.0 (2) | 480.9 (1) | |
| 1-knot Odds | 382.3 (3) | 435.3 (2) | 456.8 (1) | 481.5 (2) | |
| 1-knot Normal | 382.4 | 435.6 | 457.8 | 483.2 | |
| 2-knot Hazard | 385.4 | 438.5 | 459.4 | 482.8 (3) | |
| 2-knot Odds | 385.3 | 438.3 | 459.3 | 483.2 | |
| 2-knot Normal | 385.2 | 438.2 | 459.2 | 483.1 | |
| 3-knot Hazard | 387.7 | 441.4 | 461.6 | 484.6 | |
| 3-knot Odds | 387.4 | 441.3 | 461.7 | 484.9 | |
| 3-knot Normal | 387.2 | 441.1 | 461.6 | 484.8 | |
| Log-logistic | 384.2 (2) | 438.4 (2) | 462.4 | 492.0 | |
| Log-normal | 382.7 (1) | 436.5 (1) | 460.3 (1) | 489.5 | |
| Gen Gamma | 386.8 | 440.0 | 462.1 | 488.4 | |
| 1-knot Hazard | 386.7 | 440.0 | 461.4 (3) | 485.3 (1) | |
| 1-knot Odds | 386.7 (3) | 439.7 (3) | 461.2 (2) | 485.9 (2) | |
| 1-knot Normal | 386.8 | 440.0 | 462.2 | 487.7 (3) | |
| 2-knot Hazard | 391.3 | 444.4 | 465.3 | 488.7 | |
| 2-knot Odds | 391.2 | 444.2 | 465.2 | 489.2 | |
| 2-knot Normal | 391.1 | 444.1 | 465.1 | 489.0 | |
| 3-knot Hazard | 395.1 | 448.8 | 469.0 | 492.0 | |
| 3-knot Odds | 394.8 | 448.7 | 469.1 | 492.3 | |
| 3-knot Normal | 394.6 | 448.5 | 469.0 | 492.2 |
Key:AIC Akaike information criterion; AICc Akaike information criterion (corrected); BIC Bayesian information criterion; Hannan–Quinn information criterion; L Maximized likelihood function; log Natural logarithm; mFU Minimum follow up; mo Month(s)
Note: For each of the scores presented above, a lower value indicates a better statistical goodness-of-fit. The “best” fitting model (i.e. the model with the lowest score) is denoted with “(1)” after the score, and is shaded in dark grey. Models with ranks 2 and 3 are formatted similarly
Fig. 3Fitted models from Part A of the JAVELIN Merkel 200 clinical trial. Notes: A, 12-month data cut; B, 18-month data cut; C, 24-month data cut; D, 36-month data cut. Key: k, knot(s); KM, Kaplan-Meier; n, normal; o, odds.
Prediction accuracy key findings
| Data cut | Model | Criteria for model selection | Prediction accuracy (months) | |||
|---|---|---|---|---|---|---|
| PE | RMST | |||||
| Fitted (earlier) data cut | Latest (36-mo) data cut | Fitted (earlier) data cut | Latest (36-mo) data cut | |||
| 12-mo | KM estimates | |||||
| 12-mo | Log-normal | Best AIC, BIC | −0.3 | −10.2 | −0.2 | −0.9 |
| 12-mo | Log-logistic | Lowest RMST, PE | −0.7 | −11.5 | −0.2 | −1.2 |
| 12-mo | 1-knot Odds | Highest RMST, PE | −0.8 | −6.6 | −0.3 | −0.5 |
| 18-mo | KM estimates | |||||
| 18-mo | Log-normal | Best AIC, BIC | + 1.1 | −8.7 | + 0.0 | −0.5 |
| 18-mo | Log-logistic | Lowest RMST, PE | + 0.0 | −9.8 | − 0.0 | − 0.8 |
| 18-mo | 1-knot Odds | Highest RMST | + 1.2 | −5.1 | − 0.2 | − 0.4 |
| 18-mo | 3-knot Odds | Highest PE | + 1.1 | −4.8 | −0.2 | − 0.4 |
| 24-mo | KM estimates | |||||
| 24-mo | 1-knot Odds | Best AIC | + 1.2 | −2.0 | + 0.0 | + 0.0 |
| 24-mo | Log-normal | Best BIC | −0.1 | −6.0 | + 0.4 | + 0.1 |
| 24-mo | Log-logistic | Lowest RMST, PE | −1.6 | −7.4 | + 0.2 | −0.3 |
| 24-mo | Gen Gamma | Highest RMST | + 1.2 | −2.7 | + 0.2 | + 0.1 |
| 24-mo | 3-knot Hazard | Highest PE | + 0.9 | −0.4 | −0.1 | − 0.1 |
| 36-mo | KM estimates | |||||
| 36-mo | 1-knot Hazard | Best AIC, BIC, highest PE | + 1.4 | + 1.4 | + 0.2 | + 0.2 |
| 36-mo | 3-knot Hazard | Lowest RMST | + 0.9 | + 0.9 | −0.1 | −0.1 |
| 36-mo | Log-normal | Highest RMST | −2.3 | − 2.3 | + 0.9 | + 0.9 |
| 36-mo | Log-logistic | Lowest PE | −4.1 | −4.1 | + 0.4 | + 0.4 |
Key:AIC Akaike’s information criterion; BIC Bayesian information criterion; Gen Generalized; mo Month(s); PE Point estimate; RMST Restricted mean survival time
Note: Negative values indicate that the model underestimates survival, whereas positive values indicate that the model overestimates survival. ‘Fitted’ refers to the data cut from which the models were fitted (i.e. the data cut stated within the left-hand column), and so a comparison is made between a model fitted to a given data cut and the Kaplan-Meier curve for this same data cut. ‘Latest’ refers to the 36-month data cut, and so a comparison is made between a model fitted to the specified data cut (which may be earlier) and the Kaplan-Meier curve for the 36-month data cut. Models were included in this table if one or more of the following criteria were met: (1) the model provided the ‘best’ AIC or BIC score, (2) the model provided either the ‘highest’ or ‘lowest’ estimate of RMST at 36 months, or (3) the model provided either the ‘highest’ or ‘lowest’ PE of survival at 36 months. Where RMST estimates were tied (to the nearest 0.1 month), the model with the lowest AIC or BIC was included here. Full prediction accuracy results are provided within the supplementary material