Literature DB >> 36171546

Validating ORR and PFS as surrogate endpoints in phase II and III clinical trials for NSCLC patients: difference exists in the strength of surrogacy in various trial settings.

Tiantian Hua1,2, Yuan Gao2, Ruyang Zhang2, Yongyue Wei2, Feng Chen3,4.   

Abstract

OBJECTIVE: This study aims to systematically validate the performance of surrogate endpoints in phase II and III clinical trials for NSCLC patients under various trial settings.
METHODS: A literature search retrieved all registered phase II and III trials of NSCLC patients in which OS, with at least one of ORR and PFS, were reported. Associations between surrogate and true endpoints were assessed on two levels. On the arm level, three pairs of correlations, i.e., ORR vs. median OS, ORR vs. median PFS, and median PFS vs. median OS, were analysed using Spearman's rho. On the trial level, similarly, three pairs of correlations, i.e., ΔORR vs. HR of OS, ΔORR vs. HR of PFS, and HR of PFS vs. HR of OS, were analysed using Spearman's rho and weighted linear regression model respectively. Finally, sensitivity analyses were performed to explore surrogacy under various trial settings.
RESULTS: At arm level, three pairs of correlations are all high (Spearman's rho = 0.700, 0.831, 0.755, respectively). At trial level, there is a low correlation between ΔORR and HR of OS, a high correlation between ΔORR and HR of PFS and a moderate correlation between HR of PFS and HR of OS (Spearman's rho = 0.462, 0.764, 0.584, respectively). In the sensitivity analysis, we find correlations between surrogate and true endpoints vary by different trial settings. It is noteworthy that the strength of surrogacy of these intermediate endpoints in targeted therapy is greater than that in immunotherapy.
CONCLUSION: According to the arm-level and trial level-analysis, we suggest that in phase II and III trials of targeted therapy and immunotherapy for NSCLC patients: 1) ORR lacks validity for the surrogacy of OS, excluding in first-line therapy, and 2) ORR may be an appropriate surrogate endpoint for PFS, and 3) PFS may be considered a modest surrogacy for OS, with better performance in first-line therapy trials. Moreover, to provide more convincing evidence of surrogacy of the surrogate endpoints, patient-level analyses are in desperate need.
© 2022. The Author(s).

Entities:  

Keywords:  Immunotherapy; Meta-analysis; Non-small cell lung cancer; Surrogate endpoint; Targeted therapy

Mesh:

Substances:

Year:  2022        PMID: 36171546      PMCID: PMC9520950          DOI: 10.1186/s12885-022-10046-z

Source DB:  PubMed          Journal:  BMC Cancer        ISSN: 1471-2407            Impact factor:   4.638


Background

Most recently, targeted therapy and immunotherapy have sprung up in cancer therapy [1]. The development of targeted therapy has advanced the therapeutic strategy from conventional chemo-based and radiation-based therapy to genetic alteration-guided targeted therapy [2, 3]. Meanwhile, the advent of immunotherapy leads to greater availability of effective subsequent treatments and extended survival in previously treated advanced non-small cell lung cancer (NSCLC), of which a good example is the success of clinical trials for PD-1/PD-L1 inhibitor in tumour treatment [4, 5]. In oncology trials, intermediate/surrogate endpoints are often used as primary endpoints instead of true endpoints such as the overall survival (OS) over the past decade [6]. Surrogate endpoints are not intrinsically beneficial to patients but are designed to be easier, faster and cheaper to measure than clinically meaningful outcomes because they can reduce the sample size, shorten the duration and save the cost of trials [7, 8]. Although there have been many studies trying to determine the surrogacy of intermediate endpoints, the results didn’t reach a consensus [9-15]. Besides, evaluating the surrogacy of the intermediate endpoints in these new domains, such as immunotherapy and targeted therapy, raises a to-be-solved challenge. We, therefore, performed a broad-based structured review and meta-analysis of registered phase II and III NSCLC trials with immunotherapy and/or targeted therapy. The objective of this research was to find the appropriacy of surrogate endpoints in NSCLC studies under various trial settings (e.g. line of therapy, trial phase, blinding and therapy type), to give suggestions on which surrogate endpoints should be used under certain trial conditions.

Methods

Search strategy

We conducted this analysis in compliance with the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) Statement [16]. Three databases (PubMed, EMBASE, Cochrane Library) were searched with time ranged from January 2000 to May 2021. Search terms included cancer terms AND therapy terms AND terms for PFS (Progression Free Survival, PFS), ORR (Objective Response Rate, ORR) and/or OS (Overall Survival, OS) AND terms for endpoint and/or surrogate. Search results were limited to the English language and in-human studies. The detailed Cochrane Library search strategy is provided in Additional file 1. The PRISMA flow diagram is shown in Fig. 1.
Fig. 1

PRISMA flow diagram for study inclusion

PRISMA flow diagram for study inclusion

Inclusion and exclusion criteria

Inclusion was restricted to registered clinical trials reporting results of OS, with either PFS or ORR, or both. The included studies should report the NCT number. Protocols and ongoing trials without results were excluded.

Study selection and data extraction

Titles and abstracts of articles retrieved by the search were examined by two independent reviewers, followed by a discussion to ensure consistency in the selection decisions. Full texts were examined by one reviewer and a subset was checked by a second reviewer, with any discrepancies resolved through discussion. Data were extracted by one reviewer and checked by a second reviewer. For eligible studies, the following characteristics were extracted: authors, publication year, number of patients included, type of lung cancer, stage of lung cancer, therapy type, NCT number, surrogate and final endpoints analysed, line of therapy, phase of the trial, results including ORR, median PFS and median OS of each arm, and ΔORR, HR of PFS and HR of OS between arms. The summary of included articles is shown in Additional file 2.

Data synthesis and analysis

According to the definition and validation criteria for surrogate endpoints [17, 18], two conditions are necessary to measure whether an intermediate endpoint can be an acceptable surrogacy for the true clinical endpoint. First, for the individual level, there must be a strong association between the surrogate endpoint and the true endpoint. Second, for the trial level, there must be a strong association between the treatment effect of the surrogate endpoint and that of the true endpoint [19]. However, the individual level data are usually not available, so we use arm level data as an alternative, which can also be seen in many other studies [20-23]. To describe the correlation strength between surrogate and true endpoints in this study, we divided values of the correlation coefficient into five levels, as shown in Table 1.
Table 1

Rule for interpreting the strength of a correlation coefficient

Size of Correlation (absolute value)Interpretation
[0.90,1.00]Very high positive (negative) correlation
[0.70,0.90)High positive (negative) correlation
[0.50,0.70)Moderate positive (negative) correlation
[0.30,0.50)Low positive (negative) correlation
[0.00,0.30)Negligible correlation
Rule for interpreting the strength of a correlation coefficient The Spearman rank correlation coefficient, rs, was used to measure the arm-level associations, which means correlation coefficients between ORR and median OS, between ORR and median PFS, and between median PFS and median OS were calculated from all arms of all included studies. To assess trial-level associations, correlations between OR of ORR (Odds Ratio of ORR) and HR of OS (Hazard Ratio of OS), between OR of ORR and HR of PFS, and between HR of PFS (Hazard Ratio of OS) and HR of OS were needed. In light of the situation that few studies reported OR of ORR, we replaced the OR of ORR by subtracting the treatment’s ORR from the control’s ORR, which was defined as ΔORR. Similarly, the Spearman rank correlation coefficient rs was used to evaluate the associations. Correlation indicates the extent to which those two treatment effects move together, while regression allows to predict the long-term treatment effect (true endpoint) based on the short-term treatment effect (surrogate endpoint). Hence, to give an idea of how the later can help with predicting the former, we further implemented the linear regression models. Simple linear regression models, followed by multiple linear regression models (both weighted by sample size as in previous endpoint validation studies [24-27]), which were expected to control for specific trial setting factors including line of therapy, trial phase and masking, were fitted. Also, we used the surrogate threshold effect (STE) to evaluate the surrogacy of surrogate endpoints from a clinical point of view [28]. STE defines the minimum short-term (i.e. surrogate endpoint) treatment effect required to guarantee a non-zero long-term (i.e. true endpoint) treatment effect. For example, take a regression of HR of OS as y on HR of PFS as x, then the x-value of the intersection of the line y = 1 (which means zero long-term treatment effect) and the upper boundary of the 95% prediction interval stripe of the regression line is STE. It should be noted that various trial settings, such as line of therapy, trial phase, masking and type of therapy might introduce bias into the quantitative relationship between surrogate and true endpoints, if any of these factors remained from the stepwise selection in the previous multiple regression. Thus, sensitivity analyses were performed to explore whether the associations showed homogeneity regardless of different trial settings. All analyses were conducted using R version 4.1.0 (R Foundation for Statistical Computing, Vienna, Austria).

Results

After the initial search, 1158 publications were identified for screening and 959 potentially relevant unduplicated publications were reviewed in detail for eligibility. Ultimately, a total of 136 eligible trials with a total of 350 arms (including arms of reported subgroup analysis, with details in Additional file 2) and 46,028 NSCLC patients were included in the study. The study selection process and reasons for exclusion at each stage are detailed in the PRISMA flow diagram Fig. 1. Most studies focused on advanced NSCLC. Among all 136 trials, 69 (50.7%) of them were phase II studies, 2 (1.5%) of them were phase II/III studies, and 65 (47.8%) of them were phase III studies. Concerning masking, 38 trials were double-blinded, 1 trial was single-blinded, and 97 trials were open-labelled. Other study design characteristics needed for sensitivity analysis are summarised in Additional file 2. In all 350 arms, 288 (82.3%) arms reported ORR, 331 (94.6%) arms reported median PFS, and 311 (88.9%) arms reported median OS. Meanwhile, in all 136 trials, 20 (14.7%) trials reported 25 ORs of ORR, 103 (75.7%) trials reported 147 HRs of PFS, and 101 (74.3%) trials reported 145 HRs of OS. The low percentage of trials reporting OR of ORR led to our decision that ΔORR was constructed to replace OR of ORR.

Arm-level associations between surrogate and true endpoints

Of all 350 arms from 136 eligible trials, 254 arms had both ORR and median OS reported, 271 arms had both ORR and median PFS reported, and 306 arms had both median PFS and median OS reported. The median PFS averaged across all arms was 4.5 months (range from 1.0 to 24.5 months), and the median OS was 11.5 months (range from 2.8 to 51.3 months). Figure 2 gives scatter plots of ORR vs. median OS, ORR vs. median PFS, and median PFS vs. median OS from all arms, with bubble radius representing the corresponding study sample size, and bubble colour representing the therapy type. The Spearman rank correlation coefficient suggested a high positive association between ORR and median OS (Spearman’s rho = 0.700, Table 2), a high positive association between ORR and median PFS (Spearman’s rho = 0.831, Table 2), and a high positive association between median PFS and median OS (Spearman’s rho = 0.755, Table 2).
Fig. 2

Arm-level scatter plot of surrogate and true endpoints. (a) Arm-level scatter plot of ORR and median OS. The Spearman’s rho coefficient is 0.700; (b) Arm-level scatter plot of ORR and median PFS. The Spearman’s rho coefficient is 0.831; (c) Arm-level scatter plot of median PFS and median OS. The Spearman’s rho coefficient is 0.755. Each arm is represented by a bubble whose radius is proportional to the study sample size. The bubble colour represents the therapy type of the arm

Table 2

Arm-level association of all included arms

Surrogate endpointTrue endpointNo. of armsrs95%CI
ORRMedian OS2540.7000.631,0.758
ORRMedian PFS2710.8310.790,0.864
Median PFSMedian OS3060.7550.702,0.799
Arm-level scatter plot of surrogate and true endpoints. (a) Arm-level scatter plot of ORR and median OS. The Spearman’s rho coefficient is 0.700; (b) Arm-level scatter plot of ORR and median PFS. The Spearman’s rho coefficient is 0.831; (c) Arm-level scatter plot of median PFS and median OS. The Spearman’s rho coefficient is 0.755. Each arm is represented by a bubble whose radius is proportional to the study sample size. The bubble colour represents the therapy type of the arm Arm-level association of all included arms

Trial-level associations between surrogate and true endpoints

Of all 136 eligible trials, we extracted 112 pairs of ΔORR and HR of OS, 114 pairs of ΔORR and HR of PFS, and 138 pairs of HR of PFS and HR of OS (because some trials have more than one treatment arm). The ΔORR ranged from − 53.5 to 27.0% in all trials, with HR of PFS and HR of OS ranging from 0.16 to 2.00 and 0.22 to 1.78 respectively. The Spearman correlation between the treatment effect of surrogate and true endpoints was first calculated, as shown in Table 3, which gave a low positive correlation of the pair ΔORR vs. HR of OS (Spearman’s rho = 0.462, Table 3), a high positive correlation of the pair ΔORR vs. HR of PFS (Spearman’s rho = 0.764, Table 3), and a moderate positive correlation of the pair HR of PFS vs. HR of OS (Spearman’s rho = 0.584, Table 3).
Table 3

Trial-level association of all included studies (Spearman’s rho)

Treatment effect of surrogate endpointTreatment effect of true endpointNo. of trialsrs95% CI
ΔORRHR of OS1120.4620.303, 0.597
ΔORRHR of PFS1140.7640.675, 0.831
HR of PFSHR of OS1380.5840.462, 0.684
Trial-level association of all included studies (Spearman’s rho) Simple linear regression results are listed in Table 4. An increasement of 1 unit in ΔORR would lead to an increasement of 0.011 unit in log (HR of OS) (R2 = 0.261, Table 4) and 0.023 unit in log (HR of PFS) (R2 = 0.578, Table 4), while an increasement of 1 unit in log (HR of PFS) resulted in an increasement of 0.403 unit in log (HR of OS) (R2 = 0.360, Table 4).
Table 4

Trial-level associations of all included studies (Simple linear regression)

Treatment effect of surrogate endpointTreatment effect of true endpointNo. of trialsSlope estimate95% CIR2Adjusted R2STE
ΔORRHR of OS1120.0110.007, 0.0140.2610.254——b
ΔORRHR of PFSa1140.0230.019, 0.0270.5780.574−26.30%
HR of PFSaHR of OSa1380.4030.313, 0.4940.3600.355——b

Simple linear regression model form

log (HR of OS) = −0.062 + 0.011 * ΔORR;

log (HR of PFS) = − 0.103 + 0.023 * ΔORR;

log (HR of OS) = − 0.053 + 0.403 * log (HR of PFS)

aNatural log hazard ratio in the analysis

bThe STE will be given only when the upper limit of rs (in Table 3) is more than 0.7

Trial-level associations of all included studies (Simple linear regression) Simple linear regression model form log (HR of OS) = −0.062 + 0.011 * ΔORR; log (HR of PFS) = − 0.103 + 0.023 * ΔORR; log (HR of OS) = − 0.053 + 0.403 * log (HR of PFS) aNatural log hazard ratio in the analysis bThe STE will be given only when the upper limit of rs (in Table 3) is more than 0.7 The trial-level correlation between ΔORR and HR of PFS from the clinical point of view is reflected by a surrogate threshold effect (STE) equal to − 26.30%. In other words, if ORR of the control group minus ORR of the treatment group is smaller than − 26.30%, the HR of PFS will be both statistically and clinically significant, which suggests a reduction in the risk of progression in the experiment group. Multiple linear regression results are listed in Table 5. Based on the least AIC criteria, trial setting factors were selected stepwise. Trial phase should be paid attention to when using ΔORR as a surrogate endpoint for HR of OS (log (HR of OS) = − 0.003 + 0.012 * ΔORR − 0.092 * phase) and using HR of PFS for HR of OS (log (HR of OS) = 0.009 + 0.418 * log (HR of PFS) - 0.072 * phase). Whether the trial is blinded also played a role in the surrogacy of ΔORR for HR of PFS (log (HR of PFS) = − 0.066 + 0.023 * ΔORR – 0.093 * masking).
Table 5

Trial-level association of all included studies (Multiple linear regression, adjusted for line, phase and maskingb)

Treatment effect of surrogate endpointTreatment effect of true endpointNo. of trialsSlope estimate95% CIR2Adjusted R2
ΔORRHR of OSa1030.0120.008, 0.0150.3130.299
ΔORRHR of PFSa1040.0230.119, 0.0270.5710.562
HR of PFSaHR of OSa1250.4180.312, 0.5240.3930.380

Multivariable linear regression model form:

Log (HR of OS) = − 0.003 + 0.012 * ΔORR − 0.092 * phase;

Log (HR of PFS) = − 0.066 + 0.023 * ΔORR – 0.093 * masking;

Log (HR of OS) = 0.009 + 0.418 * log (HR of PFS) - 0.072 * phase

aNatural log hazard ratio in the analysis

bCode for line, phase and masking in the regression:

Line: First-line = 0, ≥2nd-line = 1

Phase: Phase II = 0, Phase III = 1

Masking: Open-label = 0, Double-blind = 1

Trial-level association of all included studies (Multiple linear regression, adjusted for line, phase and maskingb) Multivariable linear regression model form: Log (HR of OS) = − 0.003 + 0.012 * ΔORR − 0.092 * phase; Log (HR of PFS) = − 0.066 + 0.023 * ΔORR – 0.093 * masking; Log (HR of OS) = 0.009 + 0.418 * log (HR of PFS) - 0.072 * phase aNatural log hazard ratio in the analysis bCode for line, phase and masking in the regression: Line: First-line = 0, ≥2nd-line = 1 Phase: Phase II = 0, Phase III = 1 Masking: Open-label = 0, Double-blind = 1

Additional analysis

Sensitivity analyses were performed to evaluate potential heterogeneity in the correlations caused by different trial settings such as line of therapy, phase of trial, masking and therapy type. The results are shown in Tables 6, 7, and 8, Figs. 3, 4, and 5. The scatter plots and fitted weighted linear regression lines of ΔORR vs. HR of OS, ΔORR vs. HR of PFS, and HR of PFS vs. HR of OS in different trial settings are shown in Figs. 3, 4, and 5.
Table 6

Summary of arm-level sensitivity analysis (Spearman correlation)

Sensitivity AnalysisNo. of armsrs95%CI
ORR ~ median OS
Line of therapy
  First-line880.6360.492, 0.746
   ≥ 2nd-line1460.6110.498, 0.703
Phase of trial
  Phase II1410.7030.609, 0.762
  Phase III1110.6600.541, 0.754
Masking
  Double-blind660.6290.456, 0.756
  Open-label1860.7100.631, 0.775
Therapy*
  Targeted therapy-based1230.7490.660, 0.818
  Immunotherapy -based540.4490.207, 0.640
  Chemotherapy-based650.7060.558, 0.810
Overall2540.7000.631,0.758
ORR ~ median PFS
Line of therapy
  First-line930.6990.578, 0.790
   ≥ 2nd-line1530.7350.653, 0.801
Phase of trial
  Phase II1530.8250.768, 0.870
  Phase III1170.7900.711, 0.850
Masking
  Double-blind760.8880.828, 0.927
  Open-label1930.7960.738, 0.842
Therapy*
  Targeted therapy-based1320.8230.759, 0.871
  Immunotherapy -based580.7930.673, 0.873
  Chemotherapy-based680.7830.669, 0.861
Overall2710.8310.790,0.864
median PFS ~ median OS
Line of therapy
  First-line1090.8320.763, 0.882
   ≥ 2nd-line1770.5990.495, 0.686
Phase of trial
  Phase II1430.7980.730, 0.851
  Phase III1540.7050.616, 0.777
Masking
  Double-blind830.7270.606, 0.815
  Open-label2210.7550.692, 0.807
Therapy*
  Targeted therapy-based1320.7560.672, 0.821
  Immunotherapy -based760.6560.506, 0.768
  Chemotherapy-based820.7860.687, 0.857
Overall3060.7550.702,0.799

Targeted therapy-based: including targeted therapy with or without placebo/normal care

Immunotherapy: including immunotherapy with or without placebo/normal care

Chemotherapy-based: including conventional chemotherapy with or without placebo/normal care

Table 7

Summary of trial-level sensitivity analysis (Spearman correlation)

Sensitivity AnalysisNo. of trialsrs95%CI
ΔORR ~ HR of OS
Line of therapy
  First-line390.6850.471, 0.822
   ≥ 2nd-line640.3420.103, 0.544
Phase of trial
  Phase II600.3990.163, 0.591
  Phase III520.5110.277, 0.688
Masking
  Double-blind350.259−0.081, 0.545
  Open-label760.5360.354, 0.680
Overall1120.4620.303, 0.597
ΔORR ~ HR of PFS
Line of therapy
  First-line360.7670.587, 0.875
   ≥ 2nd-line680.7870.675, 0.863
Phase of trial
  Phase II600.7920.674, 0.870
  Phase III540.6730.495, 0.797
Masking
  Double-blind360.7030.487, 0.838
  Open-label770.7890.672, 0.852
Overall1140.7640.686, 0.861
HR of PFS ~ HR of OS
Line of therapy
  First-line480.7680.621, 0.863
   ≥ 2nd-line810.5500.377, 0.686
Phase of trial
  Phase II620.4750.258, 0.647
  Phase III720.6500.492, 0.766
Masking
  Double-blind420.6760.469, 0.813
  Open-label950.5700.417, 0.692
Overall1380.5840.462, 0.684
Table 8

Summary of trial-level sensitivity analysis (Linear regression)

Sensitivity AnalysisNo. of trialsSlope estimate95%CIR2Adjusted R2Threshold
ΔORR ~ HR of OSa
Line of therapy
  First-line390.0190.013, 0.0240.5700.558−28.01%
   ≥ 2nd-line640.0080.004, 0.0120.1660.153——b
Phase of trial
  Phase II600.0100.406, 0.0140.2940.282——b
  Phase III520.0120.007, 0.0180.2670.252——b
Masking
  Double-blind350.0160.009, 0.0230.3650.345——b
  Open-label760.0070.004, 0.0110.1760.165——b
Overall1120.0110.007, 0.0140.2610.254——b
ΔORR ~ HR of PFSa
Line of therapy
  First-line360.0230.018, 0.0290.6410.630−23.25%
   ≥ 2nd-line680.0230.018, 0.0290.5280.521−29.95%
Phase of trial
  Phase II600.0230.018, 0.0280.5880.581−35.82%
  Phase III540.0230.017, 0.0280.5680.560−19.28%
Masking
  Double-blind360.0180.012, 0.0250.4680.453−33.76%
  Open-label770.0250.021, 0.0300.6400.636−24.42%
Overall1140.0230.019, 0.0270.5780.574−26.30%
HR of PFSa ~ HR of OSa
Line of therapy
  First-line480.5550.419, 0.6910.5800.5710.49
   ≥ 2nd-line810.3330.217, 0.4490.2860.277——b
Phase of trial
  Phase II620.3740.255, 0.4920.3900.380——b
  Phase III720.4270.302, 0.5530.3900.3820.41
Masking
  Double-blind420.7930.610, 0.9770.6420.6330.48
  Open-label950.2760.178, 0.3740.2460.238——b
Overall1380.4030.313, 0.4940.3600.355——b

aNatural log hazard ratio in the analysis

bOnly the upper limit of rs is more than 0.7, the STE will be given

Fig. 3

Plots of ΔORR vs. HR of OS by study sample size regression line. Each trial is represented by a bubble whose radius is proportional to the trial sample size

Fig. 4

Plots of ΔORR vs. HR of PFS by study sample size regression line. Each trial is represented by a bubble whose radius is proportional to the trial sample size

Fig. 5

Plots of HR of PFS vs. HR of OS by study sample size regression line. Each trial is represented by a bubble whose radius is proportional to the trial sample size

Summary of arm-level sensitivity analysis (Spearman correlation) Targeted therapy-based: including targeted therapy with or without placebo/normal care Immunotherapy: including immunotherapy with or without placebo/normal care Chemotherapy-based: including conventional chemotherapy with or without placebo/normal care Summary of trial-level sensitivity analysis (Spearman correlation) Summary of trial-level sensitivity analysis (Linear regression) aNatural log hazard ratio in the analysis bOnly the upper limit of rs is more than 0.7, the STE will be given Plots of ΔORR vs. HR of OS by study sample size regression line. Each trial is represented by a bubble whose radius is proportional to the trial sample size Plots of ΔORR vs. HR of PFS by study sample size regression line. Each trial is represented by a bubble whose radius is proportional to the trial sample size Plots of HR of PFS vs. HR of OS by study sample size regression line. Each trial is represented by a bubble whose radius is proportional to the trial sample size From the arm-level sensitivity analysis (Fig. 6), we can find that 1) ORR is more relevant with median OS in first-line, phase II, open-label, targeted-therapy based trials; 2) ORR is more relevant with median PFS in ≥2nd-line, phase II, double-blind, targeted-therapy based trials. 3) PFS is more relevant with median OS in first-line, phase II, open-label, chemotherapy-based trials. Correlations in all subgroups are statistically significant. It is worth mentioning that, in targeted therapy-based trials, the association strengths between ORR and median OS, between ORR and median PFS, and between median PFS and median OS are all at the high positive level. On the contrary, the association strengths in immunotherapy-based trials are much weaker, except for the ORR vs. median PFS pair. We may conclude that at the arm level, surrogate endpoints show the best appropriation in targeted therapy-based trials among all three therapy types.
Fig. 6

Forest plot of arm-level sensitivity analysis. The box size is based on the precision of Spearman’s rho

Forest plot of arm-level sensitivity analysis. The box size is based on the precision of Spearman’s rho From the trial-level sensitivity analysis (Fig. 7), we can find that associations between ΔORR and HR of OS are not satisfactory in most subgroups except in first-line therapy trials. The association between ΔORR and HR of PFS is the strongest among all three pairs, with all rs estimated above 0.65 and upper limit of rs estimated above 0.7. For the pair of HR of PFS vs. HR of OS, high positive correlation only exists in first-line therapy trials.
Fig. 7

Forest plot of trial-level sensitivity analysis (Spearman correlation). The box size is based on the precision of Spearman’s rho

Forest plot of trial-level sensitivity analysis (Spearman correlation). The box size is based on the precision of Spearman’s rho Table 8 reveals the weighted (based on sample size) linear regression coefficient β of surrogate endpoints to true endpoints in each subgroup. All models using ΔORR to predict HR of OS have R2 < 0.3 except for that in first-line therapy and double-blind trials, which may lead to the result that there are negligible correlations between ΔORR and HR of OS at trial level. In addition, for both ΔORR as a surrogacy of HR of PFS and HR of PFS as a surrogacy of HR of OS, the associations in the subgroup of first-line therapy is much stronger than those in the subgroup of ≥2nd-line therapy, indicating that it may be more reliable and reasonable to use surrogacy endpoints in first-line NSCLC therapy.

Discussion

All analyses on the arm-level, including those in each subgroup, are all statistically significant. The association between ORR and PFS, and that between PFS and OS, are stronger than the association between ORR and OS, which may indicate a pass-on effect of ORR to PFS to OS. We can find that the correlation strength sequence is the same at both the arm level and the trial level, with the pair of PFS vs. OS, the pair of ORR vs. PFS, and the pair of ORR vs. OS listed in a descending order. Meanwhile, the value of correlation strength is smaller at the trial level than that at the arm level. This conclusion agrees with Nie [29], Bira [30] and Ritchie [31]. It is worth noting that, one possible reason for the high correlation at arm level may be that the endpoints are naturally dependent by definition. The dependence structure between endpoints, especially for PFS and OS, should be taken into consideration [32]. To explore this, we calculated the mean (0.381) of all arms’ median PFS/median OS, which is much smaller than the arm-level correlation (0.755). So we conclude that the arm-level correlation are rather driven by a real association between PFS and OS than by the dependence introduced because of the definition.

The surrogacy of ORR for OS

Although ORR has a high positive correlation with OS at arm level, we don’t consider it as an appropriate surrogacy for OS because of its poor performance (low positive correlation, adjusted R2 < 0.3) at trial level, as what Gideon [10] has found in advanced NSCLC from both trial level and patient level. However, the additional analysis indicates that when it is limited to trials with first-line therapy, the trial-level association estimates (Spearman’s rho, regression coefficients and adjusted R2) increases significantly. Hence, ORR may be used as a surrogate endpoint for OS only in first-line therapy trials.

The surrogacy of ORR for PFS

At trial level, ΔORR can explain 58% of the variability of HR of PFS. Combining with its good behaviour at arm level, it implies that ORR may be an appropriate surrogacy for PFS, especially in first-line, open-label trials. The fine surrogacy of ORR for PFS to some extent comes from the definition of those two indices. Notably, it seems that ORR works better as surrogacy for PFS in both immunotherapy and targeted therapy than that in chemotherapy.

The surrogacy of PFS for OS

We can conclude from this analysis that PFS may be at most a modest surrogate endpoint for OS because of its performance at both arm level and trial level, consistent with Buyse’s [15] finding in advanced NSCLC patients and Fiteni‘s [33] finding in locally advanced lung cancer. However, Laporte [14] found that R2 were low in advanced NSCLC at patient-level. We also find that the association between PFS and OS is not the same across various therapy types, with stronger association in targeted therapy than in immunotherapy, and both of which are weaker compared to that in chemotherapy. One possible explanation for this finding is that crossover designs are often implemented in immunotherapy and targeted therapy, as mentioned in many researches [34], because of their outstanding early efficacy and ethical consideration, which would make the effect of treatment smaller. Besides, the correlation between PFS and OS is stronger in first-line therapy trials, which was also found by Foster [35] in extensive-stage small cell lung cancer. There are several limitations in our study that need to be acknowledged. First, meta-analysis can only drop a hint on the association between arm-level and trial-level parameters, while true causation should be established only with an analysis of patient-level data [36]. Second, due to the unavailability of extracting same summary statistics for all trials, such as the OR of ORR on the trial-level, we manually calculated ΔORR using the reported ORR of each arm instead, which may not reflect the true treatment effect of ORR. At last, the idea can be considered in future studies that dividing all trials into a training part and a validating part, to examine more thoroughly the prediction ability of surrogate endpoints.

Conclusion

In conclusion, the findings of this meta-analysis of prospective randomized trials demonstrate that ORR is proved to be lack of validity for the surrogacy of OS, except in first-line therapy trials. Conversely, it may be an appropriate surrogate endpoint for PFS across different trial settings (e.g. line of therapy, phase of trial, masking and therapy type), considering the strong and robust correlations at both arm level and trial level. However, as a traditional surrogate endpoint, PFS may be considered at most a modest surrogacy for OS in the setting of phase II/III clinical trials of targeted therapy and immunotherapy for NSCLC patients (especially in first-line therapy trials). Besides, we must admit that to evaluate the true surrogacy, further evaluation of patient-level data should be carried out. Additional file 1. Cochrane Library Search Strategy (May 2021). Additional file 2. Characteristics of Included 136 studies.
  34 in total

1.  Relationship between response and survival in more than 50,000 patients with advanced non-small cell lung cancer treated with systemic chemotherapy in 143 phase III trials.

Authors:  Katsuyuki Hotta; Yoshiro Fujiwara; Katsuyuki Kiura; Nagio Takigawa; Masahiro Tabata; Hiroshi Ueoka; Mitsune Tanimoto
Journal:  J Thorac Oncol       Date:  2007-05       Impact factor: 15.609

Review 2.  Immunotherapy for lung cancer.

Authors:  Antonius Steven; Scott A Fisher; Bruce W Robinson
Journal:  Respirology       Date:  2016-04-21       Impact factor: 6.424

Review 3.  The Strength of Association Between Surrogate End Points and Survival in Oncology: A Systematic Review of Trial-Level Meta-analyses.

Authors:  Vinay Prasad; Chul Kim; Mauricio Burotto; Andrae Vandross
Journal:  JAMA Intern Med       Date:  2015-08       Impact factor: 21.873

Review 4.  PD-1/PD-L1 immune checkpoint: Potential target for cancer therapy.

Authors:  Fatemeh K Dermani; Pouria Samadi; Golebagh Rahmani; Alisa K Kohlan; Rezvan Najafi
Journal:  J Cell Physiol       Date:  2018-09-07       Impact factor: 6.384

Review 5.  Evaluation of classical clinical endpoints as surrogates for overall survival in patients treated with immune checkpoint blockers: a systematic review and meta-analysis.

Authors:  Howard L Kaufman; Lawrence H Schwartz; William N William; Mario Sznol; Kyle Fahrbach; Yingxin Xu; Eric Masson; Andrea Vergara-Silva
Journal:  J Cancer Res Clin Oncol       Date:  2018-08-21       Impact factor: 4.553

6.  Multitrial Evaluation of Progression-Free Survival as a Surrogate End Point for Overall Survival in First-Line Extensive-Stage Small-Cell Lung Cancer.

Authors:  Nathan R Foster; Lindsay A Renfro; Steven E Schild; Mary W Redman; Xiaofei F Wang; Suzanne E Dahlberg; Keyue Ding; Penelope A Bradbury; Suresh S Ramalingam; David R Gandara; Taro Shibata; Nagahiro Saijo; Everett E Vokes; Alex A Adjei; Sumithra J Mandrekar
Journal:  J Thorac Oncol       Date:  2015-07       Impact factor: 15.609

7.  Pembrolizumab versus Chemotherapy for PD-L1-Positive Non-Small-Cell Lung Cancer.

Authors:  Martin Reck; Delvys Rodríguez-Abreu; Andrew G Robinson; Rina Hui; Tibor Csőszi; Andrea Fülöp; Maya Gottfried; Nir Peled; Ali Tafreshi; Sinead Cuffe; Mary O'Brien; Suman Rao; Katsuyuki Hotta; Melanie A Leiby; Gregory M Lubiniecki; Yue Shentu; Reshma Rangwala; Julie R Brahmer
Journal:  N Engl J Med       Date:  2016-10-08       Impact factor: 91.245

Review 8.  Surrogate endpoints for overall survival in lung cancer trials: a review.

Authors:  Frédéric Fiteni; Virginie Westeel; Franck Bonnetain
Journal:  Expert Rev Anticancer Ther       Date:  2017-04-12       Impact factor: 4.512

9.  Postprogression survival in patients with advanced non-small-cell lung cancer who receive second-line or third-line chemotherapy.

Authors:  Hidetoshi Hayashi; Isamu Okamoto; Masataka Taguri; Satoshi Morita; Kazuhiko Nakagawa
Journal:  Clin Lung Cancer       Date:  2012-10-27       Impact factor: 4.785

10.  Preferred reporting items for systematic reviews and meta-analyses: the PRISMA statement.

Authors:  David Moher; Alessandro Liberati; Jennifer Tetzlaff; Douglas G Altman
Journal:  PLoS Med       Date:  2009-07-21       Impact factor: 11.069

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.