Literature DB >> 35652050

Local Investigators Significantly Overestimate Overall Response Rates Compared to Blinded Independent Central Reviews in Uncontrolled Oncology Trials: A Comprehensive Review of the Literature.

Cinzia Dello Russo1,2, Pierluigi Navarra1.   

Abstract

Several drugs gained market authorization based on the demonstration of improved progression-free survival (PFS), adopted as a primary endpoint in Phase 3 clinical trials. In addition, an increasing number of drugs have been granted accelerated approval, and sometimes regular approval, by the main regulatory agencies based on the evaluation of the overall response rate in Phase 1 and 2 clinical trials. However, while the overall survival is an unbiased measure of drug efficacy, these outcomes rely on the assessment of radiological images and patients' categorization using standardized response criteria. The evaluation of these outcomes may be influenced by subjective factors, particularly when the analysis is performed locally. In fact, blinding of treatment is not always possible in modern oncology trials. Therefore, a blinded independent central review is often adopted to overcome the problem of expectation bias associated with local investigator assessments. In this regard, we have recently observed that local investigators tend to overestimate the overall response rate in comparison to central reviewers in Phase 2 clinical trials, whereas we did not find any significant evaluation bias between local investigators and central reviews when considering progression-free survival in both Phase 2 and 3 trials. In the present article, we have tried to understand the reasons behind this discrepancy by reviewing the available evidence in the literature. In addition, a further analysis of Phase 2 and 3 clinical trials that included the evaluation of both endpoints showed that local investigators significantly overestimate overall response rates compared to blinded independent central reviews in uncontrolled oncology trials.
Copyright © 2022 Dello Russo and Navarra.

Entities:  

Keywords:  BICR; ORR; PFS; local investigators; objective response rate; oncology; pharmacotherapy; progression-free survival

Year:  2022        PMID: 35652050      PMCID: PMC9149259          DOI: 10.3389/fphar.2022.858354

Source DB:  PubMed          Journal:  Front Pharmacol        ISSN: 1663-9812            Impact factor:   5.988


Introduction

Cancer is one of the major causes of death across the world, leading to a significant reduction of life expectancy in several countries. The burden of cancer is steadily increasing worldwide (Sung et al., 2021); in 2020, there were 19.29 million new cancer cases, 9.96 million cancer deaths, and a total of 50.55 million people living with cancer within 5 years of diagnosis (Global Cancer Observatory, 2020). The COVID-19 pandemic has negatively impacted on several clinical procedures in oncology, such as reduced access to screening programs, delayed diagnosis, and disruption in treatment schedules, possibly contributing to such increase. The rising incidence of cancer coupled with the high mortality rates highlights the need for new and more effective treatments in oncology. However, the outcome of anticancer treatments significantly depends on the extent of tumor disease at treatment initiation, which further highlights the relevance of preventive strategies as well as diagnostic procedures or other clinical interventions to allow early diagnosis of cancer and timely treatment initiation (Kwong et al., 2021; Okoli et al., 2021). With respect to the development of novel anticancer drugs, there is a continuous effort to define the most accurate methods as well as the relevant endpoints to assess the clinical benefit of new compounds in the context of different tumors (Daniele et al., 2020). In this regard, the overall survival (OS) is an unbiased measure of drug efficacy, and thus it is considered the gold standard endpoint for most types of tumors (Delgado and Guddati, 2021). However, it is not always possible—or convenient—to take the OS as the primary endpoint of drug efficacy in clinical trials (Lebwohl et al., 2009). In fact, several drugs gained market authorization based on the demonstration of improved progression-free survival (PFS), which was adopted as a primary endpoint in Phase 3 clinical trials (Robinson et al., 2014). This is particularly frequent in the context of metastatic disease, although the PFS is not always predictive of beneficial effects in terms of OS (Haslam et al., 2019; Pasalic et al., 2020). Moreover, an increasing number of drugs have been granted accelerated approval, and sometimes regular approval, by the FDA based on the evaluation of the overall response rate (ORR) in Phase 1–2 clinical trials (Chen et al., 2019), although a large meta-analysis showed poor correlation between ORR and OS or PFS (Cooper et al., 2020). In addition, it has become progressively clear, especially with the development of targeted therapies, that not all new drugs cause shrinking of tumors, with disease stabilization observed in several cases. Therefore, it is also often necessary to perform evaluations of time-to-event endpoints during the early phases of clinical development (Gravanis et al., 2014). Notably, both the ORR and the PFS rely on the assessment of radiological images and patients’ categorization based on standardized response criteria (Delgado and Guddati, 2021). The most frequently adopted criteria are the response evaluation criteria in solid tumors (RECIST) (Therasse et al., 2000; Eisenhauer et al., 2009), which have been adequately modified over time to better assess responses in specific clinical settings, for a specific tumor, or in relation to specific drugs, that is, immune check point inhibitors (Aykan and Özatlı, 2020). A major drawback is that the evaluation of these endpoints may be influenced by subjective factors, such as differences in tumor measurement, differences in the selection of target lesions, failure to diagnose new lesions, and differences in the analysis of non-target lesions (Amit et al., 2010; Iannessi et al., 2021). This may be particularly relevant when the evaluations are performed locally, considering that blinding of treatment is not always possible in modern oncology trials. Therefore, a blinded independent central review (BICR) is often adopted to overcome the problem of expectation bias associated with local investigator (LI) assessments (Dodd et al., 2008; Amit et al., 2010). When BICR is implemented, all the radiological images and selected clinical data acquired in the clinical trials are reviewed by independent radiologists who are blinded to treatment assignment and to any kind of clinical data that can influence the independent review process (Ford et al., 2009). In this regard, we have recently observed that LIs tend to overestimate the ORR in comparison to BICR in Phase 2 clinical trials, whereas we did not find any significant evaluation bias between LIs and BICR when considering PFS in both Phase 2 and 3 trials (Dello Russo et al., 2021; 2020). In the present article, we have tried to understand the reasons behind this discrepancy by reviewing the available evidence in the literature. In addition, we performed a further analysis of Phase 2 and 3 clinical trials that included the evaluation of both endpoints, ORR and PFS.

Local Investigator Evaluation Versus Blinded Independent Central Review in the Assessment of Relevant Primary Outcomes

Overall Response Rate

The ORR measures the response to a pharmacological treatment through the evaluation of changes in the overall tumor burden in comparison to baseline values (Aykan and Özatlı, 2020). In solid tumors, this is calculated using RECIST criteria (Therasse et al., 2000), updated in 2009 as RECIST v1.1 (Eisenhauer et al., 2009). These criteria allow to classify patients into different groups considering the effects of the investigational drug on the tumor lesions. In brief, patients are categorized as having a complete response (CR), partial response (PR), stable disease (SD), or progressive disease (PD). The ORR is then calculated as the percentage of patients with CR and PR over the total number of eligible patients, that is, all subjects included in the trial (Eisenhauer et al., 2009). Response evaluation is usually based on arbitrary cutoff values at a specific time point, usually at 4 or 8 weeks after treatment initiation (Eisenhauer et al., 2009; Aykan and Özatlı, 2020). In the first analysis of 18 Phase 3 clinical trials, a significant variability was reported between the LI assessments, and the BICR results in the evaluation of the tumor response rate (RR) (Tang et al., 2010). Despite using a general abbreviation, in the majority of the analyzed trials, the RR is indeed equal to the ORR, as per the abovementioned definition. However, in two trials, CR was observed only in one enrolled patient, which leaves the comparison among LI and BICR to the evaluation of the rate of PRs (Escudier et al., 2007; Motzer et al., 2007). In 14 out of 18 trials, the RR values measured by LIs were consistently greater than those by BICR for both the experimental and the control arms of the studies. For the remaining four trials, in one study, the evaluation of the RR was higher in the experimental group by LIs than that by BICR and lower for the control arm; in two trials, the RR was reported to be inferior by LIs in comparison to BICR in the experimental group and superior in the Control arm; and in one trial, the RR was consistently lower in both arms of the study by the LI in comparison to BICR. The analysis was performed by comparing the absolute values reported for the RR. Therefore, the RR reported by LIs was greater in 31 out of 36 evaluations, with an estimated mean difference of +4.57% (95% confidence interval, CI, 2.95–6.19%), thus suggesting that overall the LIs were more “optimistic” than the independent reviewers in the evaluation of tumor RR. These results are consistent with a more recent analysis carried out by our research group on data from 20 Phase 2 clinical trials having the ORR as primary or secondary endpoint, assessed by both LIs and through BICR (Dello Russo et al., 2021). In some trials, more than one treatment group was included for a total of 33 ORR values. In most cases (26/33, 78.8%), the LI assessment was more “optimistic” in the evaluation of ORR, with an average of + 17.5% positive responses than the central independent reviewer. Most trials had an open-label design, and 12 out of 20 trials (60%) had a single group assignment. When parallel groups were included, these referred, for example, to different doses of the same drugs (NCT02094573, brigatinib 90 mg, and brigatinib 90–180 mg) (Kim et al., 2017) or to the evaluation of the same drug in different cohorts of patients (NCT02336451, ceritinib in different cohorts of patients with brain metastases, no evidence of leptomeningeal carcinomatosis, and exposed to different previous therapies) (Chow et al., 2019). Therefore, most trials lacked a comparator arm. Our data are however consistent with previous evidence from the evaluation of 28 Phase 3 clinical trials, six of which were carried out according to a blinded design (Zhang et al., 2017). In this study, with respect to the analysis of ORR on both the experimental and control arms of the studies, the authors found a lower response frequency in the central review compared with LI assessment. The discrepancy was larger in the open-label trials, for a positive primary outcome, and central assessed primary endpoint. However, this phenomenon impacted both the experimental and the comparator arm; therefore, it did not cause any significant evaluation bias between the two reviews. The ratio of odd ratios (ORs) was 1.02, with a 95% CI of 0.97–1.07 (p = 0.42), thereby including the value of 1 and suggesting a high degree of concordance between the two assessments. Interestingly, 16 trials reported data on the disease control rate (DCR) with similar results to those observed for the ORR (Zhang et al., 2017). Consistently, in a subsequent large analysis of 76 Phase 3 clinical trials, including over 45,000 patients, the authors found eight trials with discrepancy in the evaluation of ORR among LIs and BICR. In seven of these trials, ORR was included as a secondary endpoint, further indicating that the ORR is rarely used as primary endpoint in Phase 3 clinical trials. Interestingly, in five out of eight trials, the BICR appeared to be more positive than the LI assessments (please refer to Table 2 in the original paper, Zhang et al. (2018)). The ORR was included as a primary endpoint in a clinical trial investigating the potential beneficial effect of peptide vaccine therapy as add-on treatment to aldesleukin (IL-2) in patients with locally advanced or metastatic melanoma (NCT00019682, Schwartzentruber et al., 2011). Interestingly, in this study, the BICR was more optimistic in the evaluation of the ORR in comparison to LI, although both evaluations found the addition of the vaccine more effective than the treatment with aldesleukin alone. Interestingly, the vaccine significantly increased the PFS and OS (although the study was not powered to assess these outcomes), indicating in this case a correlation between ORR and the other outcomes. However, the pooled analysis carried out on 29 trials that included the evaluation of ORR showed no significant differences among the two reviews (Zhang et al., 2018). Taken together, these data, therefore, suggest that there is an overall trend to overestimate the ORR by LIs. However, in controlled studies no significant differences can be found in the analysis of the beneficial effects of the investigational drug versus the comparator treatments among the two reviews, thus highlighting the reliability of the LIs in the evaluation of this endpoint in Phase 3 clinical trials.

Progression-Free Survival

The PFS is defined as the time from randomization or treatment initiation until first evidence of objective tumor progression or death from any cause (Lebwohl et al., 2009; Amit et al., 2011; Delgado and Guddati, 2021). It is a time-to-event endpoint based on the assessment of disease progression at different time points defined in each clinical trial. It depends on tumor growth. This evaluation usually requires shorter time and a smaller population than OS, and it is not affected by subsequent therapies. Thus, it is often used as a primary endpoint in Phase 3 clinical trials, especially in the context of advanced/metastatic disease (Lebwohl et al., 2009). However, the data obtained may be subject to measurement error and bias since progression is defined based on standard criteria and is not objective as the time of death (Amit et al., 2010). That is why BICR is often recommended to minimize possible investigator bias, except when the trial is truly double-blinded (Amit et al., 2010). However, also in these cases, the occurrence of adverse drug effects can be valuable information for the investigator to reveal the treatment arm in which the patient is possibly allocated (Dodd et al., 2008). In an initial analysis of concordance among LIs and BICR, data from 7 Phase 3 trials showed that there were not significant differences in the assessment of PFS and treatment efficacy among the two reviews (Dodd et al., 2008). However, in a subsequent study, a certain degree of variability between the LI assessments and the BICR results with respect to the evaluation of PFS was found in eight Phase 3 clinical trials (Tang et al., 2010). This analysis focused mainly on differences in the median times, showing that the estimated mean difference in PFS was ≥0.19 (95% CI, −0.68 to +0.29) months. The concordance of treatment effect, as measured by the hazard ratio (HR) for LIs and BICR, could be analyzed only in four trials, two of which reported differences among the LIs and BICR (Tang et al., 2010). In contrast, a large meta-analysis, promoted by several pharmaceutical companies, namely, GlaxoSmithKline, AstraZeneca, Eli Lilly, and Hoffman LaRoche, and carried out on 27 Phase 3 trials with PFS, showed strong correlation (r = 0.947) between the LI assessments and BICR. The estimated ratio of HRs between LIs and BICR was 1.03 (95% CI, 0.98–1.08), implying approximately 3% difference between the two evaluations (Amit et al., 2011). However, in a subsequent analysis of 76 Phase 3 clinical trials, 10 trials were identified with inconsistencies between the two reviews with respect to the assessment of PFS (Zhang et al., 2018). These studies mostly included PFS as the primary endpoint. In seven out of 10 trials, the LIs reported lower HR values in comparison to BICR, which means a larger beneficial effect of the investigational drug estimated locally. Consistent with data on the ORR discussed above, the pooled analysis carried out on 72 trials that included the evaluation of PFS showed no significant differences among the two reviews (please refer to Table 3 in the original paper, Zhang et al. (2018)). In a recent analysis performed by our group on 28 Phase 3 clinical trials, we compared the HRs for PFS between LIs and BICR calculating a discrepancy index through the ratio of the HRs among the two reviews, and we did not find any significant evaluation bias among the two assessments (Dello Russo et al., 2020). The estimated mean discrepancy index was 0.98 (95% CI, 0.927–1.032). We also performed subsequent analyses, dividing the trials by year, by tumor type, by drug type, and by study design. Interestingly, the discrepancy index did not include the value 1 only in the subgroup of trials started in the period 2003–2006, with the blinded assessment showing increased benefit than the LI assessment. We can hypothesize that the improvement in the diagnostic skills in clinical trials and the continuous refinement of diagnostic criteria per different tumors has progressively contributed to the reduction of evaluation bias. Consistently, we found a relatively higher variability (large CI) in trials testing biological agents, which was mainly due to the high discrepancy recorded in trial KEYNOTE-61 testing the effects of the immune check point inhibitor pembrolizumab (Shitara et al., 2018). In this regard, it is now well established that standard RECIST criteria cannot be applied to the assessment of immunotherapy efficacy in solid tumors and needed to be modified for these drugs, so-called immune-related response evaluation criteria in solid tumors (iRECIST) (Seymour et al., 2017). No other relevant differences were found in the subgroup analysis based on tumor type (trials in onco-hematology and solid tumors) and drug type (trials testing small-molecules and those testing biologicals). We also found a trend toward more optimistic assessments by the independent review in double-blinded trials. Taken together, these data suggest that the two approaches, local and central review, with respect to PFS, lead to substantially similar results. These results further support the notion that the BICR is not necessary for all clinical trials, but it can be considered based on specific cases or carried out as an audit on a subset of enrolled patients (Amit et al., 2010).

Additional Evidence on the Comparison Between Local Investigators and Independent Reviewers

In our previous analysis, we found that the LIs tend to overestimate the ORR in comparison to BICR, whereas high concordance was observed for the evaluation of PFS. This result was unexpected, considering that both endpoints rely on the same criteria to assess the radiological images as well as to establish response to therapy and/or disease progression. In an attempt to clarify the reasons behind the differences between LIs and BICR in relation to the evaluation of these two outcomes, we accessed the clinicaltrials.gov database and the clinicaltrialsregister.eu (EudraCT) database on 30 October 2021 and searched for clinical trials that included both the ORR and the PFS as endpoints and evaluations by both LIs and blinded assessors. We have already analyzed a subgroup of Phase 2 clinical trials that included both the ORR and the PFS, showing no differences between local and central reviewers with respect to PFS (Dello Russo et al., 2021). We aimed to update and extend this initial analysis by including also the Phase 3 clinical trials.

Research Strategy

The research strategy was based on the methodological approach previously described (Dello Russo et al., 2020). In brief, in the clinicaltrials.gov database, we searched for clinical trials selected by the search string ((investigator-assessed) OR and (investigator-assessment) AND (Cancer) and posted in the database within the data-lock date of 30 October 2021. We found a total of 1389 interventional clinical trials, of which 301 Phase 2 studies with results and 171 Phase 3 studies with results. Among these, we then selected for the analysis clinical trials that included both the ORR and the PFS evaluated both locally and by BICR. According to these inclusion criteria, we selected 17 Phase 2 and 18 Phase 3 clinical trials. However, two Phase 2 clinical trials were subsequently excluded, namely, the NCT01258608 trial, because the result section only included ORR evaluated by BICR (64.4 months, 95% CI, 48.80–78.10) and the NCT02108964 clinical trial for which no results were posted in the database at the data-lock date. A further search was carried out in the EudraCT database using the same search string as above, retrieving additional 614 clinical trials, 203 of with results including 116 Phase 2 and 88 Phase 3 clinical trials. Among these, we then selected for the analysis studies including both ORR and PFS evaluated both locally and by BICR. A total of nine Phase 2 and eight Phase 3 clinical trials met the inclusion criteria. Among the newly found Phase 2 clinical trials, only four did not overlap with the search in the clinicaltrials.gov database, although only two could be included for the analysis (Table 1, trials number 16 and 17). In this regard, the EudraCT 2013-000311-25 (NCT01915589, with no results) and the EudraCT 2013-000241-39 evaluating the efficacy of refametinib in the hepatocellular carcinoma were excluded since it was not possible to calculate a discrepancy index for the ORR. In addition, the Phase 2 clinical trial number 16 (Table 1), EudraCT 2013-005486-39, was linked to two different trials in the US database, namely, the NCT02108652 and NCT02951767 clinical trials, including results from two different cohorts of patients. Both trials are reported in Table 1 as number 16. For the newly found Phase 3 clinical trials, only three did not overlap with those found in the clinicaltrials.gov database. The total number of Phase 3 trials considered for the analysis is therefore 21.
TABLE 1

Main characteristics of Phase 2 clinical trials selected for the analysis.

Trial registration number (First posted date)Intervention model/maskingTumor typeTreatment arm(s)Number of patientsPrimary endpointCycle lengthTiming of scansTime frame (primary endpoint)DI ORRDI PFSReference
1NCT00875667Parallel assignment/none (open label)Relapsed or refractory mantle cell lymphomaE: lenalidomideE: 170PFS per BICR and LI a E: 28 daysEvery 2 cycles for 6 months; every 90 days thereafter until disease progressionMedian follow-up of 15.9 months (IQR 731.7) for all patientsa E: 1.15E: 0.99 Trněný et al. (2016)
(3 Apr 2009)AC: investigator’s choice single agentAC: 84AC: variable according to different regimens AC: 2.09AC: 1.04
2NCT01685060 (13 Sep 2012)Single Group Assignment/None (Open Label)Non-small-cell lung cancerCeritinib (LDK378)140ORR per LI28 daysEvery 8 weeksUp to 24 weeks1.140.78 Hida et al. (2017)
EudraCT 2012-003432-24 (12 Apr 2017)
3NCT01685138 (14 Sep 2012)Single group assignment/none (open label)Non-small-cell lung cancerCeritinib (LDK378)124ORR per LI28 daysEvery 8 weeksUp to 5 years1.060.86 Nishio et al. (2020)
4NCT01708174 (16 Oct 2012)Single group assignment/none (open label)MedulloblastomaSonidegib (LDE225) and TMZ16ORR per BICR28 daysNA>3 years1.331 Clinicaltrials.gov identifier NCT01708174, 2012
5NCT01984242 (14 Nov 2013)Parallel assignment/none (open label)Renal cell carcinomaITTITTPFS per BICR6 weeksEvery 12 weeks ± 5 days after cycle 1Up to ∼2.75 yearsITTITT McDermott et al. (2018)
E1: atezolizumab + bevacizumabE1: 101E(1–2): one infusion every 3 weeksE1: 1.09E1: 0.95
EudraCT 2013-003167-58 (29 Oct 2017)E2: atezolizumabE2: 103AC: Day 1–28 of each cycleE2: 0.92E2: 0.90
AC: sunitinibAC: 101AC: 1.14AC: 0.93
6NCT02031458 (9 Jan 2014)Single group assignment/none (open label)Non-small-cell lung cancerAtezolizumabCohort 1: 139ORR per BICR3 weeksEvery 6 weeks for 12 months; every 9 weeks thereafterUp to 16 monthsCohort 1: 1.15Cohort 1: 1.29 Peters et al. (2017)
Cohort 1: first lineCohort 2: 267Cohort 2: 1.09Cohort 2: 1.00
EudraCT 2013-003330-32 (1 July 2016)Cohort 2: second lineCohort 3: 253Cohort 3: 1.05Cohort 3: 1.07
Cohort 3: third line and beyond
7NCT02040870 (20 Jan 2014)Single group assignment/none (open label)Non-small-cell lung cancerCeritinib (LDK378)103Others28 daysEvery 8 weeks40 months1.301.89 Wu et al. (2020)
8NCT02336451 (13 Jan 2015)Parallel assignment/ none (open label)ALK-positive Non-small-cell lung cancerCeritinib c (LDK378)Arm 1: 42ORR per LI28 daysAt week 8; every 8 weeks thereafter43 monthsArm 1: 1.50Arm 1: 1.44 Chow et al. (2019)
Arm 2: 40Arm 2: 2.00Arm 2: 1.02
EudraCT 2014-000578-20 (22 Feb 2020)Arm 3: 12Arm 3: 1.50 b Arm 3: NA b
Arm 4: 44Arm 4: 0.96Arm 4: 1.03
Arm 5: 18Arm 5: 1.50Arm 5: 1.44
9NCT00117637 (8 July 2005)Parallel assignment/none (open label)Renal cell carcinomaE: first sorafenib (BAY43-9006) 400 mg then 600 mgE: 97PFS per BICRNAEvery 8 weeks∼15 monthsE: 4.19E: 0.98 Escudier et al. (2009)
AC: first interferon then sorafenib 400 mgAC: 92AC: 1.75AC: 1.25
10NCT01943864 (September 17, 2013)Single group assignment/none (open label)Advanced or metastatic biliary tract cancers in JapaneseTrametinib20OthersNAEvery 8 weeksUp to week 37NA b (ORR per LI: 0)1 b Ikeda et al. (2018)
11NCT00679211 (16 May 2008)Single group assignment/none (Open Label)Metastatic breast cancerTrastuzumab emtansine (Kadcyla)110ORR per BICR3 weeksEvery other cycle by LI and retrospectively by double-reader BICR as needed∼9 months10.797 Krop et al. (2012)
12NCT02674061 (4 Feb 2016)Parallel assignment/none (open label)Advanced recurrent ovarian cancerPembrolizumab (MK-3475)Cohort A: 285ORR per BICR (in all cohorts A and B and subgroup cohorts PDl-L1+)3 weeksEvery 9 weeks for the first 54 weeks; every 12 weeks thereafterUp to ∼43 monthsCohort A: 0.864Cohort A: 1 Matulonis et al. (2019)
Cohort A (0–2 prior lines of treatment)Cohort B: 91Cohort B: 0.889Cohort B: 1
Cohort B (3–5 prior lines of treatment)
13NCT03525678 (16 May 2018)Parallel assignment/none (open label)Multiple myelomaBelantamab mafodotin (GSK2857916)Arm 1: 97ORR per BICR3 weeksEvery 3 weeks (radiography for skeletal lytic lesions, but it is not specified in the timing)Up to 48 weeksArm 1: 0.968Arm 1: 0.759 Lonial et al. (2020)
EudraCT 2017-004810-25 (26 Apr 2020)Arm 1 (2.5 mg/kg frozen liquid, FL)Arm 2: 99Arm 2: 0.912Arm 2: 0.776
Arm 2 (3.4 mg/kg FL)Arm 3: 25Arm 3: 1.083 b Arm 3: NA b
Arm 3 (3.4 mg/kg lyophilized powder, LP)
14NCT02576990 (15 Oct 2015)Single group assignment/none (open label)Mediastinal large B-cell lymphoma (rrMLBCL)Pembrolizumab (MK-3475)Arm 1: 53ORR per BICR3 weeksAt week 12 and then every 12 weeksUp to ∼27 monthsArm 1: 0.916Arm 1: 0.782 Armand et al., 2019; Armand et al., 2020
Richter syndrome (rrRS)Arm 1 (rrMLBCL)Arm 2: 23Arm 2: 0.331Arm 2: 1.125
Arm 2 (rrRS)
15NCT01660451 (8 Aug 2012)Parallel assignment/none (open label)Non-Hodgkin’s lymphoma (NHL)Copanlisib (BAY80-6946)Part A (indolent): 33ORR per BICR28 daysEvery two cycles during year 1; every three cycles during year 2; every six cycles during year 3Up to 16 weeks of treatmentPart A (indolent): 1.072Part A (indolent): 0.952 Dreyling et al. (2017)
Part A (indolent NHL/CLL)Part A (aggressive): 51Part A (aggressive): 1.154Part A (aggressive): 1 Panayiotidis et al. (2021)
Part A (aggressive NHL)Part B: 142Part B: 0.869Part B: 0.971
Part B (indolent B-cell NHL)
16EudraCT 2013-005486-39 (28 July 2016) NCT02108652 (9 Apr 2014)Single group assignment/none (open label)Urothelial bladder cancerAtezolizumab Cohort 2: second-line or beyond treatmentCohort 2: 310ORR per LI and BICR21 daysEvery 9 weeks for the first 12 months; every 12 weeks thereafterUp to maximum length of follow-up of 24.5 months1.041 Perez-Gracia et al. (2018)
NCT02951767 (1 Nov 2016)Cohort 1: drug naïveCohort 1: 1191.111.55 Balar et al. (2017)
17EudraCT 2013-002737-38 (28 Apr 2021) NCT02183870 (no results) (8 Jul 2014)Single group assignment/Lung cancerCrizotinibPer protocol: 30ORR per LIContinuous daily dosingEvery 6 weeksNAPer protocol: 0.959Per protocol: 0.97 Michels et al. (2019)
none (open label)Adenocarcinoma NSCLC (ROS1 +)ITT: 34ITT: 0.959 b ITT: NA b

Trials that were included in our initial analysis (Dello Russo et al., 2021) are highlighted in gray. IQR, interquartile range; ITT, intention to treat population; ORR, objective response rate; PFS, progression-free survival; BICR, blind independent central review; LI, local investigator; rrMLBCL, relapsed or refractory primary mediastinal large B-cell lymphoma; rrRS, relapsed or refractory richter syndrome; DI, discrepancy index; NHL, non-Hodgkin’s lymphoma; CLL, chronic lymphocytic leukemia; mRECIST, modified response evaluation criteria in solid tumors; PFI/TFI, platinum-free interval/treatment-free interval; PrALKi, previous ALK inhibitors’ treatment; PrBRad, previous brain radiations; RECIST, response evaluation criteria in solid tumors.

The Primary outcome for this trial was PFS per BICR (up to data cutoff date, 7 March 2014) and LI (at final analysis, up to study discontinuation of 09 October 2018, (median follow up of 285 weeks). The data included in our analysis refer to the primary analysis cutoff date for both LI and BICR ( ). The same DIs are calculated by comparing the values obtained by LI at final analysis versus BICR primary analysis (Dello Russo et al., 2021).

Not included in the present analysis since the specific arm/group was missing the DI for one of the outcomes (either ORR or PFS).

Treatment arms were the followings: Arm 1 (PrALKi = Yes; PrBRad=Yes); Arm 2 (PrALKi=Yes; PrBRad=No); Arm 3 (PrALKi=No; PrBRad=Yes); Arm 4 (PrALKi=No; PrBRad=No); Arm 5 (leptomeningeal carcinomatosis). In Arm 1–4, no evidence of leptomeningeal carcinomatosis.

Main characteristics of Phase 2 clinical trials selected for the analysis. Trials that were included in our initial analysis (Dello Russo et al., 2021) are highlighted in gray. IQR, interquartile range; ITT, intention to treat population; ORR, objective response rate; PFS, progression-free survival; BICR, blind independent central review; LI, local investigator; rrMLBCL, relapsed or refractory primary mediastinal large B-cell lymphoma; rrRS, relapsed or refractory richter syndrome; DI, discrepancy index; NHL, non-Hodgkin’s lymphoma; CLL, chronic lymphocytic leukemia; mRECIST, modified response evaluation criteria in solid tumors; PFI/TFI, platinum-free interval/treatment-free interval; PrALKi, previous ALK inhibitors’ treatment; PrBRad, previous brain radiations; RECIST, response evaluation criteria in solid tumors. The Primary outcome for this trial was PFS per BICR (up to data cutoff date, 7 March 2014) and LI (at final analysis, up to study discontinuation of 09 October 2018, (median follow up of 285 weeks). The data included in our analysis refer to the primary analysis cutoff date for both LI and BICR ( ). The same DIs are calculated by comparing the values obtained by LI at final analysis versus BICR primary analysis (Dello Russo et al., 2021). Not included in the present analysis since the specific arm/group was missing the DI for one of the outcomes (either ORR or PFS). Treatment arms were the followings: Arm 1 (PrALKi = Yes; PrBRad=Yes); Arm 2 (PrALKi=Yes; PrBRad=No); Arm 3 (PrALKi=No; PrBRad=Yes); Arm 4 (PrALKi=No; PrBRad=No); Arm 5 (leptomeningeal carcinomatosis). In Arm 1–4, no evidence of leptomeningeal carcinomatosis.

Results From Phase 2 Clinical Studies

As summarized in Table 1, a total of 17 clinical Phase 2 clinical trials were selected for the current analysis, including nine studies, highlighted in gray, considered in our previous analysis (Dello Russo et al., 2021). All the trials reported in Table 1 had an open-label design and mostly (10/17, 58.8%) a single group assignment. Among the remaining studies with parallel group assignment, only two included an active comparator group, that is, the NCT00875667 (n. 1) clinical trial (Trněný et al., 2016), studying the efficacy of lenalidomide versus chemotherapy as per investigator’s choice in patients with mantle cell lymphoma after previous treatment failure, and the NCT01984242 (n. 5) clinical trial (McDermott et al., 2018), testing the efficacy of atezolizumab alone or in combination with bevacizumab versus sunitinib in patients affected by advanced renal carcinoma. All the other trials with parallel group assignment tested the same treatment in different subgroups or different schedule of administration of the same drug or drug combination. Therefore, all these studies substantially lacked a comparator arm. Interestingly, both the NCT00875667 and the NCT01984242 clinical trials, with the active comparator arm, included the PFS per BICR as primary endpoint. The latter was also found as the primary outcome in the NCT00117637 (n. 9) clinical trial (Escudier et al., 2009), testing the efficacy of sorafenib in combination with interferon-α (IFN-α) in two different schedules of administration, that is, Cohort 1 (Sorafenib First) and Cohort 2 (IFN-α First). With respect to the primary endpoint, the majority of the selected trials (7/17, 41%) included the evaluation of ORR per BICR, whereas only four (23.5%) trials reported the ORR per LI as the primary outcome. Interestingly, all these four trials were set to assess the efficacy of ALK1 inhibitors (three trials focused on ceritinib and one on crizotinib) in the context of non-small-cell lung cancer. In these trials, tumor assessment was performed every 8 weeks with a median time to first drug response of approximately of 2.0 months by BICR versus 2.3 months per LIs, which indicates a rapid pattern of tumor response to these inhibitors. To evaluate the concordance between the two evaluations in this set of Phase 2 clinical trials, a discrepancy index (DI) was calculated for both the ORR (expressed as % of patients with CR and PR over the total number of the enrolled patients) and the PFS (considering the median PFS time for each treatment group) (Table 1). The DI was used in our previous analyses in order to assess differences between the two evaluations (Dello Russo et al., 2020; 2021). It is calculated as the ratio between the LI evaluation over the corresponding independently assessed endpoint, with a DI >1 indicating that the investigator was “more optimistic” and a DI <1 indicating the opposite, that the “blinded reviewer was more optimistic.” Among all these studies, the average DI for the ORR was 1.24 (95% CI, 1.005–1.478, n = 31) and the average DI for PFS was 1.05 (95% CI, 0.958–1.141). These data suggest that the LIs tend to overestimate the ORR, with a +24% of discrepancy which is little over what observed in our previous analysis (average + 17.5% of positive responses, Dello Russo et al., 2021). In line with previous data, we found a substantial agreement between the two evaluations for the PFS in these Phase 2 clinical trials. Interestingly, in the abovementioned four clinical trials on ALK inhibitors having the ORR per LI as the primary outcome, we found an average DI of 1.19 for both the ORR and the PFS which suggest a +19% overestimation of positive effects by LIs in comparison to BICR on both endpoints. In Table 2, we summarized the results for eight different trials data on the time to first treatment response (TTFR), the time to progression (TTP) and the median PFS. From the comparison of these different endpoints, it emerges that the average time to observe a tumor response to treatment is relatively shorter that the time to reach progression. This is an expected finding, which implies that a reduced number of scans are indeed necessary to observe the response to treatment, therefore explaining in part the higher variability observed in the assessment of ORR locally and centrally. With time and the evaluation of multiple scans, the variability among the two evaluations tends to be reduced. Moreover, we can also hypothesize that it is more difficult to detect a response to treatment particularly in term of distinction between PR and stable disease in comparison to disease progression, which may further contribute to the higher discrepancy observed for the ORR among the two evaluations.
TABLE 2

Comparison between time to first response (TTFR), time to disease progression (TTP), and median PFS time in Phase 2 clinical trials.

Trial registration numberTreatment arm(s)Timing of scansNumber of patients (TTFR)TTFR per LI (months, 95% CI)TTFR per BICR (months, 95% CI)Number of patients (PFS)Time to progression (TTP) per LI (months, 95% CI)Time to progression (TTP) per BICR (months, 95% CI)Median PFS per LI (months, 95% CI)Median PFS per BICR (months, 95% CI)
1NCT00875667E: lenalidomideEvery two cycles for 6 months; every 90 days thereafter until disease progression a E: 170E: 5.5 (3.9 – 5.9)E: 4.3 (3.9 – 11.5)E: 170E: 9.1 (5.8 − 14.1)E: 9.1 (5.6 – 12.2)E: 8.6 (5.6 – 12.1)E: 8.7 (5.5 – 12.1)
AC: investigator’s choice single agentAC: 84AC: 9.2 (5.9 – NR)AC: NRAC: 84AC: 5.7 (3.7 − 8.5)AC: 5.7 (3.7 – 6.9)AC: 5.4 (3.6 – 7.7)AC: 5.2 (3.7 – 6.9)
2NCT01685060 - EudraCT 2012-003432-24Ceritinib (LDK378)Every 8 weeksLI: 573.0 (SD, 3.54)2.2 (SD, 1.44)140NANA5.8 (5.4–7.6)7.4 (5.6–10.9)
BICR: 50
3NCT01685138Ceritinib (LDK378)Every 8 weeksLI: 842.5 (SD, 2.66)2.2 (SD, 1.22)124NANA16.6 (11.0–23.2)19.4 (10.9–29.3)
BICR: 79
7NCT02040870Ceritinib (LDK378)Every 8 weeksLI: 431.90 (1.6–12.9)1.80 (1.6–3.7)103NANA7.2 (4.1–7.5)3.8 (3.6–5.6)
BICR: 33
8NCT02336451Ceritinib (LDK378)At week 8; every 8 weeks thereafter LI Arm 1: 1.87 (1.7–9.3)Arm 1: 2.00 (1.7 – 12.9) LI NANAArm 1: 7.2 (3.3–10.9)Arm 1: 5.0 (3.3–9.1)
Arm 1: 15Arm 2: 2.00 (1.7–9.3)Arm 2: 1.76 (1.6–1.9)Arm 1: 32Arm 2: 5.6 (3.6–9.2)Arm 2: 5.5 (3.6–7.3)
Arm 2: 12Arm 3: 1.82 (1.2–30.1)Arm 3: 1.82 (1.7–26.5)Arm 2: 35Arm 3: NR (1.0 - NR)Arm 3: 15.5 (1.0 - NR)
Arm 3: 6Arm 4: 1.81 (1.3–3.7)Arm 4: 1.81 (1.3–22.0)Arm 3: 6Arm 4: 7.9 (5.5–9.4)Arm 4: 7.7 (5.5–9.7)
Arm 4: 26Arm 5: 1.91 (1.8–3.6)Arm 5: 1.86 (1.8–1.9)Arm 4: 33Arm 5: 5.2 (1.6–7.2)Arm 5: 3.6 (1.6–5.4)
Arm 5: 3Arm 5: 14
BICR BICR
Arm 1: 10Arm 1: 34
Arm 2: 6Arm 2: 36
Arm 3: 4Arm 3: 8
Arm 4: 27Arm 4: 33
Arm 5: 2Arm 5:14
9NCT00117637Sorafenib (BAY43-9006) + interferonEvery 8 weeks LI E: 3.5 (1.6–11.1)E: 1.8 (1.7–3.7)E: 97NANAE: 5.6 (5.4–7.5)E: 5.7 (5.0–7.4)
E: 21
AC: 14AC: 5.4 (1.2–18.3)AC: 5.4 (3.7–11)AC: 92AC: 7.0 (5.4–8.8)AC: 5.6 (3.7–7.4)
BICR
E: 5
AC: 8
10NCT01943864TrametinibEvery 8 weeksLI: 0NA20.1 weeks20NANA10.6 weeks (4.6–12.1)10.6 weeks (4.6–12.7)
BICR: 1
13NCT03525678 (EudraCT 2017-004810-25)Belantamab mafodotin (GSK2857916)Every 3 weeks (radiography for skeletal lytic lesions, but it is not specified in the timing) LI Arm 1: 1.4 (0.8–2.1)Arm 1: 1.4 (0.8–2.1)Arm 1: 97Arm 1: 2.3 (0.8 - NR)Arm 1: 3.0 (0.9 - NR)Arm 1: 2.2 (0.8 - NR)Arm 1: 2.9 (0.9 - NR)
Arm 1: 29Arm 2: 1.5 (0.9–3.0)Arm 2: 1.4 (0.8–2.8)Arm 2: 99Arm 2: 4.2 (1.3 - NR)Arm 2: 5.8 (0.9 - NA)Arm 2: 3.8 (1.1 - NR)Arm 2: 4.9 (0.9 - NR)
Arm 2: 31Arm 3: 0.9 (0.8–1.0)Arm 3: 0.9 (0.8–1.6)Arm 3: 25Arm 3: 4.3 (2.1 - NR)Arm 3: NA (2.2 - NA)Arm 3: 4.3 (2.1 - NR)Arm 3: NA (2.2 - NR)
Arm 3: 13
BICR
Arm 1: 30
Arm 2: 34
Arm 3: 12

Trials that were included in our initial analysis (Dello Russo et al., 2021) are highlighted in gray. ITT, intention to treat population; NA, not available; NR, not reached; ORR, objective response rate; PFS, progression-free survival; BICR, blind independent central review.

Data included in the table refer to the analysis at data cutoff date, 7 March 2014 (Trněný et al., 2016).

Comparison between time to first response (TTFR), time to disease progression (TTP), and median PFS time in Phase 2 clinical trials. Trials that were included in our initial analysis (Dello Russo et al., 2021) are highlighted in gray. ITT, intention to treat population; NA, not available; NR, not reached; ORR, objective response rate; PFS, progression-free survival; BICR, blind independent central review. Data included in the table refer to the analysis at data cutoff date, 7 March 2014 (Trněný et al., 2016).

Results From Phase 3 Clinical Studies

A total of 21 Phase 3 clinical trials, including both ORR and PFS evaluated locally and centrally were found (Table 3). These included one trial, the NCT00075270 (n. 3) (Di Leo et al., 2008), in which the time to disease progression (TTP) per LI and BRIC was included as primary endpoint and considered in place of the PFS for the present evaluation. The trial NCT01287741 (n. 15) only included the HR for the evaluation of PFS by LI and BICR (Vitolo et al., 2017). Both trials were also included in our previous analysis (Dello Russo et al., 2020), therefore kept in the present analysis. The majority of these studies (17/21, 81%) had an open-label design and (10/21, 47.6%) included the evaluation of PFS per BICR as primary endpoint. In this regard, only six studies out of 21 (28.6%) included the evaluation of PFS per LI as the primary outcome, whereas three studies (14.2%) had both evaluations as the primary endpoint.
TABLE 3

Main characteristics of Phase 3 clinical trials selected for the analysis.

Trial registration number (first posted date)Intervention model/maskingTumor typeTreatment arm(s)Number of patientsPrimary endpointCycle lengthTiming of scansTime frame (primary endpoint)DI ORRDI PFSReference
1NCT00069108 (September 18, 2003)Parallel assignment/none (open label)Colorectal cancerE: XELOXE (ITT): 313PFS per LIE: 3 weeks up to eight cyclesEvery 6 weeks (+ within 2 weeks of study completion, withdrawal or treatment discontinuation)Up to 3 yearsE (ITT): 1.50E (PP): 0.92 Rothenberg et al. (2008)
AC: FOLFOX-4AC (ITT): 314
E (PP): 251AC: 2 weeks up to 12 cyclesAC (ITT): 1.33AC (PP): 1.04
AC (PP): 314
2NCT02370498 (25 Feb 2015)Parallel assignment/none (open label)Gastric adenocarcinomaE: pembrolizumabE (all): 296AC (all: 296PFS (and OS) per BICR in PD-L1+ patientsE: 21 daysEvery 6 weeksUp to 30 monthsE (all): 1.10E (all): 1.07 Shitara et al. (2018)
Gastroesophageal junction adenocarcinomaAC: paclitaxelE (PD-L1+): AC (all): 1.22AC (all): 0.78
196AC: 28 daysE (PD-L1+): 1.09 a E (PD-L1+): 1.07 a
AC (PD-L1+): 199AC (PD-L1+: 1.15 a AC (PD-L1+: 0.76 a
3NCT00075270 (9 Jan 2004)Parallel assignment/Metastatic breast cancerE: lapatinib (+ paclitaxel)E: 291TTP per LI and BICRE: 3 weeksFor efficacy 9 weeks after study entry, at 12-week intervals, and at treatment end. For survival at 12-week intervalsAverage 26 weeksE: 1.31E: 0.86 b Di Leo et al. (2008)
double (participant, investigator)AC: placebo (+ paclitaxel)AC: 288AC: 3 weeksAC: 1.35AC: 0.88 b
4NCT01120184 (10 May 2010)Parallel assignment/Breast cancerAC: trastuzumab + taxanePFSPFS per BICR3 weeks (except paclitaxel every 1 week)Every 9 weeks for 81 weeks, then every 12 weeks thereafter, and/or up to 42 days after last doseUp to 48 monthsAC: 1.02AC: 0.91 Perez et al. (2017)
triple (participant, investigator, and outcomes assessor)E1: trastuzumab emtansine + placeboAC: 365E1: 1.08E1: 1
E2: trastuzumab emtansine + pertuzumabE1: 367E2: 1.05E2:0.97
E2: 363
ORR
AC: 287
E1: 303
E2: 299
5NCT00689936 (4 Jun 2008)Parallel assignment/Multiple myeloma (previously untreated; stem cell transplant ineligible)Arm 1: lenalidomide + low-dose DEX (until disease progression)Arm 1: 535PFS per BICR and LI4 weeksAfter each treatment cycle and everyPFS by BICR: median follow-up time of 17.1 monthsArm 1: 1.07Arm 1: 1.02 Benboubker et al. (2014)
none (open label)Arm 2: lenalidomide + low-dose DEX (18 cycles)Arm 2: 54128 days during the follow-up phasePFS by BICR: median follow-up time of 17.7 monthsArm 2: 1.07Arm 2: 1.01
Arm 3/AC: melphalan + prednisone + thalidomideArm 3/AC: 547Arm 3/AC: 1.08Arm 3/AC: 1.03
6NCT01360554 (25 May 2011)Parallel assignment/Non-small-cell lung cancerE: dacomitinib (PF-00299804) + placebo (erlotinib)All populationPFS per BICR and28 days (continuous oral daily dosing)At the end of cycles 2, 3, and 4, then every other cycleMedian follow-up of 7.1 months, until disease progressionE: 1.13E: 0.73 Ramalingam et al. (2014)
quadruple (participant, care provider, investigator, and outcomes assessor)AC: erlotinib + placebo (PF-00299804)E: 439PFS in KRAS wild-type patientsAC: 1.29AC: 0.76
AC: 439
7NCT01774721 (24 Jan 2013)/EudraCTParallel assignment/Non-small-cell lung cancer with EGFR-activating mutationsE: dacomitinib (PF-00299804)E: 227PFS per BICR28 days (continuous oral daily dosing)At the end of cycles 1–2, then atUp to 48 monthsE: 1.01E: 1.13 Wu et al. (2017)
2012-004977-23 (25 Oct 2018)none (open label)AC: gefitinibAC: 225every other cycleAC: 0.98AC: 1.20
8NCT02604342 (13 Nov 2015)/EudraCTParallel assignment/Non-small-cell lung cancerE: alectinibE: 79PFS per LI3 weeks (alectinib: continuous oral twice daily dosing)Every 6 weeksUp to 33 monthsE: 1.40E: 1.35 Novello et al. (2018)
2015-000634-29none (open label)AC: premetrexed/AC: 40AC: 0.22AC: 0.875
docetaxel
9NCT01245062 (22 Nov 2010)Crossover assignment/MelanomaE: trametinib (GSK1120212)BRAF V600E + w/o brain metastasisPFS in BRAF V600E+ w/o brain metastasis per BICR and LI3 weeks (trametinib: continuous dosing)At weeks 6, 12Average of 20.3 monthsBRAF V600E+ w/o brain metastasisBRAF V600E+ w/o brain metastasis Flaherty et al. (2012)
none (open label)AC: dacarbazine or paclitaxelE: 17821, and 30; then, every 12 weeksE: 1.30E: 0.92
AC: 75AC: 2.33AC: 0.88
10NCT02718417 (24 Mar 2016) EudraCTParallel assignment/Ovarian cancerAC: chemotherapy then observationAC: 335PFS per BICRChemotherapy: 3 weeksAfter three cycles and at completion of chemotherapy; then, every 12 weeks during maintenanceMaximum duration of 27 monthsAC: 0.914 a AC: NA a Monk et al. (2021)
2015-003239-36none (open label)E1: chemotherapy then avelumab in maintenanceE1: 332Avelumab: 2 weeksE1: 0.852E1: 0.821
E2: chemotherapy in combination with avelumab then avelumab in maintenanceE2: 331E2: 0.864E2: 0.890
11NCT00083889 (4 Jun 2004)Parallel assignment/Renal cell carcinomaAC: IFNαAC: 375PFS per BICR and LIAC: 3 weeksAt day 28 of cycles 1 through 4, and every two cycles thereafter until the end of treatmentDuration of treatment phaseAC: 1.50AC: 1.00 Motzer et al. (2007)
none (open label)E: sunitinib (SU011248)E: 375E: 6 weeksE: 1.19E: 0.99
12NCT02421588 (April 20, 2015) EudraCTParallel assignment/Ovarian cancer (platinum resistant)E (Arm A): lurbinectedin (PM01183)E: 221PFS per BICRE: 3 weeksEvery 8 weeksUp to 3 yearsE: 1.09E: 1.10 Gaillard et al. (2021)
2014–005251-39 (17 Oct 2019)none (open label)AC (Arm B): pegylated liposomal doxorubicin or topotecanAC: 221AC: pegylated liposomal doxorubicin (4 weeks)AC: 1.31AC: 1.00
topotecan (3 weeks)
13NCT03052608 (14 Feb 2017)Parallel assignment/Non-small-cell lung cancerE: lorlatinibE: 149PFS per BICR28 daysEvery 8 weeks (±1 week)Up to 33 monthsE: 1.06 a E: NA a Shaw et al. (2020)
none (open label)AC: crizotinibAC: 147AC: 1.07AC: 0.98
14NCT01102426 (13 Apr 2010)Parallel assignment/Relapsed/refractory multiple myelomaE: plitidepsin + dexamethasoneE: 171PFS per BICR4 weeksNAUp to 5 yearsE: 1.31E: 1.12 Spicka et al. (2019)
none (open label)AC: dexamethasoneAC: 84AC: 0.33AC: 0.65
15NCT01287741 c Parallel assignment/Diffuse large B-cell lymphomaE: obinutuzumab + chemotherapyE: 712PFS per LI21 days4–8 weeks (CT) or 6–8 weeks (FDG-PET) after the last treatment or sooner in the case of early discontinuationLE up to approximately 6.5 yearsE: 0.999 a NA a Vitolo et al. (2017)
(1 Feb 2011) EudraCT 2010-024194-39none (open label)AC: rituximab + chemotherapyAC: 706BICR: up to approximately 4 years and 9 monthsAC: 0.992 a
(23 Apr 2017)
16NCT02580058 d Parallel assignment/Ovarian cancerE1: avelumabE1: 188PFS perAvelumab: 2 weeks; doxorubicin: 4 weeksMRI or CT scans every 8 weeksUp to 30 monthsE1: 1.43E1: 1.00 Pujade-Lauraine et al. (2021)
(20 Oct 2015)none (open label)E2: avelumab plus pegylated liposomal doxorubicin (PLD)E2: 188BICR and OSE2: 1.40E2: 1.27
AC: PLD aloneAC: 190AC: 2.26AC: 1.06
17NCT02603432 d parallel assignment/Urothelial cancerE (Arm A): avelumab plus best supportive care (BSC)E (Arm A): 350OS4 weeksEvery 8 weeks for 12 months and then every 12 weeksUp to 41 months at the time of final analysisE (Arm A): 1.27E (Arm A): 1.5 Powles et al. (2020)
(11 Nov 2015)none (open label)Arm B: best supportive care (BSC) aloneArm B: 350Arm B: 2.43Arm B: 1.05
18EudraCTParallel assignment/Untreated advanced renal cell carcinomaAC: sunitinibITT:PFS per LI in PD-L1 selected populationAC: 4 weeks on, 2 weeks offAt week 12, then every 6 weeks up to week 78, and then every 12 weeksUp to approximately 24 monthsITTITT Rini et al. (2019)
2014-004684-20 NCT02420821 (20 Apr 2015)none (open label)E: atezolizumab + bevacizumabAC: 461E: 3 weeksAC: 1.064AC: 1.012
E: 454E: 1.099E: 1.167
19EudraCT 2010-024132-41Parallel assignment/Non-Hodgkin’s lymphomaE: obinutuzumab + chemotherapyFollicular lymphoma populationPFS per LI in the follicular lymphoma population21 or 28 daysAfter three cycles (bendamustine treated) or four cycles (CHOP or CVP) and on the completion of induction therapy; every 2 months for 2 years; then, every 3–6 months, with CT performed every 6–12 months, until progression or withdrawal from the trialUp to ∼4 years and 7 monthsE: 0.957E: 1.15 Hiddemann et al. (2018); Marcus et al. (2017)
(16 Mar 2017)none (open label)AC: rituximab + chemotherapyE: 601AC: 0.970AC: 1.08
NCT01332968 (11 Apr 2011)AC: 601
20EudraCTParallel assignment/Advanced BRAFV600 wild-type melanomaE: cobimetinib + atezolizumabE: 222PFS per BICREvery 8 weeks through 80 weeks; then, every 12 weeks until progression3 weeksFor approximately 16 monthsE: 1.07E: 1.02 Gogas et al. (2021)
2016-004387-18 d (01 May 2020) NCT03273153 (6 Sep 2017)none (open label)AC: pembrolizumabAC: 224AC: 1.16AC: 1.26
21NCT00789373 (First posted: 11 Nov 2008)Parallel assignment/Non-small-cell lung cancerE: pemetrexed (maintenance)E: 316/359PFS per LIEvery other cycle (6 weeks [±1])21 days(Up to 19.3 months)E: 1E: 1.043 Paz-Ares et al. (2012)
quadruple (participant, care provider, investigator, and outcomes assessor)AC: placeboAC: 156/180AC: 1AC: 1.088

Trials that were included in our initial analysis (Dello Russo et al., 2021) are highlighted in gray. ITT, intention to treat population; ORR, objective response rate; PFS, progression-free survival; PD-L1, programmed cell death ligand 1; PP, per protocol; TTP, time to progression.

Not included in the analysis because the subgroup of the whole population or the specific arm/group was missing the DI for one of the outcomes (either ORR or PFS).

Time to progression was used for comparative analysis.

Trial NCT01287741 is included in the table because HR values for PFS per LI and BICR were available for comparison. DI was calculated and included in a pooled analysis (see text).

For these trials, DI based of the HR values for PFS was not available.

Main characteristics of Phase 3 clinical trials selected for the analysis. Trials that were included in our initial analysis (Dello Russo et al., 2021) are highlighted in gray. ITT, intention to treat population; ORR, objective response rate; PFS, progression-free survival; PD-L1, programmed cell death ligand 1; PP, per protocol; TTP, time to progression. Not included in the analysis because the subgroup of the whole population or the specific arm/group was missing the DI for one of the outcomes (either ORR or PFS). Time to progression was used for comparative analysis. Trial NCT01287741 is included in the table because HR values for PFS per LI and BICR were available for comparison. DI was calculated and included in a pooled analysis (see text). For these trials, DI based of the HR values for PFS was not available. To assess the concordance between the two evaluations in this set of Phase 3 clinical trials, a discrepancy index (DI) was calculated for both the ORR (expressed as % of patients with CR and PR over the total of the enrolled patients) and the PFS (considering the median PFS time for each treatment group). Among all these studies, the average DI for the ORR was 1.20 (95% CI, 1.075–1.328, n = 42) and for PFS was 1.014 (95% CI, 0.963–1.065, n = 42). These data suggest that the LIs tend to overestimate the ORR, by a factor of +20% in the selected Phase 3 clinical trials, whereas there is a substantial agreement between the two evaluations for the PFS when considering the PFS median time. In addition, for the ORR, we calculated the DI for the odd ratios (ORs), when reported (four trials), and for the PFS, the DI based on the ratio of HRs was calculated per type of evaluation for most of the trials. The DI calculated on the ORs for the ORR was 1.04, 95% CI 0.912–1.168, indicating a potential agreement when it comes to the evaluation of treatment effect among the two evaluations. With respect of the DI calculated on the HRs for PFS, out of 21 trials included in the present analysis, 13 were already considered in our previous study (Dello Russo et al., 2020), whereas six trials brought additional new data. On average, the DI calculated on the HRs from these six trials was lower than that from the previous observations (0.84, 95% CI 0.756–0.923, n = 6). However, by pooling these data with those of previous analysis, the average DI value was only slightly reduced, and 95% CI included the value 1 (0.958, 95% CI 0.91–1.01, n = 38). As shown in Table 4, for three Phase 3 trials, it was possible to evaluate the TTFR together with the PFS. In line with data from Phase 2 clinical trials, the TTFR was shorter than the median PFS, with minimal differences observed in NCT02603432 (n. 17) trial evaluating the efficacy of avelumab in combination with best supportive care as maintenance therapy in patients with locally advanced or metastatic urothelial cancer with stable disease after completion of first-line platinum-containing chemotherapy (Powles et al., 2020). However, in this study only 34 responses out of 350 treated patients were observed in the experimental group in comparison to five responses out of 350 patients in the comparator arm by BICR. The LI reported a higher number of tumor responses in comparison to BICR. The fact that the ORR is often calculated based on a reduced number of responses may also contribute to the higher variability observed for the assessment of this endpoint by LIs and BICR.
TABLE 4

Comparison between time to first response (TTFR), time to disease progression (TTP), and median PFS in Phase 3 clinical trials.

Trial registration numberTreatment arm(s)Timing of scansNumber of patients (TTFR)TTFR per LI (months, 95% CI)TTFR per BICR (months, 95% CI)Number of patientsTime to progression (TTP) per LI (months, 95% CI)Time to progression (TTP) per BICR (months, 95% CI)Median PFS per LI (months, 95% CI)Median PFS per BICR (months, 95% CI)
5NCT00689936Arm 1: lenalidomide + low-dose DEX (until disease progression)After each treatment cycle and every 28 days during the follow-up phaseArm 1: 402Arm 1: 1.8 (0.50–22.2)Arm 1: 1.8 (0.7–22.2)Arm 1: 535NANAArm 1: 26.0 (20.7–29.7)Arm 1: 25.5 (20.7–29.4)
Arm 2: lenalidomide + low-dose DEX (18 cycles)Arm 2: 397Arm 2: 1.8 (0.8–34.8)Arm 2: 1.8 (0.8–17.1)Arm 2: 541Arm 2: 21.0 (19.7–22.4)Arm 2: 20.7 (19.4–22.0)
Arm 3/AC: melphalan + prednisone + thalidomideArm 3/AC: 341Arm 3/AC: 2.8 (1.2–56.3)Arm 3/AC: 2.8 (1.3–49.7)Arm 3/AC: 547Arm 3/AC: 21.9 (19.8–23.9)Arm 3/AC: 21.2 (19.3–23.2)
13NCT03052608E: lorlatinibEvery 8 weeks (±1 week)E: 113NAE: 1.8 (1.7–1.9)E: 149NANAE: NR (NR to NR)E: NR (NR to NR)
AC: crizotinibAC: 85AC: 1.8 (1.7–1.9)AC: 147AC: 9.1 (7.4–10.9)AC: 9.3 (7.6–11.1)
17NCT02603432E (Arm A): avelumab plus best supportive care (BSC)Every 8 weeks for 12 months and then every 12 weeksLIE (Arm A): 2.0 (1.8–22.2)E (Arm A): 2.0 (1.7–16.4)E (Arm A): 350NANAE (Arm A): 5.5 (4.2–7.2)E (Arm A): 3.7 (3.5–5.5)
Arm B: best supportive care (BSC) aloneE (Arm A): 43Arm B: 1.9 (1.1–10.9)Arm B: 2.0 (1.8–7.0)Arm B: 350Arm B: 2.1 (1.9–3.0)Arm B: 2.0 (1.9–2.7)
Arm B: 12
BICR
E (Arm A): 34
Arm B: 5

Trials that were included in our initial analysis (Dello Russo et al., 2021) are highlighted in gray. ITT, intention to treat population; ORR, objective response rate; PFS, progression-free survival; BICR, blind independent central review; NA, not available; NR, not reached.

Comparison between time to first response (TTFR), time to disease progression (TTP), and median PFS in Phase 3 clinical trials. Trials that were included in our initial analysis (Dello Russo et al., 2021) are highlighted in gray. ITT, intention to treat population; ORR, objective response rate; PFS, progression-free survival; BICR, blind independent central review; NA, not available; NR, not reached.

Conclusion

Considering the review of available evidence carried out in this study, along with additional new evidence shown here for the first time, we can attempt to draw some overall conclusions: ✓ The additional analysis presented in this work confirmed that 1) LIs tend to overestimate the ORR in comparison to BICR, whereas 2) no significant differences are observed between LI and BICR concerning the assessment of PFS. ✓ The assessment of ORR is endowed with higher variability than that of PFS. Such larger variability is associated with various factors, including 1) a limited number of measurements, compared to repeated measures with PFS; 2) the time-to-response, which is a variable in ORR assessment, whereas PFS is always measured once a response is established; 3) the time-to-response is in turn influenced by the type of treatment, with small molecules in general inducing faster responses than immunotherapies; 4) some protocols may assess ORR at fixed times, while other may consider the best response to calculate ORR. Thus, a higher variability seems to be associated to a significant expectation bias. ✓ The analysis of ORR assessment in Phase 3 trials showed that, luckily, LIs tend to overestimate ORR compared to BICR both in experimental and control groups. Thus, by analyzing the data as ORs, the overestimations of control ORRs tend to counterbalance those of experimental ORRs, thereby reducing the gap between LI and BICR.
  69 in total

1.  New guidelines to evaluate the response to treatment in solid tumors. European Organization for Research and Treatment of Cancer, National Cancer Institute of the United States, National Cancer Institute of Canada.

Authors:  P Therasse; S G Arbuck; E A Eisenhauer; J Wanders; R S Kaplan; L Rubinstein; J Verweij; M Van Glabbeke; A T van Oosterom; M C Christian; S G Gwyther
Journal:  J Natl Cancer Inst       Date:  2000-02-02       Impact factor: 13.506

2.  A phase II study of trastuzumab emtansine in patients with human epidermal growth factor receptor 2-positive metastatic breast cancer who were previously treated with trastuzumab, lapatinib, an anthracycline, a taxane, and capecitabine.

Authors:  Ian E Krop; Patricia LoRusso; Kathy D Miller; Shanu Modi; Denise Yardley; Gladys Rodriguez; Ellie Guardino; Michael Lu; Maoxia Zheng; Sandhya Girish; Lukas Amler; Eric P Winer; Hope S Rugo
Journal:  J Clin Oncol       Date:  2012-05-29       Impact factor: 44.544

3.  An Overview of Cancer Drugs Approved by the US Food and Drug Administration Based on the Surrogate End Point of Response Rate.

Authors:  Emerson Y Chen; Vikram Raghunathan; Vinay Prasad
Journal:  JAMA Intern Med       Date:  2019-07-01       Impact factor: 21.873

4.  Final Overall Survival and Other Efficacy and Safety Results From ASCEND-3: Phase II Study of Ceritinib in ALKi-Naive Patients With ALK-Rearranged NSCLC.

Authors:  Makoto Nishio; Enriqueta Felip; Sergey Orlov; Keunchil Park; Chong-Jen Yu; Chun-Ming Tsai; Manuel Cobo; Mark McKeage; Wu-Chou Su; Tony Mok; Giorgio V Scagliotti; David R Spigel; Kalyanee Viraswami-Appanna; Zhe Chen; Vanessa Q Passos; Alice T Shaw
Journal:  J Thorac Oncol       Date:  2019-11-25       Impact factor: 15.609

5.  Antitumor activity and safety of pembrolizumab in patients with advanced recurrent ovarian cancer: results from the phase II KEYNOTE-100 study.

Authors:  U A Matulonis; R Shapira-Frommer; A D Santin; A S Lisyanskaya; S Pignata; I Vergote; F Raspagliesi; G S Sonke; M Birrer; D M Provencher; J Sehouli; N Colombo; A González-Martín; A Oaknin; P B Ottevanger; V Rudaitis; K Katchar; H Wu; S Keefe; J Ruman; J A Ledermann
Journal:  Ann Oncol       Date:  2019-07-01       Impact factor: 32.976

Review 6.  Progression-free survival is a suboptimal predictor for overall survival among metastatic solid tumour clinical trials.

Authors:  Dario Pasalic; Gwendolyn J McGinnis; C David Fuller; Aaron J Grossberg; Vivek Verma; Walker Mainwaring; Austin B Miller; Timothy A Lin; Amit Jethanandani; Andres F Espinoza; Markus Diefenhardt; Prajnan Das; Vivek Subbiah; Ishwaria M Subbiah; Reshma Jagsi; Adam S Garden; Emmanouil Fokas; Claus Rödel; Charles R Thomas; Bruce D Minsky; Ethan B Ludmir
Journal:  Eur J Cancer       Date:  2020-07-20       Impact factor: 9.162

7.  Blinded independent central review of progression in cancer clinical trials: results from a meta-analysis.

Authors:  O Amit; F Mannino; A M Stone; W Bushnell; J Denne; J Helterbrand; H U Burger
Journal:  Eur J Cancer       Date:  2011-03-21       Impact factor: 9.162

8.  First-Line Lorlatinib or Crizotinib in Advanced ALK-Positive Lung Cancer.

Authors:  Alice T Shaw; Todd M Bauer; Filippo de Marinis; Enriqueta Felip; Yasushi Goto; Geoffrey Liu; Julien Mazieres; Dong-Wan Kim; Tony Mok; Anna Polli; Holger Thurm; Anna M Calella; Gerson Peltz; Benjamin J Solomon
Journal:  N Engl J Med       Date:  2020-11-19       Impact factor: 91.245

9.  New response evaluation criteria in solid tumours: revised RECIST guideline (version 1.1).

Authors:  E A Eisenhauer; P Therasse; J Bogaerts; L H Schwartz; D Sargent; R Ford; J Dancey; S Arbuck; S Gwyther; M Mooney; L Rubinstein; L Shankar; L Dodd; R Kaplan; D Lacombe; J Verweij
Journal:  Eur J Cancer       Date:  2009-01       Impact factor: 9.162

10.  Evaluation bias in objective response rate and disease control rate between blinded independent central review and local assessment: a study-level pooled analysis of phase III randomized control trials in the past seven years.

Authors:  Jianrong Zhang; Yiyin Zhang; Shiyan Tang; Hengrui Liang; Difei Chen; Long Jiang; Qihua He; Yu Huang; Xinyu Wang; Kexin Deng; Shuhan Jiang; Jiaqing Zhou; Jiaxuan Xu; Xuanzuo Chen; Wenhua Liang; Jianxing He
Journal:  Ann Transl Med       Date:  2017-12
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.