Literature DB >> 33836710

Frequent fragility of randomized controlled trials for HCC treatment.

Abstract

BACKGROUND: The fragility index (FI) of trial results can provide a measure of confidence in the positive effects reported in randomized controlled trials (RCTs). The aim of this study was to calculate the FI of RCTs supporting HCC treatments.
METHODS: A methodological systematic review of RCTs in HCC treatments was conducted. Two-arm studies with randomized and positive results for a time-to-event outcome were eligible for the FI calculation.
RESULTS: A total of 6 trails were included in this analysis. The median FI was 0.5 (IQR 0-10). FI was ≤7 in 4 (66.7%) of 6 trials; in those trials the fragility quotient was ≤1%.
CONCLUSION: Many phase 3 RCTs supporting HCC treatments have a low FI, which challenges the confidence in concluding the superiority of these drugs over control treatments.

Entities: Chemical Disease Gene Species

Keywords: Endpoint; Fragility index; Randomized controlled trials

Year: 2021 PMID： 33836710 PMCID： PMC8034173 DOI： 10.1186/s12885-021-08133-8

Source DB: PubMed Journal: BMC Cancer ISSN： 1471-2407 Impact factor: 4.430

Background

Modern medicine is built on evidence-based clinical practice, with randomized controlled trials (RCTs) forming the foundation of such evidence. Because RCTs play important roles in governing clinical practice, the robustness of their results is critical. The results of clinical trials must be valid, reproducible, and repeatable; however, in the context of clinical research, reproducibility and replicability are generally under-researched topics. Historically, P values have been used to indicate statistical the significance of results in clinical trials [1]. Nevertheless, this approach has some significant limitations and has been heavily criticized for being simplistic, with frequent misapplication and misinterpretation [2]. The fragility index (FI) is a novel tool, which was developed to assess the robustness of statistically significant dichotomous outcomes from RCTs [3]. It is defined as the minimum number of patients receiving experimental treatment whose status would have to change from a non-event to an event to nullify a meaningful result. A higher FI represents a relativiely robust outcome and indicates that the statistical significance of a given outcome hinges on a greater number of events, whereas a lower FI indicates that the statistical significance of a given outcome depends on only a few events, which suggests a more fragile outcome. The recommendation of new drugs or treatments for use in clinical practice, mainly depends on the results of phase 3 clinical trials. Thus, this study was performedto analysis to assess the wider implications of the FI in the findings of HCC treatments in phase 3 clinical trials.

Methods

This study conducted a methodological systematic review of phase 3 RCTs for HCC treatment. The search terms used were (hepatocellular carcinoma OR hepatocarcinoma OR “liver cancer” OR HCC) AND (“phase 3” OR “phase III”). Only articles published in English were searched for using PubMed search engine and Medline database until August 1, 2019. For the FI analysis, only two-arm studies with randomization that reported significant positive results with primary or secondary outcomes were included. Data was obtained on trial design, trial number, and the observed numbers of events for the control and experimental groups in primary or secondary time-to-event outcomes. The FI was calculated from a two-by-two contingency table by the iterative addition of an event to the experimental group, which was determined using a web-based fragility calculator (available at http://www.clincalc.com/Stats/FragilityIndex.aspx). P values were calculated using Fisher’s Exact Test. A sample of FI is presented in Fig. 1.

Fig. 1

Example of fragility index calculation for the phase 3 trial SILIUS reported by Kudo M, et al [4]

Example of fragility index calculation for the phase 3 trial SILIUS reported by Kudo M, et al [4] The fragility quotient (FQ) is a metric, that accounts for the FI in the context of sample size [5]. It is described as the FI divided by the total sample size. The usefulness of the FQ lies in its ability to allocate an objective value to the results of subjective importance, and it may be assigned to an outcome with a given FI in a certain sample size [5]. In other words, the FQ assesses the robustness of the FI.

Results

This study identified 125 records through a series of PubMed searches (Fig. 2). After an initial screening of abstracts and a full-text review of the studies, 6 articles were included in the fragility analysis (Table 1, Fig. 3) [4, 6–10]. The other five RCTs were excluded, as FI can only be calculated in RCTs that allocate 1:1 randomization (Supplementary Table 1). The median sample size for the 6 eligible RCTs was 257 (IQR 220.75–539), and the median FI for the 6 studies was 0.5 (IQR 0–10). The FI ≤ was 7 in 4 (72.73%) of 6 trials [7-10], and those trials had FQ < 1%.

Fig. 2

Flow chart for included studies

Table 1

Fragility index calculated for 6 phase 3 trials with 1:1 randomization for HCC treatment

Author	Study name	Clinical Trial	Experimental Treatment vs. Control	Endpoint	Experimental sample size	Experimental event number	Control sample size	Control event number	P vaule	Fragility index	Fragility quotient
Kudo M et al. [4].	SILIUS	NCT01214343	Sorafenib plus HAIC (hepatic arterial infusion chemotherapy) vs. Sorafenib	Primary outcome: Overall response	102	37	103	18	0.003	7	3.41%
Wang Z et al. [6].	NA	NCT01966133	adjuvant TACE vs. No adjuvant TACE	Primary endpoint: Recurrence-free survival	140	46	140	82	0.01	19	6.79%
Lee JH et al. [7].	NA	NCT00699816	CIK cell agent vs. No CIK cell agent	Primary end point: Recurrence-free survival	114	69	112	59	0.01	0	0%
Llovet JM et al. [8].	SHARP	NCT00105443	Sorafenib vs. Placebo	Primary endpoint: Overall survival	299	44	303	33	0.00583	0	0%
Wei W et al. [9].	NA	NCT02788526	Hepatectomy plus TACE vs. Hepatectomy	Primary endpoint: Disease-free survival	116	83	118	85	0.02	0	0%
Geissler EK et al. [10].	NA	NCT0035586.	Liver transplantation with sirolimus vs. Liver transplantation	Secondary endpoint: Overall survival	252	242	256	234		1	0.20%

Fig. 3

FI and FQ in the included studies

Flow chart for included studies Fragility index calculated for 6 phase 3 trials with 1:1 randomization for HCC treatment FI and FQ in the included studies Five studies in the fragility analysis were for primary outcome results. Three (60%) had primary outcome trials with a FI of 0 (Fisher’s exact test p > 0.05), for which a stratified log-rank test was used to calculate the reported significant P value [7-9], and these three (60%) trials had an FQ < 1% [7-9]. The article with the highest FI fragility index of 19 was published in the Clinical Cancer Research [6]. However, this study was not a multiple center trial. The remaining 1 study was evaluated with inferior outcome results, whereas non significant differences were found in the primary outcome results. The study of the FI was 1, and the FQ was less than 1% [10].

Discussion

To the best of our knowledge, FI investigation for HCC trials has not been performed. The FI has been evaluated in other RCTs, such as emergency medicine [11], giant cell arteritis, Clinical Practice Guidelines [12], and cardiac surgery field [13]. These studies consistently show that many RCTs are fragile, and several researchers have recommended that FI should be adopted in reporting clinical trial outcomes [12, 14], our study showed that most results from the randomized trials were far more fragile. This analysis demonstrated that over 60% of the phase 3 trials supporting HCC treatments had a low FI; however, they are vulnerable to losing their significance with just a small change in the designation of a small number of events, often equating to < 1% of the sample size in an experimental group. As clinical practices or the use of drugs approved by Food and Drug Administration are developed on the results of phase 3 clinical trials, the change in the number of events required for fragility raises concerns about a statistical change in the results. RCTs, particularly phase 3 clinical trials, are likely to remain an important evidence base for clinicians’ practice. Despite this, the statistical methodology used to establish significance in such clinical trials has barely evolved. In principle, the P value is an indication of the compatibility among data from a trial; a smaller P value implies a greater statistical incompatibility of the result with the null hypothesis (an estimation of no difference between the experimental and control group [15]). However, this approach has been greatly criticized for being simplistic, and has frequently been misinterpreted [16]. The log-rank test used in survival data analysis has advantage in that it accounts for events, but it relies on the assumption that the hazard ratio of two treatments remains constant over time. Fisher’s exact test, which is used to calculate the FI, has the disadvantage of not accounting for the time-to-event [17]. Thus, the FI is simplistic in its application and resolves some of these shortcomings. Although the FI and FQ do provide a relative wealth of information when consider alongside other metrics, this study again emphasizes the limitations of the FI itself. First, clinical trials must obtain significant in effects in the treatment group, which means that treatment group got better results compared with control group. These trails could be included to be analyzed by the FI. Many non-inferiority studies cannot be included in this analysis, such as the E7080 trials of lenvatinib for HCC, which produced the same treatment results as sorafenib22. Second, because the FI relies on P value, it is essentially an extension of the most frequent approach to data analysis. Thus, it cannot be applied to an outcome of a continuous variable. Third, although many time-to-event outcomes are usually dichotomous, such as mortality, and survival, etc., the FI does not account for the difference in outcomes over time. Particularly in longer studies with variable follow-up time periods, analyses that account for time (such as a Kaplan–Meier curve, or a Cox proportional hazards model) are more appropriate than a simple binary outcome analysis. Fourth, our study shows a tendency of the inverse correlation between the FI and p-value, which is similar with previous FI studies [18, 19]. This might be the RCT studies included small number patients. Also, The FI was much higher as the samples increasing [20, 21]. Finally, there is no specific cut-off value or lower limit of the FI to classify a study as “either fragile” or “robust”.

Conclusion

The outcomes of many phase 3, RCTs supporting HCC treatments with a low FI challenges the confidence in concluding the superiority of these drugs over control treatments. Additional file 1: Table S1. The exclusion cause and names of the excluded RCTs as FI can only be calculated in RCTs that allocate 1:1 randomization. Supplementary References.

21 in total

1. Scientists rise up against statistical significance.

Authors: Valentin Amrhein; Sander Greenland; Blake McShane
Journal: Nature Date: 2019-03 Impact factor: 49.962

2. The Fragility Index: a P-value in sheep's clothing?

Authors: Rickey E Carter; Paul M McKie; Curtis B Storlie
Journal: Eur Heart J Date: 2017-02-01 Impact factor: 29.983

3. Does Sample Size Matter When Interpreting the Fragility Index?

Authors: Wael Ahmed; Robert A Fowler; Victoria A McCredie
Journal: Crit Care Med Date: 2016-11 Impact factor: 7.598

Review 4. The statistical significance of randomized controlled trial results is frequently fragile: a case for a Fragility Index.

Authors: Michael Walsh; Sadeesh K Srinathan; Daniel F McAuley; Marko Mrkobrada; Oren Levine; Christine Ribic; Amber O Molnar; Neil D Dattani; Andrew Burke; Gordon Guyatt; Lehana Thabane; Stephen D Walter; Janice Pogue; P J Devereaux
Journal: J Clin Epidemiol Date: 2014-02-05 Impact factor: 6.437

5. Adjuvant Transarterial Chemoembolization for HBV-Related Hepatocellular Carcinoma After Resection: A Randomized Controlled Study.

Authors: Zheng Wang; Zhenggang Ren; Yi Chen; Jie Hu; Guohuan Yang; Lei Yu; Xinrong Yang; Ao Huang; Xin Zhang; Shaolai Zhou; Huichuan Sun; Yanhong Wang; Ningling Ge; Xiaoyu Xu; Zhaoyou Tang; Wanyee Lau; Jia Fan; Jiping Wang; Jian Zhou
Journal: Clin Cancer Res Date: 2018-02-02 Impact factor: 12.531

6. Adjuvant immunotherapy with autologous cytokine-induced killer cells for hepatocellular carcinoma.

Authors: Joon Hyeok Lee; Jeong-Hoon Lee; Young-Suk Lim; Jong Eun Yeon; Tae-Jin Song; Su Jong Yu; Geum-Youn Gwak; Kang Mo Kim; Yoon Jun Kim; Jae Won Lee; Jung-Hwan Yoon
Journal: Gastroenterology Date: 2015-03-04 Impact factor: 22.682

7. How Fragile Are Clinical Trial Outcomes That Support the CHEST Clinical Practice Guidelines for VTE?

Authors: Elizabeth Edwards; Cole Wayant; Jonathan Besas; Justin Chronister; Matt Vassar
Journal: Chest Date: 2018-02-02 Impact factor: 9.410

8. The p-Value You Can't Buy.

Authors: Eugene Demidenko
Journal: Am Stat Date: 2016-03-31 Impact factor: 8.710

9. Sirolimus Use in Liver Transplant Recipients With Hepatocellular Carcinoma: A Randomized, Multicenter, Open-Label Phase 3 Trial.

Authors: Edward K Geissler; Andreas A Schnitzbauer; Carl Zülke; Philipp E Lamby; Andrea Proneth; Christophe Duvoux; Patrizia Burra; Karl-Walter Jauch; Markus Rentsch; Tom M Ganten; Jan Schmidt; Utz Settmacher; Michael Heise; Giorgio Rossi; Umberto Cillo; Norman Kneteman; René Adam; Bart van Hoek; Philippe Bachellier; Philippe Wolf; Lionel Rostaing; Wolf O Bechstein; Magnus Rizell; James Powell; Ernest Hidalgo; Jean Gugenheim; Heiner Wolters; Jens Brockmann; André Roy; Ingrid Mutzbauer; Angela Schlitt; Susanne Beckebaum; Christian Graeb; Silvio Nadalin; Umberto Valente; Victor Sánchez Turrión; Neville Jamieson; Tim Scholz; Michele Colledan; Fred Fändrich; Thomas Becker; Gunnar Söderdahl; Olivier Chazouillères; Heikki Mäkisalo; Georges-Philippe Pageaux; Rudolf Steininger; Thomas Soliman; Koert P de Jong; Jacques Pirenne; Raimund Margreiter; Johann Pratschke; Antonio D Pinna; Johann Hauss; Stefan Schreiber; Simone Strasser; Jürgen Klempnauer; Roberto I Troisi; Sherrie Bhoori; Jan Lerut; Itxarone Bilbao; Christian G Klein; Alfred Königsrainer; Darius F Mirza; Gerd Otto; Vincenzo Mazzaferro; Peter Neuhaus; Hans J Schlitt
Journal: Transplantation Date: 2016-01 Impact factor: 4.939

10. Adjuvant transcatheter arterial chemoembolization after curative resection for hepatocellular carcinoma patients with solitary tumor and microvascular invasion: a randomized clinical trial of efficacy and safety.

Authors: Wei Wei; Pei-En Jian; Shao-Hua Li; Zhi-Xing Guo; Yong-Fa Zhang; Yi-Hong Ling; Xiao-Jun Lin; Li Xu; Ming Shi; Lie Zheng; Min-Shan Chen; Rong-Ping Guo
Journal: Cancer Commun (Lond) Date: 2018-10-10