Literature DB >> 34910598

Making Sense of Phase II Trials for Investigational Agents in COVID-19: The Case of Ilomedin in Mechanically Ventilated Patients.

Fernando G Zampieri1, Adit Ginde2.   

Abstract

Entities:  

Mesh:

Year:  2022        PMID: 34910598      PMCID: PMC8887006          DOI: 10.1164/rccm.202111-2593ED

Source DB:  PubMed          Journal:  Am J Respir Crit Care Med        ISSN: 1073-449X            Impact factor:   21.405


× No keyword cloud information.
Feasibility trials are underappreciated, although they are an essential part of building a solid evidence-based practice (1). Such studies are designed to test important aspects of future larger clinical trials, including plausible inclusion rate, logistic procedures (drug supply and data collection), site monitoring, and early safety in the population of interest, among others. By their own nature and comparatively small sample size, feasibility trials can be challenging to interpret for preliminary efficacy results. A “positive” finding from a small trial can be overinflated (“winners curse” [2] or publication bias), whereas a “neutral” result is frequently only the reflex of low power. Reckless interpretation of data leads to euphoria or deception, and the former inevitably leads to the latter. In this issue of the Journal, Johansson and colleagues (pp. 324–329) present the results of a well-conducted feasibility/phase II trial that excels at being exactly what it was designed to be. The authors randomized 80 patients with coronavirus disease (COVID-19) on mechanical ventilation and with high (>4 ng/ml) thrombomodulin to receive intravenous prostacyclin or placebo (3). The trial’s rationale is based on the premise that prostacyclin could attenuate endotheliopathy, mainly through local vasodilatation and platelet adhesion inhibition; therefore, the use of a serum thrombomodulin threshold for inclusion is a clever predictive enrichment strategy used by the authors. The primary endpoint was days alive and free of mechanical ventilation, an endpoint that is patient-centered, may maximize power owing to greater granularity than binary or ordinal endpoints, and is aligned with the expected mechanism of action of the drug. The study was powered to detect a 20% increase in days free of mechanical ventilation (from 10 to 12, assuming an SD of 3), which is a bold assumption. The trial succeeded in demonstrating that the intervention can be feasibly given to mechanically ventilated patients with COVID-19, with apparently no exceedingly major safety issues, although therapy for five patients was discontinued without a clear reason described in the manuscript. Point estimates favored the intervention, with the results for the sequential organ failure assessment score passing through the heralded P < 0.05 threshold, though without adjustment for multiple comparisons. These clinical trials are challenging to conduct, especially during COVID-19, and the authors should be commended for pursuing this biologically driven approach to identify a new treatment for critical COVID-19, an area that remains remarkably lacking in effective treatment options. It should also be noticed, however, that several issues challenge proper conclusions from the results. The report lacks a clear definition and reporting on mechanical ventilation–free days, including how deaths were considered in its calculation and a lack of presentation of duration of mechanical ventilation among survivors (4). There are issues with hypothesis testing with inferential statistics for secondary endpoints (especially mortality) that can be misleading to readers. The authors grossly underestimated the anticipated variability for the primary endpoint—although the observed difference in the primary endpoint was well higher than predicated (11 days, compared with 2 days in the sample size calculation), the trial was still dramatically underpowered for this massive effect size due primarily to the SD being three- to fourfold higher than predicted. Most secondary endpoints are implicitly composite with mortality (days alive without X), but without reporting the treatment effect in survivors, which leads to an overcounting of mortality across most of these endpoints. Also, in critical care studies, how mortality is handled is critically important for many physiological and clinical secondary endpoints that have high potential for survivor biases, in this case sequential organ failure assessment scores. Using survivor average causal effects is one way to mitigate, estimating the effect of the intervention in patients who would have survived both groups (5). Finally, the lack of biomarker endpoints such as markers of endotheliopathy and lung injury represents a missed opportunity to understand the underlying biological effect, which can be critically helpful in these smaller phase IIa trials Clinicians may have difficulties in making sense of results of small clinical trials. Much of the confusion arises from wrong interpretation of P values and an underappreciation of issues that arise from small samples and secondary endpoints. Publication bias, or the higher likelihood of journals to accept “positive” appearing clinical trials instead of null results, further increases the likelihood of readers seeing such studies and creates a greater potential for type 1 error where promising phase II trials are often not confirmed in larger trials. The trial by Johansson and colleagues presents exciting data for primary and secondary endpoints; there is an important numerical difference for days alive and free of mechanical ventilation and mortality (the difference in mortality, 9 of 41 [21.9%] vs. 17 of 39 [43.6%], which may render P values <0.05 depending on the method used for analysis [6]). These results by no means are definitive, as acknowledged by the authors. A word of caution is needed when interpreting “positive” results, such as may be the case for mortality in this trial. At such a low sample size, a favorable result would be associated with an important magnification of the effect size (magnification—Type M—error, shown in Figure 1A, based on Reference 2). If date is confronted with regularizing information under a Bayesian framework that makes the model skeptic to effect sizes (in odds ratio scale) <0.5 or >2.0 (7), the overall probability of benefit remains <0.90 (Figure 1B).
Figure 1.

(A) Design plot assuming standard difference between groups of 0.102. For different values of the possible difference in mortality (x-axis), the magnification effect is presented. Unless the actual difference in mortality is higher than a 4% result, the trial sample size would have magnified the effect size by a factor >1.5. (B) Trial results for hospital mortality under a Bayesian framework, assuming a neutral moderate strength skeptical prior for the intervention. The overall probability of benefit is (left of the dashed line) ∼88%, which warrants further investigation.

(A) Design plot assuming standard difference between groups of 0.102. For different values of the possible difference in mortality (x-axis), the magnification effect is presented. Unless the actual difference in mortality is higher than a 4% result, the trial sample size would have magnified the effect size by a factor >1.5. (B) Trial results for hospital mortality under a Bayesian framework, assuming a neutral moderate strength skeptical prior for the intervention. The overall probability of benefit is (left of the dashed line) ∼88%, which warrants further investigation. The authors should be commended for running this trial in such challenging times, the substance of the biological plausibility, and the willingness to provide further data on an important topic. Methodological issues from the trial, especially on analyses, preclude more conclusive findings. Although neither safety nor efficacy can be clearly concluded from a trial with such a small sample size and interpretation should be approached with appropriate skepticism, the necessity of a larger trial, perhaps on acute respiratory distress syndrome lato sensu, is fully justified by present data.
  7 in total

1.  An estimator for treatment comparisons among survivors in randomized trials.

Authors:  Douglas Hayden; Donna K Pauler; David Schoenfeld
Journal:  Biometrics       Date:  2005-03       Impact factor: 2.571

2.  The mid p-value in exact tests for Hardy-Weinberg equilibrium.

Authors:  Jan Graffelman; Victor Moreno
Journal:  Stat Appl Genet Mol Biol       Date:  2013-08

3.  Beyond Power Calculations: Assessing Type S (Sign) and Type M (Magnitude) Errors.

Authors:  Andrew Gelman; John Carlin
Journal:  Perspect Psychol Sci       Date:  2014-11

4.  How we design feasibility studies.

Authors:  Deborah J Bowen; Matthew Kreuter; Bonnie Spring; Ludmila Cofta-Woerpel; Laura Linnan; Diane Weiner; Suzanne Bakken; Cecilia Patrick Kaplan; Linda Squiers; Cecilia Fabrizio; Maria Fernandez
Journal:  Am J Prev Med       Date:  2009-05       Impact factor: 5.043

Review 5.  Reappraisal of Ventilator-Free Days in Critical Care Research.

Authors:  Nadir Yehya; Michael O Harhay; Martha A Q Curley; David A Schoenfeld; Ron W Reeder
Journal:  Am J Respir Crit Care Med       Date:  2019-10-01       Impact factor: 21.405

6.  Using Bayesian Methods to Augment the Interpretation of Critical Care Trials. An Overview of Theory and Example Reanalysis of the Alveolar Recruitment for Acute Respiratory Distress Syndrome Trial.

Authors:  Fernando G Zampieri; Jonathan D Casey; Manu Shankar-Hari; Frank E Harrell; Michael O Harhay
Journal:  Am J Respir Crit Care Med       Date:  2021-03-01       Impact factor: 21.405

7.  Prostacyclin in Intubated Patients with COVID-19 and Severe Endotheliopathy: A Multicenter, Randomized Clinical Trial.

Authors:  Pär I Johansson; Peter Søe-Jensen; Morten H Bestle; Niels E Clausen; Klaus T Kristiansen; Theis Lange; Jakob Stensballe; Anders Perner
Journal:  Am J Respir Crit Care Med       Date:  2022-02-01       Impact factor: 21.405

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.