| Literature DB >> 33270526 |
Fernando G Zampieri1,2, Jonathan D Casey3, Manu Shankar-Hari4,5, Frank E Harrell6, Michael O Harhay7,8,9.
Abstract
Most randomized trials are designed and analyzed using frequentist statistical approaches such as null hypothesis testing and P values. Conceptually, P values are cumbersome to understand, as they provide evidence of data incompatibility with a null hypothesis (e.g., no clinical benefit) and not direct evidence of the alternative hypothesis (e.g., clinical benefit). This counterintuitive framework may contribute to the misinterpretation that the absence of evidence is equal to evidence of absence and may cause the discounting of potentially informative data. Bayesian methods provide an alternative, probabilistic interpretation of data. The reanalysis of completed trials using Bayesian methods is becoming increasingly common, particularly for trials with effect estimates that appear clinically significant despite P values above the traditional threshold of 0.05. Statistical inference using Bayesian methods produces a distribution of effect sizes that would be compatible with observed trial data, interpreted in the context of prior assumptions about an intervention (called "priors"). These priors are chosen by investigators to reflect existing beliefs and past empirical evidence regarding the effect of an intervention. By calculating the likelihood of clinical benefit, a Bayesian reanalysis can augment the interpretation of a trial. However, if priors are not defined a priori, there is a legitimate concern that priors could be constructed in a manner that produces biased results. Therefore, some standardization of priors for Bayesian reanalysis of clinical trials may be desirable for the critical care community. In this Critical Care Perspective, we discuss both frequentist and Bayesian approaches to clinical trial analysis, introduce a framework that researchers can use to select priors for a Bayesian reanalysis, and demonstrate how to apply our proposal by conducting a novel Bayesian trial reanalysis.Entities:
Keywords: Bayesian; P value; critical care; randomized trials; statistical significance
Mesh:
Year: 2021 PMID: 33270526 PMCID: PMC7924582 DOI: 10.1164/rccm.202006-2381CP
Source DB: PubMed Journal: Am J Respir Crit Care Med ISSN: 1073-449X Impact factor: 21.405
Recommended Guidance for the Selection and Application of a Minimum Set of Priors and Analyses to Be Used in a Bayesian Reanalysis of a Completed Trial
| Defining Priors | |||
|---|---|---|---|
| Prior Belief | Belief Strength | ||
| Weak | Moderate | Strong | |
| Neutral | “I know almost nothing about the intervention and cannot rule out extreme effect sizes.” | “I have no reason to believe the intervention is good or bad, but I am mostly sure I can rule out large effect sizes.” | “I strongly believe the intervention has no effect or a very small effect.” |
| Bayesian analysis will not provide additional information, as the results will converge with results from frequentist approaches | Consider a normal prior centered at an OR of 1 that allows a 0.95 probability that the OR is between 2 and 0.5; that is, Pr(OR < 0.5) = 0.025 and Pr(OR > 2) = 0.025 | Consider a normal prior centered at an OR of 1 that allows a 0.95 probability that the OR is between 1.5 and 1/1.5; that is, Pr(OR < 0.66) = 0.025 and Pr(OR > 1.5) = 0.025 | |
| Example prior distribution: N(0, 5) | Example prior distribution: N(0, 0.355) | Example prior distribution: N(0, 0.205) | |
| Optimistic | “I believe the intervention is good, but there are few data, and I cannot rule out harm.” | “I believe the intervention is good, but I acknowledge there is a nonnegligible chance it may be harmful.” | “I strongly believe the intervention is good and that there is a very low chance that it is harmful.” |
| Consider a normal prior centered at the log of the expected OR for the intervention with variance set to allow at least 0.30 probability of Pr(OR > 1) | Consider a normal prior centered at the log of the expected OR for the intervention with variance set to allow at least a 0.15 probability of Pr(OR > 1) | Only useful in special cases | |
| Consider a normal prior centered at the log of the expected OR for the intervention with variance set to allow at least 0.05 probability of Pr(OR > 1) | |||
| Pessimistic | “I believe the intervention is harmful, but there are few data, and I cannot rule out eventual benefit.” | “I believe the intervention is harmful, but I acknowledge there is a nonnegligible chance it may be beneficial.” | “I strongly believe the intervention is harmful and that there is a very low chance that it is beneficial.” |
| Consider a normal prior centered at the log of the expected OR for the intervention with the variance set to allow at least 0.30 probability of Pr(OR <1 ) | Consider a normal prior centered at log of the expected OR for the intervention with the variance set to allow at least 0.15 probability of Pr(OR < 1) | Only useful in special cases | |
| Consider a normal prior centered at log of the expected OR for the intervention with the variance set to allow at least 0.05 probability of Pr(OR < 1) | |||
Definition of abbreviations: OR = odds ratio; Pr = probability.
For this example, the primary outcome is mortality, so the proportion of the distribution with an OR less than 1.0 [Pr(OR < 1)] is the probability of benefit. Quotes represent a nontechnical statement on what priors mean for clarity.
N means the prior follows a normal distribution with two parameters (mean and SD). Creating a prior requires the selection of the mean of prior distribution (μ, reflecting the prior belief of the intervention as providing benefit, no effect, or harm) and the SDs (σ, the spread of the possible effect sizes around the mean, which is a reflection of the “strength” of that belief). A description of the prior can be summarized as N(μ, σ), which indicates a normal distribution with mean = μ and SD = σ. The prior is for the log(OR) of the intervention.
See Appendix E3 for details.
Suggestions for Selecting Prior Belief Strengths Given Hypothetical Scenarios and Examples from the Critical Care Literature
| Scenario | Neutral Prior Strength Suggestion | Optimistic Prior Strength Suggestion | Pessimistic Trial Strength Suggestion |
|---|---|---|---|
| Little to no information previously available | Weak | Weak | Weak |
| Example: most trials run in the COVID-19 pandemic | |||
| Conflicting evidence, with some trials showing benefit and others pointing toward harm | Moderate (“skeptical” prior) | Moderate | Moderate |
| Example: EOLIA trial ( | |||
| Evidence pointing toward benefit (for example, positive previous metanalysis). No outliers in previous literature. Usually occurs for trials designed to confirm benefit | Moderate (“skeptical” prior) | Moderate | Weak |
| Example: ART trial ( | |||
| Evidence pointing toward benefit (for example, previous metanalysis). Presence of outliers (one or few studies) pointing toward an opposite direction | Moderate (“skeptical” prior) | Moderate | Moderate |
| Consecrated intervention deemed to be beneficial above reasonable doubt inside the medical community | Moderate (“skeptical” prior) | Strong | Weak |
| Example: assessing the effects of proton-pump inhibitors to avoid gastric bleeding using data from the SUP-ICU trial ( | |||
| Interventions with a very low rationale of exerting a direct effect on a given outcome, but data is available | Strong (“very skeptical”) | Weak | Weak |
| Example: mortality outcome in the PEPTIC trial ( | |||
| Several previous trials reporting neutral results, sometimes reaching futility thresholds on trial sequential analysis | Strong (“very skeptical”) | Weak | Weak |
Definition of abbreviations: ART = Alveolar Recruitment for Acute Respiratory Stress Syndrome; COVID-19 = coronavirus disease; EOLIA = Extracorporeal Membrane Oxygenation to Rescue Lung Injury in Severe Acute Respiratory Distress Syndrome; PEPTIC = Proton Pump Inhibitors versus Histamine-2 Receptor Blockers for Ulcer Prophylaxis Treatment in the ICU; SUP-ICU = Stress Ulcer Prophylaxis in the ICU.
Figure 1.Posterior distribution of the log odds ratio (OR) in ART (Alveolar Recruitment for Acute Respiratory Distress Syndrome Trial) using a “flat” prior. The distribution represents 100,000 draws from the posterior, which approximates to a normal distribution with a mean of 0.24 and an SD of 0.13. The vertical line at 0 represents the point at which the OR is equal to 1 [i.e., log(OR) = 0]. The area to the right (in orange) represents the probability that the intervention is harmful (0.97 probability). The probability of severe harm [Pr(OR > 1.25)] is shown in dark orange and is equal to 0.54. Values <0 mean the intervention is beneficial [Pr(log(OR) < 0); Pr(OR < 1.0)] and are shown in light blue (which equals 0.03). The ROPE is defined as the OR between 1/1.1 and 1.1 (vertically hatched area) and is 0.14. A similar figure with the OR on the x-axis is shown in Figure E5 for comparison. All these findings provide compelling evidence against the experimental treatment even in the context of a flat prior. Pr = probability; ROPE = region of practical equivalence.
Results of a Bayesian Reanalysis of the ART trial (27) Using the Reanalysis Framework Developed in This Manuscript
| Prior: Mean (SD) | OR (95% CrI) | Difference in OR vs. Skeptical Prior (95% CrI) | Difference in OR vs. Optimistic Prior (95% CrI) | Difference in OR vs. Pessimistic Prior (95% CrI) | Probability of Harm; Pr(OR > 1) | Probability of Important Benefit; Pr(OR < 1/1.25) | Probability of Important Harm; | ROPE; | Interpretation |
|---|---|---|---|---|---|---|---|---|---|
| Skeptical: 0 (0.355) | 1.24 (0.98 to 1.55) | — | 0.03 (0.02 to 0.04) | −0.05 (−0.06 to −0.03) | 0.956 | 0.00 | 0.465 | 0.168 | When assuming a moderate strength neutral prior (“skeptical” prior), the probability of harm of the intervention is more than 0.95. There is also an important probability that the intervention is very harmful, following the chosen definition of severe harm (an OR >1.25). |
| Optimistic: −0.41 (0.40) | 1.19 (0.95 to 1.51) | −0.03 (−0.04 to −0.02) | — | −0.08 (−0.10 to −0.06) | 0.936 | 0.00 | 0.348 | 0.255 | Even when assuming a moderate strength optimistic prior, the probability of harm of the intervention is greater than 0.90. The probability of severe harm remains clinically relevant at 0.35, and there is only a probability of 1 in 4 that the intervention is within the defined limits of equivalence. |
| Pessimistic: 0.41 (0.80) | 1.28 (1.01 to 1.62) | 0.05 (0.03 to 0.06) | 0.08 (0.06 to 0.10) | — | 0.971 | 0.00 | 0.563 | 0.127 | When assuming a weak strength pessimistic prior, the probability of harm of the intervention is very high. Not only is the intervention probably harmful under these assumptions, but the probability of severe harm is greater than 0.50. |
Definition of abbreviations: ART = Alveolar Recruitment for Acute Respiratory Distress Syndrome Trial; CrI = credible interval; OR = odds ratio; Pr = probability; ROPE = region of practical equivalence.
Data are shown on the log scale, with negative values meaning OR <1 and positive values meaning OR >1.
Difference obtained by sampling the posterior OR distribution.
Please note that this is a suggestion. “Significant harm” is subjective and should be tailored to the scenario.
Please note that this is a suggestion. “Equivalence” is subjective and should be tailored to the scenario.
Figure 2.Reinterpretation of ART (Alveolar Recruitment for Acute Respiratory Distress Syndrome Trial). Priors were set following the suggested principles outlined in the main manuscript using optimistic, skeptical, and pessimistic priors of moderate strength at (A) N(0, 0.355), (B) N(−0.44, 0.40), and (C) N(0.44, 0.80). Priors are shown in dashed lines. For each selected prior, the black line shows the posterior distribution of the odds ratio (OR). The probability of significant harm [Pr(OR > 1.25)] is filled in red (values in Table 3). The ROPE, defined as an OR between 1/1.1 and 1.1, is filled in blue. ROPE = region of practical equivalence.