| Literature DB >> 28875400 |
Scott K Aberegg1, Andrew M Hersh2, Matthew H Samore3.
Abstract
BACKGROUND: Noninferiority trials are increasingly common, though they have less standardized designs and their interpretation is less familiar to clinicians than superiority trials.Entities:
Mesh:
Year: 2017 PMID: 28875400 PMCID: PMC5756156 DOI: 10.1007/s11606-017-4161-4
Source DB: PubMed Journal: J Gen Intern Med ISSN: 0884-8734 Impact factor: 5.128
Figure 1Simulacrum of the CONSORT diagram for interpreting the results of noninferiority trials. According to CONSORT, noninferiority can be declared whenever the upper bound of the confidence interval of the difference between the two therapies does not include delta, as in scenarios 1–4. Whenever the upper bound of the confidence interval exceeds delta, as in scenarios 5–7, noninferiority cannot be declared, because the plausible values of the parameter include some values greater than delta. When both the upper and lower bounds of the confidence interval exceed delta, the NT is declared inferior to the AC, as in scenario 8. Scenario 1 represents all situations in which the upper bound of the confidence interval is less than zero—that is, any statistically significant result favoring the NT garners a declaration of superiority for the NT. By contrast, in scenarios 4 and 7, where there is a statistically significant difference favoring the AC, the NT is not declared inferior in this schematic, but rather noninferior (scenario 4) or inconclusive (scenario 7). NT, new treatment; AC, active control; Δ, delta (the pre-specified margin of noninferiority).
Figure 2Flow diagram showing the results of our search.
Characteristics of the Trials in Our Cohort
| No. (%) | ||
|---|---|---|
| Journal | NEJM | 64 (39%) |
| Lancet | 63 (39%) | |
| JAMA | 23 (14%) | |
| BMJ | 8 (5%) | |
| Annals | 5 (3%) | |
| Year | June–December 2011 | 12 (7%) |
| 2012 | 25 (15%) | |
| 2013 | 34 (21%) | |
| 2014 | 22 (14%) | |
| 2015 | 43 (26%) | |
| January–October 2016 | 27 (17%) | |
| Top specialties | Infectious diseases | 26% |
| Hematology/oncology | 25% | |
| Cardiology | 17% | |
| Pulmonary/critical care | 15% | |
| Endocrinology | 8% | |
| Primary outcome measure ( | Absolute risk difference | 114 (70%) |
| Mean | 26 (16%) | |
| Hazard ratio | 13 (8%) | |
| Relative risk difference | 8 (5%) | |
| Odds ratio | 2 (1%) | |
| Primary analysis ( | Intention-to-treat | 95 (58%) |
| Modified intention-to-treat | 36 (22%) | |
| Per-protocol | 24 (15%) | |
| As treated | 2 (1%) | |
| Not reported | 6 (4%) | |
| Secondary analysis ( | Intention-to-treat | 14 (9%) |
| Modified intention-to-treat | 7 (4%) | |
| Per-protocol | 79 (50%) | |
| As treated | 10 (6%) | |
| None | 48 (30%) | |
| Outcome is or includes mortality ( | 49 (30%) | |
| Delta for comparisons with proportional outcome measure ( | Mean delta | 0.087 |
| Lowest value | 0.004 | |
| Highest value | 0.25 | |
| Mean delta for outcomes that do not include mortality ( | 0.1 | |
| Lowest value for outcomes that do not include mortality | 0.0057 | |
| Highest value for outcomes that do not include mortality | 0.25 | |
| Mean delta for outcomes that include mortality ( | 0.061 | |
| Lowest value for outcomes that include mortality | 0.004 | |
| Highest value for outcomes that include mortality | 0.19 | |
| Delta justification ( | Not reported | 95 (58%) |
| Vague and non-reproducible | 27 (17%) | |
| Concrete and reproducible | 41 (25%) | |
| Alpha, one-sided equivalent ( | ≤ 0.025 | 105 (66%) |
| 0.05 | 51 (32%) | |
| 0.1 | 4 (2.5%) | |
| Not reported | 3 (2%) | |
| Two-sided hypothesis test ( | 62 (38%) | |
| CONSORT confidence interval categorization ( | 1 (new treatment superior) | 28 (15%) |
| 2 (new treatment noninferior) | 67 (37%) | |
| 3 (new treatment noninferior) | 46 (25%) | |
| 4 (new treatment noninferior, but old treatment statistically better by less than delta) | 3 (2%) | |
| 5 (inconclusive) | 0 (0%) | |
| 6 (inconclusive) | 19 (10%) | |
| 7 (inconclusive, but old treatment statistically better, by less than delta) | 15 (8%) | |
| 8 (new treatment inferior) | 4 (2%) | |
| Advantage of new therapy | Explicitly stated | 114 (70%) |
| Could be inferred | 31 (19%) | |
| Neither stated nor able to be inferred | 18 (11%) |
NEJM, New England Journal of Medicine; Annals, Annals of Internal Medicine
Trials with Statistically Significant Results Disfavoring the New Therapy but Not Considering It Inferior
| First author | Disease | New therapy | Active control | Outcome | Delta | Result (95% CI) |
|---|---|---|---|---|---|---|
| Roberts | Pediatric respiratory failure | High-flow nasal cannula | CPAP | Tx failure | 0.1 | 0.122 (0.06–0.19) |
| Geisler | Chlamydia infection | Azithromycin | Doxycycline | Tx failure | 0.05 | 0.032 (0.004–0.06) |
| Kaul | Coronary disease | Paclitaxel stent | Everolimus stent | Composite, including death | 0.04 | 0.027 (0.01–0.05) |
| Gillespie | Tuberculosis | Moxifloxacin replaces ethambutol | Standard TB Tx | Tx failure | 0.06 | 0.07 (0.03–0.11) |
| Bwakura-Danbarembizi | HIV | Stopping SMX prophylaxis | Continued SMX prophylaxis | Hospitalization or death | 0.03 | 0.06 (0.01–0.11) |
| Stevenson | Rectal cancer | Laparoscopic surgery | Open surgery | Pathological outcomes | 0.08 | 0.07 (0.01–0.13) |
| Salminen | Appendicitis | Antibiotics | Surgery | Tx failure | 0.24 | 0.27 (0.22–0.33) |
| Hooton | Urinary tract infection | Cefpodoxime | Ciprofloxacin | Clinical cure | 0.1 | 0.11 (0.03–0.18) |
| Bachelez | Rheumatoid arthritis | Tofacitinib 5 mg dose | Etanercept | PASI75 score | 0.15 | 0.19 (0.12–0.27) |
| Behringer | Hodgkin’s lymphoma | ABV | ABVD | Tx failure | 0.06 | 0.12 (0.06–0.18) |
| Behringer | Hodgkin’s lymphoma | AVD | ABVD | Tx failure | 0.06 | 0.04 (0.01–0.07) |
| Vaidya | Breast cancer | Targeted XRT | Whole breast XRT | Local recurrence | 0.025 | 0.007 (0.0004–0.014) |
| Buse | Diabetes | Exenatide | Liraglutide | Mean change in glycated hemoglobin | 0.25% | 0.0021 (0.0008–0.0033) |
| Lindson-Hawley | Smoking | Gradual cessation | Abrupt cessation | Smoking cessation | 0.095 | 0.10 (0.03–0.17) |
| Perkins | ACLS training | E-learning | In-person training | Pass rate | 0.05 | 0.06 (0.03–0.09) |
| Mol | Venous thrombosis | 1-year TED hose | 2-year TED hose | Post-thrombotic syndrome rate | 0.1 | 0.07 (0.01–0.13) |
| Gallwitz | Diabetes | Linagliptin | Glimepiride | Mean change in glycated hemoglobin | 0.35% | 0.2 (0.09–0.30) |
| Fishbane | Renal failure | Peginesatide | Epoetin | Mean change in hemoglobin | 1 | 0.15 (0.01–0.30) |
These trials are categorized as scenarios 4 and 7 based on the Figure 1 diagram. Abbreviations: Tx, treatment; CPAP, continuous positive airway pressure; XRT, X-ray therapy; SMX, sulfamethoxazole; ACLS, advanced cardiac life support
*The authors state in the introduction that paclitaxel stents are inferior to everolimus stents in most patients; in an apparent post hoc analysis, they reverse the noninferiority hypothesis and declare everolimus stents superior to paclitaxel stents, a conclusion that they acknowledged was known before the trial was conducted
† Use of a one-sided 95% CI with an upper bound of 0.32 allowed the authors to correctly classify the result as “inconclusive”
‡ One of three CONSORT 4 results where the entire 95% CI lies between 0 and Δ, and the result is considered noninferior
Figure 3Plot of 151 comparisons of absolute risk differences as a function of the log of the total number of patients analyzed in the trial, color coded by the interpretation of the results as recommended by CONSORT. See text for details.
Figure 4Plot of 151 comparisons with a calculable absolute risk difference as in Figure 3, but with statistically significant results in favor of active control (AC), coded as inconclusive or noninferior in Figure 3, denoted by red in Figure 4.
Figure 5The effect of asymmetrical interpretation of noninferiority results. This schematic shows how the conclusions of a noninferiority trial will differ depending upon which agent, NT or AC, is assigned preferential status on the left of the interpretative diagram. The confidence intervals in this diagram are the mirror images of the confidence intervals in Figure 1, but the NT is now on the right, and AC is on the left (favored side) of the diagram. The absolute risk differences between NT and AC are the same as in Figure 1, and the diagram represents the interpretation that would result if the hypothesis were set up in reverse, to test the noninferiority of AC to NT. The top confidence interval shows a statistically significant difference favoring the new treatment, but instead of a conclusion of superiority (designation 1 in Fig. 1), the result is inconclusive because the upper bound of the confidence interval crosses delta. For the second and third confidence intervals, the conclusion of noninferiority does not change. For the fourth confidence interval, the prior designation 4 (NT is noninferior to AC) becomes a designation 1 (AC superior to NT). The designation of the fifth confidence interval does not change, but the sixth, previously designated inconclusive, becomes a noninferior result for AC. The seventh and eighth confidence intervals, previously showing inconclusive and inferior results for NT, are now designated superior results for AC. Note that the experimental results have not changed—only the assignment of one agent to preferential status on the left of the diagram. Among the eight confidence intervals, four conclusions are materially changed when preferential status is changed.