Lukas Aguirre Dávila1, Kristina Weber1, Udo Bavendiek2, Johann Bauersachs2, Janet Wittes3, Salim Yusuf4, Armin Koch1. 1. Institute for Biostatistics, Hannover Medical School, Carl-Neuberg Str. 1, 30625 Hannover, Germany. 2. Department of Cardiology and Angiology, Hannover Medical School, Hannover, Germany. 3. Statistics Collaborative, Inc., Washington, DC, USA. 4. Population Health Institute, Hamilton Health Sciences and McMaster University, Hamilton, Ontario, Canada.
Abstract
AIMS: The Digitalis Investigation Group (DIG) trial, the only large randomized trial of digoxin in heart failure, reported a neutral effect on mortality and a significant reduction in heart failure hospitalizations. Recent observational studies reported increased mortality with digoxin treatment. We present further analyses of the DIG trial displaying the inability to control bias in observational treatment comparisons despite extensive statistical adjustments. METHODS AND RESULTS: Forty-four percent of the 6800 patients in the DIG trial had been treated with digoxin before randomization, and half of them were randomly withdrawn from digoxin treatment. We contrast the main randomization-based result of the DIG trial with the observational non-randomized comparison of patients pre-treated or not pre-treated with digoxin. Mortality [hazard ratio (HR) 1.22, 95% confidence interval (CI) 1.12-1.34; P < 0.001] and heart failure hospitalizations (HR 1.47, 95% CI 1.33-1.61; P < 0.001) were significantly higher in patients pre-treated with digoxin even after adjustment for baseline population differences. The higher risks for both outcomes in those who had previously received digoxin persisted even if they received placebo during the trial (HR 1.24, 95% CI 1.10-1.40; P < 0.001). This sharply contradicts the neutral effect on mortality and the significant reduction in heart failure hospitalizations observed in the randomized comparison. CONCLUSION: Prescription of digoxin is an indicator of disease severity and worse prognosis, which cannot be fully accounted for by covariate adjustments in the DIG trial where patients were well-characterized. It is unlikely that weaker research approaches (observational studies of administrative data or registries) can provide more reliable estimates of the effects of cardiac glycosides.
RCT Entities:
AIMS: The Digitalis Investigation Group (DIG) trial, the only large randomized trial of digoxin in heart failure, reported a neutral effect on mortality and a significant reduction in heart failure hospitalizations. Recent observational studies reported increased mortality with digoxin treatment. We present further analyses of the DIG trial displaying the inability to control bias in observational treatment comparisons despite extensive statistical adjustments. METHODS AND RESULTS: Forty-four percent of the 6800 patients in the DIG trial had been treated with digoxin before randomization, and half of them were randomly withdrawn from digoxin treatment. We contrast the main randomization-based result of the DIG trial with the observational non-randomized comparison of patients pre-treated or not pre-treated with digoxin. Mortality [hazard ratio (HR) 1.22, 95% confidence interval (CI) 1.12-1.34; P < 0.001] and heart failure hospitalizations (HR 1.47, 95% CI 1.33-1.61; P < 0.001) were significantly higher in patients pre-treated with digoxin even after adjustment for baseline population differences. The higher risks for both outcomes in those who had previously received digoxin persisted even if they received placebo during the trial (HR 1.24, 95% CI 1.10-1.40; P < 0.001). This sharply contradicts the neutral effect on mortality and the significant reduction in heart failure hospitalizations observed in the randomized comparison. CONCLUSION: Prescription of digoxin is an indicator of disease severity and worse prognosis, which cannot be fully accounted for by covariate adjustments in the DIG trial where patients were well-characterized. It is unlikely that weaker research approaches (observational studies of administrative data or registries) can provide more reliable estimates of the effects of cardiac glycosides.
Numerous publications have reported conflicting results regarding the benefit of cardiac glycosides. Observational studies, analyses of registries (e.g. RIKS-HIA, SCAF), and secondary analyses of randomized clinical trials done for other purposes (AFFIRM, SPORTIF III and V, RACE II, SOLVD, Val-HeFT, ROCKET-AF, ENGAGE AF-TIMI 48, and ARISTOTLE) have reported increases, decreases, and neutral effects on mortality (sometimes based on analyses of the same data by different investigators). Some of these have been published and discussed in this journal, as well.,, These analyses compare patients treated with those not treated with cardiac glycosides and use statistical models to adjust for differences in characteristics of the respective populations. The Digitalis Investigation Group (DIG) trial, the only large randomized clinical trial evaluating the efficacy of digoxin in patients with heart failure, as evidenced by a recent Cochrane Review reported a neutral effect on mortality and a significant reduction in heart failure hospitalizations.A recent review argued that increases in mortality in patients treated with digoxin in observational data are most likely driven by ‘prescription bias’: clinical deterioration leads the treating physician to prescribe an additional drug and consequently sicker patients are more likely treated than healthier patients. The conclusions of the aforementioned observational studies thus rely on the assumption that adjustment for baseline covariates appropriately addresses these prognostic differences.Our analysis of the DIG trial demonstrates that this assumption does not necessarily hold true: we demonstrate the existence of prescription bias after adjustment for baseline covariates in a situation where the ‘true treatment effect’ in the same data is known to be neutral for mortality and beneficial regarding heart failure hospitalizations.
Methods
The primary aim of this analysis is to assess whether adjustment in an observational analysis can lead to the same treatment effect estimate as the randomization-based analysis of the DIG trial.The design and primary results of the DIG trial (NCT00000476) have been previously published., Interestingly, 44% of the 6800 study patients had been treated with digoxin prior to study start and were randomized to either continue digoxin or receive a placebo treatment instead (randomized withdrawal). Knowing that digoxin had no effect on overall mortality in the randomized trial, we compare the outcome among patients previously treated with digoxin with the outcome among those who were not treated with digoxin before the start of the study. We contrast this observational result with the neutral effect estimated in the randomization-based approach. Obviously, this observational analysis assumes that the randomized treatment added a similar effect to the average outcome in both subgroups. Therefore, we repeat the analysis (i) in patients randomized to digoxin and (ii) in patients randomized to placebo.In both subpopulations (i) and (ii), pre-treated and not pre-treated patients were identically treated during the trial. Therefore, observed differences between patients previously treated and those newly treated with digoxin or placebo cannot be due to digoxin unless a potential withdrawal or add-on effect is present. These comparisons are ‘observational’ in that they are not based on the randomization in the trial.We expected that—if the observational comparison was valid—the effect of digoxin should be the same irrespective of whether estimated from the randomized or the observational comparison after adjustment for population differences. The effect of digoxin is estimated with Cox proportional hazards regression in both approaches. In the observational approach (previous digoxin use vs. no previous digoxin use), the respective models are further adjusted for prognostic differences between patients pre-treated with digoxin and those not pre-treated. The following variables are included in the adjusted observational model (a P-value <0.2 in baseline comparisons was used as a criterion for inclusion into the model, see Supplementary Appendix for the respective P-values): age (years), sex, race, systolic blood pressure (mmHg), diastolic blood pressure (mmHg), body mass index (kg/m2), potassium serum level (mmol/L), ischaemic primary cause of congestive heart failure, duration of congestive heart failure (months), ejection fraction (%), chest X-ray (cardiothoracic ratio) ≤0.55, New York Heart Association (NYHA) classification III or IV, current angina, rales, elevated jugular venous pressure, peripheral oedema, dyspnoea at rest, dyspnoea at exertion, limitation of activity, S3, pulmonary congestion, previous myocardial infarction, history of hypertension, angiotensin-converting enzyme inhibitor use, hydralazine use, diuretics use, recommended digoxin dose (mg/day), and prescribed dose of study medication (mg/day). The model also includes the randomized treatment, as it may in principle alter prognosis. All analyses are performed using the SAS 9.3. Forest plots are created with R 3.3.1 using the forestplot package.
Results
Baseline variables were well balanced between the two randomized treatment groups, but patients previously treated with digoxin systematically differed from patients not previously treated regarding many important baseline variables, especially for heart failure symptoms (e.g. rales, dyspnoea, pulmonary congestion), clinical features of heart failure (e.g. NYHA classification, ejection fraction, duration of heart failure), and use of heart failure medication (diuretics, hydralazine). Figure illustrates the good balance of characteristics between those randomized to receive digoxin vs. placebo. In contrast, patients previously treated with digoxin had more frequently markers of advanced heart failure than those not previously treated with digoxin.Standardized baseline differences (difference between groups/pooled standard deviation) in the randomized comparison (blue circle) and the observational comparison (red square). Baseline characteristics are balanced between randomized treatment groups, but patients previously treated with digoxin had more advanced heart failure than previously untreated patients and standardized differences are no longer close to 0. Data from the DIG trial.Mortality was significantly higher in patients treated with digoxin before randomization. A total of 1207 (40.0%) and 1168 (30.9%) deaths occurred in patients previously treated and those not previously treated with digoxin, respectively [hazard ratio (HR) 1.36, 95% confidence interval (CI) 1.25–1.47; P < 0.001]. The magnitude of the increase in mortality was similar in patients randomized to placebo and those randomized to digoxin [placebo: 611 (40.2%) and 583 (30.9%) deaths in pre-treated and not pre-treated patients, HR 1.38, 95% CI 1.23–1.54; P < 0.001, digoxin: 596 (39.8%) and 585 (30.8%) deaths in pre-treated and not pre-treated patients, HR 1.34, 95% CI 1.20–1.50; P < 0.001] (Figure ).Mortality and hospitalizations for heart failure are increased in patients previously treated with digoxin, irrespective of whether treated with digoxin or placebo in the DIG trial. The randomized treatment effect estimate is included as reference.Adjustment for baseline population differences does not explain the observed increased mortality. As we explored a variety of adjustments, differences in mortality between previously treated and not previously treated patients became smaller but persisted. For example, in all treated patients, adjustment for prognostic differences in baseline covariates reduced the HR from 1.36 to 1.22 (95% CI 1.12–1.34, P < 0.001); this reduction was of the same magnitude in the subpopulations of patients randomized to placebo or to digoxin (Figure ).As reported previously, digoxin in the randomized DIG trial had no overall effect on all-cause mortality; this neutral effect is consistently observed among those in whom digoxin was withdrawn [HR for digoxin vs. placebo in previously treated patients: 0.98, 95% CI (0.87–1.09); P = 0.67] as well as among those starting de novo therapy [HR for digoxin vs. placebo in previously untreated patients: 1.00, 95% CI (0.90–1.13); P = 0.94] (Figure ).The effect of digoxin on mortality and on hospitalizations for heart failure overall and in subgroups of pre-treated and not pre-treated patients.Observational results were similar when time to hospitalization for heart failure was analysed: worse prognosis of patients pre-treated with digoxin led to a significant increase in the risk for heart failure hospitalizations that could not be accounted for with adjustment for population differences (adjusted HR 1.47, 95% CI 1.33–1.61; P < 0.001). Again, findings were similar irrespective of whether patients were treated with digoxin or placebo. This observational result is diametrically opposite to the results of the randomized comparison which indicated a significant reduction of hospitalizations for heart failure with digoxin (HR 0.72, 95% CI 0.66–0.79; P < 0.001) (Figure ).
Discussion
Our analysis provides evidence of prescription bias: We demonstrate that prognostic differences between patients pre-treated and not pre-treated with digoxin were so pronounced that they could not be appropriately addressed with statistical adjustment for baseline covariates. Mortality and risk for heart failure hospitalizations remained increased in those pre-treated with digoxin, even if treated with placebo in the trial. Both findings sharply contrast the results of the randomized comparison which indicated that digoxin had a neutral effect on mortality but substantially decreased heart failure hospitalizations. This implies that important prognostic variables are unmeasured. In fact, the size of prescription bias (the difference in estimate of effects between the randomized and the observational comparisons in heart failure hospitalizations) is much larger than the true effect of treatment. Given that the potential benefits of most treatments are likely to be moderate, biases in observational studies may far exceed these. Bias in this analysis is of the same magnitude as pooled estimates of other observational analyses provided in current meta-analyses (Figure ). The results of this analysis cast doubts on many of the recently presented observational analyses indicating harm from digoxin treatment.Results from the DIG trial in the context of current meta-analyses. Prescription bias in the DIG trial is of the same size as pooled estimates from observational data.In contrast to other analyses, our observational analysis is based on a randomized clinical trial that was specifically designed to assess the effects of digoxin on mortality. It is not plausible to assume that eventually more sophisticated analyses of data recorded for other purposes can produce more reliable conclusions concerning the treatment effect of digoxin. To our best knowledge, in all the aforementioned observational analyses treatment with cardiac glycosides should be interpreted as an indicator of advanced heart failure but increased mortality should not be interpreted as effect of treatment. Along the same lines a recent review argued that bias in observational analyses of treatment is not limited to digoxin in heart failure with reduced ejection fraction, but may occur in other indications, as well.The current ESC heart failure guidelines acknowledge the controversy about potential increases in mortality with digoxin treatment based on observational studies. Recommendations regarding digoxin treatment in heart failure are still based on the DIG trial and the respective additional analyses. This reflects that the DIG trial is still considered best evidence for treatment with cardiac glycosides. The current guideline recommendation is to consider digoxin in patients with heart failure and reduced ejection fraction in sinus rhythm if symptoms persist even after combinations of other treatments.Our results have wider implications beyond the case of digoxin: We present an example that proves the existence of prescription bias, which cannot be accounted for in the statistical analysis. This again challenges the role of observational studies to formally proof (or disprove) efficacy of medical treatments. In our example, not even the direction of the treatment effect was estimated correctly in the observational comparison (). In recent times many publications are purporting to show the effects of a new therapy (usually a drug) in ‘real world’ data analyses, however, caution should be exercised when interpreting such data.The ‘best’ observational estimate of the effect of digoxin on mortality and heart failure hospitalization does not match the randomization-based estimate in the DIG trial.
Limitations
The DIG trial was conducted several years ago and both diagnosis and treatment of heart failure have changed. Well planned clinical trials document those variables that are needed to describe the patients expected to benefit from treatment and this may depend on the drug but also on the current understanding of the disease. In more recent data, information on biomarkers such as N-terminal prohormone of brain natriuretic peptide may allow better adjustment and obviously the treatment effect at the time the DIG-trial has been conducted may be different from what would be observed today. Nonetheless, data from the DIG trial is of high quality and ‘residual bias’ is substantial. We doubt that other data will overcome prescription bias.
Conclusion
Our result, derived in the context of digoxin, emphasizes the critical importance of randomized clinical trials for evaluating treatments.Click here for additional data file.
Authors: Jeffrey B Washam; Susanna R Stevens; Yuliya Lokhnygina; Jonathan L Halperin; Günter Breithardt; Daniel E Singer; Kenneth W Mahaffey; Graeme J Hankey; Scott D Berkowitz; Christopher C Nessel; Keith A A Fox; Robert M Califf; Jonathan P Piccini; Manesh R Patel Journal: Lancet Date: 2015-03-06 Impact factor: 79.321
Authors: Bart A Mulder; Dirk J Van Veldhuisen; Harry J G M Crijns; Jan G P Tijssen; Hans L Hillege; Marco Alings; Michiel Rienstra; Maarten P Van den Berg; Isabelle C Van Gelder Journal: Heart Rhythm Date: 2014-06-09 Impact factor: 6.343
Authors: Piotr Ponikowski; Adriaan A Voors; Stefan D Anker; Héctor Bueno; John G F Cleland; Andrew J S Coats; Volkmar Falk; José Ramón González-Juanatey; Veli-Pekka Harjola; Ewa A Jankowska; Mariell Jessup; Cecilia Linde; Petros Nihoyannopoulos; John T Parissis; Burkert Pieske; Jillian P Riley; Giuseppe M C Rosano; Luis M Ruilope; Frank Ruschitzka; Frans H Rutten; Peter van der Meer Journal: Eur Heart J Date: 2016-05-20 Impact factor: 29.983
Authors: Alon Eisen; Christian T Ruff; Eugene Braunwald; Rose A Hamershock; Basil S Lewis; Christian Hassager; Tze-Fan Chao; Jean Yves Le Heuzey; Michele Mercuri; Howard Rutman; Elliott M Antman; Robert P Giugliano Journal: J Am Heart Assoc Date: 2017-06-30 Impact factor: 5.501
Authors: Oliver J Ziff; Deirdre A Lane; Monica Samra; Michael Griffith; Paulus Kirchhof; Gregory Y H Lip; Richard P Steeds; Jonathan Townend; Dipak Kotecha Journal: BMJ Date: 2015-08-30
Authors: Milton Packer; James L Januzzi; Joao Pedro Ferreira; Stefan D Anker; Javed Butler; Gerasimos Filippatos; Stuart J Pocock; Martina Brueckmann; Waheed Jamal; Daniel Cotton; Tomoko Iwata; Faiez Zannad Journal: Eur J Heart Fail Date: 2021-06-21 Impact factor: 17.349
Authors: Balázs Muk; Máté Vámos; Péter Bógyi; Barna Szabó; Miklós Dékány; Dénes Vágány; Zsuzsanna Majoros; Tünde Borsányi; Gábor Zoltán Duray; Róbert Gábor Kiss; Noémi Nyolczas Journal: Clin Cardiol Date: 2020-11-03 Impact factor: 2.882