Nick Freemantle1, Sophie Jourdan2. 1. Institute of Clinical Trials and Methodology, University College London, London, UK. 2. Health Economics and Value Assessment, Sanofi, Paris, France.
To the Editor:Tibaldi et al. report results of the propensity‐matched observational CONFIRM study1 in previously insulin‐naïve adults with type 2 diabetes, comparing two second‐generation basal insulins. Using patients' electronic medical records from a large US database, they report an association between treatment with insulin degludec (IDeg) and a reduction in glycated haemoglobin (HbA1c) and hypoglycaemia rates, compared with insulinglargine 300 units/mL (Gla‐300).The results are in marked contrast to previously reported comparisons of these insulins2, 3, 4 and a recent randomized controlled trial (RCT)5 comparing the same basal insulins in broadly the same patient population, which found equivalent effects on both HbA1c and hypoglycaemia over a 6‐month treatment period. Reasonably, we might have expected to see results that were similar at least on a ratio scale. Unfortunately, there are important flaws in the CONFIRM propensity score matching, which probably explain this discrepancy and confound any conclusions that may be drawn from the CONFIRM study.The matching of cohorts in terms of hypoglycaemia prior to insulin initiation is clearly suboptimal (Supporting Information Table S2) and it is this imbalance at baseline, rather than the effects of treatment (as reported), that drives the differences in the final results. Helpfully, Tibaldi et al. have provided the data required to undertake conventional statistical testing of baseline differences, although these tests are not reported in the manuscript. For example, baseline hypoglycaemia rates (events per patient year exposed [PPYE]) for the matched population (IDeg 0.26 events/PPYE vs Gla‐300 0.22 events/PPYE) were significantly different, with a P value of 0.007. Moreover, the population used in the hypoglycaemia analysis (a subgroup representing around two‐thirds of the 4056 supposedly matched patients) had baseline differences that were even greater (IDeg 0.301 events/PPYE vs Gla‐300 0.210 events/PPYE).By presenting the results as a ratio of the pre‐ and post‐treatment rates of hypoglycaemia, the authors effectively ignore the unmatched baseline values and obfuscate the fact that actual “on‐treatment” outcomes were the same (IDeg 0.391 events/PPYE vs Gla‐300 0.389 events/PPYE).Other key patient characteristics differed between the “matched” groups at baseline, including body mass index (P = 0.01) and HbA1c (P = 0.008), and this raises particular concerns about the validity of any conclusions on the primary endpoint. Missing data and analyses conducted on a subset of the matched populations are a major challenge in CONFIRM. We invite the authors to provide baseline data on the subgroup of patients available for the HbA1c analysis to allow proper assessment of their findings.It is completely plausible that these mismatches between the patient groups, rather than differences in the treatments received by the patients, explain the results presented and these imbalances can explain why the CONFIRM results differ from the RCT results.Real‐world studies, which provide data gathered from actual patient experiences and diverse patient populations, are valuable and encouraged.6, 7 However, while maximising external validity (generalisability), real‐world studies will lack internal validity (unlike RCTs) if matching is not successful. Unfortunately, adequate matching is not achieved in the study by Tibaldi et al.
CONFLICT OF INTEREST
NF has received funding for research, travel and consulting from Sanofi Aventis, AstraZeneca, Ipsen, PTC, Tesaro, Takeda, Akcea.SJ is an employee of Sanofi, Paris, France.
Authors: Rachel E Sherman; Steven A Anderson; Gerald J Dal Pan; Gerry W Gray; Thomas Gross; Nina L Hunter; Lisa LaVange; Danica Marinac-Dabic; Peter W Marks; Melissa A Robb; Jeffrey Shuren; Robert Temple; Janet Woodcock; Lilly Q Yue; Robert M Califf Journal: N Engl J Med Date: 2016-12-08 Impact factor: 91.245
Authors: Julio Rosenstock; Alice Cheng; Robert Ritzel; Zsolt Bosnyak; Christine Devisme; Anna M G Cali; Jochen Sieber; Peter Stella; Xiangling Wang; Juan P Frías; Ronan Roussel; Geremia B Bolli Journal: Diabetes Care Date: 2018-08-13 Impact factor: 19.112
Authors: Joseph Tibaldi; Martin Hadley-Brown; Andreas Liebl; Steffen Haldrup; Viktor Sandberg; Michael L Wolden; Helena W Rodbard Journal: Diabetes Obes Metab Date: 2019-01-08 Impact factor: 6.577
Authors: Sean D Sullivan; Timothy S Bailey; Ronan Roussel; Fang Liz Zhou; Zsolt Bosnyak; Ronald Preblick; Jukka Westerbacka; Rishab A Gupta; Lawrence Blonde Journal: Diabetes Obes Metab Date: 2018-06-25 Impact factor: 6.577