Angelo Del Parigi1, Wenbo Tang1, Dacheng Liu1, Christopher Lee2, Richard Pratley3. 1. Boehringer Ingelheim Pharmaceuticals Inc., Ridgefield, CT, USA. 2. Boehringer Ingelheim GmbH & Co. KG, Ingelheim, Germany. 3. Florida Hospital Diabetes Institute, AdventHealth Translational Research Institute for Metabolism and Diabetes, 301 Princeton Ave, Orlando, FL, 32804, USA. Richard.Pratley@flhosp.org.
Abstract
INTRODUCTION: Outcomes in type 2 diabetes mellitus (T2DM) could be optimized by identifying which treatments are likely to produce the greatest improvements in glycemic control for each patient. OBJECTIVES: We aimed to identify patient characteristics associated with achieving and maintaining a target glycated hemoglobin (HbA1c) of ≤ 7% using machine learning methodology to analyze clinical trial data on combination therapy for T2DM. By applying a new machine learning methodology to an existing clinical dataset, the practical application of this approach was evaluated and the potential utility of this new approach to clinical decision making was assessed. METHODS: Data were pooled from two phase III, randomized, double-blind, parallel-group studies of empagliflozin/linagliptin single-pill combination therapy versus each monotherapy in patients who were treatment-naïve or receiving background metformin. Descriptive analysis was used to assess univariate associations between HbA1c target categories and each baseline characteristic. After the descriptive analysis results, a machine learning analysis was performed (classification tree and random forest methods) to estimate and predict target categories based on patient characteristics at baseline, without a priori selection. RESULTS: In the descriptive analysis, lower mean baseline HbA1c and fasting plasma glucose (FPG) were both associated with achieving and maintaining the HbA1c target. The machine learning analysis also identified HbA1c and FPG as the strongest predictors of attaining glycemic control. In contrast, covariates including body weight, waist circumference, blood pressure, or other variables did not contribute to the outcome. CONCLUSIONS: Using both traditional and novel data analysis methodologies, this study identified baseline glycemic status as the strongest predictor of target glycemic control attainment. Machine learning algorithms provide an hypothesis-free, unbiased methodology, which can greatly enhance the search for predictors of therapeutic success in T2DM. The approach used in the present analysis provides an example of how a machine learning algorithm can be applied to a clinical dataset and used to develop predictions that can facilitate clinical decision making.
RCT Entities:
INTRODUCTION: Outcomes in type 2 diabetes mellitus (T2DM) could be optimized by identifying which treatments are likely to produce the greatest improvements in glycemic control for each patient. OBJECTIVES: We aimed to identify patient characteristics associated with achieving and maintaining a target glycated hemoglobin (HbA1c) of ≤ 7% using machine learning methodology to analyze clinical trial data on combination therapy for T2DM. By applying a new machine learning methodology to an existing clinical dataset, the practical application of this approach was evaluated and the potential utility of this new approach to clinical decision making was assessed. METHODS: Data were pooled from two phase III, randomized, double-blind, parallel-group studies of empagliflozin/linagliptin single-pill combination therapy versus each monotherapy in patients who were treatment-naïve or receiving background metformin. Descriptive analysis was used to assess univariate associations between HbA1c target categories and each baseline characteristic. After the descriptive analysis results, a machine learning analysis was performed (classification tree and random forest methods) to estimate and predict target categories based on patient characteristics at baseline, without a priori selection. RESULTS: In the descriptive analysis, lower mean baseline HbA1c and fasting plasma glucose (FPG) were both associated with achieving and maintaining the HbA1c target. The machine learning analysis also identified HbA1c and FPG as the strongest predictors of attaining glycemic control. In contrast, covariates including body weight, waist circumference, blood pressure, or other variables did not contribute to the outcome. CONCLUSIONS: Using both traditional and novel data analysis methodologies, this study identified baseline glycemic status as the strongest predictor of target glycemic control attainment. Machine learning algorithms provide an hypothesis-free, unbiased methodology, which can greatly enhance the search for predictors of therapeutic success in T2DM. The approach used in the present analysis provides an example of how a machine learning algorithm can be applied to a clinical dataset and used to develop predictions that can facilitate clinical decision making.
Authors: Alan J Garber; Martin J Abrahamson; Joshua I Barzilay; Lawrence Blonde; Zachary T Bloomgarden; Michael A Bush; Samuel Dagogo-Jack; Ralph A DeFronzo; Daniel Einhorn; Vivian A Fonseca; Jeffrey R Garber; W Timothy Garvey; George Grunberger; Yehuda Handelsman; Irl B Hirsch; Paul S Jellinger; Janet B McGill; Jeffrey I Mechanick; Paul D Rosenblit; Guillermo E Umpierrez Journal: Endocr Pract Date: 2019-01 Impact factor: 3.443
Authors: Beverley Balkau; Françoise Calvi-Gries; Nick Freemantle; Maya Vincent; Valerie Pilorget; Philip D Home Journal: Diabetes Res Clin Pract Date: 2015-03-12 Impact factor: 5.602
Authors: Andrew Lewin; Ralph A DeFronzo; Sanjay Patel; Dacheng Liu; Renee Kaste; Hans J Woerle; Uli C Broedl Journal: Diabetes Care Date: 2015-01-29 Impact factor: 19.112
Authors: Jeffrey P Anderson; Jignesh R Parikh; Daniel K Shenfeld; Vladimir Ivanov; Casey Marks; Bruce W Church; Jason M Laramie; Jack Mardekian; Beth Anne Piper; Richard J Willke; Dale A Rublee Journal: J Diabetes Sci Technol Date: 2015-12-20