Literature DB >> 35022756

Bayesian logistic regression for online recalibration and revision of risk prediction models with performance guarantees.

Jean Feng1, Alexej Gossmann2, Berkman Sahiner2, Romain Pirracchio3.   

Abstract

OBJECTIVE: After deploying a clinical prediction model, subsequently collected data can be used to fine-tune its predictions and adapt to temporal shifts. Because model updating carries risks of over-updating/fitting, we study online methods with performance guarantees.
MATERIALS AND METHODS: We introduce 2 procedures for continual recalibration or revision of an underlying prediction model: Bayesian logistic regression (BLR) and a Markov variant that explicitly models distribution shifts (MarBLR). We perform empirical evaluation via simulations and a real-world study predicting Chronic Obstructive Pulmonary Disease (COPD) risk. We derive "Type I and II" regret bounds, which guarantee the procedures are noninferior to a static model and competitive with an oracle logistic reviser in terms of the average loss.
RESULTS: Both procedures consistently outperformed the static model and other online logistic revision methods. In simulations, the average estimated calibration index (aECI) of the original model was 0.828 (95%CI, 0.818-0.938). Online recalibration using BLR and MarBLR improved the aECI towards the ideal value of zero, attaining 0.265 (95%CI, 0.230-0.300) and 0.241 (95%CI, 0.216-0.266), respectively. When performing more extensive logistic model revisions, BLR and MarBLR increased the average area under the receiver-operating characteristic curve (aAUC) from 0.767 (95%CI, 0.765-0.769) to 0.800 (95%CI, 0.798-0.802) and 0.799 (95%CI, 0.797-0.801), respectively, in stationary settings and protected against substantial model decay. In the COPD study, BLR and MarBLR dynamically combined the original model with a continually refitted gradient boosted tree to achieve aAUCs of 0.924 (95%CI, 0.913-0.935) and 0.925 (95%CI, 0.914-0.935), compared to the static model's aAUC of 0.904 (95%CI, 0.892-0.916). DISCUSSION: Despite its simplicity, BLR is highly competitive with MarBLR. MarBLR outperforms BLR when its prior better reflects the data.
CONCLUSIONS: BLR and MarBLR can improve the transportability of clinical prediction models and maintain their performance over time. Published by Oxford University Press on behalf of the American Medical Informatics Association 2022. This work is written by US Government employees and is in the public domain in the US.

Entities:  

Keywords:  Bayesian model updating; clinical prediction models; machine learning; model recalibration

Mesh:

Year:  2022        PMID: 35022756      PMCID: PMC9006691          DOI: 10.1093/jamia/ocab280

Source DB:  PubMed          Journal:  J Am Med Inform Assoc        ISSN: 1067-5027            Impact factor:   4.497


  17 in total

1.  A nonparametric updating method to correct clinical prediction model drift.

Authors:  Sharon E Davis; Robert A Greevy; Christopher Fonnesbeck; Thomas A Lasko; Colin G Walsh; Michael E Matheny
Journal:  J Am Med Inform Assoc       Date:  2019-12-01       Impact factor: 4.497

Review 2.  Clinical applications of continual learning machine learning.

Authors:  Cecilia S Lee; Aaron Y Lee
Journal:  Lancet Digit Health       Date:  2020-06

3.  Recalibrating our prediction models in the ICU: time to move from the abacus to the computer.

Authors:  Romain Pirracchio; Otavio T Ranzani
Journal:  Intensive Care Med       Date:  2014-02-14       Impact factor: 17.440

4.  A closed testing procedure to select an appropriate method for updating prediction models.

Authors:  Yvonne Vergouwe; Daan Nieboer; Rianne Oostenbrink; Thomas P A Debray; Gordon D Murray; Michael W Kattan; Hendrik Koffijberg; Karel G M Moons; Ewout W Steyerberg
Journal:  Stat Med       Date:  2016-11-28       Impact factor: 2.373

5.  Detection of calibration drift in clinical prediction models to inform model updating.

Authors:  Sharon E Davis; Robert A Greevy; Thomas A Lasko; Colin G Walsh; Michael E Matheny
Journal:  J Biomed Inform       Date:  2020-11-04       Impact factor: 6.317

6.  Improving patient prostate cancer risk assessment: Moving from static, globally-applied to dynamic, practice-specific risk calculators.

Authors:  Andreas N Strobl; Andrew J Vickers; Ben Van Calster; Ewout Steyerberg; Robin J Leach; Ian M Thompson; Donna P Ankerst
Journal:  J Biomed Inform       Date:  2015-05-16       Impact factor: 6.317

7.  The state of artificial intelligence-based FDA-approved medical devices and algorithms: an online database.

Authors:  Stan Benjamens; Pranavsingh Dhunnoo; Bertalan Meskó
Journal:  NPJ Digit Med       Date:  2020-09-11

8.  Approval policies for modifications to machine learning-based software as a medical device: A study of bio-creep.

Authors:  Jean Feng; Scott Emerson; Noah Simon
Journal:  Biometrics       Date:  2020-10-11       Impact factor: 2.571

9.  Updating methods improved the performance of a clinical prediction model in new patients.

Authors:  K J M Janssen; K G M Moons; C J Kalkman; D E Grobbee; Y Vergouwe
Journal:  J Clin Epidemiol       Date:  2007-11-26       Impact factor: 6.437

10.  Calibration drift in regression and machine learning models for acute kidney injury.

Authors:  Sharon E Davis; Thomas A Lasko; Guanhua Chen; Edward D Siew; Michael E Matheny
Journal:  J Am Med Inform Assoc       Date:  2017-11-01       Impact factor: 4.497

View more
  1 in total

Review 1.  Clinical artificial intelligence quality improvement: towards continual monitoring and updating of AI algorithms in healthcare.

Authors:  Jean Feng; Rachael V Phillips; Ivana Malenica; Andrew Bishara; Alan E Hubbard; Leo A Celi; Romain Pirracchio
Journal:  NPJ Digit Med       Date:  2022-05-31
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.