Literature DB >> 34470057

Systematic Review of Approaches to Preserve Machine Learning Performance in the Presence of Temporal Dataset Shift in Clinical Medicine.

Lin Lawrence Guo1, Stephen R Pfohl2, Jason Fries2, Jose Posada2, Scott Lanyon Fleming2, Catherine Aftandilian3, Nigam Shah2, Lillian Sung1,4.   

Abstract

OBJECTIVE: The change in performance of machine learning models over time as a result of temporal dataset shift is a barrier to machine learning-derived models facilitating decision-making in clinical practice. Our aim was to describe technical procedures used to preserve the performance of machine learning models in the presence of temporal dataset shifts.
METHODS: Studies were included if they were fully published articles that used machine learning and implemented a procedure to mitigate the effects of temporal dataset shift in a clinical setting. We described how dataset shift was measured, the procedures used to preserve model performance, and their effects.
RESULTS: Of 4,457 potentially relevant publications identified, 15 were included. The impact of temporal dataset shift was primarily quantified using changes, usually deterioration, in calibration or discrimination. Calibration deterioration was more common (n = 11) than discrimination deterioration (n = 3). Mitigation strategies were categorized as model level or feature level. Model-level approaches (n = 15) were more common than feature-level approaches (n = 2), with the most common approaches being model refitting (n = 12), probability calibration (n = 7), model updating (n = 6), and model selection (n = 6). In general, all mitigation strategies were successful at preserving calibration but not uniformly successful in preserving discrimination.
CONCLUSION: There was limited research in preserving the performance of machine learning models in the presence of temporal dataset shift in clinical medicine. Future research could focus on the impact of dataset shift on clinical decision making, benchmark the mitigation strategies on a wider range of datasets and tasks, and identify optimal strategies for specific settings. Thieme. All rights reserved.

Entities:  

Mesh:

Year:  2021        PMID: 34470057      PMCID: PMC8410238          DOI: 10.1055/s-0041-1735184

Source DB:  PubMed          Journal:  Appl Clin Inform        ISSN: 1869-0327            Impact factor:   2.762


  25 in total

1.  The Unified Medical Language System (UMLS): integrating biomedical terminology.

Authors:  Olivier Bodenreider
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

2.  The number needed to benefit: estimating the value of predictive analytics in healthcare.

Authors:  Vincent X Liu; David W Bates; Jenna Wiens; Nigam H Shah
Journal:  J Am Med Inform Assoc       Date:  2019-12-01       Impact factor: 4.497

3.  A nonparametric updating method to correct clinical prediction model drift.

Authors:  Sharon E Davis; Robert A Greevy; Christopher Fonnesbeck; Thomas A Lasko; Colin G Walsh; Michael E Matheny
Journal:  J Am Med Inform Assoc       Date:  2019-12-01       Impact factor: 4.497

4.  The Proliferation of Reports on Clinical Scoring Systems: Issues About Uptake and Clinical Utility.

Authors:  Douglas W Challener; Larry J Prokop; Omar Abu-Saleh
Journal:  JAMA       Date:  2019-06-25       Impact factor: 56.272

5.  Calibration Drift Among Regression and Machine Learning Models for Hospital Mortality.

Authors:  Sharon E Davis; Thomas A Lasko; Guanhua Chen; Michael E Matheny
Journal:  AMIA Annu Symp Proc       Date:  2018-04-16

Review 6.  Methods for updating a risk prediction model for cardiac surgery: a statistical primer.

Authors:  Sabrina Siregar; Daan Nieboer; Michel I M Versteegh; Ewout W Steyerberg; Johanna J M Takkenberg
Journal:  Interact Cardiovasc Thorac Surg       Date:  2019-03-01

7.  Barriers to Achieving Economies of Scale in Analysis of EHR Data. A Cautionary Tale.

Authors:  Mark P Sendak; Suresh Balu; Kevin A Schulman
Journal:  Appl Clin Inform       Date:  2017-08-09       Impact factor: 2.342

8.  Improving patient prostate cancer risk assessment: Moving from static, globally-applied to dynamic, practice-specific risk calculators.

Authors:  Andreas N Strobl; Andrew J Vickers; Ben Van Calster; Ewout Steyerberg; Robin J Leach; Ian M Thompson; Donna P Ankerst
Journal:  J Biomed Inform       Date:  2015-05-16       Impact factor: 6.317

9.  EHRtemporalVariability: delineating temporal data-set shifts in electronic health records.

Authors:  Carlos Sáez; Alba Gutiérrez-Sacristán; Isaac Kohane; Juan M García-Gómez; Paul Avillach
Journal:  Gigascience       Date:  2020-08-01       Impact factor: 6.524

10.  Changing how we think about healthcare improvement.

Authors:  Jeffrey Braithwaite
Journal:  BMJ       Date:  2018-05-17
View more
  3 in total

Review 1.  Advances in Machine Learning Approaches to Heart Failure with Preserved Ejection Fraction.

Authors:  Faraz S Ahmad; Yuan Luo; Ramsey M Wehbe; James D Thomas; Sanjiv J Shah
Journal:  Heart Fail Clin       Date:  2022-03-04       Impact factor: 3.179

2.  Evaluation of domain generalization and adaptation on improving model robustness to temporal dataset shift in clinical medicine.

Authors:  Lin Lawrence Guo; Stephen R Pfohl; Jason Fries; Alistair E W Johnson; Jose Posada; Catherine Aftandilian; Nigam Shah; Lillian Sung
Journal:  Sci Rep       Date:  2022-02-17       Impact factor: 4.379

3.  Open questions and research gaps for monitoring and updating AI-enabled tools in clinical settings.

Authors:  Sharon E Davis; Colin G Walsh; Michael E Matheny
Journal:  Front Digit Health       Date:  2022-09-02
  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.