Literature DB >> 33936452

Is Deep Reinforcement Learning Ready for Practical Applications in Healthcare? A Sensitivity Analysis of Duel-DDQN for Hemodynamic Management in Sepsis Patients.

MingYu Lu1, Zachary Shahn2, Daby Sow2, Finale Doshi-Velez3, Li-Wei H Lehman1.   

Abstract

The potential of Reinforcement Learning (RL) has been demonstrated through successful applications to games such as Go and Atari. However, while it is straightforward to evaluate the performance of an RL algorithm in a game setting by simply using it to play the game, evaluation is a major challenge in clinical settings where it could be unsafe to follow RL policies in practice. Thus, understanding sensitivity of RL policies to the host of decisions made during implementation is an important step toward building the type of trust in RL required for eventual clinical uptake. In this work, we perform a sensitivity analysis on a state-of-the-art RL algorithm (Dueling Double Deep Q-Networks) applied to hemodynamic stabilization treatment strategies for septic patients in the ICU. We consider sensitivity of learned policies to input features, embedding model architecture, time discretization, reward function, and random seeds. We find that varying these settings can significantly impact learned policies, which suggests a need for caution when interpreting RL agent output. ©2020 AMIA - All rights reserved.

Entities:  

Mesh:

Year:  2021        PMID: 33936452      PMCID: PMC8075511     

Source DB:  PubMed          Journal:  AMIA Annu Symp Proc        ISSN: 1559-4076


  11 in total

1.  Doubly robust estimation in missing data and causal inference models.

Authors:  Heejung Bang; James M Robins
Journal:  Biometrics       Date:  2005-12       Impact factor: 2.571

Review 2.  Inotropes and vasopressors: review of physiology and clinical use in cardiovascular disease.

Authors:  Christopher B Overgaard; Vladimír Dzavík
Journal:  Circulation       Date:  2008-09-02       Impact factor: 29.690

3.  The Third International Consensus Definitions for Sepsis and Septic Shock (Sepsis-3).

Authors:  Mervyn Singer; Clifford S Deutschman; Christopher Warren Seymour; Manu Shankar-Hari; Djillali Annane; Michael Bauer; Rinaldo Bellomo; Gordon R Bernard; Jean-Daniel Chiche; Craig M Coopersmith; Richard S Hotchkiss; Mitchell M Levy; John C Marshall; Greg S Martin; Steven M Opal; Gordon D Rubenfeld; Tom van der Poll; Jean-Louis Vincent; Derek C Angus
Journal:  JAMA       Date:  2016-02-23       Impact factor: 56.272

4.  Long short-term memory.

Authors:  S Hochreiter; J Schmidhuber
Journal:  Neural Comput       Date:  1997-11-15       Impact factor: 2.026

5.  Improving Sepsis Treatment Strategies by Combining Deep and Kernel-Based Reinforcement Learning.

Authors:  Xuefeng Peng; Yi Ding; David Wihl; Omer Gottesman; Matthieu Komorowski; Li-Wei H Lehman; Andrew Ross; Aldo Faisal; Finale Doshi-Velez
Journal:  AMIA Annu Symp Proc       Date:  2018-12-05

6.  Human-level control through deep reinforcement learning.

Authors:  Volodymyr Mnih; Koray Kavukcuoglu; David Silver; Andrei A Rusu; Joel Veness; Marc G Bellemare; Alex Graves; Martin Riedmiller; Andreas K Fidjeland; Georg Ostrovski; Stig Petersen; Charles Beattie; Amir Sadik; Ioannis Antonoglou; Helen King; Dharshan Kumaran; Daan Wierstra; Shane Legg; Demis Hassabis
Journal:  Nature       Date:  2015-02-26       Impact factor: 49.962

7.  The Artificial Intelligence Clinician learns optimal treatment strategies for sepsis in intensive care.

Authors:  Matthieu Komorowski; Leo A Celi; Omar Badawi; Anthony C Gordon; A Aldo Faisal
Journal:  Nat Med       Date:  2018-10-22       Impact factor: 53.440

8.  Combining Kernel and Model Based Learning for HIV Therapy Selection.

Authors:  Sonali Parbhoo; Jasmina Bogojeska; Maurizio Zazzi; Volker Roth; Finale Doshi-Velez
Journal:  AMIA Jt Summits Transl Sci Proc       Date:  2017-07-26

9.  Recurrent Neural Networks for Multivariate Time Series with Missing Values.

Authors:  Zhengping Che; Sanjay Purushotham; Kyunghyun Cho; David Sontag; Yan Liu
Journal:  Sci Rep       Date:  2018-04-17       Impact factor: 4.379

10.  MIMIC-III, a freely accessible critical care database.

Authors:  Alistair E W Johnson; Tom J Pollard; Lu Shen; Li-Wei H Lehman; Mengling Feng; Mohammad Ghassemi; Benjamin Moody; Peter Szolovits; Leo Anthony Celi; Roger G Mark
Journal:  Sci Data       Date:  2016-05-24       Impact factor: 6.444

View more
  3 in total

1.  Editorial: Clinical Application of Artificial Intelligence in Emergency and Critical Care Medicine, Volume II.

Authors:  Zhongheng Zhang; Rahul Kashyap; Nan Liu; Longxiang Su; Qinghe Meng
Journal:  Front Med (Lausanne)       Date:  2022-05-06

2.  Editorial: Clinical Application of Artificial Intelligence in Emergency and Critical Care Medicine, Volume I.

Authors:  Zhongheng Zhang; Nan Liu; Qinghe Meng; Longxiang Su
Journal:  Front Med (Lausanne)       Date:  2021-12-06

3.  Establishment and Implementation of Potential Fluid Therapy Balance Strategies for ICU Sepsis Patients Based on Reinforcement Learning.

Authors:  Longxiang Su; Yansheng Li; Shengjun Liu; Siqi Zhang; Xiang Zhou; Li Weng; Mingliang Su; Bin Du; Weiguo Zhu; Yun Long
Journal:  Front Med (Lausanne)       Date:  2022-04-14
  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.