| Literature DB >> 32646412 |
Chao Yu1, Guoqi Ren2, Yinzhao Dong2.
Abstract
BACKGROUND: Reinforcement learning (RL) provides a promising technique to solve complex sequential decision making problems in healthcare domains. Recent years have seen a great progress of applying RL in addressing decision-making problems in Intensive Care Units (ICUs). However, since the goal of traditional RL algorithms is to maximize a long-term reward function, exploration in the learning process may have a fatal impact on the patient. As such, a short-term goal should also be considered to keep the patient stable during the treating process.Entities:
Keywords: Intensive care units; Inverse learning; Mechanical ventilation; Reinforcement learning; Sedative dosing
Mesh:
Substances:
Year: 2020 PMID: 32646412 PMCID: PMC7344039 DOI: 10.1186/s12911-020-1120-5
Source DB: PubMed Journal: BMC Med Inform Decis Mak ISSN: 1472-6947 Impact factor: 2.796
Fig. 1The framework of SAC algorithm
Fig. 2Learning dynamics in terms of Q values regarding ventilation using SAC and AC algorithms
Fig. 3The process of SAC and AC algorithms evaluating the accuracy of the test set on ventilation
The AR of learned strategies using SAC and AC algorithms on the test data set
| Strategy | Validation set | Expert data | Common single intubation | Multiple intubation |
|---|---|---|---|---|
| SAC | 99.55% | 99.57% | 99.51% | 99.55% |
| AC | 99.48% | 99.47% | 99.46% | 99.49% |
Fig. 4Learning dynamics in terms of Q values regarding sedative using SAC and AC algorithms
Fig. 5MSE reduction process of SAC and AC algorithms on the train set for sedative dosing
The AR and MSE of learned polices using SAC and AC algorithms on the test data set
| Strategy | MSE | AR |
|---|---|---|
| SAC | 2.49 | 41.5% |
| AC | 3.10 | 41.5% |
The AR and MSE of learned polices using SAC and AC algorithms on expert data, single intubation and multiple intubation
| Strategy | Expert data | Single intubation | Multiple intubation | |||
|---|---|---|---|---|---|---|
| MSE | AR | MSE | AR | MSE | AR | |
| SAC | 2.52 | 37% | 2.71 | 38% | 2.28 | 47% |
| AC | 3.01 | 34% | 3.15 | 35% | 3.08 | 41% |