| Literature DB >> 35083434 |
Chih-Hao Huang1, Feras A Batarseh2, Adel Boueiz3, Ajay Kulkarni1, Po-Hsuan Su1, Jahan Aman1.
Abstract
The quality of service in healthcare is constantly challenged by outlier events such as pandemics (i.e., Covid-19) and natural disasters (such as hurricanes and earthquakes). In most cases, such events lead to critical uncertainties in decision-making, as well as in multiple medical and economic aspects at a hospital. External (geographic) or internal factors (medical and managerial) lead to shifts in planning and budgeting, but most importantly, reduce confidence in conventional processes. In some cases, support from other hospitals proves necessary, which exacerbates the planning aspect. This paper presents three data-driven methods that provide data-driven indicators to help healthcare managers organize their economics and identify the most optimum plan for resources allocation and sharing. Conventional decision-making methods fall short in recommending validated policies for managers. Using reinforcement learning, genetic algorithms, traveling salesman, and clustering, we experimented with different healthcare variables and presented tools and outcomes that could be applied at health institutes. Experiments are performed; the results are recorded, evaluated, and presented.Entities:
Keywords: data normalization; decision-making; fitness function; genetic algorithms; reinforcement learning; resource allocation
Year: 2021 PMID: 35083434 PMCID: PMC8788986 DOI: 10.1017/dap.2021.29
Source DB: PubMed Journal: Data Policy ISSN: 2632-3249
Reinforcement Learning (RL) for decision-making
| Year | Author(s) | Description |
|---|---|---|
| 1982 | Boyan Jovanovic | Applied RL with the use of Bayesian Learning to study single firm dynamics |
| 2001 | John Moody and Matthew Saffell | Implemented an adaptive algorithm: Recurrent RL with the utilization of Q-Learning for optimizing portfolios and asset allocations |
| 2007 | Michael Schwind | Applied RL and combinatorial auctions to bidding decision problems |
| 2014 | Neal Hughes | Author introduced methods based on Q-iteration and a batch version of Q-Learning to solve economic problems |
| 2016 | Koichiro Ito and Mar Reguant | Use of RL to characterize strategic behavior in sequential markets under imperfect competition and restricted entry in arbitrage |
| 2016 | Yue Deng et al. | Implemented concepts from deep learning and RL for real-time financial signal representations and trading |
| 2017 | Han Cai et al. | Built a Markov Decision Process framework for learning the optimal bidding policy and optimize advertising |
| 2017 | Saud Almahdi and Steve Y. Yang | Developing risk-based RL portfolios for rebalancing and market condition stop-loss retraining mechanism |
| 2018 | Jun Zhao et al. | Applied deep RL for bidding optimization in online advertising |
| 2018 | Thomas Spooner et al. | Authors provided one of the solutions for the market making trading problem by designing a market making agent using temporal-difference RL |
| 2018 | Zhuoran Xiong et al. | Authors implemented Deep Deterministic Policy Gradient (DDPG) methods based on deep RL for finding the best trading strategy in complex and dynamic stock markets |
| 2019 | Haoran Wang and Xun Yu Zhou | Authors achieved the best tradeoff between exploration and exploitation using an entropy-regularized relaxed stochastic control problem using RL |
| 2019 | Olivier Guéant and Iuliia Manziuk | Authors proposed a new approach of implementing model-based approximations of optimal bid and ask quotes for bonds |
| 2019 | Xinyi Li et al. | A new DDPG technique which incorporates optimistic or pessimistic deep RL for the portfolio allocation tasks |
| 2019 | Yuming Li et al. | Deep Q-Network for decision-making |
| 2020 | Bastien Baldacci et al. | Authors designed approaches to approximate the financial market and other optimal controls for real-time decisions |
Figure 1.Resources management dashboard using Reinforcement Learning.
Resources management decisions using Reinforcement Learning (RL) (based on the Pandemic Severity Assessment Framework scale)
| RL model inputs | RL model outputs | ||||||
|---|---|---|---|---|---|---|---|
| Use case | Ratio for case hospitalization | Scaled measure of clinical severity | Scaled measure of transmissibility | Stage | Value | Action | Event |
| 1 | 0.1 | 2 | 2 | 1 | 0.81 | Idle | Normal |
| 2 | 1.71 | Idle | Normal | ||||
| 3 | 3.71 | Idle | Normal | ||||
| 2 | 0.1 | 7 | 5 | 1 | 2.84 | Idle | Normal |
| 2 | 5.98 | Idle | Normal | ||||
| 3 | 12.98 | Idle | Normal | ||||
| 3 | 0.5 | 2 | 2 | 1 | 0.4 | Idle | Normal |
| 2 | 1.2 | Share | Normal | ||||
| 3 | 2.8 | Idle | Normal | ||||
| 4 | 0.5 | 7 | 5 | 1 | 0.88 | Idle | Normal |
| 2 | 2.62 | Idle | Normal | ||||
| 3 | 9.62 | Idle | Normal | ||||
| 5 | 0.9 | 2 | 2 | 1 | 0.1 | Idle | Normal |
| 2 | 1.05 | Idle | Normal | ||||
| 3 | 2.15 | Idle | Normal | ||||
| 6 | 0.9 | 7 | 5 | 1 | 0.1 | Idle | Outlier |
| 2 | 1.05 | Share | Outlier | ||||
| 3 | 7.41 | Idle | Outlier | ||||
Top 10 hospitals in terms of resource sharing readiness using Genetic Algorithms
| Facility name | Rating | No. of beds | Death rate | cost | Decision 1 | FF1 | Decision 2 | FF2 |
|---|---|---|---|---|---|---|---|---|
| Centra | 4 | 39.1 | 10.3 | 63.6 | 1 | 29.2 | 1 | 7.3 |
| Inova Alexandria Hospital | 5 | 42.4 | 25.5 | 64.5 | 1 | 16.9 | 1 | 3.4 |
| Bon Secours St Mary’s Hospital | 3 | 68.5 | 26.1 | 77.6 | 1 | 15.7 | 1 | 5.2 |
| Mary Washington Hospital | 3 | 29.3 | 13.3 | 70.2 | 1 | 13.2 | 1 | 4.4 |
| Sentara Princess Anne Hospital | 5 | 16.3 | 15.2 | 66.5 | 1 | 11.4 | 1 | 2.3 |
| Inova Fair Oaks Hospital | 5 | 33.7 | 30.9 | 69.2 | 1 | 11.3 | 1 | 2.3 |
| Sentara Norfolk General Hospital | 3 | 64.1 | 34.5 | 54.8 | 0 | 11.0 | 1 | 3.7 |
| Winchester Medical Center | 4 | 51.1 | 43.6 | 50.3 | 0 | 9.4 | 0 | 2.4 |
| Virginia Hospital Center | 5 | 33.7 | 38.8 | 68.8 | 1 | 9.1 | 1 | 1.8 |
| Reston Hospital Center | 4 | 16.3 | 17.0 | 66.7 | 1 | 8.2 | 1 | 2.1 |
| Inova Loudoun Hospital | 5 | 23.9 | 30.9 | 49.7 | 1 | 8.2 | 1 | 1.6 |
Figure 2.Resource allocation inputs and outputs for Genetic Algorithms using (a) FF1 and (b) FF2.
Figure 3.Beds allocation for Veterans Affairs hospitals based on Method #2.
Fitness values with different fitness functions
| Algorithm | Fitness value |
|---|---|
| FF4—Medical Centers | 1.51 × 10−3 |
| FF4—State Centers | 4.74 × 10−3 |
| FF3—State Centers—Unnormalized | 1.91 × 10−8 |
| FF3—State Centers—Normalized | 1.04 |
| FF3—State Centers—Unnormalized— | 1.38 × 10−7 |
| FF3—State Centers—Normalized— | 1.64 |
Figure 4.Resource allocation routes using Traveling Salesman Problem with Genetic Algorithm. (a) Veterans Affairs medical centers using unnormalized data with FF3; (b) state centers using unnormalized with FF3; (c) state centers using unnormalized data with FF4; (d) state centers using normalized data with FF4; and (e) state centers using K-means with FF4 with normalized data.