| Literature DB >> 34664120 |
Shihan Wang1,2, Chao Zhang3,4, Ben Kröse5,6, Herke van Hoof5.
Abstract
Mobile health (mHealth) intervention systems can employ adaptive strategies to interact with users. Instead of designing such complex strategies manually, reinforcement learning (RL) can be used to adaptively optimize intervention strategies concerning the user's context. In this paper, we focus on the issue of overwhelming interactions when learning a good adaptive strategy for the user in RL-based mHealth intervention agents. We present a data-driven approach integrating psychological insights and knowledge of historical data. It allows RL agents to optimize the strategy of delivering context-aware notifications from empirical data when counterfactual information (user responses when receiving notifications) is missing. Our approach also considers a constraint on the frequency of notifications, which reduces the interaction burden for users. We evaluated our approach in several simulation scenarios using real large-scale running data. The results indicate that our RL agent can deliver notifications in a manner that realizes a higher behavioral impact than context-blind strategies.Entities:
Keywords: Adaptive agent; Human simulator; Just-in-time adaptive intervention; Mobile health intervention; Reinforcement learning
Mesh:
Year: 2021 PMID: 34664120 PMCID: PMC8523513 DOI: 10.1007/s10916-021-01773-0
Source DB: PubMed Journal: J Med Syst ISSN: 0148-5598 Impact factor: 4.460
Fig. 1The overview of our methodology, including the agent–environment interaction in the MDP model and three key components developed in both environment and agent. The approach optimizes the delivery of context-aware notifications from empirical data
Fig. 2Topological structure and transition probabilities of our dynamic Bayesian network
Fig. 3The simulation results shown the average reward of agents in the sliding windows of 500 episodes
Fig. 4Information of three episodes in the (R-REINFORCE) agent. Each circle represents one decision point, marked by hour and weekday. Black on the left side means ‘a notification’, and black on the right side means ‘a run’. The color of a circle represents the context desirability for running. While red and blue color correspond to the high and low desirability respectively, darker is more extreme