Literature DB >> 29225449

Action Centered Contextual Bandits.

Kristjan Greenewald1, Ambuj Tewari2, Predrag Klasnja3, Susan Murphy4.   

Abstract

Contextual bandits have become popular as they offer a middle ground between very simple approaches based on multi-armed bandits and very complex approaches using the full power of reinforcement learning. They have demonstrated success in web applications and have a rich body of associated theoretical guarantees. Linear models are well understood theoretically and preferred by practitioners because they are not only easily interpretable but also simple to implement and debug. Furthermore, if the linear model is true, we get very strong performance guarantees. Unfortunately, in emerging applications in mobile health, the time-invariant linear model assumption is untenable. We provide an extension of the linear model for contextual bandits that has two parts: baseline reward and treatment effect. We allow the former to be complex but keep the latter simple. We argue that this model is plausible for mobile health applications. At the same time, it leads to algorithms with strong performance guarantees as in the linear model setting, while still allowing for complex nonlinear baseline modeling. Our theory is supported by experiments on data gathered in a recently concluded mobile health study.

Entities:  

Year:  2017        PMID: 29225449      PMCID: PMC5719505     

Source DB:  PubMed          Journal:  Adv Neural Inf Process Syst        ISSN: 1049-5258


  2 in total

1.  Microrandomized trials: An experimental design for developing just-in-time adaptive interventions.

Authors:  Predrag Klasnja; Eric B Hekler; Saul Shiffman; Audrey Boruvka; Daniel Almirall; Ambuj Tewari; Susan A Murphy
Journal:  Health Psychol       Date:  2015-12       Impact factor: 4.267

2.  Sample size calculations for micro-randomized trials in mHealth.

Authors:  Peng Liao; Predrag Klasnja; Ambuj Tewari; Susan A Murphy
Journal:  Stat Med       Date:  2015-12-28       Impact factor: 2.373

  2 in total
  4 in total

1.  Power Constrained Bandits.

Authors:  Jiayu Yao; Emma Brunskill; Weiwei Pan; Susan Murphy; Finale Doshi-Velez
Journal:  Proc Mach Learn Res       Date:  2021-08

2.  Adaptive learning algorithms to optimize mobile applications for behavioral health: guidelines for design decisions.

Authors:  Caroline A Figueroa; Adrian Aguilera; Bibhas Chakraborty; Arghavan Modiri; Jai Aggarwal; Nina Deliu; Urmimala Sarkar; Joseph Jay Williams; Courtney R Lyles
Journal:  J Am Med Inform Assoc       Date:  2021-06-12       Impact factor: 4.497

Review 3.  Personalization of Intervention Timing for Physical Activity: Scoping Review.

Authors:  Saurabh Chaudhari; Suparna Ghanvatkar; Atreyi Kankanhalli
Journal:  JMIR Mhealth Uhealth       Date:  2022-02-28       Impact factor: 4.947

4.  Notifications to Improve Engagement With an Alcohol Reduction App: Protocol for a Micro-Randomized Trial.

Authors:  Lauren Bell; Claire Garnett; Tianchen Qian; Olga Perski; Henry W W Potts; Elizabeth Williamson
Journal:  JMIR Res Protoc       Date:  2020-08-07
  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.