Literature DB >> 31656388

Robust and Efficient Transfer Learning with Hidden Parameter Markov Decision Processes.

Taylor Killian1, Samuel Daulton2, George Konidaris3, Finale Doshi-Velez1.   

Abstract

We introduce a new formulation of the Hidden Parameter Markov Decision Process (HiP-MDP), a framework for modeling families of related tasks using low-dimensional latent embeddings. Our new framework correctly models the joint uncertainty in the latent parameters and the state space. We also replace the original Gaussian Process-based model with a Bayesian Neural Network, enabling more scalable inference. Thus, we expand the scope of the HiP-MDP to applications with higher dimensions and more complex dynamics.

Entities:  

Year:  2017        PMID: 31656388      PMCID: PMC6814194     

Source DB:  PubMed          Journal:  Adv Neural Inf Process Syst        ISSN: 1049-5258


  4 in total

1.  Dynamic multidrug therapies for hiv: optimal and sti control approaches.

Authors:  B M Adams; H T Banks; Hee-Dae Kwon; Hien T Tran
Journal:  Math Biosci Eng       Date:  2004-09       Impact factor: 2.080

2.  Informing sequential clinical decision-making through reinforcement learning: an empirical study.

Authors:  Susan M Shortreed; Eric Laber; Daniel J Lizotte; T Scott Stroup; Joelle Pineau; Susan A Murphy
Journal:  Mach Learn       Date:  2011-07-01       Impact factor: 2.940

3.  Hidden Parameter Markov Decision Processes: A Semiparametric Regression Approach for Discovering Latent Task Parametrizations.

Authors:  Finale Doshi-Velez; George Konidaris
Journal:  IJCAI (U S)       Date:  2016-07

4.  Human-level control through deep reinforcement learning.

Authors:  Volodymyr Mnih; Koray Kavukcuoglu; David Silver; Andrei A Rusu; Joel Veness; Marc G Bellemare; Alex Graves; Martin Riedmiller; Andreas K Fidjeland; Georg Ostrovski; Stig Petersen; Charles Beattie; Amir Sadik; Ioannis Antonoglou; Helen King; Dharshan Kumaran; Daan Wierstra; Shane Legg; Demis Hassabis
Journal:  Nature       Date:  2015-02-26       Impact factor: 49.962

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.