| Literature DB >> 31656388 |
Taylor Killian1, Samuel Daulton2, George Konidaris3, Finale Doshi-Velez1.
Abstract
We introduce a new formulation of the Hidden Parameter Markov Decision Process (HiP-MDP), a framework for modeling families of related tasks using low-dimensional latent embeddings. Our new framework correctly models the joint uncertainty in the latent parameters and the state space. We also replace the original Gaussian Process-based model with a Bayesian Neural Network, enabling more scalable inference. Thus, we expand the scope of the HiP-MDP to applications with higher dimensions and more complex dynamics.Entities:
Year: 2017 PMID: 31656388 PMCID: PMC6814194
Source DB: PubMed Journal: Adv Neural Inf Process Syst ISSN: 1049-5258