| Literature DB >> 28603402 |
Finale Doshi-Velez1, George Konidaris2.
Abstract
Control applications often feature tasks with similar, but not identical, dynamics. We introduce the Hidden Parameter Markov Decision Process (HiP-MDP), a framework that parametrizes a family of related dynamical systems with a low-dimensional set of latent factors, and introduce a semiparametric regression approach for learning its structure from data. We show that a learned HiP-MDP rapidly identifies the dynamics of new task instances in several settings, flexibly adapting to task variation.Entities:
Year: 2016 PMID: 28603402 PMCID: PMC5466173
Source DB: PubMed Journal: IJCAI (U S) ISSN: 1045-0823