Literature DB >> 28603402

Hidden Parameter Markov Decision Processes: A Semiparametric Regression Approach for Discovering Latent Task Parametrizations.

Finale Doshi-Velez1, George Konidaris2.   

Abstract

Control applications often feature tasks with similar, but not identical, dynamics. We introduce the Hidden Parameter Markov Decision Process (HiP-MDP), a framework that parametrizes a family of related dynamical systems with a low-dimensional set of latent factors, and introduce a semiparametric regression approach for learning its structure from data. We show that a learned HiP-MDP rapidly identifies the dynamics of new task instances in several settings, flexibly adapting to task variation.

Entities:  

Year:  2016        PMID: 28603402      PMCID: PMC5466173     

Source DB:  PubMed          Journal:  IJCAI (U S)        ISSN: 1045-0823


  2 in total

1.  Choice-selective sequences dominate in cortical relative to thalamic inputs to NAc to support reinforcement learning.

Authors:  Nathan F Parker; Avinash Baidya; Julia Cox; Laura M Haetzel; Anna Zhukovskaya; Malavika Murugan; Ben Engelhard; Mark S Goldman; Ilana B Witten
Journal:  Cell Rep       Date:  2022-05-17       Impact factor: 9.995

2.  Robust and Efficient Transfer Learning with Hidden Parameter Markov Decision Processes.

Authors:  Taylor Killian; Samuel Daulton; George Konidaris; Finale Doshi-Velez
Journal:  Adv Neural Inf Process Syst       Date:  2017-12
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.