Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Bayesian Nonparametric Methods for Partially-Observable Reinforcement Learning.

Literature DB >> 26353250

Bayesian Nonparametric Methods for Partially-Observable Reinforcement Learning.

Finale Doshi-Velez, David Pfau, Frank Wood, Nicholas Roy.

Abstract

Making intelligent decisions from incomplete information is critical in many applications: for example, robots must choose actions based on imperfect sensors, and speech-based interfaces must infer a user's needs from noisy microphone inputs. What makes these tasks hard is that often we do not have a natural representation with which to model the domain and use for choosing actions; we must learn about the domain's properties while simultaneously performing the task. Learning a representation also involves trade-offs between modeling the data that we have seen previously and being able to make predictions about new data. This article explores learning representations of stochastic systems using Bayesian nonparametric statistics. Bayesian nonparametric methods allow the sophistication of a representation to scale gracefully with the complexity in the data. Our main contribution is a careful empirical evaluation of how representations learned using Bayesian nonparametric methods compare to other standard learning approaches, especially in support of planning and control. We show that the Bayesian aspects of the methods result in achieving state-of-the-art performance in decision making with relatively few samples, while the nonparametric aspects often result in fewer computations. These results hold across a variety of different techniques for choosing actions given a representation.

Year: 2015 PMID： 26353250 DOI： 10.1109/TPAMI.2013.191

Source DB: PubMed Journal: IEEE Trans Pattern Anal Mach Intell ISSN： 0098-5589 Impact factor: 6.226

Keyword Cloud
Cited

4 in total

1. Addiction beyond pharmacological effects: The role of environment complexity and bounded rationality.

Authors: Dimitri Ognibene; Vincenzo G Fiore; Xiaosi Gu
Journal: Neural Netw Date: 2019-05-08

2. Reinforcement Learning Model With Dynamic State Space Tested on Target Search Tasks for Monkeys: Extension to Learning Task Events.

Authors: Kazuhiro Sakamoto; Hinata Yamada; Norihiko Kawaguchi; Yoshito Furusawa; Naohiro Saito; Hajime Mushiake
Journal: Front Comput Neurosci Date: 2022-06-02 Impact factor: 3.387

3. Expanding the Active Inference Landscape: More Intrinsic Motivations in the Perception-Action Loop.

Authors: Martin Biehl; Christian Guckelsberger; Christoph Salge; Simón C Smith; Daniel Polani
Journal: Front Neurorobot Date: 2018-08-30 Impact factor: 2.650

4. Reinforcement Learning Model With Dynamic State Space Tested on Target Search Tasks for Monkeys: Self-Determination of Previous States Based on Experience Saturation and Decision Uniqueness.

Authors: Tokio Katakura; Mikihiro Yoshida; Haruki Hisano; Hajime Mushiake; Kazuhiro Sakamoto
Journal: Front Comput Neurosci Date: 2022-02-04 Impact factor: 2.380

4 in total