Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Online Planning Algorithms for POMDPs.

Literature DB >> 19777080

Online Planning Algorithms for POMDPs.

Stéphane Ross¹, Joelle Pineau, Sébastien Paquet, Brahim Chaib-Draa.

Abstract

Partially Observable Markov Decision Processes (POMDPs) provide a rich framework for sequential decision-making under uncertainty in stochastic domains. However, solving a POMDP is often intractable except for small problems due to their complexity. Here, we focus on online approaches that alleviate the computational complexity by computing good local policies at each decision step during the execution. Online algorithms generally consist of a lookahead search to find the best action to execute at each time step in an environment. Our objectives here are to survey the various existing online POMDP methods, analyze their properties and discuss their advantages and disadvantages; and to thoroughly evaluate these online approaches in different environments under various metrics (return, error bound reduction, lower bound improvement). Our experimental results indicate that state-of-the-art online heuristic search methods can handle large POMDP domains efficiently.

Entities: Chemical Disease Gene Species

Year: 2008 PMID： 19777080 PMCID： PMC2748358

Source DB: PubMed Journal: J Artif Intell Res ISSN： 1076-9757 Impact factor: 2.776

1 in total

1. Theoretical Analysis of Heuristic Search Methods for Online POMDPs.

Authors: Stéphane Ross; Joelle Pineau; Brahim Chaib-Draa
Journal: Adv Neural Inf Process Syst Date: 2008

1 in total

8 in total

Review 1. Emotion and decision-making: affect-driven belief systems in anxiety and depression.

Authors: Martin P Paulus; Angela J Yu
Journal: Trends Cogn Sci Date: 2012-08-13 Impact factor: 20.229

2. Goal-directed decision making as probabilistic inference: a computational framework and potential neural correlates.

Authors: Alec Solway; Matthew M Botvinick
Journal: Psychol Rev Date: 2012-01 Impact factor: 8.934

3. When Optimal Feedback Control Is Not Enough: Feedforward Strategies Are Required for Optimal Control with Active Sensing.

Authors: Sang-Hoon Yeo; David W Franklin; Daniel M Wolpert
Journal: PLoS Comput Biol Date: 2016-12-14 Impact factor: 4.475

Online Planning Algorithms for POMDPs.

1. Theoretical Analysis of Heuristic Search Methods for Online POMDPs.

Review 1. Emotion and decision-making: affect-driven belief systems in anxiety and depression.

2. Goal-directed decision making as probabilistic inference: a computational framework and potential neural correlates.

3. When Optimal Feedback Control Is Not Enough: Feedforward Strategies Are Required for Optimal Control with Active Sensing.

4. Sorting Objects from a Conveyor Belt Using POMDPs with Multiple-Object Observations and Information-Gain Rewards.

5. Improving counterfactual reasoning with kernelised dynamic mixing models.

6. Learning State-Variable Relationships in POMCP: A Framework for Mobile Robots.

7. Artificial intelligence-informed planning for the rapid response of hazard-impacted road networks.

8. Prospective Optimization with Limited Resources.