Literature DB >> 19777080

Online Planning Algorithms for POMDPs.

Stéphane Ross1, Joelle Pineau, Sébastien Paquet, Brahim Chaib-Draa.   

Abstract

Partially Observable Markov Decision Processes (POMDPs) provide a rich framework for sequential decision-making under uncertainty in stochastic domains. However, solving a POMDP is often intractable except for small problems due to their complexity. Here, we focus on online approaches that alleviate the computational complexity by computing good local policies at each decision step during the execution. Online algorithms generally consist of a lookahead search to find the best action to execute at each time step in an environment. Our objectives here are to survey the various existing online POMDP methods, analyze their properties and discuss their advantages and disadvantages; and to thoroughly evaluate these online approaches in different environments under various metrics (return, error bound reduction, lower bound improvement). Our experimental results indicate that state-of-the-art online heuristic search methods can handle large POMDP domains efficiently.

Entities:  

Year:  2008        PMID: 19777080      PMCID: PMC2748358     

Source DB:  PubMed          Journal:  J Artif Intell Res        ISSN: 1076-9757            Impact factor:   2.776


  1 in total

1.  Theoretical Analysis of Heuristic Search Methods for Online POMDPs.

Authors:  Stéphane Ross; Joelle Pineau; Brahim Chaib-Draa
Journal:  Adv Neural Inf Process Syst       Date:  2008
  1 in total
  8 in total

Review 1.  Emotion and decision-making: affect-driven belief systems in anxiety and depression.

Authors:  Martin P Paulus; Angela J Yu
Journal:  Trends Cogn Sci       Date:  2012-08-13       Impact factor: 20.229

2.  Goal-directed decision making as probabilistic inference: a computational framework and potential neural correlates.

Authors:  Alec Solway; Matthew M Botvinick
Journal:  Psychol Rev       Date:  2012-01       Impact factor: 8.934

3.  When Optimal Feedback Control Is Not Enough: Feedforward Strategies Are Required for Optimal Control with Active Sensing.

Authors:  Sang-Hoon Yeo; David W Franklin; Daniel M Wolpert
Journal:  PLoS Comput Biol       Date:  2016-12-14       Impact factor: 4.475

4.  Sorting Objects from a Conveyor Belt Using POMDPs with Multiple-Object Observations and Information-Gain Rewards.

Authors:  Ady-Daniel Mezei; Levente Tamás; Lucian Buşoniu
Journal:  Sensors (Basel)       Date:  2020-04-27       Impact factor: 3.576

5.  Improving counterfactual reasoning with kernelised dynamic mixing models.

Authors:  Sonali Parbhoo; Omer Gottesman; Andrew Slavin Ross; Matthieu Komorowski; Aldo Faisal; Isabella Bon; Volker Roth; Finale Doshi-Velez
Journal:  PLoS One       Date:  2018-11-12       Impact factor: 3.240

6.  Learning State-Variable Relationships in POMCP: A Framework for Mobile Robots.

Authors:  Maddalena Zuccotto; Marco Piccinelli; Alberto Castellini; Enrico Marchesini; Alessandro Farinelli
Journal:  Front Robot AI       Date:  2022-07-19

7.  Artificial intelligence-informed planning for the rapid response of hazard-impacted road networks.

Authors:  Li Sun; John Shawe-Taylor; Dina D'Ayala
Journal:  Sci Rep       Date:  2022-09-29       Impact factor: 4.996

8.  Prospective Optimization with Limited Resources.

Authors:  Joseph Snider; Dongpyo Lee; Howard Poizner; Sergei Gepshtein
Journal:  PLoS Comput Biol       Date:  2015-09-14       Impact factor: 4.475

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.