Literature DB >> 24049244

Batch Mode Reinforcement Learning based on the Synthesis of Artificial Trajectories.

Raphael Fonteneau1, Susan A Murphy, Louis Wehenkel, Damien Ernst.   

Abstract

In this paper, we consider the batch mode reinforcement learning setting, where the central problem is to learn from a sample of trajectories a policy that satisfies or optimizes a performance criterion. We focus on the continuous state space case for which usual resolution schemes rely on function approximators either to represent the underlying control problem or to represent its value function. As an alternative to the use of function approximators, we rely on the synthesis of "artificial trajectories" from the given sample of trajectories, and show that this idea opens new avenues for designing and analyzing algorithms for batch mode reinforcement learning.

Entities:  

Keywords:  Artificial Trajectories; Function Approximators; Optimal Control; Reinforcement Learning

Year:  2013        PMID: 24049244      PMCID: PMC3773886          DOI: 10.1007/s10479-012-1248-5

Source DB:  PubMed          Journal:  Ann Oper Res        ISSN: 0254-5330            Impact factor:   4.854


  2 in total

1.  Reinforcement learning versus model predictive control: a comparison on a power system problem.

Authors:  Damien Ernst; Mevludin Glavic; Florin Capitanescu; Louis Wehenkel
Journal:  IEEE Trans Syst Man Cybern B Cybern       Date:  2008-12-16

2.  Marginal Mean Models for Dynamic Regimes.

Authors:  S A Murphy; M J van der Laan; J M Robins
Journal:  J Am Stat Assoc       Date:  2001-12-01       Impact factor: 5.033

  2 in total
  2 in total

1.  Lipschitzness is all you need to tame off-policy generative adversarial imitation learning.

Authors:  Lionel Blondé; Pablo Strasser; Alexandros Kalousis
Journal:  Mach Learn       Date:  2022-04-04       Impact factor: 5.414

2.  Experience Replay Using Transition Sequences.

Authors:  Thommen George Karimpanal; Roland Bouffanais
Journal:  Front Neurorobot       Date:  2018-06-21       Impact factor: 2.650

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.