An Evolutionary Random Policy Search Algorithm for Solving Markov Decision Processes
From MaRDI portal
Publication:2892321
DOI10.1287/ijoc.1050.0155zbMath1241.90173WikidataQ114967841 ScholiaQ114967841MaRDI QIDQ2892321
Jiaqiao Hu, Steven I. Marcus, Vahid Reza Ramezani, Michael C. Fu
Publication date: 18 June 2012
Published in: INFORMS Journal on Computing (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1287/ijoc.1050.0155
90C59: Approximation methods and heuristics in mathematical programming
90C39: Dynamic programming
90C40: Markov and semi-Markov decision processes
Related Items
Optimal Online Learning for Nonlinear Belief Models Using Discrete Priors, A variable neighborhood search based algorithm for finite-horizon Markov decision processes
Uses Software
Cites Work