A review of stochastic algorithms with continuous value function approximation and some new approximate policy iteration algorithms for multidimensional continuous applications

DOI10.1007/s11768-011-0313-yzbMath1249.90306OpenAlexW2044287460WikidataQ115144927 ScholiaQ115144927MaRDI QIDQ2887630

Publication date: 1 June 2012

Published in: Journal of Control Theory and Applications (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1007/s11768-011-0313-y

zbMATH Keywords

optimal control reinforcement learning approximation algorithms approximate dynamic programming

Mathematics Subject Classification ID

Learning and adaptive systems in artificial intelligence (68T05) Approximation methods and heuristics in mathematical programming (90C59) Dynamic programming (90C39)

Related Items (9)

Least squares policy iteration with instrumental variables vs. direct policy search: comparison against optimal benchmarks using energy storage ⋮ Potential-based least-squares policy iteration for a parameterized feedback control system ⋮ New approximate dynamic programming algorithms for large-scale undiscounted Markov decision processes and their application to optimize a production and distribution system ⋮ A perturbation approach to a class of discounted approximate value iteration algorithms with Borel spaces ⋮ Perspectives of approximate dynamic programming ⋮ Convergence of deep fictitious play for stochastic differential games ⋮ Off-line approximate dynamic programming for the vehicle routing problem with a highly variable customer basis and stochastic demands ⋮ Temporal difference-based policy iteration for optimal control of stochastic systems ⋮ A perturbation approach to approximate value iteration for average cost Markov decision processes with Borel spaces and bounded costs

Uses Software

Cites Work

This page was built for publication: A review of stochastic algorithms with continuous value function approximation and some new approximate policy iteration algorithms for multidimensional continuous applications