A perturbation approach to a class of discounted approximate value iteration algorithms with Borel spaces

From MaRDI portal

Publication:330284

Jump to:navigation, search

DOI10.3934/JDG.2016014zbMATH Open1346.93402OpenAlexW2507723477MaRDI QIDQ330284FDOQ330284

Authors: Óscar Vega-Amaya, Joaquín López-Borbón

Publication date: 25 October 2016

Published in: Journal of Dynamics and Games (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.3934/jdg.2016014

Recommendations

A perturbation approach to approximate value iteration for average cost Markov decision processes with Borel spaces and bounded costs.
Estimate and approximate policy iteration algorithm for discounted Markov decision models with bounded costs and Borel spaces
Complexity bounds for approximately solving discounted MDPs by value iterations
Value iteration for average cost Markov decision processes in Borel spaces
Value iteration in average cost Markov control processes on Borel spaces
On the existence of fixed points for approximate value iteration and temporal-difference learning
The convergence of value iteration in discounted Markov decision processes
scientific article; zbMATH DE number 4003938

zbMATH Keywords

Markov decision processes discounted criterion approximate value iteration algorithm perturbed models

Mathematics Subject Classification ID

Approximation methods and heuristics in mathematical programming (90C59) Markov and semi-Markov decision processes (90C40) Optimal stochastic control (93E20)

Cites Work

Cited In (7)

This page was built for publication: A perturbation approach to a class of discounted approximate value iteration algorithms with Borel spaces

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q330284)

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:330284&oldid=12206102"