Estimate and approximate policy iteration algorithm for discounted Markov decision models with bounded costs and Borel spaces

From MaRDI portal

Revision as of 22:51, 3 February 2024 by Import240129110113 (talk | contribs) (Created automatically from import240129110113)

(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

Publication:3119642

Jump to:navigation, search

DOI10.3233/RDA-160116zbMath1409.90218OpenAlexW2621122412WikidataQ114943784 ScholiaQ114943784MaRDI QIDQ3119642

Ma. Teresa Robles Alcaraz, Óscar Vega-Amaya, J. Adolfo Minjárez-Sosa

Publication date: 12 March 2019

Published in: Risk and Decision Analysis (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.3233/rda-160116

zbMATH Keywords

Markov decision processes density estimation discounted criterion approximate policy iteration

Mathematics Subject Classification ID

Markov and semi-Markov decision processes (90C40)

Related Items

Approximation of discounted minimax Markov control problems and zero-sum Markov games using Hausdorff and Wasserstein distances, A perturbation approach to approximate value iteration for average cost Markov decision processes with Borel spaces and bounded costs, Time-varying Markov decision processes with state-action-dependent discount factors and unbounded costs

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:3119642&oldid=16211675"