Simulation-based algorithms for Markov decision processes.

From MaRDI portal

Publication:870662

Jump to:navigation, search

zbMath1155.90002MaRDI QIDQ870662

Jiaqiao Hu, Steven I. Marcus, Michael C. Fu, Hyeong Soo Chang

Publication date: 13 March 2007

Published in: Communications and Control Engineering (Search for Journal in Brave)

zbMATH Keywords

algorithm simulation discounting optimal policy Markov decisin process

Mathematics Subject Classification ID

Approximation methods and heuristics in mathematical programming (90C59) Markov and semi-Markov decision processes (90C40) Research exposition (monographs, survey articles) pertaining to operations research and mathematical programming (90-02)

Related Items (23)

Approximate policy iteration: a survey and some new methods ⋮ A review of stochastic algorithms with continuous value function approximation and some new approximate policy iteration algorithms for multidimensional continuous applications ⋮ Solving average cost Markov decision processes by means of a two-phase time aggregation algorithm ⋮ Computable approximations for continuous-time Markov decision processes on Borel spaces based on empirical measures ⋮ New approximate dynamic programming algorithms for large-scale undiscounted Markov decision processes and their application to optimize a production and distribution system ⋮ Adaptive aggregation for reinforcement learning in average reward Markov decision processes ⋮ Optimization of Markov decision processes under the variance criterion ⋮ Simulation-based optimization of Markov decision processes: an empirical process theory approach ⋮ Risk-Sensitive Reinforcement Learning via Policy Gradient Search ⋮ Sleeping experts and bandits approach to constrained Markov decision processes ⋮ The optimal control of just-in-time-based production and distribution systems and performance comparisons with optimized pull systems ⋮ Mean field Markov decision processes ⋮ CONIC TRADING IN A MARKOVIAN STEADY STATE ⋮ A \(Sarsa(\lambda)\) algorithm based on double-layer fuzzy reasoning ⋮ Approximation of Markov decision processes with general state space ⋮ Strategic capacity decision-making in a stochastic manufacturing environment using real-time approximate dynamic programming ⋮ Approximation of discounted minimax Markov control problems and zero-sum Markov games using Hausdorff and Wasserstein distances ⋮ Variance-penalized Markov decision processes: dynamic programming and reinforcement learning techniques ⋮ Sampled fictitious play for approximate dynamic programming ⋮ Computable approximations for average Markov decision processes in continuous time ⋮ Coupling based estimation approaches for the average reward performance potential in Markov chains ⋮ What you should know about approximate dynamic programming ⋮ Stochastic approximations of constrained discounted Markov decision processes

This page was built for publication: Simulation-based algorithms for Markov decision processes.

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:870662&oldid=12819397"