Policy set iteration for Markov decision processes
From MaRDI portal
Publication:2350853
DOI10.1016/J.AUTOMATICA.2013.09.010zbMATH Open1315.93073OpenAlexW1978639828MaRDI QIDQ2350853FDOQ2350853
Authors: Hyeong Soo Chang
Publication date: 25 June 2015
Published in: Automatica (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1016/j.automatica.2013.09.010
Recommendations
- Value set iteration for Markov decision processes
- Policy iteration for robust nonstationary Markov decision processes
- A note on policy algorithms for discounted Markov decision problems
- On the Convergence of Policy Iteration in Finite State Undiscounted Markov Decision Processes: The Unichain Case
- Adaptive policy-iteration and policy-value-iteration for discounted Markov decision processes
Dynamic programming (90C39) Markov and semi-Markov decision processes (90C40) Stochastic systems in control theory (general) (93E03)
Cited In (11)
- Multi-policy iteration with a distributed voting.
- Policy iteration type algorithms for recurrent state Markov decision processes
- Set coverage and robust policy
- Title not available (Why is that?)
- Interval iteration algorithm for MDPs and IMDPs
- Value set iteration for Markov decision processes
- Random search for constrained Markov decision processes with multi-policy improvement
- Adaptive policy-iteration and policy-value-iteration for discounted Markov decision processes
- A new policy iteration scheme for Markov decision processes using Schweitzer's formula
- Policy iteration for robust nonstationary Markov decision processes
- Reduced complexity dynamic programming based on policy iteration
This page was built for publication: Policy set iteration for Markov decision processes
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q2350853)