(Approximate) iterated successive approximations algorithm for sequential decision processes
DOI10.1007/S10479-012-1073-XzbMATH Open1274.90469OpenAlexW2014981566MaRDI QIDQ378751FDOQ378751
Authors: Pelin G. Canbolat, Uriel G. Rothblum
Publication date: 12 November 2013
Published in: Annals of Operations Research (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1007/s10479-012-1073-x
Recommendations
- Approximate Fixed Point Iteration with an Application to Infinite Horizon Markov Decision Processes
- scientific article; zbMATH DE number 934465
- scientific article; zbMATH DE number 3950237
- Technical Note—Successive Approximations in Value Determination for a Markov Decision Process
- Convergence Properties of Policy Iteration
successive approximationsMarkov decision chainsmodified policy iterationsequential decision processes
Approximation methods and heuristics in mathematical programming (90C59) Markov and semi-Markov decision processes (90C40)
Cites Work
- Title not available (Why is that?)
- Title not available (Why is that?)
- Discounted Markov games: Generalized policy iteration method
- Title not available (Why is that?)
- Title not available (Why is that?)
- Approximations of Dynamic Programs, I
- Modified Policy Iteration Algorithms for Discounted Markov Decision Problems
- Q-learning and enhanced policy iteration in discounted dynamic programming
- Contraction Mappings in the Theory Underlying Dynamic Programming
- Truncated policy iteration methods
- Block-successive approximation for a discounted Markov decision model
- Affine Structure and Invariant Policies for Dynamic Programs
- Improved iterative computation of the expected discounted return in Markov and semi-Markov chains
- Action Elimination Procedures for Modified Policy Iteration Algorithms
- A set of successive approximation methods for discounted Markovian decision problems
- Title not available (Why is that?)
- On the Convergence of Policy Iteration in Stationary Dynamic Programming
- Some Bounds for Discounted Sequential Decision Processes
Cited In (5)
This page was built for publication: (Approximate) iterated successive approximations algorithm for sequential decision processes
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q378751)