Uniform convergence of value iteration policies for discounted Markov decision processes
From MaRDI portal
Publication:2467010
Recommendations
- The convergence of value iteration in discounted Markov decision processes
- On the Convergence of Policy Iteration in Finite State Undiscounted Markov Decision Processes: The Unichain Case
- On convergence of value iteration for a class of total cost Markov decision processes
- Pointwise approximations of discounted Markov decision processes to optimal policies
- The value iteration method for countable state Markov decision processes
Cited in
(18)- Identification of optimal policies in Markov decision processes
- Convergence of value functions for finite horizon Markov decision processes with constraints
- On convergence of value iteration for a class of total cost Markov decision processes
- A mixed value and policy iteration method for stochastic control with universally measurable policies
- On the Convergence of Policy Iteration in Finite State Undiscounted Markov Decision Processes: The Unichain Case
- Simulation‐based Uniform Value Function Estimates of Markov Decision Processes
- An empirical study of policy convergence in Markov decision process value iteration
- A Note on the Convergence of Policy Iteration in Markov Decision Processes with Compact Action Spaces
- Pointwise approximations of discounted Markov decision processes to optimal policies
- A Lyapunov-based version of the value iteration algorithm formulated as a discrete-time switched affine system
- The convergence of value iteration in discounted Markov decision processes
- Regular policies in abstract dynamic programming
- Nonuniqueness versus uniqueness of optimal policies in convex discounted Markov decision processes
- Convergence Properties of Policy Iteration
- Convergence in unconstrained discrete-time differential dynamic programming
- Suboptimality of the value iteration policies in discounted linear-quadratic models
- A stopping rule for discounted Markov decision processes with finite action sets
- scientific article; zbMATH DE number 790730 (Why is no real title available?)
This page was built for publication: Uniform convergence of value iteration policies for discounted Markov decision processes
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q2467010)