A K-step look-ahead analysis of value iteration algorithms for Markov decision processes
From MaRDI portal
Publication:1266643
Recommendations
Cites work
- scientific article; zbMATH DE number 3628710 (Why is no real title available?)
- scientific article; zbMATH DE number 3632247 (Why is no real title available?)
- scientific article; zbMATH DE number 522741 (Why is no real title available?)
- A set of successive approximation methods for discounted Markovian decision problems
- A simple technique in Markovian control with applications to resource allocation to resource allocation in communication networks
- Accelerating Procedures of the Value Iteration Algorithm for Discounted Markov Decision Processes, Based on a One-Step Lookahead Analysis
- Action Elimination Procedures for Modified Policy Iteration Algorithms
- Bounds and Transformations for Discounted Finite Markov Decision Chains
- Branch-and-Bound Strategies for Dynamic Programming
- Computational comparison of value iteration algorithms for discounted Markov decision processes
- Computing Optimal Policies for Controlled Tandem Queueing Systems
- Criteria for selecting the relaxation factor of the value iteration algorithm for undiscounted Markov and semi-Markov decision processes
- Discrete versions of an algorithm due to Varaiya
- Iterative solution of the functional equations of undiscounted Markov renewal programming
- Letter to the Editor—A Test for Suboptimal Actions in Markovian Decision Problems
- Modified Policy Iteration Algorithms for Discounted Markov Decision Problems
- Some Bounds for Discounted Sequential Decision Processes
- Technical Note—Accelerated Computation of the Expected Discounted Return in a Markov Chain
- Technical Note—Undiscounted Markov Renewal Programming Via Modified Successive Approximations
- The convergence of value iteration in discounted Markov decision processes
- Truncated policy iteration methods
Cited in
(2)
This page was built for publication: A \(K\)-step look-ahead analysis of value iteration algorithms for Markov decision processes
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q1266643)