A set of successive approximation methods for discounted Markovian decision problems
From MaRDI portal
Publication:4133441
Cites work
- scientific article; zbMATH DE number 3148886 (Why is no real title available?)
- scientific article; zbMATH DE number 3333895 (Why is no real title available?)
- A modified dynamic programming method for Markovian decision problems
- Contraction Mappings in the Theory Underlying Dynamic Programming
- Discounted Dynamic Programming
- Discounted semi-Markov decision processes: linear programming and policy iteration
- Discrete Dynamic Programming
- Letter to the Editor—A Test for Suboptimal Actions in Markovian Decision Problems
- Linear Programming Solutions for Separable Markovian Decision Problems
- Some Bounds for Discounted Sequential Decision Processes
- Technical Note—Elimination of Suboptimal Actions in Markov Decision Problems
Cited in
(16)- A \(K\)-step look-ahead analysis of value iteration algorithms for Markov decision processes
- Discounted Markov games: Generalized policy iteration method
- MARKOV DECISION PROCESSES
- The numerical exploitation of periodicity in Markov decision processes
- On theory and algorithms for Markov decision problems with the total reward criterion
- Action-dependent stopping times and Markov decision process with unbounded rewards
- A successive approximation algorithm for an undiscounted Markov decision process
- Markov programming by successive approximations with respect to weighted supremum norms
- Solving linear systems by methods based on a probabilistic interpretation
- The method of value oriented successive approximations for the average reward Markov decision process
- Optimization of STEOR networks via Markov renewal programming
- Adaptive policy-iteration and policy-value-iteration for discounted Markov decision processes
- Improved iterative computation of the expected discounted return in Markov and semi-Markov chains
- GPI-based design for partially unknown nonlinear two-player zero-sum games
- On integral generalized policy iteration for continuous-time linear quadratic regulations
- (Approximate) iterated successive approximations algorithm for sequential decision processes
This page was built for publication: A set of successive approximation methods for discounted Markovian decision problems
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q4133441)