Some Bounds for Discounted Sequential Decision Processes

Cited in

(32)

Zur Extrapolation in Markoffschen Entscheidungsmodellen mit Diskontierung
A method of bisection for discounted Markov decision problems
Computational comparison of value iteration algorithms for discounted Markov decision processes
A \(K\)-step look-ahead analysis of value iteration algorithms for Markov decision processes
A decision exclusion algorithm for a class of Markovian Decision Processes
Discounted Markov games: Generalized policy iteration method
Block-successive approximation for a discounted Markov decision model
Bounds on the fixed point of a monotone contraction operator
Solving infinite horizon discounted Markov decision process problems for a range of discount factors
MARKOV DECISION PROCESSES
Replacement process decomposition for discounted Markov renewal programming
Estimates for finite-stage dynamic programs
Variational characterizations in Markov decision processes
Serial and parallel value iteration algorithms for discounted Markov decision processes
Bounds for the renewal function
A natural extension of the MacQueen extrapolation
Conditions for characterizing the structure of optimal strategies in infinite-horizon dynamic programs
The method of value oriented successive approximations for the average reward Markov decision process
Using adaptive learning in credit scoring to estimate take-up probability distribution
Block-scaling of value-iteration for discounted Markov renewal programming
Computation techniques for large scale undiscounted markov decision processes
Improved iterative computation of the expected discounted return in Markov and semi-Markov chains
A superharmonic approach to solving infinite horizon partially observable Markov decision problems
A set of successive approximation methods for discounted Markovian decision problems
Some basic concepts of numerical treatment of Markov decision models
Error bounds for stochastic shortest path problems
(Approximate) iterated successive approximations algorithm for sequential decision processes
An Heuristic for Multi-Dimensional Markov Decision Processes
Markov decision processes
Nonstationary Markov decision problems with converging parameters
Discounted Markov games; successive approximation and stopping times
Contraction mappings underlying undiscounted Markov decision problems

This page was built for publication: Some Bounds for Discounted Sequential Decision Processes

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q5640967)