Approximation of average cost optimal policies for general Markov decision processes with unbounded costs

DOI10.1007/BF01193864MaRDI QIDQ1362682zbMATH OpenOpenAlexFDO

Authors Evgueni Gordienko, Raúl Montes-de-Oca, Adolfo Minjárez-sosa

Publication date 5 August 1997

Published in Mathematical Methods of Operations Research (Search for Journal in Brave)

Full work available at URL https://doi.org/10.1007/bf01193864

value iteration Markov decision processes geometrical convergence average cost criterion Borel state space approximation of optimal policy Lyapunov-like ergodicity conditions

Mathematics Subject Classification ID

Markov and semi-Markov decision processes (90C40)

Recommendations

Average Cost Optimal Stationary Policies in Infinite State Markov Decision Processes with Unbounded Costs
On strong average optimality of Markov decision processes with unbounded costs
Average cost Markov decision processes with weakly continuous transition probabilities
Average cost optimal policies for Markov control processes with Borel state space and unbounded costs

Cites work

scientific article; zbMATH DE number 3648459 (Why is no real title available?)
scientific article; zbMATH DE number 3320878 (Why is no real title available?)
A note on the vanishing interest rate approach in average Markov decision chains with continuous and bounded costs
Adaptive Markov control processes
Adaptive Strategies for Certain Classes of Controlled Markov Processes
An estimate of the stability of optimal control of certain stochastic and deterministic systems
Average cost Markov control processes with weighted norms: existence of canonical policies
Average cost Markov control processes with weighted norms: value iteration
Discrete-Time Controlled Markov Processes with Average Cost Criterion: A Survey
Dynamic programming, Markov chains, and the method of successive approximations
Equivalence of Lyapunov stability criteria in a class of Markov decision processes
General Irreducible Markov Chains and Non-Negative Operators
Inequalities in Theorems of Ergodicity and Stability for Markov Chains with Common Phase Space. II
Infinite-horizon Markov control processes with undiscounted cost criteria: from average to overtaking optimality
Markov chains and stochastic stability
Measurable selection theorems for optimization problems
Perturbation and stability theory for Markov control problems
Perturbation theory for Markov reward processes with applications to queueing systems
Perturbation theory for unbounded Markov reward processes with applications to queueing
Sensitive Optimality Criteria in Countable State Dynamic Programming
Stochastic optimal control. The discrete time case
The asymptotic behaviour of the minimal total expected cost for the denumerable state Markov decision model
Value iteration in a class of average controlled Markov chains with unbounded costs: necessary and sufficient conditions for pointwise convergence
Value iteration in average cost Markov control processes on Borel spaces
Value iteration in countable state average cost Markov decision processes with unbounded costs

Cited in

(8)

Average cost Markov control processes: Stability with respect to the Kantorovich metric
On the Minimum Pair Approach for Average Cost Markov Decision Processes with Countable Discrete Action Spaces and Strictly Unbounded Costs
scientific article; zbMATH DE number 513084 (Why is no real title available?)
Approximation of average cost Markov decision processes using empirical distributions and concentration inequalities
Suboptimal policy determination for large-scale Markov decision processes. I: Description and bounds
Unbounded cost Markov decision processes with limsup and liminf average criteria: new conditions
Adaptive control for discrete-time Markov processes with unbounded costs: Discounted criterion.
Exact finite approximations of average-cost countable Markov decision processes

This page was built for publication: Approximation of average cost optimal policies for general Markov decision processes with unbounded costs

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q1362682)