Value iteration in countable state average cost Markov decision processes with unbounded costs

unbounded costs countable state Markov decision processes expected average cost optimal stationary policy finite action sets undiscounted value iteration variable arrival parameter variable service rates

Mathematics Subject Classification ID

Queues and service in operations research (90B22) Dynamic programming (90C39) Queueing theory (aspects of probability theory) (60K25) Computational methods for problems pertaining to operations research and mathematical programming (90-08) Markov and semi-Markov decision processes (90C40)

Recommendations

The value iteration method for countable state Markov decision processes
Value iteration in a class of average controlled Markov chains with unbounded costs: necessary and sufficient conditions for pointwise convergence
Value iteration for average cost Markov decision processes in Borel spaces
Value Iteration in a Class of Communicating Markov Decision Chains with the Average Cost Criterion
Value iteration in average cost Markov control processes on Borel spaces

Cites work

scientific article; zbMATH DE number 3906232 (Why is no real title available?)
A new condition for the existence of optimal stationary policies in average cost Markov decision processes
Adaptive Markov control processes
Average Cost Optimal Stationary Policies in Infinite State Markov Decision Processes with Unbounded Costs
Average Cost Semi-Markov Decision Processes and the Control of Queueing Systems
Dynamic programming, Markov chains, and the method of successive approximations
Hitting times of Markov chains, with application to state-dependent queues
Some Conditions for Ergodicity and Recurrence of Markov Chains
The Asymptotic Behavior of Undiscounted Value Iteration in Markov Decision Problems
The asymptotic behaviour of the minimal total expected cost for the denumerable state Markov decision model

Cited in

(18)

Constrained Discounted Markov Decision Chains
Structures of optimal policies in MDPs with unbounded jumps: the state of our art
SOJOURN TIMES IN NON-HOMOGENEOUS QBD PROCESSES WITH PROCESSOR SHARING
A note on the convergence rate of the value iteration scheme in controlled Markov chains
A pause control approach to the value iteration scheme in average Markov decision processes
On the Minimum Pair Approach for Average Cost Markov Decision Processes with Countable Discrete Action Spaces and Strictly Unbounded Costs
Stochastic Inventory Models with Limited Production Capacity and Periodically Varying Parameters
The value iteration method for countable state Markov decision processes
Incompleteness of results for the slow-server problem with an unreliable fast server
scientific article; zbMATH DE number 970660 (Why is no real title available?)
On convergence of value iteration for a class of total cost Markov decision processes
Value iteration in average cost Markov control processes on Borel spaces
An optimal control approach to day-to-day congestion pricing for stochastic transportation networks
Isotone policies for the value iteration method for Markov decision processes
Value iteration for average cost Markov decision processes in Borel spaces
Value Iteration in a Class of Communicating Markov Decision Chains with the Average Cost Criterion
Approximation of average cost optimal policies for general Markov decision processes with unbounded costs
A value iteration method for undiscounted multichain Markov decision processes

This page was built for publication: Value iteration in countable state average cost Markov decision processes with unbounded costs

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q806687)