Value iteration in countable state average cost Markov decision processes with unbounded costs
From MaRDI portal
Publication:806687
DOI10.1007/BF02055585zbMath0729.90088MaRDI QIDQ806687
Publication date: 1991
Published in: Annals of Operations Research (Search for Journal in Brave)
unbounded costscountable state Markov decision processesexpected average cost optimal stationary policyfinite action setsundiscounted value iterationvariable arrival parametervariable service rates
Queueing theory (aspects of probability theory) (60K25) Queues and service in operations research (90B22) Dynamic programming (90C39) Markov and semi-Markov decision processes (90C40) Computational methods for problems pertaining to operations research and mathematical programming (90-08)
Related Items
Approximation of average cost optimal policies for general Markov decision processes with unbounded costs, Value iteration in average cost Markov control processes on Borel spaces, An optimal control approach to day-to-day congestion pricing for stochastic transportation networks, A pause control approach to the value iteration scheme in average Markov decision processes, A note on the convergence rate of the value iteration scheme in controlled Markov chains, SOJOURN TIMES IN NON-HOMOGENEOUS QBD PROCESSES WITH PROCESSOR SHARING, Stochastic Inventory Models with Limited Production Capacity and Periodically Varying Parameters, Constrained Discounted Markov Decision Chains, The value iteration method for countable state Markov decision processes, Incompleteness of results for the slow-server problem with an unreliable fast server
Cites Work
- Unnamed Item
- A new condition for the existence of optimal stationary policies in average cost Markov decision processes
- Adaptive Markov control processes
- Dynamic programming, Markov chains, and the method of successive approximations
- Average Cost Semi-Markov Decision Processes and the Control of Queueing Systems
- Average Cost Optimal Stationary Policies in Infinite State Markov Decision Processes with Unbounded Costs
- The asymptotic behaviour of the minimal total expected cost for the denumerable state Markov decision model
- Hitting times of Markov chains, with application to state-dependent queues
- The Asymptotic Behavior of Undiscounted Value Iteration in Markov Decision Problems
- Some Conditions for Ergodicity and Recurrence of Markov Chains