A new condition for the existence of optimal stationary policies in average cost Markov decision processes (Q1076617): Difference between revisions

The author considers a discrete time, countable state Markov decision processes with finite decision sets and bounded costs. He obtains conditions under which (possibly) unbounded solution to the average cost equation for the optimal value exists and yields an optimal stationary policy. In the special case in which every stationary policy induces an ergodic Markov chain, he obtains a new form for the optimality equation and gives a sufficient condition for the existence of an optimal stationary policy. The results are illustrated by some examples.

0 references

reviewed by

Shmuel Gal

0 references

zbMATH Keywords

stochastic dynamic programming

0 references

discrete time, countable state Markov decision processes

0 references

finite decision sets

0 references

bounded costs

0 references

optimal stationary policy

0 references

ergodic Markov chain

0 references

optimality equation

0 references

MaRDI profile type

MaRDI publication profile

0 references

full work available at URL

https://doi.org/10.1016/0167-6377(86)90095-7

0 references

cites work

Denumerable State Markovian Decision Processes-Average Cost Criterion

0 references

On the Stochastic Matrices Associated with Certain Queuing Processes

0 references

Q3313617

0 references

Some Conditions for Ergodicity and Recurrence of Markov Chains

0 references

Non-Discounted Denumerable Markovian Decision Models

0 references

Q3683893

0 references

Technical Note—Mean Drifts and the Non-Ergodicity of Markov Chains

0 references

Technical Note—An Equivalence Between Continuous and Discrete Time Markov Decision Processes

0 references

Criteria for strong ergodicity of Markov chains

0 references

The existence of moments for stationary Markov chains

0 references

Optimal control of service rates in networks of queues

0 references

Identifiers

zbMATH Open document ID

0593.90083

0 references

DOI

10.1016/0167-6377(86)90095-7

0 references

Mathematics Subject Classification ID

0 references

0 references

0 references

0 references

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:1076617

@@ Property / MaRDI profile type @@
+MaRDI publication profile
@@ Property / MaRDI profile type: MaRDI publication profile / rank @@
+Normal rank
@@ Property / full work available at URL @@
+https://doi.org/10.1016/0167-6377(86)90095-7
+Normal rank
@@ Property / OpenAlex ID @@
+W2033474053
@@ Property / OpenAlex ID: W2033474053 / rank @@
+Normal rank
@@ Property / cites work @@
+Denumerable State Markovian Decision Processes-Average Cost Criterion
+Normal rank
@@ Property / cites work @@
+On the Stochastic Matrices Associated with Certain Queuing Processes
+Normal rank
@@ Property / cites work @@
+Q3313617
@@ Property / cites work: Q3313617 / rank @@
+Normal rank
@@ Property / cites work @@
+Some Conditions for Ergodicity and Recurrence of Markov Chains
+Normal rank
@@ Property / cites work @@
+Non-Discounted Denumerable Markovian Decision Models
+Normal rank
@@ Property / cites work @@
+Q3683893
@@ Property / cites work: Q3683893 / rank @@
+Normal rank
@@ Property / cites work @@
+Technical Note—Mean Drifts and the Non-Ergodicity of Markov Chains
+Normal rank
@@ Property / cites work @@
+Technical Note—An Equivalence Between Continuous and Discrete Time Markov Decision Processes
+Normal rank
@@ Property / cites work @@
+Criteria for strong ergodicity of Markov chains
@@ Property / cites work: Criteria for strong ergodicity of Markov chains / rank @@
+Normal rank
@@ Property / cites work @@
+The existence of moments for stationary Markov chains
+Normal rank
@@ Property / cites work @@
+Optimal control of service rates in networks of queues
+Normal rank
@@ links / mardi / name / links / mardi / name @@
+Publication:1076617