Necessary and sufficient conditions for a bounded solution to the optimality equation in average reward Markov decision chains (Q1103532): Difference between revisions

Latest revision as of 16:30, 18 June 2024

scientific article

Language	Label	Description	Also known as
English	Necessary and sufficient conditions for a bounded solution to the optimality equation in average reward Markov decision chains	scientific article

Statements

instance of

scholarly article

0 references

title

Necessary and sufficient conditions for a bounded solution to the optimality equation in average reward Markov decision chains (English)

0 references

author

Rolando Cavazos-Cadena

0 references

published in

Systems \& Control Letters

0 references

publication date

1988

0 references

review text

Consider a discrete time Markov decision process with countable state space S. In addition to the standard assumptions of compact action sets and continuous transition probabilities, suppose that the Markov chain determined by each stationary policy f has a single positive recurrent class R(f), which is entered with probability one and which contains at least one member of a fixed, finite subset G of S. The main theorem gives, under these assumptions, five necessary and sufficient conditions (including a simultaneous Doeblin condition with set G) for the average reward optimality equation to have a bounded measurable solution for an arbitrary bounded measurable reward function. The establishment of necessity is an uncommon feature; sufficient conditions are discussed in \textit{L. C. Thomas} [``Connectedness conditions for denumerable state Markov decision processes'', in: Recent developments in Markov decision processes, R. Hartley, L. C. Thomas, D. J. White (eds.), Academic Press (1980; Zbl 0547.90064)].

0 references

zbMATH Keywords

optimal stationary policies

0 references

discrete time Markov decision process

0 references

countable state space

0 references

simultaneous Doeblin condition

0 references

average reward optimality equation

0 references

bounded measurable reward function

0 references

MaRDI profile type

MaRDI publication profile

0 references

full work available at URL

https://doi.org/10.1016/0167-6911(88)90043-6

0 references

cites work

Q4606219

0 references

Two competing queues with linear costs and geometric service requirements: the <i>μc</i>-rule is often optimal

0 references

Necessary conditions for the optimality equation in average-reward Markov decision processes

0 references

Existence of optimal stationary policies in average reward Markov decision processes with a recurrent state

0 references

A note on simultaneous recurrence conditions on a set of denumerable stochastic matrices

0 references

0 references

0 references

0 references

0 references

A new condition for the existence of optimal stationary policies in average cost Markov decision processes

0 references

Identifiers

zbMATH Open document ID

0645.90099

0 references

DOI

10.1016/0167-6911(88)90043-6

0 references

Mathematics Subject Classification ID

0 references

0 references

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:1103532

@@ Property / cites work @@
+Q4606219
@@ Property / cites work: Q4606219 / rank @@
+Normal rank
@@ Property / cites work @@
+Two competing queues with linear costs and geometric service requirements: the <i>μc</i>-rule is often optimal
+Normal rank
@@ Property / cites work @@
+Necessary conditions for the optimality equation in average-reward Markov decision processes
+Normal rank
@@ Property / cites work @@
+Existence of optimal stationary policies in average reward Markov decision processes with a recurrent state
+Normal rank
@@ Property / cites work @@
+A note on simultaneous recurrence conditions on a set of denumerable stochastic matrices
+Normal rank
@@ Property / cites work @@
+Q5599448
@@ Property / cites work: Q5599448 / rank @@
+Normal rank
@@ Property / cites work @@
+Q4771778
@@ Property / cites work: Q4771778 / rank @@
+Normal rank
@@ Property / cites work @@
+Q4131338
@@ Property / cites work: Q4131338 / rank @@
+Normal rank
@@ Property / cites work @@
+Q5615108
@@ Property / cites work: Q5615108 / rank @@
+Normal rank
@@ Property / cites work @@
+A new condition for the existence of optimal stationary policies in average cost Markov decision processes
+Normal rank