Average optimality for continuous-time Markov decision processes with a policy iteration approach (Q2465179): Difference between revisions

The paper deals with the average expected reward criterion for continuous-time Markov decision processes in general state and action spaces. The transition rates of underlying continuous-time jump Markov processes are allowed to be unbounded, and the reward rates may have neither upper nor lower bounds. The author gives conditions on the system's primitive data and under which he proves the existence of the average reward optimality equation and an average optimal stationary policy. Also, under proposed conditions the author ensures the existence of \(\epsilon\)-average optimal stationary polices. Moreover, the author studies some properties of average optimal stationary polices. The author not only establishes another average optimality equation on an average optimal stationary policy, but also presents an interesting ``martingale characterization'' of such a policy. The approach presented in the paper is based on the policy iteration algorithm and is different from those (``vanishing discounting factor approach'', ``optimality inequality approach'') usually used in the literature. References contain \(31\) items.

0 references

reviewed by

Wiesław Kotarski

0 references

zbMATH Keywords

continuous-time Markov decision processes

0 references

policy iteration algorithm

0 references

average criterion

0 references

optimality equation

0 references

optimal stationary policy

0 references

MaRDI profile type

MaRDI publication profile

0 references

full work available at URL

https://doi.org/10.1016/j.jmaa.2007.06.071

0 references

Identifiers

zbMATH Open document ID

1156.90023

0 references

DOI

10.1016/j.jmaa.2007.06.071

0 references

Mathematics Subject Classification ID

0 references

0 references

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:2465179

@@ Property / full work available at URL @@
+https://doi.org/10.1016/j.jmaa.2007.06.071
+Normal rank
@@ Property / OpenAlex ID @@
+W2073471284
@@ Property / OpenAlex ID: W2073471284 / rank @@
+Normal rank