On the vanishing discount factor approach for Markov decision processes with weakly continuous transition probabilities (Q2264001): Difference between revisions

This paper deals with average cost Markov decision processes with Borel state and control spaces, possibly unbounded costs and non-compact action subsets under the assumption of weak continuity of the transition law. It provides a simplified and somewhat elementary proof of the existence of average cost for stationary optimal policies under the same assumptions as in the paper by \textit{E. A. Feinberg} et al. [Math. Oper. Res. 37, No. 4, 591--607 (2012; Zbl 1297.90173)]. The prove is based on the concept of lower semicontinuous envelope of functions and an elementary result on the interchange of limits and minima in lieu of a Fatou's lemma for varying measures.

0 references

reviewed by

Wiesław Kotarski

0 references

zbMATH Keywords

Markov decision processes

0 references

average cost criterion

0 references

vanishing discount factor approach

0 references

MaRDI profile type

MaRDI publication profile

0 references

full work available at URL

https://doi.org/10.1016/j.jmaa.2015.02.007

0 references

Identifiers

zbMATH Open document ID

1322.90110

0 references

DOI

10.1016/j.jmaa.2015.02.007

0 references

Mathematics Subject Classification ID

0 references

0 references

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:2264001

@@ Property / full work available at URL @@
+https://doi.org/10.1016/j.jmaa.2015.02.007
+Normal rank
@@ Property / OpenAlex ID @@
+W2028232213
@@ Property / OpenAlex ID: W2028232213 / rank @@
+Normal rank