On the vanishing discount factor approach for Markov decision processes with weakly continuous transition probabilities (Q2264001): Difference between revisions
From MaRDI portal
Set profile property. |
Set OpenAlex properties. |
||
Property / full work available at URL | |||
Property / full work available at URL: https://doi.org/10.1016/j.jmaa.2015.02.007 / rank | |||
Normal rank | |||
Property / OpenAlex ID | |||
Property / OpenAlex ID: W2028232213 / rank | |||
Normal rank |
Revision as of 21:00, 19 March 2024
scientific article
Language | Label | Description | Also known as |
---|---|---|---|
English | On the vanishing discount factor approach for Markov decision processes with weakly continuous transition probabilities |
scientific article |
Statements
On the vanishing discount factor approach for Markov decision processes with weakly continuous transition probabilities (English)
0 references
19 March 2015
0 references
This paper deals with average cost Markov decision processes with Borel state and control spaces, possibly unbounded costs and non-compact action subsets under the assumption of weak continuity of the transition law. It provides a simplified and somewhat elementary proof of the existence of average cost for stationary optimal policies under the same assumptions as in the paper by \textit{E. A. Feinberg} et al. [Math. Oper. Res. 37, No. 4, 591--607 (2012; Zbl 1297.90173)]. The prove is based on the concept of lower semicontinuous envelope of functions and an elementary result on the interchange of limits and minima in lieu of a Fatou's lemma for varying measures.
0 references
Markov decision processes
0 references
average cost criterion
0 references
vanishing discount factor approach
0 references