Average cost Markov decision processes with weakly continuous transition probabilities

From MaRDI portal
Publication:2925348

DOI10.1287/MOOR.1120.0555zbMATH Open1297.90173arXiv1202.4122OpenAlexW1966208686MaRDI QIDQ2925348FDOQ2925348


Authors: Eugene A. Feinberg, Pavlo O. Kasyanov, Nina V. Zadoianchuk Edit this on Wikidata


Publication date: 21 October 2014

Published in: Mathematics of Operations Research (Search for Journal in Brave)

Abstract: This paper presents sufficient conditions for the existence of stationary optimal policies for average-cost Markov Decision Processes with Borel state and action sets and with weakly continuous transition probabilities. The one-step cost functions may be unbounded, and action sets may be noncompact. The main contributions of this paper are: (i) general sufficient conditions for the existence of stationary discount-optimal and average-cost optimal policies and descriptions of properties of value functions and sets of optimal actions, (ii) a sufficient condition for the average-cost optimality of a stationary policy in the form of optimality inequalities, and (iii) approximations of average-cost optimal actions by discount-optimal actions.


Full work available at URL: https://arxiv.org/abs/1202.4122




Recommendations




Cites Work


Cited In (51)





This page was built for publication: Average cost Markov decision processes with weakly continuous transition probabilities

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q2925348)