Markov decision processes with incomplete information and semiuniform Feller transition probabilities

From MaRDI portal
Publication:5097394

DOI10.1137/21M1442152zbMATH Open1498.90245arXiv2108.09232OpenAlexW3164777109WikidataQ113779024 ScholiaQ113779024MaRDI QIDQ5097394FDOQ5097394


Authors: Eugene A. Feinberg, Pavlo O. Kasyanov, Michael Z. Zgurovsky Edit this on Wikidata


Publication date: 23 August 2022

Published in: SIAM Journal on Control and Optimization (Search for Journal in Brave)

Abstract: This paper deals with control of partially observable discrete-time stochastic systems. It introduces and studies Markov Decision Processes with Incomplete Information and with semi-uniform Feller transition probabilities. The important feature of these models is that their classic reduction to Completely Observable Markov Decision Processes with belief states preserves semi-uniform Feller continuity of transition probabilities. Under mild assumptions on cost functions, optimal policies exist, optimality equations hold, and value iterations converge to optimal values for these models. In particular, for Partially Observable Markov Decision Processes the results of this paper imply new and generalize several known sufficient conditions on transition and observation probabilities for weak continuity of transition probabilities for Markov Decision Processes with belief states, the existence of optimal policies, validity of optimality equations defining optimal policies, and convergence of value iterations to optimal values.


Full work available at URL: https://arxiv.org/abs/2108.09232




Recommendations




Cites Work


Cited In (12)





This page was built for publication: Markov decision processes with incomplete information and semiuniform Feller transition probabilities

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q5097394)