Finite optimal control for time-bounded reachability in CTMDPs and continuous-time Markov games

From MaRDI portal
Publication:766176

DOI10.1007/S00236-011-0140-0zbMATH Open1242.91016arXiv1004.4005OpenAlexW2020204292MaRDI QIDQ766176FDOQ766176

Markus Rabe, Sven Schewe

Publication date: 23 March 2012

Published in: Acta Informatica (Search for Journal in Brave)

Abstract: We establish the existence of optimal scheduling strategies for time-bounded reachability in continuous-time Markov decision processes, and of co-optimal strategies for continuous-time Markov games. Furthermore, we show that optimal control does not only exist, but has a surprisingly simple structure: The optimal schedulers from our proofs are deterministic and timed-positional, and the bounded time can be divided into a finite number of intervals, in which the optimal strategies are positional. That is, we demonstrate the existence of finite optimal control. Finally, we show that these pleasant properties of Markov decision processes extend to the more general class of continuous-time Markov games, and that both early and late schedulers show this behaviour.


Full work available at URL: https://arxiv.org/abs/1004.4005




Recommendations




Cites Work


Cited In (6)





This page was built for publication: Finite optimal control for time-bounded reachability in CTMDPs and continuous-time Markov games

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q766176)