Runtime monitors for Markov decision processes

DOI10.1007/978-3-030-81688-9_26zbMATH Open1493.68214arXiv2105.12322OpenAlexW3184761245MaRDI QIDQ832291FDOQ832291

Authors: Sebastian Junges, Hazem Torfah, Sanjit A. Seshia

Publication date: 25 March 2022

Abstract: We investigate the problem of monitoring partially observable systems with nondeterministic and probabilistic dynamics. In such systems, every state may be associated with a risk, e.g., the probability of an imminent crash. During runtime, we obtain partial information about the system state in form of observations. The monitor uses this information to estimate the risk of the (unobservable) current system state. Our results are threefold. First, we show that extensions of state estimation approaches do not scale due the combination of nondeterminism and probabilities. While convex hull algorithms improve the practical runtime, they do not prevent an exponential memory blowup. Second, we present a tractable algorithm based on model checking conditional reachability probabilities. Third, we provide prototypical implementations and manifest the applicability of our algorithms to a range of benchmarks. The results highlight the possibilities and boundaries of our novel algorithms.

Full work available at URL: https://arxiv.org/abs/2105.12322

Recommendations

Mathematics Subject Classification ID

Probability in computer science (algorithm analysis, random structures, phase transitions, etc.) (68Q87) Specification and verification (program logics, model checking, etc.) (68Q60) Markov and semi-Markov decision processes (90C40)

Cites Work

Cited In (10)

Uses Software

This page was built for publication: Runtime monitors for Markov decision processes

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q832291)