Reachability in recursive Markov decision processes (Q924718): Difference between revisions

A class of infinite-state Markov decision processes generated by stateless pushdown automata is considered. This class corresponds to 1 1/2-player games over graphs generated by BPA systems or (equivalently) 1-exit recursive state machines. An extended reachability objective is specified by two sets \(S\) and \(T\) of safe and terminal stack configurations, where the membership to \(S\) and \(T\) depends just on the top-of-the-stack symbol. The question is whether there is a suitable strategy such that the probability of hitting a terminal configuration by a path leading only through safe configurations is equal to (or different from) a given \(x\) in \((0,1)\). It is shown that the qualitative extended reachability problem is decidable in polynomial time, and that the set of all configurations for which there is a winning strategy is effectively regular. More precisely, this set can be represented by a deterministic finite-state automaton with a fixed number of control states. This result is a generalization of a recent theorem by Etessami and Yannakakis which says that the qualitative termination for 1-exit RMDPs (which exactly correspond to our 1 1/2-player BPA games) is decidable in polynomial time. Interestingly, the properties of winning strategies for the extended reachability objectives are quite different from the ones for termination, and new observations are needed to obtain the result. As an application, the EXPTIME-completeness of the model-checking problem is obtained for 1 1/2-player BPA games and qualitative PCTL formulae.

0 references

reviewed by

Giacomo Bonanno

0 references

zbMATH Keywords

Markov decision processes

0 references

Temporal logics

0 references

Stochastic games

0 references

describes a project that uses

PRISM

0 references

MaRDI profile type

MaRDI publication profile

0 references

full work available at URL

https://doi.org/10.1016/j.ic.2007.09.002

0 references

cites work

Q5137353

0 references

Model checking of probabilistic and nondeterministic systems

0 references

Reachability analysis of pushdown automata: Application to model-checking

0 references

STACS 2005

0 references

Model checking LTL with regular valuations for pushdown systems

0 references

Automata, Languages and Programming

0 references

Efficient Qualitative Analysis of Classes of Recursive Markov Decision Processes and Simple Stochastic Games

0 references

Handbook of Markov decision processes. Methods and applications

0 references

Q5538132

0 references

A logic for reasoning about time and reliability

0 references

Optimal control of diffusion processes with reflection

0 references

Pushdown processes: Games and model-checking

0 references

Identifiers

zbMATH Open document ID

1145.91011

0 references

DOI

10.1016/j.ic.2007.09.002

0 references

Mathematics Subject Classification ID

0 references

0 references

0 references

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:924718

@@ Property / cites work @@
+Q5137353
@@ Property / cites work: Q5137353 / rank @@
+Normal rank
@@ Property / cites work @@
+Model checking of probabilistic and nondeterministic systems
+Normal rank
@@ Property / cites work @@
+Reachability analysis of pushdown automata: Application to model-checking
+Normal rank
@@ Property / cites work @@
+STACS 2005
@@ Property / cites work: STACS 2005 / rank @@
+Normal rank
@@ Property / cites work @@
+Model checking LTL with regular valuations for pushdown systems
+Normal rank
@@ Property / cites work @@
+Automata, Languages and Programming
@@ Property / cites work: Automata, Languages and Programming / rank @@
+Normal rank
@@ Property / cites work @@
+Efficient Qualitative Analysis of Classes of Recursive Markov Decision Processes and Simple Stochastic Games
+Normal rank
@@ Property / cites work @@
+Handbook of Markov decision processes. Methods and applications
+Normal rank
@@ Property / cites work @@
+Q5538132
@@ Property / cites work: Q5538132 / rank @@
+Normal rank
@@ Property / cites work @@
+A logic for reasoning about time and reliability
@@ Property / cites work: A logic for reasoning about time and reliability / rank @@
+Normal rank
@@ Property / cites work @@
+Optimal control of diffusion processes with reflection
+Normal rank
@@ Property / cites work @@
+Pushdown processes: Games and model-checking
@@ Property / cites work: Pushdown processes: Games and model-checking / rank @@
+Normal rank