Optimistic and topological value iteration for simple stochastic games

From MaRDI portal
Publication:6160920

DOI10.1007/978-3-031-19992-9_18zbMATH Open1522.68295arXiv2207.14417OpenAlexW4312411454MaRDI QIDQ6160920FDOQ6160920


Authors: Muqsit Azeem, Alexandros Evangelidis, Jan Křetínský, Alexander Slivinskiy, Maximilian Weininger Edit this on Wikidata


Publication date: 2 June 2023

Published in: Automated Technology for Verification and Analysis (Search for Journal in Brave)

Abstract: While value iteration (VI) is a standard solution approach to simple stochastic games (SSGs), it suffered from the lack of a stopping criterion. Recently, several solutions have appeared, among them also "optimistic" VI (OVI). However, OVI is applicable only to one-player SSGs with no end components. We lift these two assumptions, making it available to general SSGs. Further, we utilize the idea in the context of topological VI, where we provide an efficient precise solution. In order to compare the new algorithms with the state of the art, we use not only the standard benchmarks, but we also design a random generator of SSGs, which can be biased towards various types of models, aiding in understanding the advantages of different algorithms on SSGs.


Full work available at URL: https://arxiv.org/abs/2207.14417




Recommendations



Cites Work


Cited In (6)





This page was built for publication: Optimistic and topological value iteration for simple stochastic games

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q6160920)