On a continuous solution to the Bellman-Poisson equation in stochastic games (Q983723): Difference between revisions

Zero-sum semi-Markov games are considered with the set of states \(S=\{s\}\), sets of actions of two players \(A=\{a\}\) and \(B=\{b\}\), weakly continuous transition probabilities and expected average payoff criterion. Assuming the \(V\)-geometric ergodicity of the embedded Markov chain it is proved the existence of a continuous solution to the Bellman-Poisson equation \[ \begin{aligned} \hat{h}(s)&=\max_{a} \min_{b} \left[r(s,a,b)- \hat{\xi} \tau(s,a,b) +\int_S \hat{h}(y) q(dy|s,a,b)\right]\\ &=\min_{b} \max_{a} \left[r(s,a,b)- \hat{\xi} \tau(s,a,b) +\int_S \hat{h}(y) q(dy|s,a,b)\right] \end{aligned} \] for all \(s \in S\). Here \(r(s,a,b)\) and \(\tau(s,a,b)\) are reward of player~1 and expected occupation time of the process in the state \(s\) provided that actions \(a\) and \(b\) were chosen, \(q(\cdot|s,a,b)\) is the transition law of the embedded Markov chain, \(\hat{\xi}\) is the value of the game. Both players possess optimal stationary strategies.

0 references

reviewed by

Alex V. Kolnogorov

0 references

zbMATH Keywords

zero-sum semi-Markov games

0 references

optimality equations

0 references

optimal strategies

0 references

Identifiers

zbMATH Open document ID

1198.91034

0 references

DOI

10.1007/s10957-010-9698-x

0 references

Mathematics Subject Classification ID

0 references

0 references

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:983723

Revision as of 09:50, 10 July 2023 Importer (talk \| contribs) Bots 7,073,481 edits ‎Created a new Item	Revision as of 20:03, 30 January 2024 Import240129110113 (talk \| contribs) Bots 7,163,963 edits Added link to MaRDI item. Newer edit →
links / mardi / name	links / mardi / name
		Publication:983723