Stochastic limit-average games are in EXPTIME (Q933752): Difference between revisions

From MaRDI portal
Importer (talk | contribs)
Changed an Item
ReferenceBot (talk | contribs)
Changed an Item
 
Property / cites work
 
Property / cites work: New results on quantifier elimination over real closed fields and applications to constraint databases / rank
 
Normal rank
Property / cites work
 
Property / cites work: Algorithms in real algebraic geometry / rank
 
Normal rank
Property / cites work
 
Property / cites work: On the menbership problem for functional and multivalued dependencies in relational databases / rank
 
Normal rank
Property / cites work
 
Property / cites work: The Asymptotic Theory of Stochastic Games / rank
 
Normal rank
Property / cites work
 
Property / cites work: The Big Match / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4382287 / rank
 
Normal rank
Property / cites work
 
Property / cites work: On Nonterminating Stochastic Games / rank
 
Normal rank
Property / cites work
 
Property / cites work: Number of quantifiers is better than number of tape cells / rank
 
Normal rank
Property / cites work
 
Property / cites work: Stochastic Games with Perfect Information and Time Average Payoff / rank
 
Normal rank
Property / cites work
 
Property / cites work: Stochastic games / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4437142 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4298260 / rank
 
Normal rank
Property / cites work
 
Property / cites work: An orderfield property for stochastic games when one player controls transition probabilities / rank
 
Normal rank
Property / cites work
 
Property / cites work: Algorithms for stochastic games ? A survey / rank
 
Normal rank
Property / cites work
 
Property / cites work: Stochastic Games / rank
 
Normal rank
Property / cites work
 
Property / cites work: Continuity of the value of competitive Markov decision processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5807665 / rank
 
Normal rank
Property / cites work
 
Property / cites work: The complexity of mean payoff games on graphs / rank
 
Normal rank

Latest revision as of 13:46, 28 June 2024

scientific article
Language Label Description Also known as
English
Stochastic limit-average games are in EXPTIME
scientific article

    Statements

    Stochastic limit-average games are in EXPTIME (English)
    0 references
    0 references
    0 references
    0 references
    25 July 2008
    0 references
    A repeated zero-sum stochastic game \(G=(S,A,\Gamma_1,\Gamma_2, \delta,r)\) played by two players is considered. Here \(S\) and \(A\) are finite space of states and set of actions respectively, \(\Gamma_i(s) \subseteq A\), \(i=1,2\), \(s \in S\), describe possible sets of actions of players at each state. If in the state \(s\) players choose actions \(a\) and \(b\) respectively then \(\delta(s,a,b)(t)\) is a probability to reach the state \(t\) and \(r(s,a,b)\) is a real-valued reward of the first player. Strategies \(\sigma\) and \(\pi\) of players can depend on all known history of the game. The limit-average for the player 1 reward of a pair of strategies \(\sigma\) and \(\pi\) and a starting state \(s\) is defined as \[ v_1(s,\sigma,\pi)=E_s^{\sigma, \pi}\left[ \liminf_{n \to \infty} n^{-1} \sum_{i=1}^n r(X_i,\Theta_{i,1}, \Theta_{i.2}) \right], \] where \(X_i\), \(\Theta_{i,1}\), \(\Theta_{i.2}\) stand for the state and chosen actions at instant of time \(i\). In the article the computational complexity of approximating the value of the game is characterized. It is shown that for any given real \(\varepsilon >0\) the value of a game \(G\) at a state \(s\) can be computed to within \(\varepsilon\)-precision in time bounded by an exponential in a polynomial in the size of the game \(G\) times a polynomial function of \(\log 1/\varepsilon\). The main technique is the characterization of values as semi-algebraic quantities.
    0 references
    0 references
    stochastic games
    0 references
    limit-average payoff
    0 references
    computational complexity
    0 references
    0 references
    0 references