Discounting and averaging in games across time scales (Q2909220)

From MaRDI portal





scientific article; zbMATH DE number 6073961
Language Label Description Also known as
default for all languages
No label defined
    English
    Discounting and averaging in games across time scales
    scientific article; zbMATH DE number 6073961

      Statements

      0 references
      0 references
      30 August 2012
      0 references
      games on graphs
      0 references
      discounted objectives
      0 references
      mean-payoff objectives
      0 references
      Discounting and averaging in games across time scales (English)
      0 references
      Two-level discounted and mean-payoff games played by two players on a perfect-information stochastic game graph are studied. The upper level game is a discounted or mean-payoff game and the lower level game is a (undiscounted) reachability game. Two-level games model hierarchical and sequential decision making under uncertainty across different time scales. For both discounted and mean-payoff two-level games, it is shown that there exist pure memoryless optimal strategies for both players and an ordered field property. It is shown that if there is only one player (Markov decision processes), then the values can be computed in polynomial time. It follows that whether the value of a player is equal to a given rational constant in two-level discounted or mean-payoff games can be decided in NP \(\cap \) coNP.
      0 references

      Identifiers

      0 references
      0 references
      0 references
      0 references
      0 references
      0 references