Independent learning in stochastic games (Q6200215)

From MaRDI portal
scientific article; zbMATH DE number 7822596
Language Label Description Also known as
English
Independent learning in stochastic games
scientific article; zbMATH DE number 7822596

    Statements

    Independent learning in stochastic games (English)
    0 references
    0 references
    0 references
    0 references
    22 March 2024
    0 references
    Summary: Reinforcement learning (RL) has recently achieved tremendous successes in many artificial intelligence applications. Many of the forefront applications of RL involve \textit{multiple agents}, e.g., playing chess and Go games, autonomous driving, and robotics. Unfortunately, the framework upon which classical RL builds is inappropriate for multiagent learning, as it assumes an agent's environment is stationary and does not take into account the adaptivity of other agents. In this review paper, we present the model of \textit{stochastic games} due to [\textit{L. S. Shapley}, Proc. Natl. Acad. Sci. USA 39, 1095--1100 (1953; Zbl 0051.35805)] for multiagent learning in \textit{dynamic} environments. We focus on the development of \textit{simple} and \textit{independent} learning dynamics for stochastic games: each agent is myopic and chooses best-response type actions to other agents' strategy without any coordination with her opponent. There has been limited progress on developing convergent best-response type independent learning dynamics for stochastic games. We present our recently proposed simple and independent learning dynamics that guarantee convergence in zero-sum stochastic games, together with a review of other contemporaneous algorithms for dynamic multiagent learning in this setting. Along the way, we also reexamine some classical results from both the game theory and RL literature, to situate both the conceptual contributions of our independent learning dynamics, and the mathematical novelties of our analysis. We hope this review paper serves as an impetus for the resurgence of studying independent and natural learning dynamics in game theory, for the more challenging settings with a dynamic environment. For the entire collection see [Zbl 07816361].
    0 references
    stochastic games
    0 references
    learning in games
    0 references
    reinforcement learning
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references

    Identifiers

    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references