Independent learning in stochastic games (Q6200215)

From MaRDI portal

Jump to:navigation, search

scientific article; zbMATH DE number 7822596

Language	Label	Description	Also known as
English	Independent learning in stochastic games	scientific article; zbMATH DE number 7822596

Statements

scholarly article

0 references

Independent learning in stochastic games (English)

0 references

Asuman Ozdaglar

0 references

Muhammed O. Sayin

0 references

0 references

International Congress of Mathematicians

0 references

publication date

22 March 2024

0 references

full work available at URL

https://arxiv.org/abs/2111.11743

0 references

Summary: Reinforcement learning (RL) has recently achieved tremendous successes in many artificial intelligence applications. Many of the forefront applications of RL involve \textit{multiple agents}, e.g., playing chess and Go games, autonomous driving, and robotics. Unfortunately, the framework upon which classical RL builds is inappropriate for multiagent learning, as it assumes an agent's environment is stationary and does not take into account the adaptivity of other agents. In this review paper, we present the model of \textit{stochastic games} due to [\textit{L. S. Shapley}, Proc. Natl. Acad. Sci. USA 39, 1095--1100 (1953; Zbl 0051.35805)] for multiagent learning in \textit{dynamic} environments. We focus on the development of \textit{simple} and \textit{independent} learning dynamics for stochastic games: each agent is myopic and chooses best-response type actions to other agents' strategy without any coordination with her opponent. There has been limited progress on developing convergent best-response type independent learning dynamics for stochastic games. We present our recently proposed simple and independent learning dynamics that guarantee convergence in zero-sum stochastic games, together with a review of other contemporaneous algorithms for dynamic multiagent learning in this setting. Along the way, we also reexamine some classical results from both the game theory and RL literature, to situate both the conceptual contributions of our independent learning dynamics, and the mathematical novelties of our analysis. We hope this review paper serves as an impetus for the resurgence of studying independent and natural learning dynamics in game theory, for the more challenging settings with a dynamic environment. For the entire collection see [Zbl 07816361].

0 references

zbMATH Keywords

stochastic games

0 references

learning in games

0 references

reinforcement learning

0 references

MaRDI profile type

MaRDI publication profile

0 references

Decentralized Q-Learning for Stochastic Teams and Games

0 references

0 references

Stochastic Approximations and Differential Inclusions

0 references

Fictitious play in \(2\times n\) games

0 references

Learning in games with strategic complementarities revisited

0 references

Distributed dynamic programming

0 references

0 references

REINFORCEMENT LEARNING IN MARKOVIAN EVOLUTIONARY GAMES

0 references

Stochastic approximation. A dynamical systems viewpoint.

0 references

10.1162/153244303765208377

0 references

0 references

Prediction, Learning, and Games

0 references

0 references

Nash equilibrium and the evolution of preferences

0 references

0 references

Adaptive game playing using multiplicative weights

0 references

Learning mixed equilibria

0 references

Consistency and cautious fictitious play

0 references

0 references

On the rate of convergence of continuous-time fictitious play

0 references

On the Global Convergence of Stochastic Fictitious Play

0 references

Stochastic approximation methods for constrained and unconstrained systems

0 references

Individual <i>Q</i>-Learning in Normal Form Games

0 references

Best-response dynamics in zero-sum stochastic games

0 references

The weighted majority algorithm

0 references

A Theory of Dynamic Oligopoly, I: Overview and Quantity Competition with Large Fixed Costs

0 references

A Theory of Dynamic Oligopoly, II: Price Competition, Kinked Demand Curves, and Edgeworth Cycles

0 references

Quantal response equilibria for normal form games

0 references

Adaptive and sophisticated learning in normal form games

0 references

A \(2 \times 2\) game without the fictitious play property

0 references

Fictitious play property for games with identical interests

0 references

Potential games

0 references

Asynchronous stochastic approximation with differential inclusions

0 references

An iterative method of solving a game

0 references

Intrinsic robustness of the price of anarchy

0 references

Fictitious play in ''one-against-all'' multi-player games

0 references

Online Learning and Online Convex Optimization

0 references

Stochastic Games

0 references

0 references

Composable and efficient mechanisms

0 references

0 references

Asynchronous stochastic approximation and Q-learning

0 references

A WEAKENED FORM OF FICTITIOUS PLAY IN TWO-PERSON ZERO-SUM GAMES

0 references

\({\mathcal Q}\)-learning

0 references

Multi-agent reinforcement learning: a selective overview of theories and algorithms

0 references

Identifiers

0 references

10.4171/icm2022/152

0 references

Mathematics Subject Classification ID

0 references

0 references

0 references

zbMATH DE Number

0 references

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:6200215

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Item:Q6200215&oldid=37650870"