Mean-Field Controls with Q-Learning for Cooperative MARL: Convergence and Complexity Analysis

From MaRDI portal

Publication:5018896

Jump to:navigation, search

DOI10.1137/20M1360700zbMath1479.49088arXiv2002.04131OpenAlexW3208215078MaRDI QIDQ5018896

Xin Guo, Haotian Gu, Xiaoli Wei, Renyuan Xu

Publication date: 27 December 2021

Published in: SIAM Journal on Mathematics of Data Science (Search for Journal in Brave)

Full work available at URL: https://arxiv.org/abs/2002.04131

zbMATH Keywords

cooperative games Q-learning mean-field control dynamic programming principle multi-agent reinforcement learning

Mathematics Subject Classification ID

Noncooperative games (91A10) Learning and adaptive systems in artificial intelligence (68T05) Markov and semi-Markov decision processes (90C40) Mean field games and control (49N80)

Related Items

Unified reinforcement Q-learning for mean field game and control problems, Graphon mean-field control for cooperative multi-agent reinforcement learning, Model-free mean-field reinforcement learning: mean-field MDP and mean-field Q-learning, Reinforcement learning and stochastic optimisation, Exploratory LQG mean field games with entropy regularization

Cites Work

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:5018896&oldid=19484809"