Mean-Field Controls with Q-Learning for Cooperative MARL: Convergence and Complexity Analysis

From MaRDI portal
Publication:5018896

DOI10.1137/20M1360700zbMath1479.49088arXiv2002.04131OpenAlexW3208215078MaRDI QIDQ5018896

Xin Guo, Haotian Gu, Xiaoli Wei, Renyuan Xu

Publication date: 27 December 2021

Published in: SIAM Journal on Mathematics of Data Science (Search for Journal in Brave)

Full work available at URL: https://arxiv.org/abs/2002.04131



Related Items



Cites Work