Dynamic Programming Principles for Mean-Field Controls with Learning
From MaRDI portal
Publication:6192781
DOI10.1287/opre.2022.2395arXiv1911.07314OpenAlexW4315786096MaRDI QIDQ6192781
Haotian Gu, Xiaoli Wei, Renyuan Xu, Xin Guo
Publication date: 12 March 2024
Published in: Operations Research (Search for Journal in Brave)
Full work available at URL: https://arxiv.org/abs/1911.07314
reinforcement learningcooperative gameQ-learningdynamic programming principlemulti-agent reinforcement learningmean-field controls