Fully asynchronous policy evaluation in distributed reinforcement learning over networks
From MaRDI portal
Publication:2063869
DOI10.1016/j.automatica.2021.110092zbMath1480.93027arXiv2003.00433MaRDI QIDQ2063869
Tamer Başar, Keyou You, Jiaqi Zhang, Xingyu Sha, Kaiqing Zhang
Publication date: 3 January 2022
Published in: Automatica (Search for Journal in Brave)
Full work available at URL: https://arxiv.org/abs/2003.00433
policy evaluation; multi-agent networks; distributed reinforcement learning; fully asynchronous updates
68T05: Learning and adaptive systems in artificial intelligence
68W15: Distributed algorithms
93A16: Multi-agent systems
93B70: Networked control
Uses Software