Distributed consensus-based multi-agent temporal-difference learning

From MaRDI portal

Publication:6164031

Jump to:navigation, search

DOI10.1016/j.automatica.2023.110922zbMath1520.93516MaRDI QIDQ6164031

Srdjan S. Stanković, Marko Beko, Miloš S. Stanković

Publication date: 30 June 2023

Published in: Automatica (Search for Journal in Brave)

Mathematics Subject Classification ID

Decentralized systems (93A14) Markov and semi-Markov decision processes (90C40) Multi-agent systems (93A16) Consensus (93D50)

Related Items

Multi-agent off-policy actor-critic algorithm for distributed multi-task reinforcement learning

Cites Work

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:6164031&oldid=35640720"