On-policy concurrent reinforcement learning

From MaRDI portal

Publication:4670596

Jump to:navigation, search

DOI10.1080/09528130412331297956MaRDI QIDQ4670596zbMATH OpenOpenAlexWikidataFDO

Authors Bikramjit Banerjee, Sandip Sen, Jing Peng

Publication date 22 April 2005

Published in Journal of Experimental & Theoretical Artificial Intelligence (Search for Journal in Brave)

Full work available at URL https://doi.org/10.1080/09528130412331297956

zbMATH Keywords

multi-agent learning

Mathematics Subject Classification ID

Learning and adaptive systems in artificial intelligence (68T05) Theory of software (68N99)

Recommendations

Cites work

Cited in

(1)

Concurrent Q-learning: Reinforcement learning for dynamic goals and environments

This page was built for publication: On-policy concurrent reinforcement learning

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q4670596)

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:4670596&oldid=18885003"