Monotonic value function factorisation for deep multi-agent reinforcement learning
From MaRDI portal
Publication:5148965
Authors: Tabish Rashid, Mikayel Samvelyan, Christian Schröder de Witt, Gregory Farquhar, Jakob N. Foerster, Shimon Whiteson
Publication date: 5 February 2021
Full work available at URL: https://arxiv.org/abs/2003.08839
Recommendations
- Cournot policy model: rethinking centralized training in multi-agent reinforcement learning
- On centralized critics in multi-agent reinforcement learning
- Improving coordination in small-scale multi-agent deep reinforcement learning through memory-driven communication
- Deep multi-agent reinforcement learning: a survey
Cites Work
- Lenient learning in independent-learner stochastic cooperative games
- Optimal and approximate Q-value functions for decentralized POMDPS
- Title not available (Why is that?)
- Approximate dynamic programming with a fuzzy parameterization
- Title not available (Why is that?)
- The Hanabi challenge: a new frontier for AI research
- Title not available (Why is that?)
- Incorporating functional knowledge in neural networks
- A concise introduction to decentralized POMDPs
- Title not available (Why is that?)
- Deep reinforcement learning for swarm systems
Cited In (12)
- Scalable Online Planning for Multi-Agent MDPs
- Cournot policy model: rethinking centralized training in multi-agent reinforcement learning
- Safe multi-agent reinforcement learning for multi-robot control
- A deep multi-agent reinforcement learning approach to solve dynamic job shop scheduling problem
- A sufficient statistic for influence in structured multiagent environments
- Title not available (Why is that?)
- A leader-following paradigm based deep reinforcement learning method for multi-agent cooperation games
- Hierarchical method for cooperative multiagent reinforcement learning in Markov decision processes
- On centralized critics in multi-agent reinforcement learning
- Deep reinforcement learning for swarm systems
- Multi-agent reinforcement learning: a selective overview of theories and algorithms
- A collaboration of multi-agent model using an interactive interface
Uses Software
This page was built for publication: Monotonic value function factorisation for deep multi-agent reinforcement learning
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q5148965)