Reinforcement learning: exploration-exploitation dilemma in multi-agent foraging task
From MaRDI portal
Recommendations
Cites work
- scientific article; zbMATH DE number 2038829 (Why is no real title available?)
- A new approach to the design of reinforcement schemes for learning automata
- Near-optimal reinforcement learning in polynomial time
- Reinforcement learning and evolutionary algorithms for non-stationary multi-armed bandit problems
- Reinforcement learning. An introduction
- \({\mathcal Q}\)-learning
This page was built for publication: Reinforcement learning: exploration-exploitation dilemma in multi-agent foraging task
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q505118)