Multiagent learning using a variable learning rate
From MaRDI portal
Publication:1605410
DOI10.1016/S0004-3702(02)00121-2zbMath0995.68075OpenAlexW2120327309MaRDI QIDQ1605410
Michael Bowling, Manuela M. Veloso
Publication date: 15 July 2002
Published in: Artificial Intelligence (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1016/s0004-3702(02)00121-2
Related Items (32)
Belief and truth in hypothesised behaviours ⋮ FUZZY STATE AGGREGATION AND POLICY HILL CLIMBING FOR STOCHASTIC ENVIRONMENTS ⋮ EAQR: a multiagent Q-learning algorithm for coordination of multiple agents ⋮ Autonomous agents modelling other agents: a comprehensive survey and open problems ⋮ AWESOME: a general multiagent learning algorithm that converges in self-play and learns a best response against stationary opponents ⋮ A general criterion and an algorithmic framework for learning in multi-agent systems ⋮ Introduction to the special issue on learning and computational game theory ⋮ GENERAL PROOF OF CONVERGENCE OF THE NASH-Q-LEARNING ALGORITHM ⋮ Exploration-exploitation in multi-agent learning: catastrophe theory meets game theory ⋮ Continuous learning methods in two-buyer pricing problem ⋮ Learning equilibrium in bilateral bargaining games ⋮ Model Checking for Safe Navigation Among Humans ⋮ Learning to compete, coordinate, and cooperate in repeated games using reinforcement learning ⋮ Multi-agent machine learning in self-organizing systems ⋮ Learning efficient Nash equilibria in distributed systems ⋮ $Q$-Learning in a Stochastic Stackelberg Game between an Uninformed Leader and a Naive Follower ⋮ Decentralized reinforcement learning of robot behaviors ⋮ SOLVING CONSTRAINED OPTIMIZATION PROBLEMS USING PROBABILITY COLLECTIVES AND A PENALTY FUNCTION APPROACH ⋮ On-policy concurrent reinforcement learning ⋮ Single-leader-multiple-follower games with boundedly rational agents ⋮ A distributed algorithm to obtain repeated games equilibria with discounting ⋮ Sharing in teams of heterogeneous, collaborative learning agents ⋮ Negotiating team formation using deep reinforcement learning ⋮ When autonomous agents model other agents: an appeal for altered judgment coupled with mouths, ears, and a little more tape ⋮ Analysis of Hannan consistent selection for Monte Carlo tree search in simultaneous move games ⋮ Unnamed Item ⋮ COOPERATIVE LEARNING BY POLICY-SHARING IN MULTIPLE AGENTS ⋮ Perspectives on multiagent learning ⋮ Learning with policy prediction in continuous state-action multi-agent decision processes ⋮ An adjustment scheme for nonlinear pricing problem with two buyers ⋮ Multi-agent reinforcement learning: a selective overview of theories and algorithms ⋮ A Probability Collectives Approach for Multi-Agent Distributed and Cooperative Optimization with Tolerance for Agent Failure
Cites Work
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- On-line learning and the metrical task system problem
- Convergence results for single-step on-policy reinforcement-learning algorithms
- Two-person nonzero-sum games and quadratic programming
- An iterative method of solving a game
- Equilibrium points in n -person games
- Stochastic Games
This page was built for publication: Multiagent learning using a variable learning rate