Multiagent learning using a variable learning rate

From MaRDI portal

Revision as of 03:57, 1 February 2024 by Import240129110113 (talk | contribs) (Created automatically from import240129110113)

(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

Publication:1605410

Jump to:navigation, search

DOI10.1016/S0004-3702(02)00121-2zbMath0995.68075OpenAlexW2120327309MaRDI QIDQ1605410

Michael Bowling, Manuela M. Veloso

Publication date: 15 July 2002

Published in: Artificial Intelligence (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1016/s0004-3702(02)00121-2

zbMATH Keywords

game theory reinforcement learning multiagent learning

Mathematics Subject Classification ID

Learning and adaptive systems in artificial intelligence (68T05)

Related Items (32)

Belief and truth in hypothesised behaviours ⋮ FUZZY STATE AGGREGATION AND POLICY HILL CLIMBING FOR STOCHASTIC ENVIRONMENTS ⋮ EAQR: a multiagent Q-learning algorithm for coordination of multiple agents ⋮ Autonomous agents modelling other agents: a comprehensive survey and open problems ⋮ AWESOME: a general multiagent learning algorithm that converges in self-play and learns a best response against stationary opponents ⋮ A general criterion and an algorithmic framework for learning in multi-agent systems ⋮ Introduction to the special issue on learning and computational game theory ⋮ GENERAL PROOF OF CONVERGENCE OF THE NASH-Q-LEARNING ALGORITHM ⋮ Exploration-exploitation in multi-agent learning: catastrophe theory meets game theory ⋮ Continuous learning methods in two-buyer pricing problem ⋮ Learning equilibrium in bilateral bargaining games ⋮ Model Checking for Safe Navigation Among Humans ⋮ Learning to compete, coordinate, and cooperate in repeated games using reinforcement learning ⋮ Multi-agent machine learning in self-organizing systems ⋮ Learning efficient Nash equilibria in distributed systems ⋮ $Q$-Learning in a Stochastic Stackelberg Game between an Uninformed Leader and a Naive Follower ⋮ Decentralized reinforcement learning of robot behaviors ⋮ SOLVING CONSTRAINED OPTIMIZATION PROBLEMS USING PROBABILITY COLLECTIVES AND A PENALTY FUNCTION APPROACH ⋮ On-policy concurrent reinforcement learning ⋮ Single-leader-multiple-follower games with boundedly rational agents ⋮ A distributed algorithm to obtain repeated games equilibria with discounting ⋮ Sharing in teams of heterogeneous, collaborative learning agents ⋮ Negotiating team formation using deep reinforcement learning ⋮ When autonomous agents model other agents: an appeal for altered judgment coupled with mouths, ears, and a little more tape ⋮ Analysis of Hannan consistent selection for Monte Carlo tree search in simultaneous move games ⋮ Unnamed Item ⋮ COOPERATIVE LEARNING BY POLICY-SHARING IN MULTIPLE AGENTS ⋮ Perspectives on multiagent learning ⋮ Learning with policy prediction in continuous state-action multi-agent decision processes ⋮ An adjustment scheme for nonlinear pricing problem with two buyers ⋮ Multi-agent reinforcement learning: a selective overview of theories and algorithms ⋮ A Probability Collectives Approach for Multi-Agent Distributed and Cooperative Optimization with Tolerance for Agent Failure

Cites Work

This page was built for publication: Multiagent learning using a variable learning rate

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:1605410&oldid=13908058"