10.1162/1532443041827880

From MaRDI portal
Publication:4825999

DOI10.1162/1532443041827880zbMath1094.68076OpenAlexW4230078047WikidataQ62636510 ScholiaQ62636510MaRDI QIDQ4825999

Junling Hu, Michael P. Wellman

Publication date: 5 November 2004

Published in: CrossRef Listing of Deleted DOIs (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1162/1532443041827880



Related Items

Belief and truth in hypothesised behaviours, EAQR: a multiagent Q-learning algorithm for coordination of multiple agents, Scalable Reinforcement Learning for Multiagent Networked Systems, GENERAL PROOF OF CONVERGENCE OF THE NASH-Q-LEARNING ALGORITHM, A game-theoretic analysis of deep neural networks, Stochastic and adaptive optimal control of uncertain interconnected systems: a data-driven approach, Learning Intelligent Controls in High Speed Networks: Synergies of Computational Intelligence with Control and Q-Learning Theories, Defensive deception against reactive jamming attacks in remote state estimation, Bounding fixed points of set-based Bellman operator and Nash equilibria of stochastic games, Coordinated learning by model difference identification in multiagent systems with sparse interactions, Defense and security planning under resource uncertainty and multi‐period commitments, Optimal power control for sensors and DoS attackers over a fading channel network, Value functions for depth-limited solving in zero-sum imperfect-information games, Evaluation and learning in two-player symmetric games via best and better responses, A jointly optimal design of control and scheduling in networked systems under denial-of-service attacks, Robust moving target defense against unknown attacks: a meta-reinforcement learning approach, Entropy regularized actor-critic based multi-agent deep reinforcement learning for stochastic games, Non-zero sum Nash Q-learning for unknown deterministic continuous-time linear systems, Provably efficient reinforcement learning in decentralized general-sum Markov games, Off-policy based adaptive dynamic programming method for nonzero-sum games on discrete-time system, Memory-two strategies forming symmetric mutual reinforcement learning equilibrium in repeated prisoners' dilemma game, Event-triggered resilient control for cyber-physical system under denial-of-service attacks, Stochastic Dynamic Information Flow Tracking Game with Reinforcement Learning, Solving for Best Responses and Equilibria in Extensive-Form Games with Reinforcement Learning Methods, Deep Reinforcement Learning: A State-of-the-Art Walkthrough, Multi-sensor transmission power control for remote estimation through a SINR-based communication channel, Application of reinforcement learning to the game of Othello, Symmetric equilibrium of multi-agent reinforcement learning in repeated prisoner's dilemma, Cooperative and non-cooperative behaviour in the exploitation of a common renewable resource with environmental stochasticity, A multi-channel transmission schedule for remote state estimation under DoS attacks, Single-leader-multiple-follower games with boundedly rational agents, Nash Q-learning agents in Hotelling's model: reestablishing equilibrium, A Bayesian optimization approach to find Nash equilibria, Resilient strategy design for cyber-physical system under active eavesdropping attack, If multi-agent learning is the answer, what is the question?, Perspectives on multiagent learning, Multi-agent learning for engineers, Reinforcement Learning in Robust Markov Decision Processes, Reinforcement learning and stochastic optimisation, Multi-agent reinforcement learning: a selective overview of theories and algorithms, A game-theoretic perspective of deep neural networks, A unifying learning framework for building artificial game-playing agents, Deep Q-Learning for Nash Equilibria: Nash-DQN