10.1162/153244303765208377
From MaRDI portal
Publication:3044104
DOI10.1162/153244303765208377zbMath1088.68694OpenAlexW1505937442MaRDI QIDQ3044104
Ronen I. Brafman, Moshe Tennenholtz
Publication date: 10 August 2004
Published in: CrossRef Listing of Deleted DOIs (Search for Journal in Brave)
Full work available at URL: http://jmlr.csail.mit.edu/papers/v3/brafman02a.html
Stochastic GamesMarkov Decision ProcessesReinforcement LearningLearning in GamesProvably Efficient Learning
Related Items
Robust Control for Dynamical Systems with Non-Gaussian Noise via Formal Abstractions ⋮ Belief and truth in hypothesised behaviours ⋮ Guiding exploration by pre-existing knowledge without modifying reward ⋮ AWESOME: a general multiagent learning algorithm that converges in self-play and learns a best response against stationary opponents ⋮ Unnamed Item ⋮ A synthesis of automated planning and reinforcement learning for efficient, robust decision-making ⋮ Relational reinforcement learning with guided demonstrations ⋮ Fictitious Play in Zero-Sum Stochastic Games ⋮ Reducing reinforcement learning to KWIK online regression ⋮ Stochastic feature mapping for PAC-Bayes classification ⋮ Explicit explore, exploit, or escape \((E^4)\): near-optimal safety-constrained reinforcement learning in polynomial time ⋮ Unnamed Item ⋮ Knows what it knows: a framework for self-aware learning ⋮ Learning to compete, coordinate, and cooperate in repeated games using reinforcement learning ⋮ Unnamed Item ⋮ Bayesian optimistic Kullback-Leibler exploration ⋮ Reinforcement Learning, Bit by Bit ⋮ Recent advances in reinforcement learning in finance ⋮ From Reinforcement Learning to Deep Reinforcement Learning: An Overview ⋮ Independent learning in stochastic games ⋮ Dimension reduction and its application to model-based exploration in continuous spaces ⋮ Adaptive-resolution reinforcement learning with polynomial exploration in deterministic domains ⋮ Unnamed Item ⋮ Decentralized reinforcement learning of robot behaviors ⋮ An analysis of model-based interval estimation for Markov decision processes ⋮ Statistical estimation with bounded memory ⋮ Efficient exploration through active learning for value function approximation in reinforcement learning ⋮ A joint Gaussian process model for active visual recognition with expertise estimation in crowdsourcing ⋮ Robust Algorithms via PAC-Bayes and Laplace Distributions ⋮ Bayesian Exploration for Approximate Dynamic Programming ⋮ R-MAX ⋮ Cooperative learning with joint state value approximation for multi-agent systems ⋮ If multi-agent learning is the answer, what is the question? ⋮ Perspectives on multiagent learning ⋮ Reinforcement Learning in Robust Markov Decision Processes ⋮ Controller exploitation-exploration reinforcement learning architecture for computing near-optimal policies ⋮ Unnamed Item ⋮ Multi-agent reinforcement learning: a selective overview of theories and algorithms ⋮ Induction and Exploitation of Subgoal Automata for Reinforcement Learning ⋮ Unnamed Item ⋮ Model-based Reinforcement Learning: A Survey ⋮ Efficient learning equilibrium
This page was built for publication: 10.1162/153244303765208377