10.1162/153244303765208377

From MaRDI portal

Publication:3044104

Jump to:navigation, search

DOI10.1162/153244303765208377zbMath1088.68694OpenAlexW1505937442MaRDI QIDQ3044104

Ronen I. Brafman, Moshe Tennenholtz

Publication date: 10 August 2004

Published in: CrossRef Listing of Deleted DOIs (Search for Journal in Brave)

Full work available at URL: http://jmlr.csail.mit.edu/papers/v3/brafman02a.html

zbMATH Keywords

Stochastic Games Markov Decision Processes Reinforcement Learning Learning in Games Provably Efficient Learning

Mathematics Subject Classification ID

Learning and adaptive systems in artificial intelligence (68T05)

Related Items

Robust Control for Dynamical Systems with Non-Gaussian Noise via Formal Abstractions ⋮ Belief and truth in hypothesised behaviours ⋮ Guiding exploration by pre-existing knowledge without modifying reward ⋮ AWESOME: a general multiagent learning algorithm that converges in self-play and learns a best response against stationary opponents ⋮ Unnamed Item ⋮ A synthesis of automated planning and reinforcement learning for efficient, robust decision-making ⋮ Relational reinforcement learning with guided demonstrations ⋮ Fictitious Play in Zero-Sum Stochastic Games ⋮ Reducing reinforcement learning to KWIK online regression ⋮ Stochastic feature mapping for PAC-Bayes classification ⋮ Explicit explore, exploit, or escape \((E^4)\): near-optimal safety-constrained reinforcement learning in polynomial time ⋮ Unnamed Item ⋮ Knows what it knows: a framework for self-aware learning ⋮ Learning to compete, coordinate, and cooperate in repeated games using reinforcement learning ⋮ Unnamed Item ⋮ Bayesian optimistic Kullback-Leibler exploration ⋮ Reinforcement Learning, Bit by Bit ⋮ Recent advances in reinforcement learning in finance ⋮ From Reinforcement Learning to Deep Reinforcement Learning: An Overview ⋮ Independent learning in stochastic games ⋮ Dimension reduction and its application to model-based exploration in continuous spaces ⋮ Adaptive-resolution reinforcement learning with polynomial exploration in deterministic domains ⋮ Unnamed Item ⋮ Decentralized reinforcement learning of robot behaviors ⋮ An analysis of model-based interval estimation for Markov decision processes ⋮ Statistical estimation with bounded memory ⋮ Efficient exploration through active learning for value function approximation in reinforcement learning ⋮ A joint Gaussian process model for active visual recognition with expertise estimation in crowdsourcing ⋮ Robust Algorithms via PAC-Bayes and Laplace Distributions ⋮ Bayesian Exploration for Approximate Dynamic Programming ⋮ R-MAX ⋮ Cooperative learning with joint state value approximation for multi-agent systems ⋮ If multi-agent learning is the answer, what is the question? ⋮ Perspectives on multiagent learning ⋮ Reinforcement Learning in Robust Markov Decision Processes ⋮ Controller exploitation-exploration reinforcement learning architecture for computing near-optimal policies ⋮ Unnamed Item ⋮ Multi-agent reinforcement learning: a selective overview of theories and algorithms ⋮ Induction and Exploitation of Subgoal Automata for Reinforcement Learning ⋮ Unnamed Item ⋮ Model-based Reinforcement Learning: A Survey ⋮ Efficient learning equilibrium

This page was built for publication: 10.1162/153244303765208377

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:3044104&oldid=16088517"