10.1162/153244303765208377

From MaRDI portal
Publication:3044104

DOI10.1162/153244303765208377zbMath1088.68694OpenAlexW1505937442MaRDI QIDQ3044104

Ronen I. Brafman, Moshe Tennenholtz

Publication date: 10 August 2004

Published in: CrossRef Listing of Deleted DOIs (Search for Journal in Brave)

Full work available at URL: http://jmlr.csail.mit.edu/papers/v3/brafman02a.html




Related Items

Robust Control for Dynamical Systems with Non-Gaussian Noise via Formal AbstractionsBelief and truth in hypothesised behavioursGuiding exploration by pre-existing knowledge without modifying rewardAWESOME: a general multiagent learning algorithm that converges in self-play and learns a best response against stationary opponentsUnnamed ItemA synthesis of automated planning and reinforcement learning for efficient, robust decision-makingRelational reinforcement learning with guided demonstrationsFictitious Play in Zero-Sum Stochastic GamesReducing reinforcement learning to KWIK online regressionStochastic feature mapping for PAC-Bayes classificationExplicit explore, exploit, or escape \((E^4)\): near-optimal safety-constrained reinforcement learning in polynomial timeUnnamed ItemKnows what it knows: a framework for self-aware learningLearning to compete, coordinate, and cooperate in repeated games using reinforcement learningUnnamed ItemBayesian optimistic Kullback-Leibler explorationReinforcement Learning, Bit by BitRecent advances in reinforcement learning in financeFrom Reinforcement Learning to Deep Reinforcement Learning: An OverviewIndependent learning in stochastic gamesDimension reduction and its application to model-based exploration in continuous spacesAdaptive-resolution reinforcement learning with polynomial exploration in deterministic domainsUnnamed ItemDecentralized reinforcement learning of robot behaviorsAn analysis of model-based interval estimation for Markov decision processesStatistical estimation with bounded memoryEfficient exploration through active learning for value function approximation in reinforcement learningA joint Gaussian process model for active visual recognition with expertise estimation in crowdsourcingRobust Algorithms via PAC-Bayes and Laplace DistributionsBayesian Exploration for Approximate Dynamic ProgrammingR-MAXCooperative learning with joint state value approximation for multi-agent systemsIf multi-agent learning is the answer, what is the question?Perspectives on multiagent learningReinforcement Learning in Robust Markov Decision ProcessesController exploitation-exploration reinforcement learning architecture for computing near-optimal policiesUnnamed ItemMulti-agent reinforcement learning: a selective overview of theories and algorithmsInduction and Exploitation of Subgoal Automata for Reinforcement LearningUnnamed ItemModel-based Reinforcement Learning: A SurveyEfficient learning equilibrium




This page was built for publication: 10.1162/153244303765208377