Online Markov Decision Processes
From MaRDI portal
Publication:3169063
DOI10.1287/moor.1090.0396zbMath1218.90207MaRDI QIDQ3169063
Yishay Mansour, Sham M. Kakade, Eyal Even-Dar
Publication date: 27 April 2011
Published in: Mathematics of Operations Research (Search for Journal in Brave)
Full work available at URL: https://semanticscholar.org/paper/5e838b1d5a0dc3454dad082b2ca6b9bf301bd25c
68Q32: Computational learning theory
68T05: Learning and adaptive systems in artificial intelligence
90C40: Markov and semi-Markov decision processes
Related Items
Online regret bounds for Markov decision processes with deterministic transitions, Online spatio-temporal matching in stochastic and dynamic domains, Approachability in Stackelberg stochastic games with vector costs, Reinforcement Learning in Robust Markov Decision Processes, Chasing Ghosts: Competing with Stateful Policies