Online Markov Decision Processes

From MaRDI portal

Publication:3169063

Jump to:navigation, search

DOI10.1287/moor.1090.0396zbMath1218.90207MaRDI QIDQ3169063

Yishay Mansour, Sham M. Kakade, Eyal Even-Dar

Publication date: 27 April 2011

Published in: Mathematics of Operations Research (Search for Journal in Brave)

Full work available at URL: https://semanticscholar.org/paper/5e838b1d5a0dc3454dad082b2ca6b9bf301bd25c

zbMATH Keywords

Markov decision process; no-regret algorithms

Mathematics Subject Classification ID

68Q32: Computational learning theory

68T05: Learning and adaptive systems in artificial intelligence

90C40: Markov and semi-Markov decision processes

Related Items

Online regret bounds for Markov decision processes with deterministic transitions, Online spatio-temporal matching in stochastic and dynamic domains, Approachability in Stackelberg stochastic games with vector costs, Reinforcement Learning in Robust Markov Decision Processes, Chasing Ghosts: Competing with Stateful Policies

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:3169063&oldid=16254857"