Simple regret optimization in online planning for Markov decision processes
DOI10.1613/JAIR.4432zbMATH Open1366.90216arXiv1206.3382OpenAlexW2157136665MaRDI QIDQ2921080FDOQ2921080
Authors: Zohar Feldman, Carmel Domshlak
Publication date: 30 September 2014
Published in: The Journal of Artificial Intelligence Research (JAIR) (Search for Journal in Brave)
Full work available at URL: https://arxiv.org/abs/1206.3382
Recommendations
- Regret in online combinatorial optimization
- Online regret bounds for Markov decision processes with deterministic transitions
- Online Regret Bounds for Markov Decision Processes with Deterministic Transitions
- Computationally efficient algorithms for on-line optimization of Markov decision processes
- Online planning algorithms for POMDPS
- Regret in the on-line decision problem
- Online Markov decision processes
- Online speedup learning for optimal planning
- Potential-Based Online Policy Iteration Algorithms for Markov Decision Processes
- Online Markov Decision Processes Under Bandit Feedback
Online algorithms; streaming algorithms (68W27) Approximation methods and heuristics in mathematical programming (90C59) Markov and semi-Markov decision processes (90C40)
Cited In (3)
This page was built for publication: Simple regret optimization in online planning for Markov decision processes
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q2921080)