Bayesian optimistic Kullback-Leibler exploration
From MaRDI portal
Publication:2425228
DOI10.1007/S10994-018-5767-4zbMATH Open1493.68304OpenAlexW2903743315WikidataQ128709924 ScholiaQ128709924MaRDI QIDQ2425228FDOQ2425228
Daniel D. Lee, Kee-Eung Kim, Pedro A. Ortega, Geon-Hyeong Kim, Kanghoon Lee
Publication date: 26 June 2019
Published in: Machine Learning (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1007/s10994-018-5767-4
Statistical aspects of information-theoretic topics (62B10) Bayesian inference (62F15) Learning and adaptive systems in artificial intelligence (68T05)
Cites Work
- 10.1162/153244303765208377
- Exploration-exploitation tradeoff using variance estimates in multi-armed bandits
- Near-optimal regret bounds for reinforcement learning
- An analysis of model-based interval estimation for Markov decision processes
- Near-optimal reinforcement learning in polynomial time
- Title not available (Why is that?)
- Title not available (Why is that?)
Cited In (2)
This page was built for publication: Bayesian optimistic Kullback-Leibler exploration
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q2425228)