Bayesian optimistic Kullback-Leibler exploration
From MaRDI portal
Publication:2425228
Recommendations
- Bayesian Reinforcement Learning with Exploration
- Scalable and efficient Bayes-adaptive reinforcement learning based on Monte-Carlo tree search
- Exploration in relational domains for model-based reinforcement learning
- Dual control for approximate Bayesian reinforcement learning
- Using trajectory data to improve Bayesian optimization for reinforcement learning
Cites work
- scientific article; zbMATH DE number 2107836 (Why is no real title available?)
- scientific article; zbMATH DE number 5685899 (Why is no real title available?)
- 10.1162/153244303765208377
- An analysis of model-based interval estimation for Markov decision processes
- Exploration-exploitation tradeoff using variance estimates in multi-armed bandits
- Near-optimal regret bounds for reinforcement learning
- Near-optimal reinforcement learning in polynomial time
Cited in
(6)- Bayesian Reinforcement Learning with Exploration
- Using trajectory data to improve Bayesian optimization for reinforcement learning
- Exploration in relational domains for model-based reinforcement learning
- Dual control for approximate Bayesian reinforcement learning
- scientific article; zbMATH DE number 6276176 (Why is no real title available?)
- Optimistic reinforcement learning by forward Kullback-Leibler divergence optimization
This page was built for publication: Bayesian optimistic Kullback-Leibler exploration
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q2425228)